AI-Augmented Discovery • QIS Protocol

What Individual Nodes Do vs. What Network Operators Can Do

Your node—your AI agent—helps you with your issue. It routes you to similar patients in your semantic space. It synthesizes outcomes from patients like you. It optimizes treatment for your condition. That's the core QIS value: quadratic intelligence scaling for individual benefit.

But here's what your individual node can't do: see correlations across the entire network. Spot patterns that span different conditions. Request specific additional insight from thousands of nodes outside your bucket to test a hypothesis. Refine similarity definitions that affect everyone.

That's a different layer. That's what network operators—pharma companies, health systems, research consortiums—can add on top. External AI Augmentation gives them the ability to monitor population-level patterns, generate hypotheses about what's working, test those hypotheses by querying nodes across the network, and continuously refine similarity definitions to make the whole network smarter.

This IS adding new capability. Individual nodes serve individual users. The external layer serves the network itself.

The Insight Harvesting Loop

Network operators can add a monitoring layer that watches aggregated outcome packets flowing across the entire network—not just one bucket, but all of them. Population-level patterns. Cross-condition correlations. Anonymized. Privacy-preserved. This could be a supercomputer, a cloud AI model, or an analyst with a dashboard and spreadsheet skills. The mechanism is the same; the scale varies.

This layer does five things that individual nodes cannot:

Pattern Discovery

Monitor aggregated outcome packets across the entire network—all buckets, all conditions. Spot unexpected correlations that no individual node would ever see. Example: patients in bucket X (Stage 3 KRAS+ colorectal) who also happen to report high omega-3 intake are showing 23% better response rates. That correlation isn't in the similarity template. The network operator found it by watching population-level data flow.

Hypothesis Generation

AI generates hypothesis: "Omega-3 fatty acids may potentiate treatment response in KRAS+ colorectal cancer patients." This isn't from a clinical trial. It emerged from real-time population data. The AI is doing what researchers do—noticing patterns and asking "why?"

Hypothesis Testing

Here's where the hypothesis gets proven or disproven. The network already has the infrastructure—testing uses the same QIS routing to create controlled comparisons.

Validation

Results come back. Does the effect replicate across subgroups? Does it hold when controlling for confounders? If yes—signal confirmed. If not—hypothesis rejected. Move on to the next correlation.

Template Refinement

Validated patterns propagate back to the network. The similarity template updates: now "omega-3 intake" is a weighted factor for KRAS+ colorectal patients. Every future patient benefits from the discovery. The network got smarter.

How Hypothesis Testing Actually Works

This is the technical core. A correlation has been spotted. A hypothesis has been formed. Now: is it real or just noise? Is it causal or just confounded? QIS provides three testing mechanisms—each uses existing infrastructure:

Method A: Template Split Test

Create two template versions—one includes the hypothesized variable, one doesn't. Route identical patient profiles through both. Compare which template produces better outcome matches. A/B testing at the routing layer.

Method B: Matched Cohort Query

Query two cohorts: patients WITH the hypothesized factor vs. patients WITHOUT, matched on all other variables. Compare their outcomes. Classic epidemiological design, executed via QIS queries.

Method C: Prospective User Query

Push opt-in questions to matching nodes. Ask users directly for data that doesn't exist in current packets. Collect new data points prospectively. Validate as responses accumulate.

Detailed Walkthrough: Testing the Omega-3 Hypothesis

Note: This is a hypothetical example to illustrate how the system works—not a medical claim.

Let's trace exactly how hypothesis testing works, step by step.

The Setup

Observation

Patients in the KRAS+ colorectal bucket who report high omega-3 intake show 23% better response rates than those who don't.

Hypothesis

"Omega-3 fatty acids improve treatment response in KRAS+ colorectal cancer patients."

Alternative Explanation (Confounding)

Maybe patients who take omega-3 also exercise more, eat healthier overall, have higher socioeconomic status, or are more compliant with treatment. The correlation could be spurious.

Method B in Action: Matched Cohort Comparison

The AI queries the network for two groups, matched on ALL template variables except omega-3:

// Define the base template (everything EXCEPT the hypothesized variable)
const baseTemplate = {
  disease: "colorectal_cancer",
  stage: 3,
  kras: 1,                    // KRAS mutant
  msi: 0,                      // MSI-stable
  age_range: "60-70",
  performance_status: "ECOG_0-1",
  treatment_line: 1,
  prior_chemo: false
};

// COHORT A: High omega-3 intake (from nutrition app metadata)
const cohortA = network.query({
  ...baseTemplate,
  omega3_intake: "high"         // >1000mg/day
});
// Returns: 847 patients, avg_similarity = 0.94

// COHORT B: Low/no omega-3 intake (control group)
const cohortB = network.query({
  ...baseTemplate,
  omega3_intake: "low"          // <200mg/day
});
// Returns: 2,134 patients, avg_similarity = 0.93
    

Both cohorts are matched on disease, stage, mutations, age, performance status, treatment line. The only systematic difference is omega-3 intake. Now compare their outcomes:

// Synthesize outcomes from both cohorts
const outcomesA = synthesize(cohortA);
// {
//   median_pfs: 14.2 months,
//   response_rate: 0.71,
//   serious_ae_rate: 0.18,
//   n: 847
// }

const outcomesB = synthesize(cohortB);
// {
//   median_pfs: 11.8 months,
//   response_rate: 0.54,
//   serious_ae_rate: 0.21,
//   n: 2134
// }

// Statistical comparison
const comparison = {
  pfs_delta: +20.3%,           // (14.2 - 11.8) / 11.8
  response_delta: +31.5%,      // (0.71 - 0.54) / 0.54
  p_value: 0.003,
  confidence_interval: "95% CI: [12%, 29%]"
};
    

Validation: Subgroup Replication

One comparison isn't enough. To rule out confounding, the AI runs the same matched comparison across multiple subgroups:

By Age

50-60: +18% effect ✓
60-70: +22% effect ✓
70-80: +19% effect ✓

By Treatment

FOLFOX: +21% effect ✓
FOLFIRI: +17% effect ✓
Single-agent: +24% effect ✓

By Geography

US: +20% effect ✓
EU: +19% effect ✓
Asia: +23% effect ✓

By Insurance

Private: +21% effect ✓
Medicare: +18% effect ✓
Medicaid: +22% effect ✓

If the effect only appeared in wealthy patients or specific regions, that would suggest confounding. But when it replicates across all subgroups with consistent effect size? The signal is real.

Method C: Prospective Validation

For patients who don't have nutrition app data, the AI pushes an opt-in question:

// Push prospective question to nodes without omega-3 data
const question = {
  id: "omega3_intake_prospective_001",
  target: "kras+_colorectal_stage3_no_nutrition_data",
  text: "During your treatment, have you regularly taken omega-3 supplements or eaten fatty fish (salmon, mackerel, sardines)?",
  options: [
    { value: "never", label: "Never or rarely" },
    { value: "monthly", label: "A few times per month" },
    { value: "weekly", label: "1-3 times per week" },
    { value: "daily", label: "Daily or almost daily" }
  ],
  consent: "Your response helps improve treatment recommendations for future patients.",
  expires: "2026-02-15"
};

network.pushQuestion(question);
// 4,218 eligible nodes receive question
// Response rate: ~68% = 2,868 responses
    

As responses come in, the AI correlates them with outcomes:

// Prospective results (dose-response relationship)
const prospectiveResults = {
  daily:   { n: 412,  response_rate: 0.73, delta: "+35%" },
  weekly:  { n: 891,  response_rate: 0.64, delta: "+18%" },
  monthly: { n: 743,  response_rate: 0.58, delta: "+7%" },
  never:   { n: 822,  response_rate: 0.54, delta: "baseline" }
};

// Dose-response gradient confirms hypothesis
// More omega-3 → better outcomes → causation likely
    

Decision Gate: Accept, Reject, or Escalate

ACCEPT hypothesis if:

• Effect size >15% AND p-value <0.01
• Effect replicates across ≥3 independent subgroups
• Prospective data confirms retrospective pattern
• Dose-response relationship observed

REJECT hypothesis if:

• Effect disappears when controlling for confounders
• Fails to replicate across subgroups
• Prospective data contradicts retrospective finding

ESCALATE to formal clinical trial if:

• Effect is large (>30%) and consistent
• Mechanism is plausible
• Regulatory approval would require RCT validation

In this case: Hypothesis accepted. Omega-3 intake becomes a weighted factor in the KRAS+ colorectal template. Future patients with high omega-3 intake route to similar patients. The network learned something new.

What just happened: The AI ran what's essentially a retrospective cohort study + prospective validation using QIS as the data infrastructure. No central database access. No 3-year enrollment period. No IRB delay for initial hypothesis screening. The matched cohorts already existed in the network—the AI just queried them. Time from hypothesis to preliminary validation: hours to days, not years.

Another Example: The Blueberry Discovery

Note: This is also a hypothetical example to demonstrate the discovery loop—not a medical claim.

Same Loop, Different Hypothesis

Pattern Detected

Different network, different discovery. AI monitoring aggregate outcomes spots that patients with high anthocyanin consumption (blueberries, blackberries, purple cabbage) have 31% better progression-free survival. This wasn't in any template—it emerged from optional nutrition app metadata.

Hypothesis Formed

"Anthocyanin consumption improves treatment outcomes through [unknown mechanism]."

Test via User Question

For patients WITHOUT nutrition apps, push simple opt-in question: "How often have you consumed blueberries, blackberries, or purple vegetables during treatment?" Daily / Weekly / Monthly / Never.

Results

2,847 responses. Daily: +34%. Weekly: +18%. Monthly: +7%. Never: baseline. Dose-response confirmed.

Template Updated

"anthocyanin_intake" added as weighted factor. Network improves. Every future patient benefits.

Same infrastructure. Same logic. Different discovery. The loop runs continuously.

No Supercomputer Required (But It Helps)

Different Scales, Same Logic

A network operator doesn't need a supercomputer to do this. An analyst with a dashboard, spreadsheet skills, and domain expertise can run this exact loop manually—monitor aggregated outcomes, spot correlations, form hypotheses, design user questions, analyze results, update templates.

The supercomputer (or cloud AI) scales it up: process millions of packets in parallel, spot subtle correlations across the entire population in seconds, run thousands of hypothesis tests simultaneously, automate the whole cycle.

But the fundamental capability—cross-network correlation detection, hypothesis testing via user queries, similarity refinement—that's what the external layer adds. Individual nodes don't do this. They serve their users. The external layer serves the network.

QIS Doesn't Replace Analytics—It Supercharges It

Traditional data analytics isn't going away. It still has its use cases. But for real-time, precision, scalable insight—the kind where you need to match someone to their exact cohort and synthesize outcomes instantly—QIS delivers results that traditional pipelines simply cannot match.

Here's what's interesting: one of the best use cases for traditional analytics is monitoring QIS networks themselves. That's exactly what this external AI augmentation layer does. Traditional tools analyzing QIS outcome streams.

But consider the economics: How many tokens or how much compute does it take to analyze one person's raw data—their full medical history, genomic profile, lifestyle factors, treatment records? Now compare that to analyzing a tiny outcome packet—a few structured fields, maybe 500 bytes.

The math is obvious: you can analyze orders of magnitude more outcome packets than raw records within your existing context windows and compute budgets. QIS doesn't eliminate traditional analytics—it gives analytics an entirely new field to operate in, at a scale that was previously impossible.

Technical Implementation

For those building this:

Template Versioning

Track template changes. v1.0 → v1.1 when new field added. Outcomes tagged with template version for accurate comparison.

A/B Testing

Run multiple template versions simultaneously. 50% of new queries use v1.0, 50% use v1.1. Compare outcome quality.

Graduated Collection

Only request data needed for current hypothesis. Don't ask for full dietary history—ask about anthocyanins. Minimize user burden.

Consent Flows

User questions are opt-in. Clear explanation of why the question is being asked. Response is voluntary. No penalty for skipping.

Hypothesis Registry

Log all hypotheses tested: timestamp, evidence, result, confidence. Prevents re-testing rejected hypotheses. Creates audit trail.

Effect Thresholds

Don't update templates for noise. Require minimum effect size (e.g., >15% outcome delta) AND statistical significance (p<0.01) AND replication across subgroups.

// Hypothesis test example: User questionnaire trigger

function testHypothesis(hypothesis, targetCohort) {
  // Generate targeted questionnaire
  const question = {
    id: "anthocyanin_intake_001",
    text: "How often do you consume blueberries, blackberries, or purple vegetables?",
    options: ["Never", "Monthly", "Weekly", "Daily"],
    hypothesis_id: hypothesis.id,
    expires: "2026-02-09"
  };

  // Push to matching nodes (opt-in)
  const targetNodes = network.query(targetCohort);
  targetNodes.forEach(node => {
    if (node.acceptsQuestions) {
      node.pushQuestion(question);
    }
  });

  // Collect responses, validate hypothesis
  return collectResponses(question.id)
    .then(responses => validateHypothesis(hypothesis, responses));
}
    

What This Enables

Continuous Network Improvement

Every hypothesis that validates becomes a template refinement. Every template refinement makes matching more precise. Every precise match improves outcomes. The network gets smarter with every cycle.

Network operators—pharma companies, health systems, research consortiums—can continuously improve their similarity definitions based on real-time population data. Not guessing what matters. Discovering what matters.

The same loop applies across use cases:

• Treatment optimization: Which dosing schedules work best for which patient subtypes? Refine templates accordingly.
• Side effect prediction: What patient characteristics predict adverse reactions? Add those factors to similarity.
• Drug interactions: Which combinations show unexpected synergy or interference? Create new buckets.
• Lifestyle factors: What behaviors correlate with better outcomes? Weight them in routing.
• Environmental triggers: Do regional factors affect treatment efficacy? Incorporate geography.

Every pattern discovered is a potential similarity refinement. Every refinement makes the network more useful. The loop runs continuously.

Patent: "Insight Harvesting Loop: External AI Augmentation of Distributed Insight Swarm for Self-Evolving Pattern Intelligence" (63/873,521)

Two layers, complementary functions. QIS serves individual users—aggregating locally to form outcome packets, routing them to similar cases, synthesizing insights, optimizing treatment within their semantic space. External AI Augmentation serves network operators—spotting cross-population correlations, testing hypotheses via user queries, and continuously refining similarity definitions to make the whole network smarter. Individual intelligence compounds. Network intelligence evolves.