AutoRadionuclide · In-silico Discovery Engine
MIBG/NET Flagship Demo Campaign
Campaign mibg-net-demo-001 · Run 16140108 · May 25, 2026
In-silico demonstration only — no wet lab, no real isotope
All scoring functions are frozen heuristics, not validated predictive models. The wet-lab step is a stub that returns heuristic scores plus Gaussian noise. MIBG + I-131 (Azedra) is a real FDA-approved therapy; the scores shown here are illustrative engine outputs, not clinical measurements.
How the loop works
OuterLoop (AutoResearch meta-loop)
- 1Ask LLM: propose ONE strategy modification
- 2Apply modification to in-memory StrategyConfig
- 3Run InnerLoop — one discovery cycle
- 4Compare campaign score before vs. after
- 5Keep if improved (Δ > 0); revert if not
- 6Record modification + rationale in append-only ledger
InnerLoop (one discovery cycle)
- 1generate_candidates() — design module + LLM
- 2score_all() — frozen harness (never agent-editable)
- 3policy.rank() — acquisition function + diversity
- 4safety_check() — isotope/chelator feasibility
- 5wet_lab.submit_and_wait() — stub in this run
- 6update_surrogates() — GP refitted with new observations
Frozen harness = the benchmark spec. The planner may read it but never modify it. Every decision is recorded in an append-only SQLite ledger — rows are never updated or deleted.
Turn-by-turn outer loop
Increase exploration weight
exploration_weight: 1.5 → 2
Insufficient diversity in recent batches; higher kappa increases exploration.
Focus on validated NET vectors
prioritized_targets: [] → ["NET"]
NET targeting vectors showing strongest objective improvements.
Switch acquisition function to EI
acquisition_function: "UCB" → "EI"
UCB may be over-exploring; EI focuses on high-probability improvements.
Score is non-decreasing by design: the outer loop reverts any modification that does not improve the campaign score (Δ ≤ 0). The score plateaus here because MIBG+none+I-131 is the only unique resolvable construct in the declared building-block space — this is honest scientific behaviour.
Candidate construct
Norepinephrine transporter (NET) ligand; directly radioiodinated
Vector: MIBG · Chelator: none · Isotope: I-131
NC(=N)NCc1cccc(I)c1
C8H10IN3
PubChem CID 60860 (iobenguane)
RDKit descriptors (GP surrogate input)
| Molecular weight (Da) | 275.093 |
| Wildman-Crippen logP | 1.274 |
| TPSA (Ų) | 61.900 |
| H-bond donors | 3.000 |
| H-bond acceptors | 1.000 |
| Rotatable bonds | 2.000 |
| Ring count | 1.000 |
| Fraction sp³ C | 0.125 |
Morgan-2 fingerprint: 28 active bits / 2048 total
Heuristic objective scores
Isotope physics (I-131)
53 (iodine)
8.02 days
β⁻ (encoded: 0)
IAEA Live Chart of Nuclides
Retrospective benchmark
Engine places 4/7 known-good agents in top 4. Random baseline: 0.44. This confirms scoring machinery ranks known-good agents above known-poor ones at a rate better than chance. NOT a validated predictive model.
9 compounds (3 approved · 4 clinical · 2 illustrative failures). Benchmark confirms scoring machinery ranks known-good agents above known-poor ones — this does NOT establish validated predictive power for novel compounds.
Provenance
| Run ID | 16140108 |
| Model provider | mock-deterministic-v1 |
| Featurizer version | 1.0.0 |
| MIBG SMILES source | PubChem CID 60860 (iobenguane) |
| Model calls this run | 7 |
| Ledger entries this run | 27 |
| Ledger entries (all runs) | 212 |
Every ledger row is immutable (INSERT-only). Model ID, prompt version, scoring version, config hash, and random seed are recorded per decision. Export generated from: scripts/export_run.py
Honest limits
- ✗Scoring functions are frozen heuristics — not validated predictive models.
- ✗Metal coordination chemistry is NOT modeled (coordination geometry, thermodynamic stability, kinetic inertness).
- ✗Radiation dose profile (LET, β⁻/α/Auger particle energy, DNA damage) is NOT captured.
- ✗Benchmark rank accuracy 0.57 vs. random baseline 0.44 — confirms wiring, not predictive power.
- ✗StubWetLab returns frozen-harness scores plus Gaussian noise — no real radiochemistry.
- ✗DOTA and NOTA produce identical Morgan-2 fingerprints (ring-size difference invisible at radius 2).
- ✗Large peptide targeting vectors (DOTATATE, PSMA-617, FAPI-46) omitted from registry pending independent verification.
Limitations are encoded in the source — every scoring function that lacks a validated predictive model is tagged HEURISTIC or PLACEHOLDER in its returned ObjectiveValue.