AI for Science: Drug Discovery, Protein Design & Autonomous Research

active

N/A

6 entriessince 2026-03-19

Tracks AI as a scientific research substrate — not AI economics (see ai-economic-transformation), but AI as the replacement for human-hours in hypothesis generation, molecular design, and experimental throughput. Coverage: generative protein binders (Nvidia Proteina, Baker lab, ESMFold), AI-designed neoantigens (Evaxion EVX-01), AI-autonomous drug pipelines (Insilico Medicine, Eli Lilly partnership), personalized therapeutics democratization (ChatGPT+AlphaFold home use cases), foundation models for biology (AlphaGenome et al.), and the autonomous-scientist frontier (OpenAI's internal framing of "ChatGPT moment for biology").

ai-drug-designprotein-designalphafoldautonomous-sciencefoundation-modelsbiotech

Analytical Briefing

2026-04-30

AI for Science: from research tool to research loop
April's two biggest infrastructure bets — Biohub's $500M virtual cell push and DOE's Genesis Mission lab at PNNL — make AI the experimenter, not the tool inside the experiment.

For two decades AI in biology meant a sharper instrument inside a human-led research loop: a sequence aligner, a structure predictor, a search ranker. April 2026 closed that loop. On April 29 Biohub announced a $500M, five-year Virtual Biology Initiative, anchored by NVIDIA and the Allen, Arc, Broad and Sanger institutes — explicitly aimed at a predictive model of the human cell where mutations and drugs are tested in silico before anything goes near a wet lab. A day later DOE commissioned AMP2 at Pacific Northwest National Lab and gave Ginkgo Bioworks $47M for the larger M2PC successor — 97 robots and 100+ instruments by 2030, running anaerobic microbial experiments around the clock with humans largely out of the building.

The two bets are complementary. Biohub goes after the data foundation: cellular biology has roughly billion-cell datasets, predictive cell models likely need orders of magnitude more, and nobody yet knows the scaling-law slope for biology the way OpenAI knows it for text. PNNL and Ginkgo go after throughput: if you need much more data, you need labs that generate it without humans handling pipettes. Robots run the experiments models propose; models train on the data robots produce.

Pharma has been quietly paying for the same thesis. Eli Lilly's March 2026 deal with Insilico Medicine — up to $2.75B, $115M upfront — bought not a single molecule but the pipeline: 42 AI models that pick targets, design candidates, and predict trial success. Insilico's lead asset reached clinical trials in 18 months against the standard four to six years. It's the first nine-figure-upfront pharma deal where the asset is a discovery process, not a compound.

The "autonomous scientist" is real now, if narrow. AI-Scientist-v2 runs the full cycle — literature review against Semantic Scholar, novelty filter, agentic experimental search, paper drafting with figures and citations — in roughly 15 hours and $140 per cycle, on a paper that passed peer review. A Russian consortium claims a paper accepted at an A-tier IT venue with 90% of the work automated. OpenAI has pre-announced an "autonomous AI researcher" for fall 2026, pitched as a way to bootstrap the next model generation. The risk Gonzo-ML flagged in March still applies: LLM-written literature summaries get scraped back into training data — citogenesis — and look authoritative without any human ever checking.

One demonstration is harder to file. An Australian ML engineer with no medical training designed and produced a personalized mRNA cancer vaccine for his dog using ChatGPT and AlphaFold: tumor sequencing, prediction of which tumor-specific peptides the immune system would see, and a manufactured dose. The dog responded therapeutically. The personalized cancer-vaccine pipeline that took multi-million-dollar labs a decade is now reproducible by one motivated person with off-the-shelf AI tools, and there is no safety frame for that yet.

Stanford's AI Index 2026 puts numbers on what shifted in twelve months. AI agents complete 66% of OSWorld tasks — a generic computer-use benchmark — against 72% for humans, up from 12% a year ago. SWE-bench Verified, the coding benchmark stuck near 60% in early 2025, is essentially saturated. Claude Opus 4.6 and Gemini 3.1 Pro both cross 50% on Humanity's Last Exam, where o1 scored 8.8% eighteen months earlier. The Foundation Model Transparency Index dropped from 58 to 40 over the same window: the labs got more capable and less open at exactly the same rate.

Tracked Metrics

500$M over 5 years

biohub virtual biology commitment

2026-04-29Biohub

→

Signals

Timeline

2026-04-30

milestone

PNNL commissions AMP2 autonomous microbial lab; Ginkgo wins $47M for Genesis-Mission-aligned M2PC successor

The U.S. Department of Energy commissioned the Anaerobic Microbial Phenotyping Platform (AMP2) at Pacific Northwest National Laboratory — described by PNNL as the world's largest autonomous-capable…

autonomous_labginkgo_bioworkspnnldoegenesis_missionrobotic_sciencehigh_throughputmicrobial_phenotypingwet_labinfrastructure

PNNL DOE PR Newswire (Ginkgo/PNNL)SynBioBeta @liftfeed

2026-04-29

milestone

Biohub launches $500M Virtual Biology Initiative with NVIDIA, Allen, Arc, Broad, Sanger to build predictive AI cell models

Chan Zuckerberg Biohub announced the Virtual Biology Initiative on April 29, 2026, anchored by a $500M Biohub commitment over 5 years: $400M for internal data generation and next-generation…

ai_for_biologybiohubczinvidiavirtual_cellopen_datafoundation_modelinfrastructurealex_rivesesmallen_institutebroadsangerarc_institute

Biohub PR Newswire (Biohub)@ov_ser

2026-04-15

milestone

Stanford AI Index 2026: agents do 66% of computer tasks vs 72% human; SWE-bench 60→100% in a year

The Stanford AI Index 2026 (released April 15) makes two points that are hard to unsee together. First: on OSWorld, a general-purpose computer-use benchmark, AI agents now complete 66% of tasks…

ai_indexstanfordbenchmarkswe_benchosworldjagged_frontierfoundation_modeltransparencyliterature

→ ai-economic-transformation

MIT Technology Review (AI Index 2026)@anti_agi @forklogai

2026-04-01

breakthrough

AI-Scientist-v2: first full-loop autonomous research pipeline produces peer-reviewed paper

AI-Scientist-v2 executes the full research cycle end-to-end: literature review against Semantic Scholar, hypothesis generation with novelty-filtering, agentic tree search over experimental plans,…

ai_scientistautonomous_researchairiopenaibrockmancitogenesissemantic_scholaragentic_tree_search

@gsmelin SputnikInt @sbmarketing (Brockman interview)@neurozeh

2026-03-29

milestone

Eli Lilly–Insilico Medicine $2.75B deal — 28 AI-generated drugs, 14 already in clinical trials

Eli Lilly signs a deal worth up to $2.75B with Insilico Medicine for AI-designed drugs, with $115M paid upfront and the rest tied to pipeline milestones plus tiered royalties. Lilly gets exclusive…

ai_drug_designinsilicoeli_lillydealcommercializationpharma

→ ai-economic-transformation → glp1-longevity-therapeutics

Insilico Medicine press release Financial Times (via @Stimme_der_Vernunft)ML Times

2026-03-19

innovation

Individual engineer designs personalized mRNA cancer vaccine for pet dog using ChatGPT + AlphaFold

Australian tech-startup founder Paul Cunningham — an ML engineer, not a physician — designed a personalized mRNA cancer vaccine for his dog Rosie (aggressive mast cell cancer) using ChatGPT and…

ai_drug_designalphafoldchatgptdemocratizationneoantigenpersonalized_medicine

→ cancer-immunotherapy-advances

UNSW News pcr.news