Tag: esm3 biorxiv

Chai-2 bioRxiv: Comparing AI Biology Preprints Ahead of Peer Review

The Chai-2 bioRxiv preprint, posted by Chai Discovery on 5 July 2025, reports a 16% hit rate in fully de novo antibody design — more than 100-fold above prior computational methods — but like the ESM3 and Geneformer foundation models it sits alongside, the claim has not yet cleared peer review. All three are part of a wider pattern: AI biology foundation models are increasingly disseminated as bioRxiv preprints first, journal articles later (if at all), which changes how institutions, publishers, and funders must scrutinise their claims.

A bioRxiv preprint is a manuscript posted to the Cold Spring Harbor Laboratory’s biology preprint server before, or instead of, formal peer review. This article compares how Chai-2, ESM3, Geneformer, EvolvePro, and AlphaFold-Multimer have each used that route, and what the differences mean for reproducibility.

What is Chai-2, and why was it posted as a bioRxiv preprint?
How do ESM3 and Geneformer differ from Chai-2 in preprint dissemination?
Why does preprint-first publication intensify reproducibility scrutiny?
Common questions about AI biology preprints on bioRxiv
What are the implications for institutions, publishers, and funders?

What is Chai-2, and why was it posted as a bioRxiv preprint?

Chai-2 is a multimodal generative model from Chai Discovery that designs antibodies and nanobodies from scratch, taking a target structure and epitope as input and returning a complete antibody design. The original preprint, “Zero-shot antibody design in a 24-well plate”, reported a 16% success rate in de novo design against 52 diverse targets, completed from AI design to wet-lab validation in under two weeks.

Chai Discovery followed with an updated bioRxiv preprint on 29 November 2025, “Drug-like antibody design against challenging targets”, reporting that more than 86% of designed full-length monoclonal antibodies showed developability profiles comparable to approved therapeutics. Neither preprint has yet been published in a peer-reviewed journal. The company has since raised a $130 million Series B round, taking total funding above $225 million at a $1.3 billion valuation, according to Genetic Engineering & Biotechnology News.

How do ESM3 and Geneformer differ from Chai-2 in preprint dissemination?

ESM3 and Geneformer address different biological scales entirely, and their publication paths diverge from Chai-2’s in an instructive way. ESM3, from EvolutionaryScale, is a general-purpose protein language model trained on roughly 2.78 billion protein sequences with a 98-billion-parameter flagship configuration. It was posted as a preprint before its 2025 publication in Science — meaning it eventually completed the peer-review cycle that Chai-2’s antibody preprints have not yet reached.

Geneformer operates at the cellular level rather than the molecular level. Built on a transformer-encoder architecture pretrained across tens of millions of single-cell RNA-sequencing profiles, it classifies cell types and predicts disease-relevant genes. Its foundational description, credited to Christina Theodoris and colleagues, circulated as a preprint before formal publication in Nature in 2023.

EvolvePro and AlphaFold-Multimer extend the comparison further. EvolvePro is a few-shot protein-engineering framework that uses language-model embeddings to guide directed evolution from very few labelled variants, disseminated via bioRxiv. AlphaFold-Multimer, Google DeepMind’s extension of AlphaFold2 for multi-chain complex prediction, is the starkest case: its 2021 bioRxiv preprint (Evans et al.) has been cited thousands of times and underpins structural biology workflows worldwide, yet it has never been published in a peer-reviewed journal.

Model	Domain	bioRxiv posting	Weight access	Peer-review status
Chai-2	De novo antibody design	v1 Jul 2025; updated Nov 2025	Platform/API access, not fully open weights	Preprint only
ESM3	General protein sequence/structure/function	Preprint, then Science (2025)	Smaller checkpoints open; 98B flagship gated via Forge API	Peer-reviewed
Geneformer	Single-cell transcriptomics	Preprint, then Nature (2023)	Fully open-weight release	Peer-reviewed
EvolvePro	Few-shot directed protein evolution	bioRxiv preprint	Open code/model release	Preprint at time of posting
AlphaFold-Multimer	Multi-chain complex structure prediction	bioRxiv preprint (2021)	Code and weights open-sourced	Never published in a peer-reviewed journal

Why does preprint-first publication intensify reproducibility scrutiny?

Preprint-first publication compresses the interval between a headline result and its public citation, which is valuable for fast-moving fields but removes a layer of independent verification before claims circulate. AlphaFold-Multimer shows this can persist indefinitely: a preprint can become de facto infrastructure without ever completing formal review.

Model weight access varies sharply: Geneformer and AlphaFold-Multimer are fully open, while Chai-2 and ESM3’s largest configuration require platform or API access, limiting independent replication of the exact reported result.
Benchmark scale differs: Chai-2’s 16% hit rate is drawn from a company-run benchmark across 52 targets, not an externally adjudicated challenge such as CASP or CAPRI.
Versioning matters: Chai-2’s updated November 2025 preprint extends claims to full-length monoclonal antibodies, meaning readers must track which version underlies any given statistic.

For research administrators and institutional evaluators, the practical implication is that a citation to “Chai-2” or “ESM3” is not self-evidently a citation to peer-reviewed work — the preprint status, version, and weight-access terms all need checking before the claim is treated as settled.

Common questions about AI biology preprints on bioRxiv

Is the Chai-2 bioRxiv preprint peer-reviewed?

No. As of publication, both Chai-2 preprints — the July 2025 original and the November 2025 update — remain bioRxiv preprints. Neither has completed formal peer review, so the reported 16% hit rate and 86% developability figures should be read as company-reported, not journal-vetted, results.

Has ESM3 been published in a peer-reviewed journal?

Yes. ESM3 was first circulated as a preprint before EvolutionaryScale’s results were published in Science in 2025, giving it a completed peer-review path that Chai-2’s antibody-design claims currently lack.

What is Geneformer used for?

Geneformer analyses single-cell RNA-sequencing data to classify cell types, model gene regulatory networks, and identify disease-relevant genes, using a transformer architecture trained on large single-cell transcriptome corpora rather than protein or antibody sequences.

What is the difference between Chai-2 and AlphaFold-Multimer?

AlphaFold-Multimer predicts the 3D structure of existing multi-chain protein complexes, while Chai-2 generates entirely new antibody sequences and structures for a chosen target — structure prediction versus de novo generative design.

What are the implications for institutions, publishers, and funders?

Research administrators citing Chai-2, ESM3, Geneformer, or comparable models in grant reports, technology assessments, or institutional communications should distinguish preprint claims from peer-reviewed findings explicitly, note the exact preprint version, and record whether model weights are open or platform-gated. Publishers and editors evaluating manuscripts that build on these models should likewise verify which version of the underlying preprint is cited, since headline metrics can shift between versions.

The broader lesson is structural rather than model-specific: as AI biology moves faster than journal review cycles, the preprint-to-journal gap itself becomes a due-diligence checkpoint that institutions, funders, and publishers now need to track as routinely as they track the results themselves.

July 3, 2026

scGPT bioRxiv: AI Biology Models Bypass Review

scGPT bioRxiv preprints, alongside ESM3, AlphaFold-Multimer, Geneformer, EvolvePro and Chai-2, illustrate a 2026 pattern: AI foundation models for biology now reach bioRxiv months or years before — and sometimes instead of — formal peer review, shifting scrutiny onto the research community itself.

A foundation model in biology is a large neural network pretrained on a broad corpus of sequence, structure or single-cell data, then fine-tuned for specific downstream tasks such as cell-type annotation, protein design or complex-structure prediction. bioRxiv is the open-access preprint server, now operated by the nonprofit openRxiv, where most of these models first appear.

What is the bioRxiv wave of AI biology preprints?
Which models are driving this trend?
Why publish before peer review?
What are the research-integrity and attribution risks?
Answer-first Q&A
Implications for institutions and publishers

What is the bioRxiv wave of AI biology preprints?

Since 2021, a cluster of high-profile AI foundation models for biology has appeared first as bioRxiv preprints rather than journal articles. scGPT, ESM3, AlphaFold-Multimer, Geneformer, EvolvePro and Chai-2 each disclosed model weights, training corpora and benchmark results on bioRxiv before, or without, completing formal peer review.

This is not unique to biology, but the scale is notable. bioRxiv’s bioinformatics collection alone now holds over 42,000 preprints, and many of the field’s most-cited foundation-model papers spent a year or more circulating in preprint form before any journal version existed.

Which models are driving this trend?

Each model targets a different layer of biology — from single cells to protein complexes — but all six followed the same preprint-first disclosure pattern, with varying paths to formal review.

Model	Domain	bioRxiv preprint date	Peer-review status	Headline result
scGPT	Single-cell multi-omics	1 May 2023	Nature Methods, 2024	Pretrained on over 10 million cells; preprint drew 1,490+ citations before formal publication
ESM3	Protein sequence/structure/function	2 July 2024	Science, January 2025	Generated esmGFP, a novel fluorescent protein only 58% identical to its nearest known relative
AlphaFold-Multimer	Protein complex structure	4 October 2021	Still bioRxiv-only	67% success rate on heteromeric interfaces despite ubiquitous structural-biology use
Geneformer	Single-cell network biology	No precursor preprint; v2 update posted August 2024	Nature, 31 May 2023	Pretrained on Genecorpus-30M, 29.9 million single-cell transcriptomes
EvolvePro	Protein engineering	17 July 2024	Still bioRxiv-only	2- to 515-fold activity gains across five therapeutic proteins
Chai-2	Antibody and miniprotein design	6 July 2025	Still bioRxiv-only	16% hit rate in de novo antibody design, over 100x prior computational methods

Two patterns stand out. First, Geneformer’s core 2023 paper went directly to Nature without a bioRxiv precursor, showing the pattern is not universal. Second, AlphaFold-Multimer, EvolvePro and Chai-2 remain, as of mid-2026, without any confirmed journal record despite being cited and deployed across thousands of downstream studies.

Why publish before peer review?

Competitive priority and speed dominate. Posting to bioRxiv creates a timestamped, public record of a result the moment it exists, which matters in a field where multiple labs often chase the same architecture within weeks of each other.

Immediate community stress-testing of code, weights and benchmark claims, often faster than a journal’s reviewer pool can respond.
Priority establishment ahead of competing labs working on the same problem class.
Faster onward use: downstream researchers can build on and cite a preprint immediately rather than waiting through a multi-month review cycle.

Journals have adapted to this reality. Many now formally accept bioRxiv-posted work, and scGPT’s own trajectory — a 2023 preprint that drew over 1,490 citations before its 2024 Nature Methods publication — shows how much scientific traffic a foundation model can carry while still formally unreviewed.

What are the research-integrity and attribution risks?

The lack of independent review before wide reuse is the core risk. A 2026 bioRxiv preprint on researcher perceptions found that scientists rely heavily on author reputation, rather than review status, as their main heuristic for judging a preprint’s credibility — a fragile substitute for structured peer review, particularly for tools other labs adopt wholesale.

Attribution is a related, distinct problem. When a foundation model like Chai-2 or ESM3 generates a candidate sequence that a human team then validates experimentally, contributor-credit questions arise: who conceived the method, who ran validation, and who is accountable for the claim. Both the International Committee of Medical Journal Editors and the Committee on Publication Ethics have stated that AI tools cannot be listed as authors, because they cannot take responsibility for the work’s accuracy or integrity.

Structured contributor-role frameworks help resolve this. CASRAI originated the CRediT contributor role taxonomy in 2014, and the standard is now stewarded by NISO as ANSI/NISO Z39.104-2022. Applying CRediT roles to preprint co-authorship — distinguishing methodology, software, validation and formal analysis — gives institutions a documented way to assign human accountability even when an AI foundation model contributed materially to the output. See the broader CRediT framework overview and CASRAI’s authorship resources for related guidance.

Answer-first Q&A

Has the scGPT bioRxiv preprint been peer reviewed?

Yes. The original scGPT preprint was posted to bioRxiv on 1 May 2023 and later passed formal peer review, publishing in Nature Methods in 2024. The preprint itself had already drawn more than 1,490 citations while still formally unreviewed.

Why do AI foundation models for biology publish on bioRxiv before peer review?

Competitive pressure and pace drive it. Posting to bioRxiv establishes priority and lets the wider research community stress-test claims, code and weights immediately, rather than waiting the months or years a formal peer-review cycle can take in a fast-moving field.

Is AlphaFold-Multimer peer reviewed?

No confirmed journal record exists for AlphaFold-Multimer itself; DeepMind’s preprint has remained on bioRxiv since 4 October 2021. It is nonetheless used routinely across structural biology — a stark example of a foundational tool that never completed formal peer review.

Who owns bioRxiv?

bioRxiv is operated by openRxiv, an independent nonprofit that assumed ownership from Cold Spring Harbor Laboratory in March 2025. The transfer aimed to secure the preprint server’s long-term governance as its role in disseminating AI foundation model research has grown.

Implications for institutions and publishers

Research offices and publishers now need explicit policy on how preprinted AI foundation models are cited, credited and re-used before formal review completes. Institutional research-integrity offices should treat a bioRxiv-only model — such as AlphaFold-Multimer, EvolvePro or Chai-2 — as provisionally validated, not settled science, when it underpins funded work or clinical-adjacent claims.

Research administrators managing grant compliance and output tracking should build preprint-status checks into their reporting workflows; CASRAI’s research administration resources outline how contributor-role and output-tracking practices adapt to fast-moving, preprint-first fields. As more foundation models follow this path, the distinction between “published” and “peer reviewed” will matter more, not less, for research integrity.

July 3, 2026