Tag: biorxiv api

  • bioRxiv Alerts: Email, RSS or API Options

    bioRxiv alerts let researchers and developers track newly posted preprints in a chosen subject area without manually rechecking the site — the three core options are subject-category email alerts, per-category RSS/Atom feeds, and the public bioRxiv API, each suited to a different workflow. bioRxiv is the preprint server for biology operated by openRxiv, a nonprofit dedicated to advancing scientific communication, and it exposes the same underlying content through all three channels plus social feeds on Bluesky, Mastodon and X.

    This guide compares the four practical ways to follow new bioRxiv postings — email alerts, RSS/Atom feeds, the REST API, and social feeds — so you can pick the right combination for a literature-monitoring workflow, a lab dashboard, or an automated pipeline.

    What are bioRxiv’s alert options?

    bioRxiv is a preprint server for the biological sciences; a preprint is a complete scientific manuscript posted online before, or without, formal peer review. Because thousands of preprints are posted every week across dozens of subject categories, bioRxiv publishes the same feed of new content through four distinct channels rather than a single notification system.

    Each channel trades off timeliness, filtering precision and technical effort differently. Email alerts and RSS feeds are built for passive monitoring by individual researchers; the API is built for developers who need structured metadata inside another tool; social feeds suit anyone already working inside those platforms.

    How do bioRxiv email alerts work?

    Email alerts are the lowest-effort option for an individual researcher who wants a periodic digest. You sign up on the bioRxiv Alerts page, select one or more of bioRxiv’s roughly 30 subject categories — from Bioinformatics to Zoology — and bioRxiv emails you when matching preprints are posted.

    • Alerts can be scoped to a subject category, a keyword search, or a specific author.
    • You can add or remove subject-area alerts at any time from the same sign-up page, without deleting your account.
    • No bioRxiv account or login is required simply to receive category alerts — the sign-up form only asks for an email address.

    This makes email alerts the right default for anyone who wants new preprints in their inbox without building or maintaining anything.

    How do bioRxiv RSS feeds work?

    bioRxiv’s Alerts/RSS page publishes an Atom 1.0 feed for each subject category, plus a combined feed across all categories. Each feed returns only the most recent 30 posts for that category — a hard limit set by bioRxiv, not a filter you can extend — so an RSS reader that checks infrequently can silently miss older items once more than 30 new preprints accumulate.

    Feeds can be combined by chaining subject categories with a plus sign in the URL, and multi-word category names use an underscore in place of a space. For example, a feed combining Genomics and Bioinformatics takes the form:

    • http://connect.biorxiv.org/biorxiv_xml.php?subject=genomics+bioinformatics

    This lets a single feed reader subscription cover several adjacent subject areas — useful for interdisciplinary groups — without needing separate subscriptions per category.

    What can the bioRxiv API do that alerts and RSS can’t?

    The bioRxiv API is a pull-based REST interface returning structured JSON metadata — DOI, title, authors, category, posting date and abstract — for preprints on bioRxiv and medRxiv. Unlike email alerts or RSS, it has no built-in subject-category filter parameter and no push/webhook mechanism: a developer must query by date interval or DOI and filter the returned category field client-side.

    That distinction matters for anyone building automated tooling:

    • The API suits scheduled polling jobs, institutional repository harvesters, and research-tool dashboards that need structured metadata, not just a headline and link.
    • RSS and email alerts remain the simpler choice for a single researcher who only wants to read new titles as they appear.
    • Because the API is pull-based, any “alert” built on top of it requires you to run your own polling schedule and de-duplication logic.

    Detailed field definitions and endpoint syntax are published in bioRxiv’s own API documentation, which developers should consult directly before building a production integration.

    Should you follow bioRxiv on Bluesky, Mastodon or X?

    bioRxiv also mirrors new postings to social platforms, and this is where the biggest recent change sits — one that generic alert guides tend to miss. Beyond the long-standing X/Twitter account (@biorxivpreprint, over 140,000 followers, plus a dedicated account per subject category), bioRxiv now runs an equivalent set of per-category streams on Bluesky (e.g. biorxiv-bioinfo.bsky.social) and Mastodon (e.g. biorxiv_bioinfo on biologists.social).

    This matters because X restricted free API access in 2023, which reduced the reliability of X-based bots and dashboards that many labs had built to watch subject feeds. Bluesky and Mastodon’s open, API-friendly protocols make them a more dependable base for anyone building a custom preprint-monitoring bot today, rather than a nice-to-have alternative.

    Which option should you choose?

    The right channel depends on how much filtering precision you need and how much technical effort you are willing to invest.

    Channel Best for Filtering Setup effort Key limitation
    Email alerts Individual researchers wanting a digest Subject, keyword, author None (email only) No login needed, but digest cadence isn’t real time
    RSS/Atom feed Feed-reader users, interdisciplinary groups Subject category, combinable Low (add feed URL) Capped at the most recent 30 posts per category
    REST API Developers, institutional tools, dashboards None built-in; filter client-side High (build a polling job) Pull-based only, no webhook/push
    Bluesky/Mastodon/X Social monitoring, bot-building Per subject-category account Low–Medium X reach reduced since 2023 API restrictions

    For most individual researchers, subject-category email alerts remain the simplest reliable option. Developers building institutional or lab-wide monitoring tools should combine the API for structured metadata with RSS as a lightweight fallback.

    Common questions about bioRxiv alerts

    Why are my bioRxiv email alerts not working?

    Missed bioRxiv alerts are usually caused by an out-of-date subject-category selection, an alert email landing in a spam or promotions folder, or an expired confirmation link. Re-visiting the bioRxiv Alerts page and re-confirming your chosen categories resolves most cases.

    Do I need a bioRxiv account or login to set up alerts?

    No account or login is required for basic email alerts — only an email address. A bioRxiv account is only needed for actions like submitting a manuscript, posting a comment, or managing an author profile, not for receiving subject-area notifications.

    Does bioRxiv have a public API for developers?

    Yes. bioRxiv publishes a public REST API returning JSON metadata — including DOI, title, category and abstract — for content on bioRxiv and medRxiv. It is pull-based, so developers must schedule their own queries rather than receive push notifications.

    Should I track bioRxiv or arXiv for my subject area?

    Choose based on discipline, not preference: bioRxiv covers biology-specific subject categories, while arXiv covers physics, mathematics, computer science and quantitative biology. Researchers working across both fields — for example in computational biology — often need alerts from both servers rather than treating them as interchangeable.

    What this means for research-monitoring workflows

    Preprint volume keeps growing across biology subject categories, and no single channel covers every use case. A researcher who only needs a daily digest is well served by email alerts; a developer building a literature-surveillance tool for an institution needs the API’s structured metadata and should plan for its pull-based, polling architecture from the outset. Teams that previously relied solely on X-based bots should treat the 2023 API restrictions as a prompt to add Bluesky or Mastodon, or the official RSS feed, as a more durable foundation.

    Research administrators supporting open-scholarship workflows can pair these tracking methods with broader terminology in the CASRAI Dictionary when documenting how preprints fit into an institution’s research-administration processes.

  • Research Square vs bioRxiv: Ownership & Fees

    Research Square vs bioRxiv is, at its core, a nonprofit-versus-commercial question: Research Square is a preprint platform owned by the for-profit publisher Springer Nature, while bioRxiv and medRxiv are nonprofit servers now governed by openRxiv, an independent 501(c)(3) that took over from Cold Spring Harbor Laboratory (CSHL) in March 2025. Both are free for authors to use, but the ownership structure behind each one shapes fees, licensing control, data governance and long-term archival continuity in ways that matter for anyone advising authors on where to post.

    A preprint server is an online platform where researchers deposit manuscripts before, or independently of, formal peer review. Research Square, bioRxiv and medRxiv are three of the most widely used servers in the life, health and biomedical sciences, and authors are increasingly asked to choose between them without understanding what sits behind each brand.

    What Is the Core Difference Between Research Square and bioRxiv?

    The core difference is legal ownership and mission accountability, not scope or screening rigour. Research Square traces to American Journal Experts (AJE); Springer Nature took a minority stake in the Research Square platform in 2018, became majority owner in 2020, and completed full acquisition of Research Square Company in 2022. It is, today, a wholly commercial subsidiary of a for-profit publishing group.

    bioRxiv was founded in 2013 by John Inglis and Richard Sever at CSHL, a nonprofit research institution. medRxiv followed in 2019 as a partnership between CSHL, Yale University and BMJ. In March 2025, governance of both servers passed from CSHL to openRxiv, a newly formed independent nonprofit whose stated mission is “creating opportunities for sharing, discovering, and advancing preprints in the life and health sciences” — with a dedicated board and a Scientific and Medical Advisory Board of researchers overseeing policy.

    Feature Research Square bioRxiv / medRxiv (via openRxiv)
    Governing entity Springer Nature (for-profit publisher) openRxiv (independent nonprofit, 501(c)(3))
    Platform launched 2016, under Research Square Company bioRxiv 2013; medRxiv 2019
    Ownership shift Minority stake 2018 → majority 2020 → full acquisition 2022 Transitioned from CSHL to independent nonprofit, March 2025
    Author posting fee Free Free
    Sustainability model Cross-subsidised by Springer Nature publishing and AJE author-services revenue Philanthropic and institutional grants (Chan Zuckerberg Initiative, Sergey Brin Family Foundation, Robert Lourie Foundation, partner universities)
    Default licence CC-BY 4.0 required for all preprints Author’s choice: CC0, CC-BY, CC-BY-NC, CC-BY-ND, CC-BY-NC-ND, or no reuse without permission
    Journal integration In Review, tied to 1,000+ participating journals No equivalent journal-submission integration
    Bulk text-and-data-mining access No published bulk TDM programme; access via Crossref metadata and the site Monthly XML/PDF corpus via a requester-pays AWS S3 bucket, plus a public metadata API
    Long-term preservation Portico Portico

    Who Pays, and How Is Each Platform Funded?

    Neither model charges authors to post a preprint — that much is identical. What differs is where the money to run the platform comes from, and what that implies about future incentives. Research Square’s operating costs are absorbed by Springer Nature’s commercial publishing business and by AJE’s paid author-services division (editing, translation and related products), which Research Square continues to cross-sell alongside free preprint posting.

    openRxiv, by contrast, depends on renewable philanthropic and institutional grants rather than a parent company’s revenue. Its principal funders include the Chan Zuckerberg Initiative, the Sergey Brin Family Foundation, the Robert Lourie Foundation and a consortium of supporting universities including Caltech, MIT, Stanford, Yale and the University of Washington. That is a genuine trade-off, not a straightforward win for either side:

    • Research Square’s commercial backing gives it predictable, revenue-linked funding, but ties its long-term direction to Springer Nature’s corporate strategy.
    • openRxiv’s nonprofit funding is mission-locked by governance structure, but depends on grant renewal cycles rather than a guaranteed revenue stream.

    Who Owns and Controls Author Data?

    Ownership of the underlying manuscript stays with authors on both platforms — this is not a copyright grab by either side. The meaningful difference is licensing control and third-party data access. Research Square requires every posted preprint to carry a CC-BY 4.0 licence, which is the most permissive open licence and maximises reuse rights for readers, but leaves authors no choice in the matter.

    bioRxiv and medRxiv give authors a menu of licence options — CC0, CC-BY, CC-BY-NC, CC-BY-ND, CC-BY-NC-ND, or a “no reuse without permission” setting — and authors can change the licence on an existing preprint after posting. That is more author control, though funders that mandate CC-BY (a growing norm, including under several cOAlition S-aligned policies) require authors to actively select it rather than receiving it by default.

    The two models also diverge sharply on bulk data access. openRxiv publishes a full monthly XML/PDF text-and-data-mining corpus through a requester-pays AWS S3 bucket, alongside a public metadata API — an open-infrastructure commitment consistent with nonprofit, grant-funded governance. Research Square does not publish an equivalent bulk TDM feed; third-party discovery of Research Square content runs through Crossref DOI metadata and the platform’s own search interface rather than a dedicated open corpus.

    What Long-Term Archival Guarantees Does Each Model Offer?

    Both platforms use the same third-party preservation service: Portico provides perpetual-access archiving for preprints posted to Research Square, bioRxiv and medRxiv alike, so the archive itself is not where the two models diverge.

    The real difference is organisational continuity risk. A commercial platform’s archival commitments are ultimately corporate policy that could change with ownership or strategy; a nonprofit platform’s commitments are set by a mission-bound board, though it carries the separate risk of grant-funding renewal. Advising authors on a multi-decade preprint record means treating “who governs the archive” as distinct from “where is the archive stored.”

    Common Questions About Research Square and bioRxiv

    Is bioRxiv reputable?

    Yes. bioRxiv is widely cited across molecular and cell biology, screens submissions for plagiarism and non-scientific content, and is now governed by openRxiv, an independent nonprofit with a Scientific and Medical Advisory Board. Its reputation rests on community adoption and transparent, nonprofit governance rather than commercial incentives.

    Does bioRxiv count as published?

    No. A bioRxiv or medRxiv preprint is not peer-reviewed and does not constitute formal publication. The ICMJE treats preprints as legitimate scholarly communication, not duplicate publication, but funders and REF-style assessment exercises generally still require the peer-reviewed version for compliance credit.

    Is bioRxiv a preprint?

    bioRxiv is not itself a preprint — it is the server that hosts preprints. A preprint is the individual manuscript version posted before or independent of peer review; bioRxiv is the nonprofit infrastructure, now under openRxiv, that makes that posting possible for life-science research.

    What are the alternatives to bioRxiv?

    Alternatives include medRxiv for clinical and public-health research, Research Square for multidisciplinary and journal-integrated posting, and repository-style options such as arXiv, the Open Science Framework, Figshare and Zenodo. The right choice depends on discipline, human-subjects status and whether journal-integrated posting matters.

    What This Means for Authors and Research Administrators

    For most authors, the nonprofit-versus-commercial distinction will not change whether posting is free — it usually is, on both models. It should change how administrators frame the advice they give:

    • Explain that Research Square’s mandatory CC-BY licence maximises reuse but removes licensing choice, while bioRxiv/medRxiv give authors more control over which licence applies.
    • Flag that researchers doing large-scale corpus analysis will find far richer bulk access through openRxiv’s TDM feeds than through Research Square.
    • Note that archival preservation (Portico) is equivalent across models — the open question is who controls future platform policy, not the archive.
    • Treat commercial ownership as a disclosure point, not a disqualifier: Springer Nature’s backing gives Research Square’s In Review workflow journal-integration value a nonprofit model does not replicate.

    As more research administration offices build formal preprint guidance into their researcher-facing documentation, the originating business model behind a server deserves the same disclosure as its discipline coverage or screening depth. Authors are entitled to know not just where their manuscript will sit, but who ultimately governs the platform holding it — a nonprofit board answerable to a research mission, or a commercial parent answerable to shareholders.

    Last updated: 3 July 2026.

  • openRxiv Explained: Why bioRxiv and medRxiv Went Independent

    openRxiv is the independent, researcher-led nonprofit that has run bioRxiv and medRxiv since March 2025, replacing Cold Spring Harbor Laboratory’s institutional stewardship with a six-member board, diversified funding, and a mandate to keep both preprint servers free to read and free to post. The spin-off was designed to insulate two of biomedicine’s most-used pieces of open-research infrastructure from dependence on any single institution or funder — a governance question every standards body and infrastructure provider eventually has to answer.

    openRxiv is the independent nonprofit, launched on 11 March 2025, that now stewards the bioRxiv and medRxiv preprint servers on behalf of the global research community, rather than as a programme of a single host institution.

    What is openRxiv, and what does it actually run?

    openRxiv is the organisational and legal home of two preprint servers: bioRxiv, covering life sciences, and medRxiv, covering health and clinical research. Neither server changed its submission process, screening policy, or URL when the transition happened — researchers post to biorxiv.org and medrxiv.org exactly as before.

    What changed is who is accountable for the platforms’ survival. bioRxiv was founded in 2013 at Cold Spring Harbor Laboratory (CSHL); medRxiv followed in 2019 as a joint initiative between CSHL, Yale University, and BMJ. Both grew into the dominant preprint venues for biomedicine, and by 2025 that success had outgrown the administrative capacity of a single laboratory to sustain indefinitely.

    Why did bioRxiv and medRxiv leave Cold Spring Harbor Laboratory?

    CSHL’s own account of the move calls it a “natural evolution,” not a rupture. Bruce Stillman, CSHL’s President and CEO, joined openRxiv’s board rather than severing ties, and co-founders John Inglis and Richard Sever moved with the platforms into the new entity.

    The stated rationale centres on three risks that concentrated stewardship inside one institution:

    • Sustainability risk — a single laboratory’s budget cycle is not designed to guarantee decades of continuity for global research infrastructure.
    • Governance risk — decisions about screening policy, features, and funding priorities benefited from a board drawn from outside CSHL alone.
    • Funder-concentration risk — the platforms needed a structure that could accept diversified funding without any one funder gaining outsized influence.

    openRxiv formally launched as an independent nonprofit on 11 March 2025, with the Chan Zuckerberg Initiative (CZI) providing three years of seed funding for the transition, according to openRxiv’s own governance Q&A published that May. In October 2025, arXiv — the physics, mathematics, and computer science preprint server run by Cornell University — joined openRxiv in submitting a joint response to a National Institutes of Health Request for Information on preprints, signalling a wider coalition forming around shared preprint-infrastructure interests, though arXiv itself remains a separate service.

    Who governs openRxiv, and who pays for it?

    openRxiv is governed by a six-member board of directors: Scott Fraser (University of Southern California and the CZI Imaging Institute), Edith Heard (Francis Crick Institute), Jeff Huber (Triatomic Capital), Harlan Krumholz (Yale School of Medicine; medRxiv co-founder), Bruce Stillman (CSHL), and Shirley Tilghman (Princeton University). A separate Scientific and Medical Advisory Board, chaired by John Inglis with medRxiv co-founder Theo Bloom as deputy, advises on content policy.

    The funding question is where most scrutiny has landed, given CZI’s long involvement with both servers before the spin-off:

    Question openRxiv’s public answer (governance Q&A, May 2025)
    How long has CZI funded the servers? Eight years for bioRxiv, four years for medRxiv, plus three years of dedicated seed funding for the openRxiv transition itself.
    Does CZI have editorial or operational control? No. openRxiv states funding agreements carry no stipulations affecting editorial or operational independence.
    How much board influence does CZI hold? One of six directors (Scott Fraser) has a CZI affiliation; the board is not CZI-appointed as a bloc.
    Is openRxiv against traditional peer review? No — openRxiv reports roughly 75% of bioRxiv and medRxiv preprints go on to formal peer-reviewed publication, with direct-submission links to 350 journals.

    openRxiv itself frames the governance model as a direct answer to funder-concentration concerns: the organisation states its mission is to be “governed by and for the research community, not a single funder, founder, or any one stakeholder.” Whether a philanthropic vehicle tied to a single tech-sector family remains structurally sufficient as the largest funder of a nonprofit intended to resist single-funder capture is a debate that predates this specific spin-off and will likely recur as openRxiv pursues its stated goal of diversifying revenue further.

    What is openRxiv Labs, and what launched in June 2026?

    openRxiv Labs launched on 1 June 2026 as a structured experimentation programme sitting on top of the core bioRxiv and medRxiv infrastructure. Rather than running many small tests at once, openRxiv committed to a small number of larger, hypothesis-driven pilots with predefined success metrics and durations, publishing results — including failures — openly on a dedicated Labs blog.

    The first Labs pilot, built with the platform Curvenote, tests an interactive preprint-reading interface layered onto openRxiv’s existing corpus of preprints, figures, and metadata. openRxiv named a broad partner list for the programme, including CZI, CSHL, the Sergey Brin Family Foundation, Caltech, CNRS, Fred Hutchinson Cancer Center, Imperial College London, MIT, Stanford, the University of Washington, and Vrije Universiteit Amsterdam — underscoring that the funder-diversification effort begun at launch has continued into 2026 rather than stalling after the initial CZI seed grant.

    Answer-first questions people are asking about openRxiv

    Who is the CEO of openRxiv?

    Dr Tracy Teal is openRxiv’s first Chief Executive Officer, appointed on 18 August 2025 after serving as interim COO since the March 2025 launch. She previously led The Carpentries and Dryad, two established open-research infrastructure nonprofits, giving her direct prior experience running community-governed scientific platforms.

    Who owns medRxiv?

    No single institution “owns” medRxiv today. It was founded in 2019 by Cold Spring Harbor Laboratory, Yale University, and BMJ, but operational and governance responsibility now sits with openRxiv, the independent nonprofit created specifically to steward it and bioRxiv without institutional or single-funder control.

    Is medRxiv a credible source?

    medRxiv preprints are screened but not peer-reviewed, so they should be cited with that caveat clearly stated. openRxiv reports around 75% of postings eventually complete formal peer review; until then, findings represent unverified claims from qualified researchers, useful for rapid awareness but not equivalent to a published, peer-reviewed article.

    What is openRxiv, in one line?

    openRxiv is the independent 501(c) nonprofit, launched 11 March 2025, that operates bioRxiv and medRxiv under a six-member board and a diversified-funding mandate, replacing their prior status as programmes hosted by Cold Spring Harbor Laboratory.

    What the openRxiv spin-off means for research-infrastructure stewardship

    The openRxiv case is a useful reference point for any organisation weighing how to govern shared research infrastructure once it outgrows its founding institution. The pattern — an originating body incubates a tool, the tool becomes essential community infrastructure, and stewardship then transfers to an independent, multi-stakeholder body — is not unique to preprints.

    CASRAI originated the CRediT contributor role taxonomy in 2014. The standard is now stewarded by NISO as ANSI/NISO Z39.104-2022. That is the same “originator, not owner” pattern openRxiv is now navigating in public: CSHL originated bioRxiv and medRxiv, and stewardship has since passed to a body structured explicitly to prevent any one funder, founder, or institution from controlling research infrastructure the whole field depends on.

    For research administrators and institutional leaders, the practical takeaway is to watch governance structure, not just funding source, when assessing an infrastructure provider’s long-term reliability. A named, multi-institutional board; published funding-independence commitments; and open reporting of pilot outcomes (as with openRxiv Labs) are the concrete signals worth checking — independent of who wrote the first cheque.