The credit authorship taxonomy (CRediT) is largely absent from arXiv and bioRxiv preprints because neither platform has an editorial office empowered to enforce it, neither offers a dedicated contribution-metadata field, and a preprint is not yet a fixed version of record. CRediT statements are collected later, when a manuscript reaches a journal that mandates them.
CRediT is a controlled vocabulary of 14 defined contributor roles used to describe, role by role, what each named author actually did on a research output. CASRAI originated the CRediT contributor role taxonomy in 2014, and the standard is now stewarded by NISO as ANSI/NISO Z39.104-2022.
Contents
- What is the CRediT authorship taxonomy?
- Why don’t arXiv and bioRxiv require CRediT statements?
- The submission and metadata gap behind the absence
- What changes when a preprint reaches a CRediT-mandating journal?
- Answer-first Q&A
- Implications for research administrators
- Outlook: will preprint servers adopt CRediT?
What Is the CRediT Authorship Taxonomy?
CRediT (Contributor Roles Taxonomy) assigns one or more of 14 standard role labels — Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, and Writing – review & editing — to each named contributor on a research output.
- CASRAI originated the taxonomy in 2014 to complement, not replace, traditional authorship bylines.
- NISO approved it as ANSI/NISO Z39.104-2022, the current formal reference standard.
- It is licensed CC-BY 4.0 and is distinct from the ICMJE authorship criteria, which govern who qualifies as an author at all rather than what each author contributed.
The taxonomy is now embedded in the submission systems of major publishers, including Elsevier, Wiley, Taylor & Francis, Sage and Nature Portfolio journals — almost always at the point of formal peer-reviewed submission or acceptance, not at the preprint stage.
Why Don’t arXiv and bioRxiv Require CRediT Statements?
Preprint servers skip CRediT largely because they have no editorial office analogous to a journal’s. arXiv and bioRxiv operate a lightweight moderation or screening check — confirming the submission is on-topic and not obviously unscientific — rather than the editorial and peer-review workflow that gives journals a natural checkpoint at which to demand a structured contributorship disclosure.
A second reason is version-of-record ambiguity. A preprint can be revised multiple times before, or instead of, formal publication, and co-authorship or individual roles can change between versions — for example when a reviewer at the eventual journal requests new experiments performed by a newly added contributor. Locking a CRediT statement to an early preprint version risks misrepresenting the contributions behind the paper that ultimately gets cited.
Neither arXiv nor bioRxiv has published an official policy explaining the omission; the absence reflects infrastructure and governance gaps rather than a stated objection to the taxonomy itself.
The Submission and Metadata Gap Behind the Absence
The practical blocker is metadata architecture. arXiv collects author information as a single free-text field with no dedicated structure for role-level contribution data. bioRxiv and medRxiv, run by Cold Spring Harbor Laboratory, capture somewhat richer structured metadata — including funder information — but likewise have no CRediT field in their submission forms.
This differs from what happens downstream. Crossref’s deposit schema supports embedding CRediT contributor-role metadata against a published journal article’s DOI record, which is how a reader can eventually see machine-readable contribution data attached to the version of record. Preprint DOI records typically carry no equivalent CRediT element, because the preprint servers do not populate it and have no requirement to.
| Feature | arXiv / bioRxiv (preprint) | Typical CRediT-mandating journal |
|---|---|---|
| Screening body | Moderators (topic/scope check) | Editorial board + peer reviewers |
| Author metadata field | Free-text author list | Structured CRediT role fields in submission system |
| Version status | Multiple revisable versions | Single accepted version of record |
| CRediT statement required | No | Often yes, per publisher policy |
| DOI metadata (CRediT roles) | Generally absent | Supported via Crossref deposit schema |
What Changes When a Preprint Reaches a CRediT-Mandating Journal?
Once a manuscript that began life as an arXiv or bioRxiv preprint is accepted by a journal that mandates CRediT, the contribution statement is captured during that journal’s own submission or production workflow — not retrofitted onto the preprint record itself.
Authors typically complete role selections in the publisher’s manuscript system (for example, at revision or acceptance stage), and the resulting statement appears on the published article page and, where supported, in the article’s Crossref-deposited metadata. bioRxiv and medRxiv link out to the published version once available, but the CRediT statement itself lives with the publisher’s version of record, not the earlier preprint.
Answer-First Q&A
What is the CRediT taxonomy?
The CRediT taxonomy is a standardised, 14-role controlled vocabulary — covering roles such as Conceptualization, Investigation, and Writing – original draft — used to describe each named author’s specific contribution to a research output, distinct from authorship order or byline position.
What are the 14 roles of the CRediT taxonomy?
The 14 roles are Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, and Writing – review & editing, as defined under ANSI/NISO Z39.104-2022.
Do preprints need a CRediT statement?
No. Neither arXiv nor bioRxiv currently requires a CRediT statement, since neither maintains the editorial enforcement mechanism or the structured metadata field that journals use to collect this information at submission or acceptance.
What happens to author contributions when a preprint is later published?
The CRediT statement is generated at the journal stage, through the publisher’s own submission system, and appears on the published version of record — it is not added retroactively to the original preprint page on arXiv or bioRxiv.
Implications for Research Administrators and Institutions
Institutions relying on contributorship data for research assessment, promotion cases, or authorship-dispute resolution should treat preprints as an incomplete contributorship record. The Contributor Roles Taxonomy resource maintained at CASRAI’s CRediT contributor roles hub and CASRAI’s broader authorship guidance both point research offices toward the published, CRediT-tagged version rather than the preprint when contributorship needs to be verified or cited formally.
- Do not assume a preprint’s author order reflects final contribution roles — roles can shift before formal publication.
- Check the journal’s published version, and its Crossref metadata where available, for the authoritative CRediT statement.
- Use CASRAI’s research administration dictionary to confirm terminology when drafting institutional authorship policy.
Outlook: Will Preprint Servers Adopt CRediT?
Momentum toward richer preprint metadata is real but has so far concentrated on discoverability and version-linking rather than contributorship. Until arXiv or bioRxiv add a structured contribution field, and until a body with editorial standing is prepared to enforce it, CRediT statements will remain a journal-stage artefact rather than a preprint-stage one. Research offices and funders that want contributor-level accountability earlier in the research lifecycle will need to look to journal policy, not preprint infrastructure, for now.