Skip to main content
v2026.1714 entries · CC-BY 4.0
CRediT role · ANSI/NISO Z39.104-2022Research & analysis

Data curation

Management activities to annotate (produce metadata), scrub data and maintain research data (including software code, where it is necessary for interpreting the data itself) for initial use and later re-use.

— ANSI/NISO Z39.104-2022, CRediT, Contributor Roles Taxonomy

In practice — medical research context

When this role applies

Management activities to annotate (produce metadata), scrub data and maintain research data (including software code, where it is necessary for interpreting the data itself) for initial use and later re-use. In biomedical and clinical-research contexts, this role typically attaches to the team member(s) who shaped the corresponding aspect of the work — distinct from acknowledgement of those who contributed materials, technical service, or general support.

Authors may hold multiple CRediT roles on a single paper. The optional degree qualifier (lead / equal / supporting) clarifies relative responsibility when several authors share a role.

In the CASRAI Dictionary, the Data curation role connects to the controlled-vocabulary concepts of Data steward role (in DMP), Project metadata, DataCite metadata schema, PID metadata, and Open data.

Encoding

Use in your systems

JATS XML <role> element
xml
<role vocab="credit"
      vocab-identifier="https://casrai.org/credit"
      vocab-term="Data curation"
      vocab-term-identifier="https://casrai.org/credit/roles/data-curation" />
Schema.org DefinedTerm (JSON-LD)
json
{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "@id": "https://casrai.org/credit/roles/data-curation",
  "name": "Data curation",
  "identifier": "https://casrai.org/credit/roles/data-curation",
  "description": "Management activities to annotate (produce metadata), scrub data and maintain research data (including software code, where it is necessary for interpreting the data itself) for initial use and later re-use.",
  "inDefinedTermSet": "https://casrai.org/credit",
  "url": "https://casrai.org/credit/roles/data-curation",
  "license": "https://creativecommons.org/licenses/by/4.0/",
  "bibliographicCitation": "CASRAI Editorial Board (2026). Data curation — CRediT contributor role (v2022.1). CASRAI. https://casrai.org/credit/roles/data-curation",
  "citation": "https://casrai.org/credit/citation",
  "dateModified": "2022-08-01",
  "version": "2022.1",
  "inLanguage": "en",
  "audience": {
    "@type": "Audience",
    "audienceType": "Researchers, editors, publishers, librarians"
  }
}
Crossref deposit (5.5 schema)
xml
<contributor sequence="additional" contributor_role="author">
  <given_name>Jane</given_name>
  <surname>Doe</surname>
  <ORCID>https://orcid.org/0000-0002-XXXX-XXXX</ORCID>
  <credit_roles>
    <credit_role vocab-term-identifier="https://casrai.org/credit/roles/data-curation">Data curation</credit_role>
  </credit_roles>
</contributor>

Identifiers

Persistent identifiers

Canonical URIhttps://casrai.org/credit/roles/data-curation
JATS vocabcredit
CASRAI PID

Frequently asked

Common questions about Data curation

What does the "Data curation" CRediT role mean?
Management activities to annotate (produce metadata), scrub data and maintain research data (including software code, where it is necessary for interpreting the data itself) for initial use and later re-use.
How do I cite "Data curation" in an author contribution statement?
Per ANSI/NISO Z39.104-2022, list the contributor's name followed by their CRediT role(s). Example: "A. Smith — Data curation". You can add an optional degree qualifier (lead / equal / supporting) when several authors share a role. See CASRAI's author-statement guidance at https://casrai.org/credit/for-authors for templates and worked examples.
What is the canonical machine-readable URI for "Data curation"?
The canonical URI is https://casrai.org/credit/roles/data-curation. Use this in JATS XML (vocab-term-identifier), Crossref deposits (credit_role), JSON-LD (sameAs), and any system that stores contributor metadata. CASRAI's encoding guides cover JATS, Crossref Schema 5.5, and JSON-LD at https://casrai.org/credit/jats and https://casrai.org/credit/json-ld.
Can one author hold "Data curation" plus other CRediT roles?
Yes — CRediT roles are not mutually exclusive. A single contributor typically holds 3–4 roles. The optional "lead / equal / supporting" qualifier clarifies the contributor's relative responsibility within each role they share with others.

Cite this role

Cite the Data curation role

These citations reference this specific role page as a standards-body definition aligned with ANSI/NISO Z39.104-2022. For guidance on citing CRediT generally, the underlying standard, or worked statement examples, see How to cite CRediT.

APA 7

CASRAI Editorial Board (2026). Data curation — CRediT contributor role (v2022.1). CASRAI. https://casrai.org/credit/roles/data-curation

Vancouver

CASRAI Editorial Board. Data curation — CRediT contributor role [Internet]. CASRAI; 2026 [cited 2026]. Version v2022.1. Available from: https://casrai.org/credit/roles/data-curation

Chicago (author-date)

CASRAI Editorial Board. 2026. "Data curation — CRediT contributor role." Version v2022.1. CASRAI. https://casrai.org/credit/roles/data-curation.

BibTeX

@misc{casrai_credit_data_curation_2026,
  author       = {{CASRAI Editorial Board}},
  title        = {{Data curation — CRediT contributor role}},
  year         = {2026},
  version      = {v2022.1},
  publisher    = {CASRAI},
  howpublished = {\url{https://casrai.org/credit/roles/data-curation}},
  note         = {Aligned with ANSI/NISO Z39.104-2022. Licensed CC-BY 4.0.}
}

Notable examples

Papers exemplifying the Data curation role

These published papers demonstrate the Data curation CRediT role in practice. Each example links to its DOI and explains why the role applies.

  1. Example 1 of 4

    · · Scientific Data · vol. 3 · pp. 160018

    DOI: 10.1038/sdata.2016.18

    The defining contemporary articulation of what data curation is for. Wilkinson and colleagues set out the Findable-Accessible-Interoperable-Reusable principles that now govern reusable research data. The paper is itself a data-curation manifesto, and the authorship structure (large international working group) is exactly the pattern CRediT was designed to make legible. Data curation is no longer invisible plumbing — this paper made the case.

  2. Example 2 of 4

    · · Nucleic Acids Research · vol. 30 · pp. 207-210

    DOI: 10.1093/nar/30.1.207

    The foundational paper for GEO, one of the world's largest curated public repositories of high-throughput gene-expression data. The authors describe the metadata schema, deposition pipeline, and annotation standards that make tens of thousands of subsequent re-use studies possible. This is data curation as primary research output: annotation, scrubbing, and maintenance of data for initial use and re-use — the exact CRediT definition.

  3. Example 3 of 4

    · · Nature · vol. 489 · pp. 57-74

    DOI: 10.1038/nature11247

    ENCODE's flagship paper describes a curated, integrated annotation of functional elements across the human genome — the product of years of metadata definition, quality control, and data scrubbing across hundreds of contributors. The consortium model demands that the data-curation contribution be made explicit. CRediT-style attribution is essentially mandatory for large-scale curation projects of this kind.

  4. Example 4 of 4illustrative

    · · Nucleic Acids Research · vol. 51 · pp. D523-D531

    DOI: 10.1093/nar/gkab1112

    UniProt is the canonical curated protein knowledgebase, and its biennial Nucleic Acids Research updates document the continuous data-curation work: annotation, quality control, schema evolution, and integration with adjacent resources. These update papers are essentially CRediT Data Curation made visible — the authorship list is the curation team.

Examples curated by the CASRAI editorial board. Inclusion is illustrative of the role, not an endorsement of methodology or claims. Pre-CRediT-era papers are flagged illustrative.

Translations

Available in

The canonical, normative version of Data curation is this English page (ANSI/NISO Z39.104-2022). Community-drafted translations are available below for working reference; the English text prevails in case of any divergence.

All translations are community drafts. CRediT has not been officially translated by NISO.

Dictionary

Related CASRAI Dictionary terms

The Data curation role connects to these CASRAI Dictionary terms — the controlled-vocabulary concepts a contributor in this role typically works with. Each links to its canonical definition with worked examples and machine-readable encodings.

Adoption

Who collects Data curation — by publisher

The Data curation role is collected by the major publishers that have adopted CRediT portfolio-wide. See how each implements the taxonomy in their submission systems:

Full CRediT adoption tracker →

Related

Continue with CRediT

Explore the full taxonomy

The other 13 CRediT roles, grouped

Each role has its own canonical-definition page with encoding examples and FAQs.

Adopted by research universities worldwide

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • Massachusetts Institute of Technology logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo

View CASRAI adoption →