Skip to main content
v2026.1714 entries · CC-BY 4.0

Reference · CRediT integration

Crosswalks: CRediT ↔ DataCite, MARC, ORCID, Schema.org

Authoritative mapping tables for research administrators integrating the Contributor Roles Taxonomy (ANSI/NISO Z39.104-2022) into four widely-used metadata systems. Each row is one CRediT role; each column is the corresponding term in the target vocabulary.

CRediT is a fourteen-role controlled vocabulary for contributor credit on a journal article. It is rarely the only vocabulary a system needs. A repository depositing datasets has to express contribution in DataCite’s contributorType; a library catalogue must use MARC 21 relator codes; an ORCID profile carries work-level contributorship in ORCID’s own API model; a publisher emitting structured data on the open web encodes contribution in Schema.org. The four tables below record how the fourteen CRediT roles cross-walk into each of these systems.

Three of the four mappings are imperfect by design. DataCite’s contributorType is dataset-centric; MARC’s relator codes predate research-output vocabularies; Schema.org’s Role is a generic pattern, not a controlled vocabulary. Only the ORCID mapping is 1:1, because ORCID adopted CRediT verbatim in API v3.0. Each row records the closest match plus a note explaining the compromise.

1. CRediT ↔ DataCite contributorType

DataCite’s Metadata Schema 4.6 defines contributorType as a controlled vocabulary on the <contributor> element (§7). The values are: ContactPerson, DataCollector, DataCurator, DataManager, Distributor, Editor, HostingInstitution, Other, Producer, ProjectLeader, ProjectManager, ProjectMember, RegistrationAgency, RegistrationAuthority, RelatedPerson, Researcher, ResearchGroup, RightsHolder, Sponsor, Supervisor, Translator, WorkPackageLeader. Authorship of the deposited resource itself is carried by <creators> and is therefore not a contributorType value.

Most CRediT roles have no exact DataCite analogue; the table records the closest primary match and, where useful, a fallback. Where DataCite expresses something CRediT cannot (e.g. RightsHolder, WorkPackageLeader), no CRediT row exists for it.

#CRediT roleDataCite contributorType (primary)FallbackNotes
1ConceptualizationProjectLeaderResearcherProjectLeader when the conceptualiser is the PI; otherwise Researcher. DataCite has no “conceptualizer” type — the role is subsumed under project leadership for dataset deposits.
2Data curationDataCuratorDirect 1:1 correspondence — the only CRediT role that maps cleanly to DataCite without compromise.
3Formal analysisResearcherDataCuratorResearcher when analysis produced new findings; DataCurator when the work was applied to existing data preparation rather than new statistical work.
4Funding acquisitionSponsorProjectLeaderFunding is normally recorded via <fundingReference> (Crossref Funder ID), NOT contributorType. Use Sponsor only when the funding-acquirer is a person to be credited and no separate funder record exists.
5InvestigationDataCollectorResearcherDataCollector is the closest DataCite type for the experimental / fieldwork sense of Investigation; Researcher when investigation was theoretical or computational.
6MethodologyResearcherProjectMemberNo methodology-specific DataCite type. Researcher carries the contribution; capture the methodological specificity in <contribution> free text if needed.
7Project administrationProjectManagerDirect 1:1 correspondence.
8ResourcesDataManagerHostingInstitutionDataManager for individuals providing materials, samples, or instrumentation; HostingInstitution when the contribution is institutional (compute, archive, repository).
9SoftwareResearcherOtherFor software outputs deposited as their own DataCite record use resourceTypeGeneral="Software" and Researcher; for software cited from a dataset record, use Researcher and reference the software DOI via relatedIdentifier.
10SupervisionSupervisorDirect 1:1 correspondence.
11ValidationResearcherOtherNo DataCite analogue for replication/verification. Researcher with a free-text note is the conventional placement.
12VisualizationResearcherOtherDataCite has no Visualizer / Illustrator type. Use Researcher; the produced figures themselves can be deposited as separate resources with resourceTypeGeneral="Image".
13Writing — original draftResearcherOtherDataCite carries authorship of the deposited resource itself via <creators>, not contributorType. For drafting of an associated narrative (e.g., a data paper), use Researcher and link via relatedIdentifier.
14Writing — review & editingEditorResearcherEditor is appropriate where the contribution is editorial review of the dataset documentation or associated data paper; Researcher for substantive scientific revision.

Canonical source: DataCite Metadata Schema 4.6 — Kernel PDF · CASRAI: DataCite federation.

2. CRediT ↔ MARC 21 Relator Codes

The MARC Code List for Relators is maintained by the Library of Congress and contains 270+ three-letter codes. Relators predate research-output vocabularies and the closest matches are often imperfect; the recommended encoding is to record the CRediT term verbatim in MARC field 720 $e (Added Entry — Uncontrolled Name, role) and use the closest MARC code in field 700 $4 (Added Entry — Personal Name, relator). For institutional contributors use 710 instead of 700.

#CRediT roleMARC relator code(s)Code label(s)Notes
1Conceptualizationrth, autResearch team head; AuthorNo exact MARC relator for the conceptualisation role. "rth" (Research team head) is closest when the conceptualiser was the PI; "aut" is over-broad but commonly used in legacy catalogues. Recommend CRediT in 720 $e and link by 700 $4 rth.
2Data curationdtm, curData manager; Curator"dtm" (Data manager) is the operational match; "cur" (Curator) is appropriate when the work is curatorial selection/annotation. Prefer "dtm" for CRediT alignment.
3Formal analysisanlAnalystDirect correspondence. "anl" was added partly to accommodate statistical-analysis credits.
4Funding acquisitionfnd, spnFunder; Sponsor"fnd" (Funder) when the credited person/body provided the funds; "spn" (Sponsor) when they brokered or championed funding without being the source. Often institutional, not personal — encode in 710 not 700.
5Investigationdtc, fldData contributor; Field director"dtc" (Data contributor) for data/evidence collection; "fld" (Field director) for fieldwork-led investigation. Choose per the actual activity recorded.
6Methodologyrth, rtmResearch team head; Research team memberNo methodology-specific relator. "rth" / "rtm" carry the team-membership sense; the methodological specifics belong in a 500-series note or a CRediT 720 entry.
7Project administrationpmn, ppmProduction manager; Project manager"ppm" (Project manager, added 2018) is the precise match. "pmn" (Production manager) is publishing-pipeline-specific and only appropriate where coordination was editorial.
8Resourcesdon, spnDonor; SponsorNo materials-provision relator. "don" (Donor) for physical materials/samples; "spn" for institutional resource provision. Imperfect; document explicitly.
9SoftwareprgProgrammerDirect correspondence for the coding aspect of CRediT Software. Use "prg" plus a separately tagged software identifier (URL or DOI) in field 856 or 024.
10Supervisionths, rthThesis advisor; Research team head"ths" for academic supervision of a student researcher; "rth" for general research-team leadership. Pick by context; ths/rth can both appear if applicable.
11Validationrev, ctgReviewer; Cartographer (replication)No clean MARC code for validation/replication. "rev" (Reviewer) covers verification activity; "ctg" is wrong-domain and not recommended. Best practice: 720 $e "Validation" with $4 rev.
12Visualizationill, artIllustrator; Artist"ill" for figures, charts, and data visualisations; "art" for original artwork. Neither perfectly captures "data visualization for scientific publication"; "ill" is the closer match.
13Writing — original draftautAuthorDirect correspondence. Use 100/700 $4 aut.
14Writing — review & editingedtEditorDirect correspondence. Use 700 $4 edt. Distinguish from "edc" (Editor of compilation) which is collection-level.

Canonical source: Library of Congress — MARC Code List for Relators. For full-text MARC encoding examples, see also the LoC bibliographic format documentation for fields 700 and 720.

3. CRediT ↔ ORCID work contributorship

ORCID adopted CRediT as the controlled vocabulary for work-level contributor roles when it released API v3.0 (announcement: ORCID — Contributor Roles Taxonomy (CRediT) in ORCID, 2018). The ORCID contributor-role element on the work model takes the canonical NISO URI directly. The mapping is therefore 1:1 by design: each CRediT role has exactly one ORCID role value, which is its casrai.org/credit URI.

When a CRediT-tagged publication is registered with Crossref or DataCite carrying CRediT in the deposit metadata, Crossref auto-pushes the role into the contributor’s ORCID record without further action by the publisher. This is the single largest channel by which CRediT propagates into the persistent-identifier graph.

Canonical source: ORCID — CRediT in ORCID; ORCID XML schema (orcid-model), element work:contributor-attributes/work:contributor-role.

4. CRediT ↔ Schema.org Role

Schema.org has no contributor-role controlled vocabulary; instead it provides the generic Role intermediate type that wraps any property to attach an additional roleName (and optionally startDate / endDate). The pattern recommended by Schema.org for credit-with-role is to express each contributor as a Role instance nested inside the author or contributor property of a CreativeWork (or its subclass ScholarlyArticle).

The recommended values for the Role properties when encoding CRediT are:

  • @typeRole
  • roleName — the CRediT role name (e.g. Data curation)
  • description — the NISO definition for that role, verbatim
  • url — the canonical casrai.org/credit URI
  • [author | contributor] — a nested Person with name and (where available) identifier as the ORCID iD
#CRediT roleSchema.org Role.roleNameSchema.org Role.url
1Conceptualization"Conceptualization"https://casrai.org/credit/roles/conceptualization
2Data curation"Data curation"https://casrai.org/credit/roles/data-curation
3Formal analysis"Formal analysis"https://casrai.org/credit/roles/formal-analysis
4Funding acquisition"Funding acquisition"https://casrai.org/credit/roles/funding-acquisition
5Investigation"Investigation"https://casrai.org/credit/roles/investigation
6Methodology"Methodology"https://casrai.org/credit/roles/methodology
7Project administration"Project administration"https://casrai.org/credit/roles/project-administration
8Resources"Resources"https://casrai.org/credit/roles/resources
9Software"Software"https://casrai.org/credit/roles/software
10Supervision"Supervision"https://casrai.org/credit/roles/supervision
11Validation"Validation"https://casrai.org/credit/roles/validation
12Visualization"Visualization"https://casrai.org/credit/roles/visualization
13Writing — original draft"Writing — original draft"https://casrai.org/credit/roles/writing-original-draft
14Writing — review & editing"Writing — review & editing"https://casrai.org/credit/roles/writing-review-editing

Worked example — JSON-LD for a ScholarlyArticle

The example below encodes a paper with two authors. The first is credited with Conceptualization, Methodology, and Writing — original draft; the second with Data curation and Writing — review & editing. Each role is a separate Role instance inside the author array, so the same person appears multiple times — once per role.

{
  "@context": "https://schema.org",
  "@type": "ScholarlyArticle",
  "@id": "https://doi.org/10.1234/example.2026.0001",
  "headline": "An illustrative paper demonstrating CRediT in Schema.org",
  "datePublished": "2026-05-19",
  "author": [
    {
      "@type": "Role",
      "roleName": "Conceptualization",
      "url": "https://casrai.org/credit/roles/conceptualization",
      "description": "Ideas; formulation or evolution of overarching research goals and aims.",
      "author": {
        "@type": "Person",
        "name": "Ada Lovelace",
        "identifier": "https://orcid.org/0000-0002-1825-0097"
      }
    },
    {
      "@type": "Role",
      "roleName": "Methodology",
      "url": "https://casrai.org/credit/roles/methodology",
      "description": "Development or design of methodology; creation of models.",
      "author": {
        "@type": "Person",
        "name": "Ada Lovelace",
        "identifier": "https://orcid.org/0000-0002-1825-0097"
      }
    },
    {
      "@type": "Role",
      "roleName": "Writing — original draft",
      "url": "https://casrai.org/credit/roles/writing-original-draft",
      "description": "Preparation, creation and/or presentation of the published work, specifically writing the initial draft (including substantive translation).",
      "author": {
        "@type": "Person",
        "name": "Ada Lovelace",
        "identifier": "https://orcid.org/0000-0002-1825-0097"
      }
    },
    {
      "@type": "Role",
      "roleName": "Data curation",
      "url": "https://casrai.org/credit/roles/data-curation",
      "description": "Management activities to annotate (produce metadata), scrub data and maintain research data (including software code, where it is necessary for interpreting the data itself) for initial use and later re-use.",
      "author": {
        "@type": "Person",
        "name": "Charles Babbage",
        "identifier": "https://orcid.org/0000-0001-2345-6789"
      }
    },
    {
      "@type": "Role",
      "roleName": "Writing — review & editing",
      "url": "https://casrai.org/credit/roles/writing-review-editing",
      "description": "Preparation, creation and/or presentation of the published work by those from the original research group, specifically critical review, commentary or revision — including pre- or post-publication stages.",
      "author": {
        "@type": "Person",
        "name": "Charles Babbage",
        "identifier": "https://orcid.org/0000-0001-2345-6789"
      }
    }
  ]
}

Canonical sources: schema.org/Role · schema.org/ScholarlyArticle · the Schema.org pattern for nested roles is documented in the Schema.org data model — Roles note.

How to apply these crosswalks

The four tables solve overlapping but distinct problems. A research administrator integrating CRediT into a production system usually faces one or two of them at once, not all four. The decision flow below maps the common cases.

  • Depositing a dataset, software release, or instrument with DataCite — use Table 1. Author credit goes in <creators>; non-author contribution goes in <contributors> with the closest contributorType. Carry the CRediT term itself in a separately addressable place: either <contribution> free text (DataCite 4.6 introduced this) or a related resourceTypeGeneral="Text" data paper that does carry CRediT in its Crossref deposit.
  • Cataloguing a research output in a MARC-based library system — use Table 2. Record the closest MARC relator in 700/710 $4; record the precise CRediT term verbatim in 720 $e with subfield $4 holding the same relator code as a cross-reference. This preserves the relator-code search index while keeping the CRediT term recoverable for downstream consumers.
  • Pushing contributor credit to ORCID records — use Table 3. In practice, this is not done directly: deposit CRediT in the Crossref or DataCite metadata at publication, and Crossref auto-propagates the roles into each contributor’s ORCID record. Direct API writes are appropriate only for back-fills or for outputs that have no Crossref / DataCite deposit.
  • Emitting structured data on the open web (JSON-LD on article landing pages, repository pages) — use Table 4. Each Role-with-Person tuple is one Role instance in the author or contributor array; a contributor with three roles appears three times. Set url to the casrai.org/credit URI so consumers can dereference the canonical definition.

Versioning and provenance

The mappings in Tables 1, 2, and 4 are editorial: they reflect the closest available correspondence in vocabularies that were not designed against each other. The mapping in Table 3 is structural: ORCID adopted CRediT verbatim. For any specific implementation that depends on a mapping being correct, refer to the canonical source linked under each table and confirm against the current schema version. The DataCite schema in particular has evolved (4.4 → 4.5 → 4.6); the MARC relator list is appended to roughly annually.

Related pages

Adopted by research universities worldwide

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • Massachusetts Institute of Technology logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo

View CASRAI adoption →