Reference · CRediT integration
Crosswalks: CRediT ↔ DataCite, MARC, ORCID, Schema.org
Authoritative mapping tables for research administrators integrating the Contributor Roles Taxonomy (ANSI/NISO Z39.104-2022) into four widely-used metadata systems. Each row is one CRediT role; each column is the corresponding term in the target vocabulary.
CRediT is a fourteen-role controlled vocabulary for contributor credit on a journal article. It is rarely the only vocabulary a system needs. A repository depositing datasets has to express contribution in DataCite’s contributorType; a library catalogue must use MARC 21 relator codes; an ORCID profile carries work-level contributorship in ORCID’s own API model; a publisher emitting structured data on the open web encodes contribution in Schema.org. The four tables below record how the fourteen CRediT roles cross-walk into each of these systems.
Three of the four mappings are imperfect by design. DataCite’s contributorType is dataset-centric; MARC’s relator codes predate research-output vocabularies; Schema.org’s Role is a generic pattern, not a controlled vocabulary. Only the ORCID mapping is 1:1, because ORCID adopted CRediT verbatim in API v3.0. Each row records the closest match plus a note explaining the compromise.
1. CRediT ↔ DataCite contributorType
DataCite’s Metadata Schema 4.6 defines contributorType as a controlled vocabulary on the <contributor> element (§7). The values are: ContactPerson, DataCollector, DataCurator, DataManager, Distributor, Editor, HostingInstitution, Other, Producer, ProjectLeader, ProjectManager, ProjectMember, RegistrationAgency, RegistrationAuthority, RelatedPerson, Researcher, ResearchGroup, RightsHolder, Sponsor, Supervisor, Translator, WorkPackageLeader. Authorship of the deposited resource itself is carried by <creators> and is therefore not a contributorType value.
Most CRediT roles have no exact DataCite analogue; the table records the closest primary match and, where useful, a fallback. Where DataCite expresses something CRediT cannot (e.g. RightsHolder, WorkPackageLeader), no CRediT row exists for it.
| # | CRediT role | DataCite contributorType (primary) | Fallback | Notes |
|---|---|---|---|---|
| 1 | Conceptualization | ProjectLeader | Researcher | ProjectLeader when the conceptualiser is the PI; otherwise Researcher. DataCite has no “conceptualizer” type — the role is subsumed under project leadership for dataset deposits. |
| 2 | Data curation | DataCurator | — | Direct 1:1 correspondence — the only CRediT role that maps cleanly to DataCite without compromise. |
| 3 | Formal analysis | Researcher | DataCurator | Researcher when analysis produced new findings; DataCurator when the work was applied to existing data preparation rather than new statistical work. |
| 4 | Funding acquisition | Sponsor | ProjectLeader | Funding is normally recorded via <fundingReference> (Crossref Funder ID), NOT contributorType. Use Sponsor only when the funding-acquirer is a person to be credited and no separate funder record exists. |
| 5 | Investigation | DataCollector | Researcher | DataCollector is the closest DataCite type for the experimental / fieldwork sense of Investigation; Researcher when investigation was theoretical or computational. |
| 6 | Methodology | Researcher | ProjectMember | No methodology-specific DataCite type. Researcher carries the contribution; capture the methodological specificity in <contribution> free text if needed. |
| 7 | Project administration | ProjectManager | — | Direct 1:1 correspondence. |
| 8 | Resources | DataManager | HostingInstitution | DataManager for individuals providing materials, samples, or instrumentation; HostingInstitution when the contribution is institutional (compute, archive, repository). |
| 9 | Software | Researcher | Other | For software outputs deposited as their own DataCite record use resourceTypeGeneral="Software" and Researcher; for software cited from a dataset record, use Researcher and reference the software DOI via relatedIdentifier. |
| 10 | Supervision | Supervisor | — | Direct 1:1 correspondence. |
| 11 | Validation | Researcher | Other | No DataCite analogue for replication/verification. Researcher with a free-text note is the conventional placement. |
| 12 | Visualization | Researcher | Other | DataCite has no Visualizer / Illustrator type. Use Researcher; the produced figures themselves can be deposited as separate resources with resourceTypeGeneral="Image". |
| 13 | Writing — original draft | Researcher | Other | DataCite carries authorship of the deposited resource itself via <creators>, not contributorType. For drafting of an associated narrative (e.g., a data paper), use Researcher and link via relatedIdentifier. |
| 14 | Writing — review & editing | Editor | Researcher | Editor is appropriate where the contribution is editorial review of the dataset documentation or associated data paper; Researcher for substantive scientific revision. |
Canonical source: DataCite Metadata Schema 4.6 — Kernel PDF · CASRAI: DataCite federation.
2. CRediT ↔ MARC 21 Relator Codes
The MARC Code List for Relators is maintained by the Library of Congress and contains 270+ three-letter codes. Relators predate research-output vocabularies and the closest matches are often imperfect; the recommended encoding is to record the CRediT term verbatim in MARC field 720 $e (Added Entry — Uncontrolled Name, role) and use the closest MARC code in field 700 $4 (Added Entry — Personal Name, relator). For institutional contributors use 710 instead of 700.
| # | CRediT role | MARC relator code(s) | Code label(s) | Notes |
|---|---|---|---|---|
| 1 | Conceptualization | rth, aut | Research team head; Author | No exact MARC relator for the conceptualisation role. "rth" (Research team head) is closest when the conceptualiser was the PI; "aut" is over-broad but commonly used in legacy catalogues. Recommend CRediT in 720 $e and link by 700 $4 rth. |
| 2 | Data curation | dtm, cur | Data manager; Curator | "dtm" (Data manager) is the operational match; "cur" (Curator) is appropriate when the work is curatorial selection/annotation. Prefer "dtm" for CRediT alignment. |
| 3 | Formal analysis | anl | Analyst | Direct correspondence. "anl" was added partly to accommodate statistical-analysis credits. |
| 4 | Funding acquisition | fnd, spn | Funder; Sponsor | "fnd" (Funder) when the credited person/body provided the funds; "spn" (Sponsor) when they brokered or championed funding without being the source. Often institutional, not personal — encode in 710 not 700. |
| 5 | Investigation | dtc, fld | Data contributor; Field director | "dtc" (Data contributor) for data/evidence collection; "fld" (Field director) for fieldwork-led investigation. Choose per the actual activity recorded. |
| 6 | Methodology | rth, rtm | Research team head; Research team member | No methodology-specific relator. "rth" / "rtm" carry the team-membership sense; the methodological specifics belong in a 500-series note or a CRediT 720 entry. |
| 7 | Project administration | pmn, ppm | Production manager; Project manager | "ppm" (Project manager, added 2018) is the precise match. "pmn" (Production manager) is publishing-pipeline-specific and only appropriate where coordination was editorial. |
| 8 | Resources | don, spn | Donor; Sponsor | No materials-provision relator. "don" (Donor) for physical materials/samples; "spn" for institutional resource provision. Imperfect; document explicitly. |
| 9 | Software | prg | Programmer | Direct correspondence for the coding aspect of CRediT Software. Use "prg" plus a separately tagged software identifier (URL or DOI) in field 856 or 024. |
| 10 | Supervision | ths, rth | Thesis advisor; Research team head | "ths" for academic supervision of a student researcher; "rth" for general research-team leadership. Pick by context; ths/rth can both appear if applicable. |
| 11 | Validation | rev, ctg | Reviewer; Cartographer (replication) | No clean MARC code for validation/replication. "rev" (Reviewer) covers verification activity; "ctg" is wrong-domain and not recommended. Best practice: 720 $e "Validation" with $4 rev. |
| 12 | Visualization | ill, art | Illustrator; Artist | "ill" for figures, charts, and data visualisations; "art" for original artwork. Neither perfectly captures "data visualization for scientific publication"; "ill" is the closer match. |
| 13 | Writing — original draft | aut | Author | Direct correspondence. Use 100/700 $4 aut. |
| 14 | Writing — review & editing | edt | Editor | Direct correspondence. Use 700 $4 edt. Distinguish from "edc" (Editor of compilation) which is collection-level. |
Canonical source: Library of Congress — MARC Code List for Relators. For full-text MARC encoding examples, see also the LoC bibliographic format documentation for fields 700 and 720.
3. CRediT ↔ ORCID work contributorship
ORCID adopted CRediT as the controlled vocabulary for work-level contributor roles when it released API v3.0 (announcement: ORCID — Contributor Roles Taxonomy (CRediT) in ORCID, 2018). The ORCID contributor-role element on the work model takes the canonical NISO URI directly. The mapping is therefore 1:1 by design: each CRediT role has exactly one ORCID role value, which is its casrai.org/credit URI.
When a CRediT-tagged publication is registered with Crossref or DataCite carrying CRediT in the deposit metadata, Crossref auto-pushes the role into the contributor’s ORCID record without further action by the publisher. This is the single largest channel by which CRediT propagates into the persistent-identifier graph.
| # | CRediT role | ORCID contributor-role value (API v3.0+) |
|---|---|---|
| 1 | Conceptualization | https://casrai.org/credit/roles/conceptualization |
| 2 | Data curation | https://casrai.org/credit/roles/data-curation |
| 3 | Formal analysis | https://casrai.org/credit/roles/formal-analysis |
| 4 | Funding acquisition | https://casrai.org/credit/roles/funding-acquisition |
| 5 | Investigation | https://casrai.org/credit/roles/investigation |
| 6 | Methodology | https://casrai.org/credit/roles/methodology |
| 7 | Project administration | https://casrai.org/credit/roles/project-administration |
| 8 | Resources | https://casrai.org/credit/roles/resources |
| 9 | Software | https://casrai.org/credit/roles/software |
| 10 | Supervision | https://casrai.org/credit/roles/supervision |
| 11 | Validation | https://casrai.org/credit/roles/validation |
| 12 | Visualization | https://casrai.org/credit/roles/visualization |
| 13 | Writing — original draft | https://casrai.org/credit/roles/writing-original-draft |
| 14 | Writing — review & editing | https://casrai.org/credit/roles/writing-review-editing |
Canonical source: ORCID — CRediT in ORCID; ORCID XML schema (orcid-model), element work:contributor-attributes/work:contributor-role.
4. CRediT ↔ Schema.org Role
Schema.org has no contributor-role controlled vocabulary; instead it provides the generic Role intermediate type that wraps any property to attach an additional roleName (and optionally startDate / endDate). The pattern recommended by Schema.org for credit-with-role is to express each contributor as a Role instance nested inside the author or contributor property of a CreativeWork (or its subclass ScholarlyArticle).
The recommended values for the Role properties when encoding CRediT are:
@type—RoleroleName— the CRediT role name (e.g. Data curation)description— the NISO definition for that role, verbatimurl— the canonicalcasrai.org/creditURI[author | contributor]— a nestedPersonwithnameand (where available)identifieras the ORCID iD
Worked example — JSON-LD for a ScholarlyArticle
The example below encodes a paper with two authors. The first is credited with Conceptualization, Methodology, and Writing — original draft; the second with Data curation and Writing — review & editing. Each role is a separate Role instance inside the author array, so the same person appears multiple times — once per role.
{
"@context": "https://schema.org",
"@type": "ScholarlyArticle",
"@id": "https://doi.org/10.1234/example.2026.0001",
"headline": "An illustrative paper demonstrating CRediT in Schema.org",
"datePublished": "2026-05-19",
"author": [
{
"@type": "Role",
"roleName": "Conceptualization",
"url": "https://casrai.org/credit/roles/conceptualization",
"description": "Ideas; formulation or evolution of overarching research goals and aims.",
"author": {
"@type": "Person",
"name": "Ada Lovelace",
"identifier": "https://orcid.org/0000-0002-1825-0097"
}
},
{
"@type": "Role",
"roleName": "Methodology",
"url": "https://casrai.org/credit/roles/methodology",
"description": "Development or design of methodology; creation of models.",
"author": {
"@type": "Person",
"name": "Ada Lovelace",
"identifier": "https://orcid.org/0000-0002-1825-0097"
}
},
{
"@type": "Role",
"roleName": "Writing — original draft",
"url": "https://casrai.org/credit/roles/writing-original-draft",
"description": "Preparation, creation and/or presentation of the published work, specifically writing the initial draft (including substantive translation).",
"author": {
"@type": "Person",
"name": "Ada Lovelace",
"identifier": "https://orcid.org/0000-0002-1825-0097"
}
},
{
"@type": "Role",
"roleName": "Data curation",
"url": "https://casrai.org/credit/roles/data-curation",
"description": "Management activities to annotate (produce metadata), scrub data and maintain research data (including software code, where it is necessary for interpreting the data itself) for initial use and later re-use.",
"author": {
"@type": "Person",
"name": "Charles Babbage",
"identifier": "https://orcid.org/0000-0001-2345-6789"
}
},
{
"@type": "Role",
"roleName": "Writing — review & editing",
"url": "https://casrai.org/credit/roles/writing-review-editing",
"description": "Preparation, creation and/or presentation of the published work by those from the original research group, specifically critical review, commentary or revision — including pre- or post-publication stages.",
"author": {
"@type": "Person",
"name": "Charles Babbage",
"identifier": "https://orcid.org/0000-0001-2345-6789"
}
}
]
}Canonical sources: schema.org/Role · schema.org/ScholarlyArticle · the Schema.org pattern for nested roles is documented in the Schema.org data model — Roles note.
How to apply these crosswalks
The four tables solve overlapping but distinct problems. A research administrator integrating CRediT into a production system usually faces one or two of them at once, not all four. The decision flow below maps the common cases.
- Depositing a dataset, software release, or instrument with DataCite — use Table 1. Author credit goes in
<creators>; non-author contribution goes in<contributors>with the closestcontributorType. Carry the CRediT term itself in a separately addressable place: either<contribution>free text (DataCite 4.6 introduced this) or a relatedresourceTypeGeneral="Text"data paper that does carry CRediT in its Crossref deposit. - Cataloguing a research output in a MARC-based library system — use Table 2. Record the closest MARC relator in
700/710 $4; record the precise CRediT term verbatim in720 $ewith subfield$4holding the same relator code as a cross-reference. This preserves the relator-code search index while keeping the CRediT term recoverable for downstream consumers. - Pushing contributor credit to ORCID records — use Table 3. In practice, this is not done directly: deposit CRediT in the Crossref or DataCite metadata at publication, and Crossref auto-propagates the roles into each contributor’s ORCID record. Direct API writes are appropriate only for back-fills or for outputs that have no Crossref / DataCite deposit.
- Emitting structured data on the open web (JSON-LD on article landing pages, repository pages) — use Table 4. Each Role-with-Person tuple is one
Roleinstance in theauthororcontributorarray; a contributor with three roles appears three times. Seturlto thecasrai.org/creditURI so consumers can dereference the canonical definition.
Versioning and provenance
The mappings in Tables 1, 2, and 4 are editorial: they reflect the closest available correspondence in vocabularies that were not designed against each other. The mapping in Table 3 is structural: ORCID adopted CRediT verbatim. For any specific implementation that depends on a mapping being correct, refer to the canonical source linked under each table and confirm against the current schema version. The DataCite schema in particular has evolved (4.4 → 4.5 → 4.6); the MARC relator list is appended to roughly annually.








