Skip to main content
v2026.1714 entries · CC-BY 4.0
Dictionary termStablev2026.2

Persistent data identifier

A persistent identifier (typically a DataCite DOI, but also IGSN for samples, Handle, ARK, or other PID-scheme identifier) assigned to a research dataset to support stable citation, attribution, and resolution over the long term.

ByCASRAI Editorial Board
· Last updated 21 May 2026

Examples

Worked examples

  • Is an instance

    A DataCite DOI 10.5061/dryad.xxxx assigned to a Dryad dataset.

  • Is an instance

    A Handle-based identifier issued by a Fedora repository for a deposited dataset.

Counter-examples

Looks similar, but isn't

  • Not an instance

    A relative path on a personal webpage is not a persistent data identifier.

  • Not an instance

    An internal database row ID is not a persistent data identifier unless published with a PID.

Editorial commentary

While ‘persistent identifier’ is the general term across all entity types, ‘persistent data identifier’ specifically denotes PIDs minted for datasets and their parts. The DataCite ecosystem is the dominant infrastructure, but Handle-based repositories (older repositories on Fedora) also issue persistent data identifiers. The Force11 Joint Declaration’s principle 4 (‘Unique Identification’) makes the persistent data identifier the operational anchor of data citation.

References

  • B
  • r
  • a
  • s
  • e
  • J
  • .
  • ,
  • D
  • a
  • t
  • a
  • C
  • i
  • t
  • e
  • a
  • g
  • l
  • o
  • b
  • a
  • l
  • r
  • e
  • g
  • i
  • s
  • t
  • r
  • a
  • t
  • i
  • o
  • n
  • a
  • g
  • e
  • n
  • c
  • y
  • f
  • o
  • r
  • r
  • e
  • s
  • e
  • a
  • r
  • c
  • h
  • d
  • a
  • t
  • a
  • ,
  • 2
  • 0
  • 0
  • 9
  • .
  • F
  • o
  • r
  • c
  • e
  • 1
  • 1
  • D
  • a
  • t
  • a
  • C
  • i
  • t
  • a
  • t
  • i
  • o
  • n
  • S
  • y
  • n
  • t
  • h
  • e
  • s
  • i
  • s
  • G
  • r
  • o
  • u
  • p
  • .

Also known as

Data PID · Dataset DOI

Machine-readable encodings

Use in your systems

JATS XML <role> element
xml
<role vocab="credit"
      vocab-identifier="https://casrai.org/dictionary/"
      vocab-term="Persistent data identifier"
      vocab-term-identifier="https://casrai.org/dictionary/term/persistent-data-identifier" />
Schema.org DefinedTerm (JSON-LD)
json
{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "name": "Persistent data identifier",
  "identifier": "https://casrai.org/dictionary/term/persistent-data-identifier",
  "description": "A persistent identifier (typically a DataCite DOI, but also IGSN for samples, Handle, ARK, or other PID-scheme identifier) assigned to a research dataset to support stable citation, attribution, and resolution over the long term.",
  "inDefinedTermSet": "https://casrai.org/dictionary/domain/casrai-dictionary/",
  "url": "https://casrai.org/dictionary/term/persistent-data-identifier",
  "sameAs": [
    "Data PID",
    "Dataset DOI"
  ],
  "license": "https://creativecommons.org/licenses/by/4.0/"
}

Adopted by research universities worldwide

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • Massachusetts Institute of Technology logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo

View CASRAI adoption →