Skip to main content
v2026.1714 entries · CC-BY 4.0
CASRAI

Definition · Plain-language

Data dictionary

A data dictionary is a centralised repository of metadata that records the definitions, formats, relationships and permitted values of an organisation’s data elements, giving everyone a shared, authoritative reference.

CASRAI research-methods explainer — Data dictionary

The step most authors miss

Doing CRediT right? Don’t stop at the statement.

A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.

Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.

What a data dictionary records

A data dictionary captures, for each data element, a precise definition, its data type and format, its relationships to other elements and its permitted values or value range. It may also record ownership, source system and any business rules. The purpose is to remove ambiguity: where one team’s revenue might differ from another’s, the dictionary fixes a single agreed meaning. It is a foundational metadata artefact that supports consistent reporting, integration and analysis across systems.

Relationship to ISO 11179

The international standard ISO/IEC 11179 describes metadata registries and a structured way to specify and register data elements so they are unambiguous and reusable across organisations. A data dictionary applies the same idea at the organisational level: it is, in effect, a registry of agreed data-element definitions. Aligning a dictionary with these registry principles — clear naming, definitions and permitted values — makes the organisation’s data more interoperable and easier to share with partners and standards-based systems.

Dictionary, catalogue and glossary

These terms overlap but differ in focus. A data dictionary describes data elements at a structural level — fields, types and allowed values. A business glossary defines business terms in plain language. A data catalogue is a broader inventory of data assets, often pulling in dictionary, glossary and lineage metadata to aid discovery. In practice a mature organisation maintains all three, with the data dictionary providing the detailed, element-level definitions the others reference.

Key facts

At a glance

  • Definition: a centralised repository of data-element metadata
  • Records: definitions, data types, formats, relationships, allowed values
  • Purpose: a single authoritative meaning for each field
  • Related standard: ISO/IEC 11179 metadata registries
  • Distinct from: a business glossary (plain-language terms)
  • Supports: consistent reporting, integration and analysis

Common misconceptions

What people often get wrong

Often heard: A data dictionary and a data catalogue are the same thing.

Actually: A data dictionary describes individual data elements — their definitions, types and allowed values. A data catalogue is a broader inventory of data assets for discovery, which often references dictionary content.

Often heard: A data dictionary is just technical documentation for developers.

Actually: It serves business users too, fixing agreed definitions and permitted values. Its value is shared meaning across business and technical teams, not only schema documentation.

Often heard: Once built, a data dictionary rarely needs updating.

Actually: Definitions, systems and rules evolve. A dictionary must be governed and maintained, usually by data stewards, or it becomes inaccurate and people stop trusting it.

Referenced across the research world

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo
  • ORCID logo
  • Crossref logo

View CASRAI adoption →