Skip to main content
v2026.1714 entries · CC-BY 4.0
CASRAI

Explainer · Plain-language

Controlled Vocabulary: Definition, Meaning & Examples | CASRAI

A controlled vocabulary is a carefully curated list of terms used to index, tag, and organize information consistently. It prevents ambiguity and ensures that users can locate relevant resources regardless of synonym differences.

CASRAI plain-language explainers — clear answers to recurring research-administration questions

The step most authors miss

Doing CRediT right? Don’t stop at the statement.

A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.

Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.

Solving the synonym problem

In natural language, the same concept can be referred to by multiple names: e.g., 'cancer', 'neoplasms', and 'malignant tumors'. Without a controlled vocabulary, a search for 'cancer' would miss papers tagged only with 'neoplasms'. Curators map all synonyms to a single authorized term, ensuring complete search recall.

Types of controlled vocabularies

Controlled vocabularies exist on a spectrum of complexity: flat term lists (simple list of valid words), authority files (such as personal name lists), taxonomies (hierarchical parent-child trees), and thesauri (associative networks linking terms through broader, narrower, and related-term relationships).

Application in database search

Scholarly databases rely heavily on controlled vocabularies to power search capabilities. For example, MEDLINE uses MeSH terms, and Embase uses Emtree. Professional searchers use these index terms in search strings to construct highly sensitive and specific search strategies for systematic reviews.

Key facts

At a glance

  • Definition: A curated, standardized list of terms used to tag and index content
  • Purpose: Eliminates natural-language ambiguity, synonyms, and variations
  • Key types: Taxonomies, thesauri, authority files, flat term lists
  • Examples: MeSH (medicine), LCSH (library science), Getty Vocabularies
  • Why it matters: Powers highly precise database queries and systematic searches

Common misconceptions

What people often get wrong

Often heard: A controlled vocabulary is the same as a search-engine keyword list.

Actually: No — user keywords are natural language and unconstrained, while controlled vocabularies are strictly restricted and structured lists managed by metadata experts.

Often heard: Using a controlled vocabulary makes database searching harder.

Actually: No — once understood, it makes searching far easier and more comprehensive because a single authorized term automatically pulls together all synonym variations.

LAC

Partner Deal

LAC Health Supplies Mobile App

Referenced across the research world

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo
  • ORCID logo
  • Crossref logo

View CASRAI adoption →