Definition · Plain-language
Clinical data management (CDM)
Clinical data management (CDM) is the discipline of collecting, cleaning and organising the data generated during a clinical trial so that the final dataset is accurate, complete, reliable and ready for statistical analysis.
The step most authors miss
Doing CRediT right? Don’t stop at the statement.
A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.
Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.
What clinical data management covers
Clinical data management spans the full life cycle of trial data, from study start-up to database lock. Early activities include designing the case report form and building the electronic data-capture system, writing the data-management plan and specifying validation rules. During conduct, the team monitors incoming data, runs edit checks, raises and resolves queries on missing or inconsistent values, and reconciles external data such as laboratory results and serious-adverse-event records. The aim throughout is a dataset whose quality can be trusted for analysis and regulatory submission.
Data validation and query management
A central task in CDM is finding and resolving discrepancies. Automated edit checks flag values that are out of range, internally inconsistent or missing, while manual review catches issues the rules cannot. Each flagged item generates a query routed back to the investigative site for clarification or correction, with every change tracked in an audit trail. This iterative cleaning continues until the data are as accurate and complete as practicable, supporting the reliability of any conclusions drawn from the trial.
Standards and database lock
CDM relies heavily on data standards, most prominently those from CDISC, which define how trial data should be structured, named and exchanged so that datasets are consistent and interoperable across studies and reviewers. Medical terms for events and medications are coded to standard dictionaries such as MedDRA. Once cleaning is complete and quality criteria are met, the database is formally locked — no further changes are permitted without a controlled process — and the clean dataset is released to the statisticians for analysis.
Key facts
At a glance
- Definition: Collecting, cleaning and managing trial data for reliable analysis.
- Stands for: Clinical data management (CDM).
- Goal: High-quality, reliable, statistically sound study data.
- Core tasks: Data validation, query management, medical coding, reconciliation.
- Standards: CDISC data standards; MedDRA for medical-term coding.
- End point: Database lock before statistical analysis.
Common misconceptions
What people often get wrong
Often heard: Clinical data management is just typing trial results into a spreadsheet.
Actually: CDM is a structured quality discipline involving data-capture design, validation rules, query resolution, medical coding and a controlled database lock — far beyond simple data entry.
Often heard: Once data are entered they can be edited freely at any time.
Actually: Changes are tracked through an audit trail, and after database lock no further changes are allowed without a formal, controlled unlock procedure. This protects data integrity.
Often heard: CDM and statistical analysis are the same activity.
Actually: CDM produces a clean, validated, locked dataset; statistical analysis is a separate downstream step performed on that dataset by statisticians.
Going deeper







