DMP Guide: Wellcome Trust for Education Research
Learn how to design a fully compliant Data Management Plan (DMP) that satisfies Wellcome Trust open-data policies. Explore optimal file formats, metadata mapping, and repository selection for Education Research research data.
1. Funder Policy & Open Data Compliance
In alignment with international open-science mandates, Wellcome Trust requires all principal investigators to submit a comprehensive Data Management Plan (DMP) with their grant application. A robust DMP details how research data will be collected, processed, documented, stored, shared, and preserved both during and after the project.
Funder-Specific Mandate Directive
The **Wellcome Trust (Wellcome Trust)** mandates that all research data generated from funded **Education Research** projects must be managed and preserved. Wellcome requires an Outputs Management Plan (OMP) covering data, code, and physical materials, which must be shared upon publication.
Verified Funder Open-Science Portfolio
Based on independent, open-science bibliometric data from OpenAlex, the Wellcome Trust (Wellcome Trust) oversees a massive scholarly ecosystem with over 271,435 published research outputs under their funding catalog, accumulating over 20,935,177 citations across the global scientific record. To protect the public's investment in this massive knowledge corpus, the funder strictly enforces FAIR data management and open repository deposits, making compliance with this DMP protocol mandatory for all awarded grants.
For projects in the field of Education Research, managing data correctly is essential not only for compliance, but also to support peer-review validation and reproducibility. All DMPs must be submitted through the Wellcome Funding platform portal, using standard institutional guidelines.
2. Data Types, Formats, and Metadata Standards
A high-quality DMP must explicitly identify the types of data that will be generated and specify open, non-proprietary file formats to ensure long-term usability. For Education Research, datasets typically range from raw observational measurements to curated computational models.
Since studies in **Education Research** rely heavily on human survey panels and interviews, participant privacy is a paramount concern. The DMP must document explicit consent agreements, transcript scrubbing techniques, and firewalled database storage to meet **Wellcome Trust** standards.
To guarantee discoverability, datasets should be documented using standardised metadata schemas that map to the Education branch of scholarly vocabularies. This ensures indexers and crawlers can crawl and identify research outputs accurately.
| DMP Component | Custom Target Value for Education Research |
|---|---|
| Preferred File Formats | CSV (assessment matrices), MP4 (classroom logs), TXT (transcripts), SAV (SPSS surveys) |
| Metadata Schema Standard | DDI standard, Dublin Core Metadata Element Set |
| Target Scientific Repositories | ERIC, ICPSR, Zenodo, Figshare, and directory servers mapped in ERIC (Education Resources Information Center) |
3. Step-by-Step DMP Construction Protocol
When preparing your DMP for a Wellcome Trust proposal, structure your document around these core sections:
- Data Collection and Generation:
Describe the methodology, instrumentation, or software used to collect or generate new data. Detail quality assurance and quality control measures implemented at your facility. - Documentation and Metadata:
Explain how the data will be documented, including accompanying read-me files, data dictionaries, and laboratory notebooks. Specify the metadata standards to be utilized (using DDI standard, Dublin Core Metadata Element Set as standard). - Ethics, Intellectual Property, and Consent:
Address how sensitive or confidential datasets will be handled. Detail anonymisation processes, access controls, and compliance with institutional ethics boards. - Storage, Backups, and Security:
State where data will be stored during active research. Detail automated backup schedules, server redundancies, and access authorisation protocols. - Long-Term Preservation and Archiving:
Select the digital repository for post-project archiving (such as ERIC, ICPSR, Zenodo, Figshare, and directory servers mapped in ERIC (Education Resources Information Center)). Confirm that the repository supports persistent identifiers (handles/DOIs) and provides secure preservation.
Open Science Workflows, Data Curation & Repositories
A compliant data management plan dmp for research projects in Education Research must detail the specific data collection methods and the precise data curation standards that govern the project. By configuring customizable dmptool workflows, research teams can seamlessly align their files with Wellcome Trust's schemas. Adhering to Wellcome Trust requirements means detailing how raw files undergo data cleaning, how researchers verify ongoing data integrity, and which tools handle automated data wrangling. Additionally, a standardized data dictionary must be compiled to guarantee metadata clarity. From a storage perspective, researchers must evaluate storing datasets in a highly indexed data warehouse versus a scalable data lake, analyzing the trade-offs of a data lake vs data warehouse architecture for downstream data analysis and exploratory data analysis inside the Wellcome Funding platform workspace. To ensure permanent access, datasets will be deposited in the dryad data repository, hosted as figshare datasets, or archived via a secure zenodo data upload, enabling inclusion in the data citation index and fulfilling standard nsf data management plan and local Wellcome Trust requirements. We will enforce structured data versioning protocols using the open science framework osf to streamline reproducible data sharing that is fully compliant with fair data principles examples. Additionally, all collaborative research must respect the care data principles and uphold indigenous data sovereignty care policies, keeping community interest central to the data lifecycle. Implementing this storage layout satisfies compliance protocols overseen by the Wellcome Trust data audit team.
4. Frequently Asked Questions
Are we required to share all raw data from our research?
No, Wellcome Trust policies generally recognise that some data cannot be shared publicly due to privacy, security, intellectual property, or commercialisation constraints. In such cases, your DMP must justify why certain datasets are restricted and describe how metadata will still be made discoverable.
Who owns the research data generated under this grant?
Data ownership is typically held by the host institution, subject to co-ownership clauses in collaborative projects. However, Wellcome Trust guidelines require that data be made as openly available as possible under open licensing, such as Creative Commons or Open Data Commons.
DMP Specifications
FAIR Principles
Your plan must align with the FAIR Principles:
- Findable: Rich metadata and persistent DOIs.
- Accessible: Free retrieval via standard protocols.
- Interoperable: Open formats and vocabulary alignment (such as DDI standard, Dublin Core Metadata Element Set).
- Reusable: Clear data licensing and reuse guidelines.







