Skip to main content
v2026.1714 entries · CC-BY 4.0
CASRAI
Data Governance & Open Science

DMP Guide: NSF for Anthropology & Ethnography

Learn how to design a fully compliant Data Management Plan (DMP) that satisfies National Science Foundation open-data policies. Explore optimal file formats, metadata mapping, and repository selection for Anthropology & Ethnography research data.

1. Funder Policy & Open Data Compliance

In alignment with international open-science mandates, National Science Foundation requires all principal investigators to submit a comprehensive Data Management Plan (DMP) with their grant application. A robust DMP details how research data will be collected, processed, documented, stored, shared, and preserved both during and after the project.

Funder-Specific Mandate Directive

The **National Science Foundation (NSF)** mandates a formal Data Management Plan (DMP) submitted via **Research.gov** for all **Anthropology & Ethnography** proposals. Under the active federal data access policies, scientific data must be preserved and made openly available no later than the date of associated publication or project end.

Verified Funder Open-Science Portfolio

Based on independent, open-science bibliometric data from OpenAlex, the National Science Foundation (NSF) oversees a massive scholarly ecosystem with over 1,723,295 published research outputs under their funding catalog, accumulating over 72,920,494 citations across the global scientific record. To protect the public's investment in this massive knowledge corpus, the funder strictly enforces FAIR data management and open repository deposits, making compliance with this DMP protocol mandatory for all awarded grants.

For projects in the field of Anthropology & Ethnography, managing data correctly is essential not only for compliance, but also to support peer-review validation and reproducibility. All DMPs must be submitted through the Research.gov portal, using standard institutional guidelines.

2. Data Types, Formats, and Metadata Standards

A high-quality DMP must explicitly identify the types of data that will be generated and specify open, non-proprietary file formats to ensure long-term usability. For Anthropology & Ethnography, datasets typically range from raw observational measurements to curated computational models.

Data outputs in **Anthropology & Ethnography** typically consist of historical records, gridded data, or structured text documents. DMPs must outline plans to archive these files in open, non-proprietary formats to avoid software lock-in under **NSF** projects.

To guarantee discoverability, datasets should be documented using standardised metadata schemas that map to the Anthropology branch of scholarly vocabularies. This ensures indexers and crawlers can crawl and identify research outputs accurately.

DMP ComponentCustom Target Value for Anthropology & Ethnography
Preferred File FormatsMP3/WAV (field recordings), MP4 (video observations), TXT (fieldnotes), PDF/A (clippings)
Metadata Schema StandardDublin Core, TEI (Text Encoding Initiative) guidelines
Target Scientific RepositoriesZenodo, Figshare, Harvard Dataverse, and directory servers mapped in Anthropological Literature & Scopus

3. Step-by-Step DMP Construction Protocol

When preparing your DMP for a NSF proposal, structure your document around these core sections:

  1. Data Collection and Generation:
    Describe the methodology, instrumentation, or software used to collect or generate new data. Detail quality assurance and quality control measures implemented at your facility.
  2. Documentation and Metadata:
    Explain how the data will be documented, including accompanying read-me files, data dictionaries, and laboratory notebooks. Specify the metadata standards to be utilized (using Dublin Core, TEI (Text Encoding Initiative) guidelines as standard).
  3. Ethics, Intellectual Property, and Consent:
    Address how sensitive or confidential datasets will be handled. Detail anonymisation processes, access controls, and compliance with institutional ethics boards.
  4. Storage, Backups, and Security:
    State where data will be stored during active research. Detail automated backup schedules, server redundancies, and access authorisation protocols.
  5. Long-Term Preservation and Archiving:
    Select the digital repository for post-project archiving (such as Zenodo, Figshare, Harvard Dataverse, and directory servers mapped in Anthropological Literature & Scopus). Confirm that the repository supports persistent identifiers (handles/DOIs) and provides secure preservation.

Open Science Workflows, Data Curation & Repositories

To secure approval from National Science Foundation, the investigator's data management plan dmp must clearly justify chosen data collection methods and adhere to active data curation standards. Integrating digital dmptool workflows helps automate compliance reporting via the Research.gov portal. Investigators must outline procedures for post-collection data cleaning, strict audits of data integrity, and programmatic data wrangling to transform raw outputs into clean models. Furthermore, a descriptive data dictionary must be provided to define the database schema. The DMP must justify whether files are catalogued in a structured data warehouse or kept as raw files in a flexible data lake, discussing how a data lake vs data warehouse decision impacts subsequent data analysis and programmatic exploratory data analysis for Anthropology & Ethnography. Upon completion, data will be submitted to the dryad data repository, published as figshare datasets, or preserved via a zenodo data upload to be registered in the global data citation index and satisfy nsf data management plan guidelines and regional NSF open-science rules. Researchers are required to publish systematic data versioning protocols through the open science framework osf to facilitate long-term reproducible data sharing in line with fair data principles examples. If data is collected from specialized regions, the plan must comply with the care data principles and respect indigenous data sovereignty care rights to meet National Science Foundation ethical benchmarks. Implementing this storage layout satisfies compliance protocols overseen by the NSF data audit team.

4. Frequently Asked Questions

Are we required to share all raw data from our research?

No, NSF policies generally recognise that some data cannot be shared publicly due to privacy, security, intellectual property, or commercialisation constraints. In such cases, your DMP must justify why certain datasets are restricted and describe how metadata will still be made discoverable.

Who owns the research data generated under this grant?

Data ownership is typically held by the host institution, subject to co-ownership clauses in collaborative projects. However, NSF guidelines require that data be made as openly available as possible under open licensing, such as Creative Commons or Open Data Commons.

DMP Specifications

Funding BodyNSF (United States)
Submission ToolResearch.gov
ROR Funder ID021nxhr62
Crossref Funder ID100000001
Discipline FocusAnthropology & Ethnography
Target Index DBAnthropological Literature & Scopus

FAIR Principles

Your plan must align with the FAIR Principles:

  • Findable: Rich metadata and persistent DOIs.
  • Accessible: Free retrieval via standard protocols.
  • Interoperable: Open formats and vocabulary alignment (such as Dublin Core, TEI (Text Encoding Initiative) guidelines).
  • Reusable: Clear data licensing and reuse guidelines.

Referenced across the research world

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo
  • ORCID logo
  • Crossref logo

View CASRAI adoption →