Princeton University Repository: DataSpace
As the primary digital gateway for Princeton University's scholarly assets in United States, DataSpace serves as the central institutional repository. In this technical handbook, we review standard deposit procedures, compliance paths for open-science mandates, and specific repository configurations optimized for DSpace users.
1. Institutional Archiving & Preservation Strategy
Operating on the DSpace repository platform, DataSpace at Princeton University utilizes a robust schema framework to index and serve research outputs. As leading institutional repository software, DSpace facilitates OAI-PMH harvesting, allowing global indexers to seamlessly ingest metadata records from United States.
To safeguard scholastic materials against systemic loss, Princeton University implements advanced digital preservation strategies governed by the OAIS reference model for DataSpace. By collecting essential preservation metadata (such as provenance and hardware requirements) and performing routine integrity audits, the library in United States guarantees data permanence. This pro-active approach highlights the core distinction of a modern depository vs repository schema, where files are actively preserved rather than merely dumped.
Verified Institutional Impact Metrics
Based on independent indexing data from the open-science catalog OpenAlex, Princeton University has recorded a cumulative corpus of 211,211 publications which have received over 26,726,181 citations globally. This volume highlights the critical role of DataSpace in providing open access to a massive stream of global knowledge. With an institutional h-index of 1530 and a two-year mean citedness score of 6.92, submissions deposited here carry a highly visible citation trajectory.
All submissions to DataSpace undergo systematic verification by the university library team. This ensures compliance with publisher embargoes, rights-retention policies, and copyright licenses (predominantly Creative Commons CC-BY or CC-BY-NC).
2. Metadata Mapping: Simple Dublin Core Alignment
The indexing backbone of DataSpace is strictly configured around the Dublin Core metadata standard to catalog outputs from Princeton University. Each deposit record is structured according to the Dublin Core metadata element set, ensuring that the schema incorporates standard Dublin Core metadata terms for rapid cross-archive mapping inside United States.
To maintain schema health across DataSpace, Princeton University implements strict metadata repository quality controls. Each incoming manuscript or dataset is processed by a server-side metadata cleaner to enforce field completion, followed by a manual metadata scrubber review by dedicated data librarians. Resolving semantic errors before publication protects the repository's ranking in search indexes for United States.
The repository cataloging team of DataSpace maps submissions to the Library of Congress Subject Headings (LCSH) to create a standardized subject indexing framework for Princeton University. This deliberate thesaurus construction ensures that research themes from United States are searchable across global networks. All metadata profiles are stored in the widely supported MARC21 format to facilitate automated sharing.
| Dublin Core Element | Preserved Value / Standard | Function & Mapping |
|---|---|---|
| dc.title | Full Article / Book Title | Main headline as registered in the publication record |
| dc.creator | Author(s) names & ORCID iD | Linked explicitly to the author's CRediT contribution roles |
| dc.publisher | Princeton University Library Services | The entity making the resource accessible in United States |
| dc.identifier | Handles / persistent URLs | Local institutional handle mapping to OAI-PMH networks |
3. Frequently Asked Questions (FAQ)
What is the correct protocol for co-author attribution during deposit?
When submitting to DataSpace, you must include all authors listed on the final manuscript. It is highly recommended to declare each co-author's CRediT roles in the metadata form or the publication description.
Are datasets supported alongside text papers?
Yes, DataSpace supports a wide array of file formats, including research datasets, code repositories, and supplemental documents. If your dataset is extremely large, the library services team will coordinate with your department to allocate specialized cold storage.
Repository Specs
Open-Science Mandates
In line with Plan S, the Nelson Memo, and regional mandates, all publicly funded publications produced at Princeton University must be deposited in DataSpace with no embargo. Ensure your metadata contains correct funder acknowledgements to avoid audit flags.







