Columbia University Repository: Academic Commons
Depositing your research outputs in Academic Commons, the official institutional repository of Columbia University, is a critical pathway to achieving green open access and satisfying funder mandates in United States. This guide outlines the submission and curation workflows required to preserve and disseminate your academic publications and associated datasets in accordance with Fedora/Hyrax standards.
1. Institutional Archiving & Preservation Strategy
The portal Academic Commons runs on a customized instance of Fedora/Hyrax, ensuring that search engines, open science crawlers, and citations compilers can index all documents deposited by Columbia University.'
The university library's digital preservation strategies at Columbia University are designed to prevent technological obsolescence. By encoding preservation metadata (such as PREMIS elements) and maintaining master files in uncompressed archival formats, Academic Commons guarantees that publications and data remain renderable in United States. This framework defines the structural difference between a simple depository vs repository model, where Academic Commons actively manages integrity and accessibility rather than merely serving as static storage.
Verified Institutional Impact Metrics
Based on independent indexing data from the open-science catalog OpenAlex, Columbia University has recorded a cumulative corpus of 386,225 publications which have received over 46,951,905 citations globally. This volume highlights the critical role of Academic Commons in providing open access to a massive stream of global knowledge. With an institutional h-index of 1720 and a two-year mean citedness score of 4.63, submissions deposited here carry a highly visible citation trajectory.
All submissions to Academic Commons undergo systematic verification by the university library team. This ensures compliance with publisher embargoes, rights-retention policies, and copyright licenses (predominantly Creative Commons CC-BY or CC-BY-NC).
2. Metadata Mapping: Simple Dublin Core Alignment
Discoverability of Columbia University's publications relies entirely on rich metadata. Submissions to Academic Commons utilize the Dublin Core metadata standard (specifically the Dublin Core metadata element set and standard Dublin Core metadata terms). This structure ensures that search engines, open-science harvesters, and citation indexes in United States can crawl, parse, and cite your work accurately.
To maintain schema health across Academic Commons, Columbia University implements strict metadata repository quality controls. Each incoming manuscript or dataset is processed by a server-side metadata cleaner to enforce field completion, followed by a manual metadata scrubber review by dedicated data librarians. Resolving semantic errors before publication protects the repository's ranking in search indexes for United States.
To align with international standards, publications at Columbia University are catalogued using the Library of Congress Subject Headings (LCSH), establishing a highly structured controlled vocabulary for Academic Commons. This systematic approach to thesaurus construction and metadata indexing enables robust cross-disciplinary discovery in United States. Furthermore, records are mapped to the MARC21 format for library catalog sharing.
| Dublin Core Element | Preserved Value / Standard | Function & Mapping |
|---|---|---|
| dc.title | Full Article / Book Title | Main headline as registered in the publication record |
| dc.creator | Author(s) names & ORCID iD | Linked explicitly to the author's CRediT contribution roles |
| dc.publisher | Columbia University Library Services | The entity making the resource accessible in United States |
| dc.identifier | Handles / persistent URLs | Local institutional handle mapping to OAI-PMH networks |
3. Frequently Asked Questions (FAQ)
What is the correct protocol for co-author attribution during deposit?
When submitting to Academic Commons, you must include all authors listed on the final manuscript. It is highly recommended to declare each co-author's CRediT roles in the metadata form or the publication description.
Are datasets supported alongside text papers?
Yes, Academic Commons supports a wide array of file formats, including research datasets, code repositories, and supplemental documents. If your dataset is extremely large, the library services team will coordinate with your department to allocate specialized cold storage.
Repository Specs
Open-Science Mandates
In line with Plan S, the Nelson Memo, and regional mandates, all publicly funded publications produced at Columbia University must be deposited in Academic Commons with no embargo. Ensure your metadata contains correct funder acknowledgements to avoid audit flags.







