Explainer · Plain-language
What is an institutional repository?
An institutional repository (IR) is a digital archive maintained by a university or research organisation to collect, preserve, and provide open access to the intellectual output of its staff and students. Institutional repositories are the backbone of the green open access route, allowing authors to self-archive accepted manuscripts and other research outputs. They play a central role in meeting funder and government open access mandates.
The step most authors miss
Doing CRediT right? Don’t stop at the statement.
A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.
Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.
Origins and governance
Institutional repositories emerged in the early 2000s as a practical response to the serials pricing crisis and the growing open access movement. MIT launched DSpace in 2002 in partnership with Hewlett Packard; EPrints was developed at the University of Southampton under Stevan Harnad and became the standard for UK repositories throughout the 2000s. Both platforms are free, open-source, and supported by active communities. DSpace is now stewarded by Lyrasis, a US-based non-profit library consortium, while EPrints development is maintained by the University of Southampton and its commercial arm. In the United Kingdom, the registry OpenDOAR (Directory of Open Access Repositories), maintained by JISC, catalogues thousands of institutional repositories worldwide and provides quality criteria and policy data. JISC also funds shared services and training that support UK repository managers.
Green open access and self-archiving
The primary function of an IR is to support green open access — the self-archiving of research by authors in an institutional or subject repository. Researchers typically deposit either a preprint, an accepted manuscript (the author's final peer-reviewed text, before publisher typesetting), or in some cases the version of record. Which version may be deposited, and after what embargo period, depends on the publisher's policy as summarised by SHERPA/RoMEO. Research England and UK Research and Innovation (UKRI) have mandated open access through institutional repositories or subject repositories as part of their REF and grant conditions. SPARC Europe advocates for strong self-archiving rights across European institutions and has campaigned for rights-retention policies that bypass publisher embargo restrictions entirely. Subject repositories — such as arXiv (physics and mathematics), PubMed Central (biomedical sciences), and Europe PMC — serve a similar discovery function but are discipline-specific rather than institution-specific. Many funder mandates require deposit in both an IR and an appropriate subject repository.
Interoperability and discovery
Institutional repositories expose their metadata using OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting), which allows aggregators to collect records from thousands of repositories into a single searchable index. BASE (Bielefeld Academic Search Engine), operated by Bielefeld University Library, indexes over 350 million documents from more than 10,000 sources including most major institutional repositories. CORE, a UK-based aggregator funded by JISC and Jisc-member institutions, harvests full-text content from institutional repositories and makes it searchable and available via API. More recently, institutional repositories have integrated with CRIS (Current Research Information Systems) such as Pure, Symplectic Elements, and Worktribe, which can automatically trigger deposit workflows when a new publication is recorded. The Samvera framework (formerly known as Hydra) provides a Ruby-on-Rails-based alternative to DSpace and EPrints that is popular in North America, with Hyku as its multi-tenant cloud variant.
Key facts
At a glance
- DSpace launched 2002 (MIT + Hewlett Packard); now maintained by Lyrasis
- EPrints developed at University of Southampton; first release 2000
- OpenDOAR (JISC): global registry of open access repositories
- SHERPA/RoMEO: database of publisher self-archiving and embargo policies
- OAI-PMH: the harvesting protocol used by almost all institutional repositories
- BASE (Bielefeld Academic Search Engine): indexes 350M+ documents from 10,000+ repositories
Common misconceptions
What people often get wrong
Often heard: An institutional repository is the same as a subject repository like arXiv.
Actually: Institutional repositories are managed by individual universities and collect all output types from that institution regardless of discipline. Subject repositories like arXiv or PubMed Central are discipline-specific and accept deposits from authors across many institutions. Funder mandates often require deposit in both.
Often heard: Self-archiving in an institutional repository breaches copyright.
Actually: Most major publishers permit self-archiving of accepted manuscripts, typically after an embargo period. SHERPA/RoMEO summarises each publisher's policy. Rights-retention strategies, increasingly adopted by UK institutions following UKRI's open access policy, allow immediate deposit of accepted manuscripts by preserving the author's right to do so in the funding agreement.
Often heard: Depositing in an institutional repository replaces journal publication.
Actually: An institutional repository stores and provides access to research outputs — it is not a publisher or peer-review venue. Deposited manuscripts are typically the accepted author version of a journal article; the repository provides access, not peer review. The journal remains the publication of record.
Going deeper
Related CASRAI guidance
- What is open access? →
- What is a data repository? →
- What is a preprint? →
- What is an embargo period? →
- Institutional vs subject repository →







