Skip to main content
v2026.1714 entries · CC-BY 4.0
CASRAI

Explainer · Plain-language

What is an institutional repository?

An institutional repository (IR) is a digital archive maintained by a university or research organisation to collect, preserve, and provide open access to the intellectual output of its staff and students. Institutional repositories are the backbone of the green open access route, allowing authors to self-archive accepted manuscripts and other research outputs. They play a central role in meeting funder and government open access mandates.

CASRAI plain-language explainers — clear answers to recurring research-administration questions

The step most authors miss

Doing CRediT right? Don’t stop at the statement.

A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.

Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.

Origins and governance

Institutional repositories emerged in the early 2000s as a practical response to the serials pricing crisis and the growing open access movement. MIT launched DSpace in 2002 in partnership with Hewlett Packard; EPrints was developed at the University of Southampton under Stevan Harnad and became the standard for UK repositories throughout the 2000s. Both platforms are free, open-source, and supported by active communities. DSpace is now stewarded by Lyrasis, a US-based non-profit library consortium, while EPrints development is maintained by the University of Southampton and its commercial arm. In the United Kingdom, the registry OpenDOAR (Directory of Open Access Repositories), maintained by JISC, catalogues thousands of institutional repositories worldwide and provides quality criteria and policy data. JISC also funds shared services and training that support UK repository managers.

Green open access and self-archiving

The primary function of an IR is to support green open access — the self-archiving of research by authors in an institutional or subject repository. Researchers typically deposit either a preprint, an accepted manuscript (the author's final peer-reviewed text, before publisher typesetting), or in some cases the version of record. Which version may be deposited, and after what embargo period, depends on the publisher's policy as summarised by SHERPA/RoMEO. Research England and UK Research and Innovation (UKRI) have mandated open access through institutional repositories or subject repositories as part of their REF and grant conditions. SPARC Europe advocates for strong self-archiving rights across European institutions and has campaigned for rights-retention policies that bypass publisher embargo restrictions entirely. Subject repositories — such as arXiv (physics and mathematics), PubMed Central (biomedical sciences), and Europe PMC — serve a similar discovery function but are discipline-specific rather than institution-specific. Many funder mandates require deposit in both an IR and an appropriate subject repository.

Interoperability and discovery

Institutional repositories expose their metadata using OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting), which allows aggregators to collect records from thousands of repositories into a single searchable index. BASE (Bielefeld Academic Search Engine), operated by Bielefeld University Library, indexes over 350 million documents from more than 10,000 sources including most major institutional repositories. CORE, a UK-based aggregator funded by JISC and Jisc-member institutions, harvests full-text content from institutional repositories and makes it searchable and available via API. More recently, institutional repositories have integrated with CRIS (Current Research Information Systems) such as Pure, Symplectic Elements, and Worktribe, which can automatically trigger deposit workflows when a new publication is recorded. The Samvera framework (formerly known as Hydra) provides a Ruby-on-Rails-based alternative to DSpace and EPrints that is popular in North America, with Hyku as its multi-tenant cloud variant.

Key facts

At a glance

  • DSpace launched 2002 (MIT + Hewlett Packard); now maintained by Lyrasis
  • EPrints developed at University of Southampton; first release 2000
  • OpenDOAR (JISC): global registry of open access repositories
  • SHERPA/RoMEO: database of publisher self-archiving and embargo policies
  • OAI-PMH: the harvesting protocol used by almost all institutional repositories
  • BASE (Bielefeld Academic Search Engine): indexes 350M+ documents from 10,000+ repositories

Common misconceptions

What people often get wrong

Often heard: An institutional repository is the same as a subject repository like arXiv.

Actually: Institutional repositories are managed by individual universities and collect all output types from that institution regardless of discipline. Subject repositories like arXiv or PubMed Central are discipline-specific and accept deposits from authors across many institutions. Funder mandates often require deposit in both.

Often heard: Self-archiving in an institutional repository breaches copyright.

Actually: Most major publishers permit self-archiving of accepted manuscripts, typically after an embargo period. SHERPA/RoMEO summarises each publisher's policy. Rights-retention strategies, increasingly adopted by UK institutions following UKRI's open access policy, allow immediate deposit of accepted manuscripts by preserving the author's right to do so in the funding agreement.

Often heard: Depositing in an institutional repository replaces journal publication.

Actually: An institutional repository stores and provides access to research outputs — it is not a publisher or peer-review venue. Deposited manuscripts are typically the accepted author version of a journal article; the repository provides access, not peer review. The journal remains the publication of record.

Referenced across the research world

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo
  • ORCID logo
  • Crossref logo

View CASRAI adoption →