Skip to main content
v2026.1714 entries · CC-BY 4.0
Dictionary termTrack BStablev2026.2

Aggregator service

A service that harvests, harmonises, and re-exposes metadata and (sometimes) content from many upstream sources, providing a unified search, browse, or query interface across the aggregated corpus; canonical examples include OpenAIRE, BASE, CORE, and OpenAlex.

ByCASRAI Editorial Board
· Last updated 21 May 2026

Examples

Worked examples

  • Is an instance

    BASE (Bielefeld Academic Search Engine) aggregating from thousands of repositories worldwide.

  • Is an instance

    OpenAlex aggregating publication metadata from Crossref, ORCID, ROR, and crawled sources.

Counter-examples

Looks similar, but isn't

  • Not an instance

    A single repository serving only its own holdings is not an aggregator.

  • Not an instance

    A search engine that indexes general web content (without harvesting structured metadata) is not, strictly, a research-information aggregator.

Editorial commentary

Aggregators sit downstream of repositories and CRIS systems. They harvest via OAI-PMH, ResourceSync, REST APIs, and dumps; perform entity deduplication and metadata enrichment; and expose the unified corpus through their own APIs, dashboards, and search interfaces. Aggregators are critical to discoverability — they make the union of millions of small repositories visible as a single corpus — but their dependence on upstream metadata quality means that good repository metadata is what makes aggregators useful.

References

  • Knoth P., Pontika N., 'Aggregating Research Papers from Publishers' Systems to Support Text and Data Mining', CORE / Open University working papers.

Also known as

Metadata aggregator · Discovery service

Machine-readable encodings

Use in your systems

JATS XML <role> element
xml
<role vocab="credit"
      vocab-identifier="https://casrai.org/dictionary/"
      vocab-term="Aggregator service"
      vocab-term-identifier="https://casrai.org/dictionary/term/aggregator-service" />
Schema.org DefinedTerm (JSON-LD)
json
{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "name": "Aggregator service",
  "identifier": "https://casrai.org/dictionary/term/aggregator-service",
  "description": "A service that harvests, harmonises, and re-exposes metadata and (sometimes) content from many upstream sources, providing a unified search, browse, or query interface across the aggregated corpus; canonical examples include OpenAIRE, BASE, CORE, and OpenAlex.",
  "inDefinedTermSet": "https://casrai.org/dictionary/domain/research-data-infrastructure/",
  "url": "https://casrai.org/dictionary/term/aggregator-service",
  "sameAs": [
    "Metadata aggregator",
    "Discovery service"
  ],
  "license": "https://creativecommons.org/licenses/by/4.0/"
}

Adopted by research universities worldwide

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • Massachusetts Institute of Technology logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo

View CASRAI adoption →