Skip to main content
v2026.1714 entries · CC-BY 4.0
CASRAI
Dictionary termTrack BStablev2026.2

Data hub

A central node in a data ecosystem that aggregates, harmonises, and brokers access to data from multiple upstream sources, exposing the harmonised data to downstream consumers via curated APIs, query interfaces, or download endpoints.

ByCASRAI Editorial Board
· Last updated 21 May 2026

Examples

Worked examples

  • Is an instance

    EMODnet as a marine data hub aggregating from member institutes across Europe.

  • Is an instance

    Health Data Research UK's Innovation Gateway as a hub indexing UK health datasets and brokering access requests.

Counter-examples

Looks similar, but isn't

  • Not an instance

    A point-to-point data integration between two systems is not a data hub.

  • Not an instance

    A single original-source repository is not a hub — it is a source.

Editorial commentary

Data hub is a hub-and-spoke architectural pattern. Distinct from a federated infrastructure (where data stay distributed), a data hub typically centralises a working copy of upstream data with harmonisation and quality-control. Distinct from a data lake (which is schema-on-read), a hub is usually opinionated about the harmonised model presented to consumers. In research contexts, examples include the UK Health Data Research Innovation Gateway, the European Marine Observation and Data Network (EMODnet), and many discipline-specific aggregators.

References

  • S
  • m
  • i
  • t
  • h
  • G
  • .
  • ,
  • D
  • e
  • s
  • i
  • g
  • n
  • i
  • n
  • g
  • D
  • a
  • t
  • a
  • H
  • u
  • b
  • s
  • ,
  • C
  • o
  • m
  • m
  • u
  • n
  • i
  • c
  • a
  • t
  • i
  • o
  • n
  • s
  • o
  • f
  • t
  • h
  • e
  • A
  • C
  • M
  • 6
  • 4
  • (
  • 1
  • 1
  • )
  • ,
  • 2
  • 0
  • 2
  • 1
  • .

Machine-readable encodings

Use in your systems

JATS XML <role> element
xml
<role vocab="credit"
      vocab-identifier="https://casrai.org/dictionary/"
      vocab-term="Data hub"
      vocab-term-identifier="https://casrai.org/dictionary/term/data-hub" />
Schema.org DefinedTerm (JSON-LD)
json
{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "name": "Data hub",
  "identifier": "https://casrai.org/dictionary/term/data-hub",
  "description": "A central node in a data ecosystem that aggregates, harmonises, and brokers access to data from multiple upstream sources, exposing the harmonised data to downstream consumers via curated APIs, query interfaces, or download endpoints.",
  "inDefinedTermSet": "https://casrai.org/dictionary/domain/research-data-infrastructure/",
  "url": "https://casrai.org/dictionary/term/data-hub",
  "sameAs": [],
  "license": "https://creativecommons.org/licenses/by/4.0/"
}
LAC

Partner Deal

LAC Health Supplies Mobile App

Referenced across the research world

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo
  • ORCID logo
  • Crossref logo

View CASRAI adoption →