{"id":1179,"date":"2026-06-13T09:00:00","date_gmt":"2026-06-13T09:00:00","guid":{"rendered":"http:\/\/localhost\/wp\/trusted-repositories-and-the-eosc-where-research-data-should-live\/"},"modified":"2026-06-13T09:00:00","modified_gmt":"2026-06-13T09:00:00","slug":"trusted-repositories-and-the-eosc-where-research-data-should-live","status":"publish","type":"post","link":"https:\/\/casrai.org\/wp\/trusted-repositories-and-the-eosc-where-research-data-should-live\/","title":{"rendered":"Trusted repositories and the EOSC: where research data should live"},"content":{"rendered":"<p>Open and FAIR data has to live somewhere, and the choice of <em>where<\/em> is not a clerical detail. A dataset deposited on a personal web page, a lab server, or a service that may not exist in five years is, for the purposes of long-term reuse, lost. The question of where research data should live is the question of <strong>trusted repositories<\/strong>, and the European answer to coordinating them is the <strong>EOSC<\/strong>. This article maps the landscape, drawing on the <a href=\"\/dictionary\/domain\/data-infrastructure\">data-infrastructure domain<\/a>.<\/p>\n<h2>What makes a repository trustworthy<\/h2>\n<p>Not every place that can store a file is fit to be the home of the scholarly record. A <strong>trusted digital repository<\/strong> is one assessed against a recognised trust framework, demonstrating that it has the organisational and technical capability to preserve and provide access to data over the long term. Trust here is not a vibe; it is a set of demonstrable properties \u2014 a sustainability plan, preservation procedures, persistent identifiers, clear access conditions, and the organisational continuity to outlast any individual project or grant.<\/p>\n<p>The most widely recognised certification of these properties is <strong>CoreTrustSeal<\/strong>, a community-governed assessment that a repository meets the core requirements of trustworthy data stewardship. A CoreTrustSeal certification is a concrete signal a funder or researcher can rely on: it means an independent process has checked that the repository can actually do what &#8220;long-term preservation&#8221; implies. When a funder mandate says data must go to a <em>trusted<\/em> repository, CoreTrustSeal is the most common way that word is given operational meaning.<\/p>\n<h2>The repository taxonomy: generalist and domain<\/h2>\n<p>Trusted repositories come in two broad kinds, and choosing well between them is one of the most consequential data-management decisions a researcher makes.<\/p>\n<ul>\n<li>A <strong>generalist repository<\/strong> accepts data from any discipline. Zenodo, Figshare, and Dryad are the familiar examples: they mint a DOI, accept almost any data type, and provide a reliable, citable home when no specialist option exists. They are the right default for the long tail of research data that has no natural disciplinary home.<\/li>\n<li>A <strong>domain repository<\/strong> is discipline-specific, built around the data types, standards, and community of a particular field. GenBank for nucleotide sequence data is the archetype; there are equivalents across crystallography, astronomy, social science, proteomics, and more. A domain repository adds what a generalist cannot: discipline-specific metadata standards, validation, and a community of expert users who will actually find and reuse the data.<\/li>\n<\/ul>\n<p>The practical rule that funders increasingly articulate is: deposit in the appropriate <em>domain<\/em> repository where one exists, and fall back to a trusted <em>generalist<\/em> repository where it does not. A sequence belongs in GenBank, not in a generic store; a one-off dataset with no community home belongs in a generalist repository with a DOI rather than on a server that will be decommissioned.<\/p>\n<h2>The EOSC: coordinating the federation<\/h2>\n<p>Individual trusted repositories are necessary but not sufficient. A researcher also needs to <em>find<\/em> the right one, move data and compute between services, and trust that the pieces interoperate. In Europe, the coordinating layer for this is the <strong>European Open Science Cloud (EOSC)<\/strong> \u2014 a federation of research-data services rather than a single monolithic platform.<\/p>\n<p>The EOSC&#8217;s model is federation: an <strong>EOSC node<\/strong> is a service provider connected to the federation, and an <strong>EOSC service<\/strong> is something offered through its catalogue \u2014 a repository, a compute resource, a data-management tool. The aspiration is that a researcher can discover trusted repositories, deposit data, and compose data with compute across institutional and national boundaries, through a coordinated catalogue rather than a patchwork of disconnected services. The EOSC is, in effect, the European attempt to make &#8220;where should this data live?&#8221; answerable through one front door onto many trustworthy providers. It is not the only such effort \u2014 the African Open Science Platform pursues a comparable continental federation \u2014 but it is the most developed.<\/p>\n<h2>The human layer: stewards and custodians<\/h2>\n<p>Infrastructure does not curate itself, and an honest account of where data should live has to name the people. A <strong>data steward<\/strong> is the professional responsible for data quality, governance, and ongoing curation \u2014 the role that makes the difference between data that is merely deposited and data that is genuinely reusable. A <strong>data custodian<\/strong> holds legal or operational responsibility for the data. Around them sit the structured agreements that govern sharing: a <strong>data sharing agreement<\/strong> setting the conditions under which data move between parties, an <strong>embargo period<\/strong> deferring public access after deposit, and <strong>access controls<\/strong> distinguishing open, restricted, and metadata-only data.<\/p>\n<blockquote>\n<p>A trusted repository with no data steward behind the data is a safe building with empty rooms. Preservation is an organisational commitment carried out by people, not a property that storage acquires on its own.<\/p>\n<\/blockquote>\n<h2>Why this connects to FAIR and to identifiers<\/h2>\n<p>Where data lives is what makes the <a href=\"\/learn\/what-is-fair-data\">FAIR principles<\/a> operational. Findability depends on the repository minting a persistent identifier and exposing good metadata; accessibility depends on stable resolution and clear access conditions; interoperability and reusability depend on the standards a domain repository enforces. A trusted repository is, in practice, the machine that turns the FAIR aspiration into a deposited reality \u2014 which is why the choice of repository, and the trust signal of CoreTrustSeal, matters as much as the decision to share at all. The repository is also where the data&#8217;s persistent identifier enters the broader graph that links it to the project, the people, and the funding.<\/p>\n<h2>Where shared vocabulary fits<\/h2>\n<p>The terms in this domain are used loosely in funder mandates and policies \u2014 &#8220;trusted&#8221;, &#8220;appropriate&#8221;, &#8220;long-term&#8221; all mean different things to different bodies, and &#8220;generalist&#8221; versus &#8220;domain&#8221; is often left implicit. A shared, federated vocabulary that defines these precisely, pointing to CoreTrustSeal for the trust framework and to the EOSC for the federation model, is what lets a data-sharing requirement be stated unambiguously and checked. Supplying that definitional layer is the role the <a href=\"\/dictionary\">CASRAI dictionary<\/a> is designed to play.<\/p>\n<h2>What to do now<\/h2>\n<p>For researchers: deposit in the appropriate domain repository where one exists, otherwise a CoreTrustSeal-certified generalist repository, and never a personal or project server for the long term. For institutions: invest in data stewards, not just storage. For funders and standards work: give &#8220;trusted repository&#8221; operational meaning through certification and shared vocabulary, and support the federations that make trustworthy services findable.<\/p>\n<h2>Related reading<\/h2>\n<ul>\n<li><a href=\"\/dictionary\/domain\/data-infrastructure\">Data-infrastructure domain<\/a><\/li>\n<li><a href=\"\/learn\/what-is-fair-data\">What is FAIR data?<\/a><\/li>\n<li><a href=\"\/for-authors\/persistent-identifiers\">Persistent identifiers for authors<\/a><\/li>\n<li><a href=\"\/dictionary\">The CASRAI dictionary<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>FAIR data has to live somewhere, and not every place is fit to hold it. Trusted digital repositories, CoreTrustSeal certification and the European Open Science Cloud set out where research data should be deposited.<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_casrai_contributor_statement":"","_casrai_contributors_json":"","_article_doi":"","_article_license":[],"_article_funding":[],"_casrai_article_id":"","_casrai_registry_status":"","_casrai_registry_date":"","footnotes":""},"categories":[1],"tags":[99,358,356,162,359,355,357,354],"credit_role":[],"dictionary_domain":[22],"class_list":["post-1179","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-coretrustseal","tag-data-steward","tag-domain-repository","tag-eosc","tag-fair-data","tag-generalist-repository","tag-research-data-infrastructure","tag-trusted-digital-repository","dictionary_domain-data-infrastructure"],"_links":{"self":[{"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/posts\/1179","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/comments?post=1179"}],"version-history":[{"count":0,"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/posts\/1179\/revisions"}],"wp:attachment":[{"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/media?parent=1179"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/categories?post=1179"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/tags?post=1179"},{"taxonomy":"credit_role","embeddable":true,"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/credit_role?post=1179"},{"taxonomy":"dictionary_domain","embeddable":true,"href":"https:\/\/casrai.org\/wp\/wp-json\/wp\/v2\/dictionary_domain?post=1179"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}