Tag: POSI

  • Open scholarly infrastructure and the POSI principles

    Modern scholarship runs on infrastructure that most researchers never think about. When you cite a paper by its DOI, look up a colleague by their ORCID iD, find a repository in a registry, or rely on a dataset resolving years after it was deposited, you are depending on registries, identifier systems and metadata services that quietly hold the whole enterprise together. Like any infrastructure — roads, water, power — it is invisible when it works and catastrophic when it fails. And infrastructure carries particular risks: it can be captured by a single commercial owner, locked behind paywalls, allowed to decay through underfunding, or simply shut down. The Principles of Open Scholarly Infrastructure (POSI) were written to address exactly these risks. This article explains them and where they sit in relation to the data infrastructure domain of the CASRAI Dictionary.

    Why infrastructure is a special case

    The reason infrastructure needs its own principles is that the ordinary market does not protect it well. A piece of scholarly infrastructure — an identifier registry, say — becomes valuable precisely because everyone adopts it, which makes it hard to replace and gives whoever controls it considerable power. If that controller raises prices, restricts access, or is acquired and redirected, the whole community that depends on it is exposed, having built years of practice and data around something it does not govern. The danger is not hypothetical: the history of scholarly communication includes services that were widely relied upon and then acquired, closed or paywalled. POSI exists to make the organisations running open infrastructure resilient against these failure modes by design, not by goodwill.

    The three pillars of POSI

    The POSI principles are organised around three concerns: governance, sustainability, and insurance.

    • Governance asks who controls the infrastructure and on whose behalf. POSI calls for the community served to be represented in governance, for the organisation to be a non-discriminatory stakeholder so that no single interest dominates, and for transparency in how decisions are made. The aim is that the people who depend on a service have a genuine say in its direction.
    • Sustainability asks whether the organisation can last. POSI favours business models that do not depend on a single revenue source or funder, that build sufficient reserves to survive disruption, and that generate surplus to invest in their own future — rather than living grant to grant or relying on one large patron whose withdrawal would be fatal.
    • Insurance asks what happens if the organisation fails anyway. This is POSI’s most distinctive idea: that open data and open source are a form of insurance. If the underlying data are available under an open licence and the software is open source, then even if the original organisation collapses, the community can pick up the work and continue it. Open licensing turns a single point of failure into something recoverable.

    Why the ‘insurance’ idea matters most

    The insurance pillar is what makes POSI more than a wish-list. Governance and sustainability reduce the chance of failure; insurance limits the damage when it happens regardless. By insisting that the data be openly licensed and the code open source, POSI ensures that no organisation running open infrastructure can hold the community hostage — because the community always retains the ability to fork the software and continue with the data. This is what genuinely distinguishes open infrastructure from a well-intentioned but proprietary service: the credible ability to walk away and rebuild. It converts trust-by-promise into trust-by-construction.

    POSI in practice

    POSI is not merely aspirational; a number of the organisations that operate the identifier and metadata services scholarship relies on have publicly adopted the principles and report against them, examining their own governance, funding and openness and disclosing where they fall short. That practice of self-assessment and public reporting is itself part of the value: it gives the community a way to judge whether a piece of infrastructure it depends on is actually being run in a durable, accountable way, rather than taking assurances on faith. For institutions choosing which services to build on, POSI alignment is a meaningful signal of long-term reliability.

    Infrastructure, federation and the wider network

    Open infrastructure is rarely a single monolith; it is a network of interoperating services that must work together — identifiers resolving against registries, metadata flowing between systems, repositories exchanging records. That federated character is precisely why open standards and open governance matter so much: interoperability across many organisations is only sustainable when no single one controls the connecting standards. Our work on federation describes how distributed services interoperate while remaining independently governed, which is the practical expression of the POSI mindset applied to a network rather than a single service.

    A shared foundation worth protecting

    The deeper message of POSI is that the systems holding scholarship together are a shared foundation, and that foundations require deliberate care to remain open, durable and accountable. Governance that represents the community, sustainability that does not depend on a single point of support, and the insurance of open data and open code together make infrastructure something a community can rely on for the long term. Recognising and crediting the often-invisible work of building and maintaining this infrastructure — through structured contribution such as the CRediT taxonomy — is part of valuing it properly. The consistent vocabulary that lets these services describe and exchange information reliably is maintained in the CASRAI Dictionary, itself a small piece of the open infrastructure it documents.