Skip to main content
v2026.1714 entries · CC-BY 4.0
Dictionary termTrack BStablev2026.2

Software Heritage archive

A non-profit international initiative based at Inria that systematically crawls, archives, and preserves the world's publicly available source code, including its full version-control history, and issues persistent identifiers (Software Hash Identifiers, SWHIDs) to every archived artefact.

ByCASRAI Editorial Board
· Last updated 21 May 2026

Examples

Worked examples

  • Is an instance

    A SWHID swh:1:dir:... cited in a paper to refer to the exact directory state used in an experiment.

  • Is an instance

    Software Heritage's continuous mirroring of GitHub repositories.

Counter-examples

Looks similar, but isn't

  • Not an instance

    A snapshot kept solely on the original developer's laptop is not a Software Heritage archive object.

  • Not an instance

    A Zenodo DOI for a software release is a different identifier scheme, complementary to SWHIDs.

Editorial commentary

Software Heritage was launched in 2016 with the stated mission of preserving all source code as part of humanity’s cultural and scientific heritage. Its archive covers over 250 million origins (GitHub, GitLab, Bitbucket, PyPI, Debian, and many other forges and package indexes). Every archived object — file content, directory, revision, release, snapshot — gets a SWHID (e.g. swh:1:cnt:…), an intrinsic identifier derived from the content’s cryptographic hash, that allows the artefact to be cited and retrieved independently of any single forge.

References

  • D
  • i
  • C
  • o
  • s
  • m
  • o
  • R
  • .
  • ,
  • Z
  • a
  • c
  • c
  • h
  • i
  • r
  • o
  • l
  • i
  • S
  • .
  • ,
  • S
  • o
  • f
  • t
  • w
  • a
  • r
  • e
  • H
  • e
  • r
  • i
  • t
  • a
  • g
  • e
  • :
  • W
  • h
  • y
  • a
  • n
  • d
  • H
  • o
  • w
  • t
  • o
  • P
  • r
  • e
  • s
  • e
  • r
  • v
  • e
  • S
  • o
  • f
  • t
  • w
  • a
  • r
  • e
  • S
  • o
  • u
  • r
  • c
  • e
  • C
  • o
  • d
  • e
  • ,
  • i
  • P
  • R
  • E
  • S
  • 2
  • 0
  • 1
  • 7
  • .
  • s
  • o
  • f
  • t
  • w
  • a
  • r
  • e
  • h
  • e
  • r
  • i
  • t
  • a
  • g
  • e
  • .
  • o
  • r
  • g
  • .

Also known as

Software Heritage · SWH

Machine-readable encodings

Use in your systems

JATS XML <role> element
xml
<role vocab="credit"
      vocab-identifier="https://casrai.org/dictionary/"
      vocab-term="Software Heritage archive"
      vocab-term-identifier="https://casrai.org/dictionary/term/software-heritage-archive" />
Schema.org DefinedTerm (JSON-LD)
json
{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "name": "Software Heritage archive",
  "identifier": "https://casrai.org/dictionary/term/software-heritage-archive",
  "description": "A non-profit international initiative based at Inria that systematically crawls, archives, and preserves the world's publicly available source code, including its full version-control history, and issues persistent identifiers (Software Hash Identifiers, SWHIDs) to every archived artefact.",
  "inDefinedTermSet": "https://casrai.org/dictionary/domain/research-data-infrastructure/",
  "url": "https://casrai.org/dictionary/term/software-heritage-archive",
  "sameAs": [
    "Software Heritage",
    "SWH"
  ],
  "license": "https://creativecommons.org/licenses/by/4.0/"
}

Adopted by research universities worldwide

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • Massachusetts Institute of Technology logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo

View CASRAI adoption →