Editorial · CASRAI · Research data infrastructure

Linked Data and Knowledge Graphs in Scholarly Research: RDF, SPARQL, Wikidata and ORKG

Linked data technologies developed by the World Wide Web Consortium — principally the Resource Description Framework and the SPARQL query language — provide a principled way to connect research objects, express relationships between them, and make them queryable by machines as well as humans. Applied to scholarly research, these technologies underpin initiatives such as the Open Research Knowledge Graph at TIB Hannover, Wikidata’s role as a community-maintained scholarly knowledge base, and the widespread adoption of Schema.org’s Dataset markup to make research datasets discoverable by search engines. Understanding how linked data connects papers, datasets, authors, funders and institutions into a navigable web of scholarly knowledge helps research administrators and data infrastructure professionals appreciate the foundations of machine-readable research information systems.

ByCASRAI Editorial Board

Published 1 Jan 1970· 1 minute read

In summary

Linked data technologies developed by the World Wide Web Consortium — principally the Resource Description Framework and the SPARQL query language — provide a principled way to connect research objects, express relationships between them, and make them queryable by machines as well as humans. Applied to scholarly research, these technologies underpin initiatives such as the Open Research Knowledge Graph at TIB Hannover, Wikidata’s role as a community-maintained scholarly knowledge base, and the widespread adoption of Schema.org’s Dataset markup to make research datasets discoverable by search engines. Understanding how linked data connects papers, datasets, authors, funders and institutions into a navigable web of scholarly knowledge helps research administrators and data infrastructure professionals appreciate the foundations of machine-readable research information systems.

Related editorial in this domain

More on Research data infrastructure

21 Jun 2026

Identifiers for Things, Not Just Papers: IGSN and PIDINST

Persistent identifiers are familiar for articles, datasets, and people, but the physical objects of research, the rock cores, water samples, and the instruments that measure them, have long lacked stable references. The IGSN for samples and the PIDINST work for instruments extend persistent identification to the physical world, making physical research objects findable, citable, and connectable to the data they produce.

21 Jun 2026

Anonymising research data: k-anonymity, differential privacy and the re-identification risk

Sharing data about people without exposing the people themselves is one of the hardest problems in research data management. This article distinguishes anonymisation from pseudonymisation, explains the privacy models researchers actually use, k-anonymity, l-diversity and differential privacy, and introduces the practical guidance from the UK Anonymisation Network (UKAN) and the ICO’s anonymisation code. It also confronts the uncomfortable reality that re-identification is often easier than it looks.

20 Jun 2026

Big Data and the Vs of Data Explained for Research

Big data describes datasets so large, fast or varied that traditional tools cannot handle them. This guide explains the defining Vs, from volume and velocity to veracity and value, how distributed processing copes, and what big data means for research and FAIR data.

Linked Data and Knowledge Graphs in Scholarly Research: RDF, SPARQL, Wikidata and ORKG

More on Research data infrastructure

Identifiers for Things, Not Just Papers: IGSN and PIDINST

Anonymising research data: k-anonymity, differential privacy and the re-identification risk

Big Data and the Vs of Data Explained for Research

Medical Supplies At Your Fingertips