Editorial · CASRAI · Research data infrastructure
Documenting social science data: the Data Documentation Initiative (DDI)
Survey and microdata are only reusable if every variable, code and question is documented. The Data Documentation Initiative (DDI) is the international metadata standard that makes that possible for social, behavioural and economic data. This article distinguishes DDI Codebook from DDI Lifecycle, explains variable-level metadata, and shows why major archives such as the UK Data Service, ICPSR and GESIS built their holdings on it, and how DDI fits the wider metadata landscape.
Related editorial in this domain
More on Research data infrastructure
Identifiers for Things, Not Just Papers: IGSN and PIDINST
Persistent identifiers are familiar for articles, datasets, and people, but the physical objects of research, the rock cores, water samples, and the instruments that measure them, have long lacked stable references. The IGSN for samples and the PIDINST work for instruments extend persistent identification to the physical world, making physical research objects findable, citable, and connectable to the data they produce.
Anonymising research data: k-anonymity, differential privacy and the re-identification risk
Sharing data about people without exposing the people themselves is one of the hardest problems in research data management. This article distinguishes anonymisation from pseudonymisation, explains the privacy models researchers actually use, k-anonymity, l-diversity and differential privacy, and introduces the practical guidance from the UK Anonymisation Network (UKAN) and the ICO’s anonymisation code. It also confronts the uncomfortable reality that re-identification is often easier than it looks.
Big Data and the Vs of Data Explained for Research
Big data describes datasets so large, fast or varied that traditional tools cannot handle them. This guide explains the defining Vs, from volume and velocity to veracity and value, how distributed processing copes, and what big data means for research and FAIR data.








