Definition · Plain-language
What is SPSS?
SPSS (Statistical Package for the Social Sciences) is a software package developed by IBM, widely used in research for quantitative data analysis.
The step most authors miss
Doing CRediT right? Don’t stop at the statement.
A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.
Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.
History and acquisition by IBM
First released in 1968, SPSS was developed to make statistical analysis accessible to social science researchers. Over the decades, it became the industry standard in universities worldwide. In 2009, IBM acquired the software and rebranded it as IBM SPSS Statistics. Despite the rise of modern programming languages, SPSS remains highly popular because of its legacy status in academia, its comprehensive support for traditional statistical methods, and its user-friendly interface that shields researchers from coding syntax. It is widely used in psychology, sociology, and marketing to analyse survey data and perform hypothesis testing without programming experience. Its widespread use ensures that students can find extensive training resources and textbooks tailored specifically to the software.
The three primary windows
Working in SPSS involves navigating three separate windows, each saving a different file type. The Data Editor window (.sav) contains the active dataset and is split into the Data View (rows of cases, columns of variables) and the Variable View (where variable types, labels, and missing values are defined). The Viewer window (.spv) displays the results of any analysis, including tables and charts. Finally, the Syntax Editor window (.sps) allows advanced users to write, save, and run code command scripts, which is crucial for reproducing analyses, automating repetitive tasks, and documenting data transformations. Understanding how these files interact is key to managing statistical projects and maintaining research transparency throughout the data lifetime.
Core features and limitations
SPSS excels at descriptive statistics, bivariate analyses (such as t-tests, ANOVA, and correlation), and general linear modelling. It features powerful data management capabilities, including sorting, merging, and restructuring datasets. However, it has significant limitations compared to open-source tools: its graphics engine produces rigid plots, it struggles with very large datasets, and its proprietary licence is expensive for independent researchers. Many research institutions and universities are consequently shifting toward open-source programming languages like R or user-friendly open-source platforms like Jamovi and JASP to reduce licensing costs and improve research flexibility. This shifting landscape encourages a broader transition toward transparent, community-supported tools that support the modern standards of reproducible scientific research.
Key facts
At a glance
- Definition: a graphical statistical software package widely used in the social sciences
- Developer: originally developed in 1968; acquired and maintained by IBM since 2009
- Data format: uses the .sav file extension to store data and variable definitions
- Interface: features a spreadsheet-like GUI with drop-down menus for statistical tests
- Syntax: supports syntax scripting (.sps files) to automate and document workflows
- Limitations: expensive licensing fees, rigid custom plotting, and poor support for big data
Common misconceptions
What people often get wrong
Often heard: SPSS is only useful if you do not know how to write code.
Actually: Whilst the GUI is its main draw, SPSS has a full syntax scripting language (.sps) that allows researchers to write programs, automate repetitive analyses, and ensure their work is reproducible.
Often heard: SPSS files (.sav) can only be opened and read within the SPSS program itself.
Actually: Many other statistical tools, including R (via the 'haven' package), Python (via 'pandas'), Jamovi, and JASP, can directly import and parse SPSS .sav files, preserving variable labels.
Common questions
FAQ
Why is SPSS so popular in psychology and social sciences?+
Its popularity is largely historical and educational. SPSS was one of the first packages with a graphical interface, making it easy to teach to non-programmers. Universities continue to license it, maintaining its status as the default tool in those disciplines.
What are the files generated by SPSS?+
SPSS creates three main files: .sav files for datasets (which store the actual data along with variable metadata like value labels), .spv files for output (containing tables and graphs from analyses), and .sps files for syntax code scripts.
Going deeper







