Statistics is the discipline concerned with collecting, organising, analysing, interpreting and presenting data. At its core it is the science of reasoning under uncertainty: it provides methods for drawing conclusions about a whole population from a limited sample, and for quantifying how much confidence those conclusions deserve. Statistics underpins quantitative research across every field, from medicine and economics to ecology and the social sciences.
Descriptive versus inferential statistics
The discipline divides into two broad branches. Descriptive statistics summarise and describe the features of a dataset without claiming anything beyond it. Measures of central tendency such as the mean, median and mode, measures of spread such as the range and standard deviation, and visual summaries such as histograms all belong here. Descriptive statistics tell you what the data at hand look like.
Inferential statistics go further: they use a sample to make estimates or test claims about a larger population that has not been fully observed. Estimation, hypothesis testing, confidence intervals and regression modelling are all inferential tools. The defining feature of inference is that it carries uncertainty, and statistics provides the machinery to measure that uncertainty rather than ignore it.
| Branch | Purpose | Typical tools |
|---|---|---|
| Descriptive | Summarise observed data | Mean, median, standard deviation, charts |
| Inferential | Draw conclusions about a population | Confidence intervals, hypothesis tests, regression |
Populations and samples
The distinction between a population and a sample is fundamental. A population is the entire set of units a researcher wishes to understand: all adults in a country, every transaction in a year, all stars in a galaxy. A sample is a subset of that population actually measured. Because studying an entire population is usually impractical, researchers work from samples and infer to the whole. A numerical fact about a population is a parameter; the corresponding figure calculated from a sample is a statistic, and statistics as a discipline is largely the study of how well sample statistics estimate population parameters.
Estimation and hypothesis testing
Two complementary tasks dominate inferential work. Estimation asks how large a quantity is and how precisely we know it, producing point estimates and interval estimates such as confidence intervals. Hypothesis testing asks whether the data are compatible with a specific claim, typically a null hypothesis of no effect, and summarises that compatibility with measures such as p-values. Both rest on the idea that random sampling produces variation, and that this variation can be modelled probabilistically.
Variability and probability
Underlying all of statistics is the recognition that data vary. Two samples from the same population will rarely give identical results, and statistics describes this sampling variation using probability. Measures such as the standard deviation quantify spread within data, while probability distributions describe how estimates would behave across repeated sampling. This probabilistic foundation is what allows statisticians to attach honest measures of uncertainty to their conclusions.
Why statistics is central to research
Statistics is not an optional add-on to research; it shapes how studies are designed, how large samples need to be, how data are analysed and how findings are reported. Sound statistical practice is essential for reproducibility, because it disciplines researchers against over-interpreting noise and helps others judge whether a result is robust. Poor statistical practice, by contrast, is a recognised driver of irreproducible findings. CASRAI’s work on standardised reporting and the CASRAI dictionary supports clearer, more comparable statistical reporting across the scholarly record, and the reproducibility category tracks developments in this area.
Frequently asked questions
Is statistics a branch of mathematics?
Statistics uses mathematics, particularly probability theory, but it is usually regarded as a distinct discipline. Its focus is on data, inference and the practical business of learning from observation under uncertainty, not on abstract mathematical structure alone.
What is the difference between a parameter and a statistic?
A parameter is a fixed numerical characteristic of a population, such as the population mean. A statistic is the corresponding figure computed from a sample, such as the sample mean. Statistics as a discipline studies how to estimate parameters from statistics.
Why does statistics matter for reproducibility?
Reproducibility depends on whether a reported result reflects a genuine effect or random variation. Statistical methods quantify that uncertainty and guard against over-claiming, so transparent statistical reporting is one foundation of a trustworthy scholarly record. See the CASRAI author guidance for reporting practices.







