Definition · Plain-language

P-value

Q: The p-value is the probability that the null hypothesis is true.

It is not. A p-value is computed assuming the null is true; it gives the probability of the data under that assumption, not the probability of the hypothesis given the data.

Q: A small p-value means the effect is large or important.

No. Statistical significance and practical importance are separate. A trivial effect can be highly significant in a large sample, which is why effect sizes and confidence intervals should also be reported.

Q: A p-value of 0.05 means there is a 5% chance the results occurred by chance.

That is a loose misreading. The 0.05 is the probability of results at least as extreme as observed if the null were true — a conditional probability about the data, not the chance the result is a fluke.

A p-value is the probability of obtaining a result at least as extreme as the one observed, assuming the null hypothesis is true. A small p-value casts doubt on the null.

CASRAI research-methods explainer — P-value

The step most authors miss

Doing CRediT right? Don’t stop at the statement.

A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.

Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.

What a p-value actually measures

A p-value answers one narrow question: if the null hypothesis were true, how surprising is the result we obtained? Formally it is the probability, computed under the null, of getting a test statistic at least as extreme as the one seen in the data. A very small p-value means such data would rarely arise by chance under the null, which is taken as evidence against it. Crucially, the p-value is a statement about the data given a hypothesis, not about the hypothesis given the data — a distinction that underlies most misuse of the figure.

Comparing p to the significance level

Researchers fix a significance level, α, before collecting data — most commonly 0.05, though 0.01 or 0.001 are used where stronger evidence is required. If the p-value is less than α, the result is declared statistically significant and the null hypothesis is rejected; if it is greater, researchers fail to reject the null. The α threshold is a convention, not a law of nature: 0.05 simply means accepting a one-in-twenty risk of a false positive. Pre-registering the threshold guards against moving the goalposts once results are in.

What a p-value cannot tell you

A p-value does not measure how large, real or important an effect is — a tiny, trivial effect can be highly significant in a large sample, while an important effect can be non-significant in a small one. It is not the probability that the null hypothesis is true, nor the probability the result occurred by chance. The American Statistical Association’s 2016 statement warned against treating p < 0.05 as a hard line for truth, urging researchers to report effect sizes and confidence intervals alongside p-values.

Key facts

At a glance

Definition: probability of data at least as extreme as observed, assuming H0 is true
Symbol: p (a probability between 0 and 1)
Threshold: compared with the significance level α (commonly 0.05)
Decision: p < α leads to rejecting the null hypothesis
Does NOT give: the probability the null is true, or the size of an effect
Caveat: ASA (2016) warns against using p < 0.05 as a sole measure of importance

Common misconceptions

What people often get wrong

Often heard: The p-value is the probability that the null hypothesis is true.

Actually: It is not. A p-value is computed assuming the null is true; it gives the probability of the data under that assumption, not the probability of the hypothesis given the data.

Often heard: A small p-value means the effect is large or important.

Actually: No. Statistical significance and practical importance are separate. A trivial effect can be highly significant in a large sample, which is why effect sizes and confidence intervals should also be reported.

Often heard: A p-value of 0.05 means there is a 5% chance the results occurred by chance.

Actually: That is a loose misreading. The 0.05 is the probability of results at least as extreme as observed if the null were true — a conditional probability about the data, not the chance the result is a fluke.

Going deeper

Related CASRAI guidance

Hypothesis testing →Confidence interval →Effect size →Null hypothesis →Statistics hub →Standards dictionary →