Editorial · CASRAI · AI and ML research outputs

Supervised vs Unsupervised Learning Explained

Supervised learning trains on labelled data to predict outcomes; unsupervised learning finds structure in unlabelled data. This explainer defines both paradigms, notes reinforcement learning as a third, and compares their tasks, methods and research uses.

ByCASRAI Editorial Board

Published 19 Jun 2026· 4 minute read

Supervised learning is a machine-learning paradigm in which a model is trained on labelled examples — inputs paired with known correct outputs — so that it can predict the output for new, unseen inputs. Its counterpart, unsupervised learning, works with unlabelled data and seeks to discover structure, patterns or groupings without being told what the “right” answer is. The presence or absence of labels is the defining distinction between the two.

Both belong to the wider field of machine learning, and choosing between them depends on whether you have labelled data and what question you are asking. A third paradigm, reinforcement learning, sits apart from both.

Supervised learning: learning from labels

In supervised learning, each training example carries a label: the email is “spam” or “not spam”, the image is “tumour” or “benign”, the house has a known sale price. The algorithm learns a mapping from inputs (features) to outputs (labels), and its performance is judged by how accurately it predicts labels for data it has not seen. Two main task types exist. Classification predicts a discrete category (spam or not, species A, B or C). Regression predicts a continuous quantity (a price, a temperature, a concentration).

Common supervised methods include linear and logistic regression, decision trees, support vector machines and neural networks. The key practical requirement — and often the key cost — is obtaining enough accurately labelled data, which may require expert annotation.

Unsupervised learning: finding structure

Unsupervised learning has no labels. Instead, the algorithm looks for inherent structure in the data. Clustering groups similar items together — for example, segmenting samples into subtypes without prior categories, using methods such as k-means or hierarchical clustering. Dimensionality reduction, such as principal component analysis, compresses many variables into fewer while preserving structure, aiding visualisation and downstream analysis. Because there is no ground-truth label, evaluating unsupervised results is harder and often relies on domain judgement.

Reinforcement learning: a third paradigm

Reinforcement learning differs from both. Here an agent learns by interacting with an environment, taking actions and receiving rewards or penalties, and gradually improving a policy to maximise cumulative reward. It is neither labelled-example learning nor pure pattern discovery; it learns from consequences over time. Reinforcement learning underlies advances in game-playing and robotics, and is noted here for completeness rather than treated in depth.

Comparing the paradigms

Feature	Supervised learning	Unsupervised learning
Data	Labelled (input–output pairs)	Unlabelled
Goal	Predict known outputs	Discover hidden structure
Main tasks	Classification, regression	Clustering, dimensionality reduction
Example methods	Logistic regression, decision trees, SVMs	k-means, hierarchical clustering, PCA
Evaluation	Accuracy against held-out labels	Often qualitative; needs domain judgement

Research uses and good practice

In research, supervised learning suits prediction and classification where labelled outcomes exist — diagnosing from images, predicting properties from features. Unsupervised learning suits exploration — finding subgroups, detecting anomalies, reducing dimensionality before further analysis. The two are often combined: unsupervised methods can pre-process or explore data that a supervised model then uses.

Whichever paradigm is used, the outputs are research outputs that require careful reporting: how data were labelled or collected, how the model was validated, and how results were evaluated. Sharing data, code and methods using consistent terminology supports reproducibility, and our guidance for authors covers documenting such computational work. For foundational background, see our overview of machine learning concepts and methods and the explainer on neural networks and deep learning.

Frequently asked questions

What is the main difference between supervised and unsupervised learning?

Labels. Supervised learning trains on labelled examples — inputs paired with known correct outputs — to make predictions, whereas unsupervised learning works with unlabelled data to discover structure such as clusters or lower-dimensional representations without any predefined answer.

What are classification and regression?

They are the two main supervised tasks. Classification predicts a discrete category, such as spam or not spam. Regression predicts a continuous value, such as a price or temperature. Both learn a mapping from input features to known output labels.

Where does reinforcement learning fit?

It is a separate, third paradigm. An agent learns by acting in an environment and receiving rewards or penalties, improving its policy over time to maximise cumulative reward. It learns from consequences rather than from labelled examples or pure pattern discovery.

Can the two approaches be combined?

Yes, frequently. Unsupervised methods such as clustering or PCA can explore or pre-process data, and a supervised model can then make predictions from the result. Many research pipelines use both, so they are complementary rather than mutually exclusive.

Related editorial in this domain

More on AI and ML research outputs

20 Jun 2026

Quantum Computing: Principles and Research Implications

Quantum computing uses qubits, superposition and entanglement to process information in ways classical computers cannot. This explainer defines the principles, situates the NISQ era, and assesses realistic research implications without overstating present-day capabilities.

19 Jun 2026

Overfitting and Underfitting in Machine Learning Explained

Overfitting happens when a model memorises training data instead of learning general patterns, while underfitting means it is too simple to capture them. This guide explains the bias-variance trade-off, regularisation, cross-validation and the threat to reproducible ML.

19 Jun 2026

Neural Networks and Deep Learning Explained

A neural network is a machine-learning model of interconnected units that transform inputs through weighted layers. This explainer covers neurons, weights and activations, training by backpropagation and gradient descent, and what makes a network “deep”.