Data science & AI · Reference

What is deep learning?

Deep learning is a branch of machine learning that uses neural networks with many layers to learn increasingly abstract representations of data, enabling strong performance on tasks such as image recognition and language understanding.

What makes learning "deep"

The depth in deep learning refers to the number of layers in a neural network through which data is transformed. Each successive layer builds a more abstract representation: in image tasks, early layers may respond to edges, later layers to shapes, and deeper layers to whole objects. This is representation learning — the network discovers useful features automatically rather than relying on humans to design them, which is the key practical advantage over earlier machine-learning methods on perception and language.

How deep networks are trained

Deep networks are trained by backpropagation, an algorithm that computes how each parameter contributes to the output error and adjusts it accordingly, combined with gradient-descent optimisation over many examples.

Common architectures include convolutional neural networks for images, recurrent networks for sequences, and the transformer for language and beyond. Training deep models typically requires large labelled or self-supervised datasets and substantial compute, often on graphics processing units (GPUs).

Why deep learning scaled

Deep neural networks are an old idea, but they became dominant only around 2012, when a deep convolutional network sharply improved image-classification accuracy on the ImageNet benchmark. Three factors converged: far larger labelled datasets, parallel hardware (GPUs) fast enough to train large models, and algorithmic refinements such as better activation functions and regularisation. This combination of data plus compute let deep models surpass methods that depended on hand-crafted features.

Deep learning in research

Deep learning is now a standard research tool in fields from structural biology to astronomy, where it is used for pattern recognition in large datasets. Methodologically, deep models pose challenges: they are data-hungry, can be opaque ("black box"), and may fail unpredictably outside their training distribution. Responsible use involves rigorous held-out evaluation, attention to dataset bias, and — where decisions matter — interpretability analysis and uncertainty estimation. Results are reported as empirical findings to be independently reproduced.

Key facts

At a glance

Field: subfield of machine learning
Core unit: multi-layer (deep) neural networks
Key idea: representation (feature) learning from raw data
Training algorithm: backpropagation with gradient descent
Breakthrough: ImageNet image classification, 2012
Enablers: large datasets plus GPU compute

Common questions

FAQ

What is the difference between machine learning and deep learning?+

Deep learning is a subset of machine learning that specifically uses many-layered neural networks and learns features automatically. Other machine-learning methods often rely on hand-engineered features and simpler models.

Why did deep learning become so successful?+

Deep learning scaled because three things converged after around 2012: much larger datasets, fast parallel hardware (GPUs), and algorithmic improvements. Together these let deep networks outperform methods using hand-crafted features on vision and language tasks.

What is representation learning?+

Representation learning is the automatic discovery of useful features from raw data. In a deep network, each layer learns a more abstract representation, so engineers do not have to design features by hand.

Going deeper

Related on CASRAI

Sources

The step most authors miss

Doing CRediT right? Don’t stop at the statement.

A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.

Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.