Definition

Deep Learning

Deep learning is a machine learning approach that uses multi-layer neural networks to learn complex patterns from large datasets.

Updated May 30, 2026Also known as: DL

Short definition

Deep learning is a branch of machine learning based on neural networks with many layers. These layers transform raw inputs into increasingly useful representations, which lets the model handle complex data such as text, images, audio and code.

The word "deep" refers to the number of layers in the neural network. Each layer transforms the input a little further, so the system can learn patterns that would be difficult to describe with hand-written rules.

How it works

A deep learning model receives examples and adjusts millions or billions of internal parameters during training. Early layers may learn simple patterns, while later layers combine them into higher-level features. In computer vision, that can mean moving from edges to shapes to objects. In language models, it can mean moving from tokens to grammar, meaning and task patterns.

Training is the expensive learning phase. Inference is the later usage phase, when the trained model receives a new input and produces a prediction, classification, text, image or other output.

Deep learning is especially useful when the data is messy or high-dimensional: natural language, photos, video, speech, sensor streams and source code. That is why it became the technical foundation for many modern AI systems.

Common deep learning systems

Examples include:

image recognition models that classify objects or detect defects;
speech models that transcribe or generate audio;
recommendation models that rank content or products;
transformer-based language models that generate and analyze text;
diffusion or multimodal models that generate images and video.

Not every AI product needs deep learning. Simpler statistical models, search systems or rules may be cheaper and easier to explain. Deep learning becomes attractive when the task needs pattern recognition at scale.

Example

A medical imaging system can use deep learning to detect suspicious regions in scans. The model is not hand-coded with every possible visual rule; it learns from many labeled examples and then flags patterns that resemble cases seen during training.

Another example is image generation. A model can learn visual patterns from large datasets and then create a new picture from a text prompt. Our coverage of ChatGPT Images 2.0 shows how image models are moving from simple generation toward planning, text rendering and multi-step visual reasoning.

Why it matters

Deep learning made many modern AI breakthroughs possible, including speech recognition, image generation and large language models. It usually needs significant data, computing power and careful evaluation, so it is powerful but not always the simplest or cheapest option.

Its impact comes from flexibility. The same broad approach can support translation, fraud detection, robotics, medical imaging, code generation and creative tools. The tradeoff is that deep learning systems can be hard to interpret, expensive to train and sensitive to the quality of their data.

Limitations

Deep learning models can fail when the real world differs from the training data. They can also learn unwanted bias, overfit to weak patterns, or produce outputs that look plausible but are wrong.

That is why evaluation matters. Teams need test sets, monitoring, human review and clear thresholds for when a model is good enough for a specific use case. A model that is impressive in a demo may still be risky in a production workflow.