Overfitting and Generalization

Overfitting is when a model performs well on training data, but poorly on unseen data.

It means the model has memorized rather than generalized.

🔍 What is Generalization?

A good model:

Learns patterns, not noise
Performs well on new, real-world data

This ability is called generalization.

📉 Example

Suppose we train a model on 100 examples.

It gets 98% accuracy on training data
But only 70% accuracy on test data

This gap suggests overfitting.

📊 Visual Intuition

Overfitting Visualization

Watch how the model learns and then starts to overfit to the training data

LossAccuracy

Loss Over Time

Current Metrics

Overfitting Analysis

Gap Analysis

Training Progress

Epoch 1 / 502%

Best Training

0.0570

Best Validation

0.2115

Overfitting Point

Epoch 22

Current Gap

0.0000

Understanding Overfitting

Learning Phase: Both training and validation metrics improve together

Overfitting Point: Validation metrics start degrading while training continues improving

Gap Widening: The difference between training and validation performance grows

Early Stopping: Stop training when validation performance starts to degrade

Regularization: Techniques like dropout, weight decay, and data augmentation can help prevent overfitting

Underfitting: Too simple, poor on both train/test
Good fit: Balanced performance
Overfitting: Too complex, great on train, bad on test

🚨 Signs of Overfitting

High training accuracy, low test accuracy
Large gap between training and validation loss
Model performance degrades on real inputs

🛡️ How to Prevent Overfitting

Technique	Description
More Data	Helps model see more variation
Regularization	Penalize large weights (e.g. L2)
Dropout	Randomly disable neurons during training
Early Stopping	Stop training when validation loss worsens
Simpler Models	Avoid overly complex models

Summary

Overfitting = memorizing, not learning
Generalization = ability to perform on new data
Prevent with regularization, more data, and validation

Self-Check

How do you know a model is overfitting?
What is the difference between underfitting and overfitting?
How does dropout help reduce overfitting?

🔍 What is Generalization?

📉 Example

📊 Visual Intuition

Overfitting Visualization

Loss Over Time

Current Metrics

Overfitting Analysis

Gap Analysis

Training Progress

Best Training

Best Validation

Overfitting Point

Current Gap

Understanding Overfitting

🚨 Signs of Overfitting

🛡️ How to Prevent Overfitting

Summary

Self-Check

← Previous Lesson

Next Lesson →

Explore More Learning

Overfitting Visualization

Loss Over Time

Current Metrics

Overfitting Analysis

Gap Analysis

Training Progress

Best Training

Best Validation

Overfitting Point

Current Gap

Understanding Overfitting