Overfitting and Generalization

Overfitting is when a model performs well on training data, but poorly on unseen data.

It means the model has memorized rather than generalized.

🔍 What is Generalization?

A good model:

Learns patterns, not noise
Performs well on new, real-world data

This ability is called generalization.

📉 Example

Suppose we train a model on 100 examples.

It gets 98% accuracy on training data
But only 70% accuracy on test data

This gap suggests overfitting.

📊 Visual Intuition

Overfitting Visualization

Watch how the model learns and then starts to overfit to the training data

LossAccuracy

Loss Over Time

Current Metrics

Overfitting Analysis

Gap Analysis

Training Progress

Epoch 1 / 502%

Best Training

0.0570

Best Validation

0.2115

Overfitting Point

Epoch 22

Current Gap

0.0000

Understanding Overfitting

Learning Phase: Both training and validation metrics improve together

Overfitting Point: Validation metrics start degrading while training continues improving

Gap Widening: The difference between training and validation performance grows

Early Stopping: Stop training when validation performance starts to degrade

Regularization: Techniques like dropout, weight decay, and data augmentation can help prevent overfitting

Underfitting: Too simple, poor on both train/test
Good fit: Balanced performance
Overfitting: Too complex, great on train, bad on test

🚨 Signs of Overfitting

High training accuracy, low test accuracy
Large gap between training and validation loss
Model performance degrades on real inputs

🛡️ How to Prevent Overfitting

| Technique | Description | |------------------|---------------------------------------------| | More Data | Helps model see more variation | | Regularization | Penalize large weights (e.g. L2) | | Dropout | Randomly disable neurons during training | | Early Stopping | Stop training when validation loss worsens | | Simpler Models | Avoid overly complex models |

🧠 Summary

Overfitting = memorizing, not learning
Generalization = ability to perform on new data
Prevent with regularization, more data, and validation

✅ Self-Check

How do you know a model is overfitting?
What is the difference between underfitting and overfitting?
How does dropout help reduce overfitting?