ReLU, Sigmoid, Tanh: Activation Functions

Activation functions add non-linearity to the network — enabling it to learn complex patterns.

Without them, the entire network would just be a linear function!

📈 What Do They Do?

They transform the output of a neuron:

output = activation(w₁x₁ + w₂x₂ + ... + b)

The type of activation function affects how information flows.

🔧 Common Functions

1. Sigmoid

f(x) = 1 / (1 + e^{-x})

Range: (0, 1)
Smooth output
Used in binary classification

2. Tanh

f(x) = (e^x - e^{-x}) / (e^x + e^{-x})

Range: (-1, 1)
Centered at 0
Good for hidden layers

3. ReLU (Rectified Linear Unit)

f(x) = max(0, x)

Simple and efficient
Speeds up training
Most common in deep learning

📊 Visual Comparison

Activation Functions Comparison

Explore different activation functions used in neural networks

Function Details

f(x) = max(0, x)

Rectified Linear Unit - most popular activation function

Key Properties

•Range: [0, ∞)
•Non-linear
•Sparsity inducing
•No saturation for positive values

All Functions

See how each function transforms the input.

🧠 Summary

| Function | Use Case | |----------|-----------------------------| | Sigmoid | Binary output | | Tanh | Centered activation | | ReLU | Default for hidden layers |

✅ Self-Check

Why do we need activation functions?
Which function is most commonly used?
What’s the difference between Sigmoid and Tanh?