Micrograd Implementation

A minimal neural network library with automatic differentiation, inspired by Andrej Karpathy's micrograd. This is a learning implementation that demonstrates the core concepts of automatic differentiation and neural network training from scratch.

🚀 Features

Automatic Differentiation: Scalar-valued autograd engine with backward pass
Neural Network Components: Neurons, layers, and multi-layer perceptrons (MLPs)
Activation Functions: ReLU and Tanh support
Training Loop: Complete gradient-based optimization
Educational: Clean, readable code perfect for understanding backpropagation

📁 Repository Structure

micrograd-implementation/
├── engine.py           # Core Value class with autograd
├── neural_network.py   # Neural network components (Neuron, Layer, MLP)
├── demo.ipynb         # Training example on breast cancer dataset
└── README.md          # This file

🔧 Core Components

Value Class (`engine.py`)

The heart of the autograd engine - a scalar value that tracks gradients:

from engine import Value

# Create values
a = Value(2.0)
b = Value(4.0)

# Operations automatically build computation graph
c = a * b + a.tanh()

# Backpropagation
c.backward()
print(f"dc/da = {a.grad}")  # Gradient of c with respect to a

Supported Operations:

Addition: a + b
Multiplication: a * b
Power: a ** n
Division: a / b
Tanh activation: a.tanh()
ReLU activation: a.relu()

Neural Network (`neural_network.py`)

Neuron: Single perceptron with weights, bias, and activation

neuron = Neuron(nin=3, nonlin=True)  # 3 inputs, ReLU activation
output = neuron([1.0, 2.0, 3.0])

Layer: Collection of neurons

layer = Layer(nin=3, nout=5)  # 3 inputs, 5 outputs

MLP: Multi-layer perceptron

model = MLP(nin=30, nouts=[16, 16, 1])  # 30->16->16->1 architecture

🎯 Example Usage

Binary Classification on Breast Cancer Dataset

from sklearn.datasets import load_breast_cancer
from sklearn.preprocessing import StandardScaler
from neural_network import MLP
from engine import Value

# Load and preprocess data
data = load_breast_cancer()
X, y = data.data, data.target
scaler = StandardScaler()
X = scaler.fit_transform(X)

# Create model
model = MLP(30, [16, 16, 1])

# Training loop
for epoch in range(100):
    # Forward pass
    inputs = [list(map(Value, row)) for row in X_train]
    predictions = [model(x) for x in inputs]
    
    # Loss calculation (MSE + L2 regularization)
    loss = sum((pred - target)**2 for pred, target in zip(predictions, y_train))
    loss = loss * (1.0 / len(predictions))
    
    # Backward pass
    model.zero_grad()
    loss.backward()
    
    # Parameter update
    learning_rate = 0.01
    for param in model.parameters():
        param.data -= learning_rate * param.grad

📊 Demo Results

The included Jupyter notebook (demo.ipynb) demonstrates:

Loading and preprocessing the breast cancer dataset
Training a neural network from scratch
Achieving ~95% accuracy on binary classification
Visualizing training progress

🛠️ Installation & Requirements

# Clone the repository
git clone https://github.com/yourusername/micrograd-implementation.git
cd micrograd-implementation

# Install dependencies
pip install numpy scikit-learn matplotlib jupyter

Dependencies:

numpy - For numerical operations
scikit-learn - For datasets and preprocessing
matplotlib - For plotting (demo only)
jupyter - For running the demo notebook

🎓 Learning Objectives

This implementation helps understand:

Automatic Differentiation: How computational graphs track gradients
Backpropagation: The chain rule in action
Neural Network Architecture: Building networks from basic components
Training Process: Forward pass, loss calculation, backward pass, parameter updates
Gradient-based Optimization: How neural networks learn

🔍 Key Implementation Details

Numerical Stability: Uses math.tanh() to avoid overflow errors
Memory Efficiency: Scalar-based operations (not vectorized)
Educational Focus: Prioritizes clarity over performance
Gradient Accumulation: Proper handling of parameter gradients

🚨 Limitations

Performance: Not optimized for large-scale training (use PyTorch/TensorFlow for real projects)
Scalar Operations: No vectorization - slow on large datasets
Limited Functionality: Basic operations only
Educational Purpose: Designed for learning, not production use

🙏 Acknowledgments

This implementation is inspired by Andrej Karpathy's micrograd and his excellent educational content. This project was created as a learning exercise to understand the fundamentals of automatic differentiation and neural networks.

📖 Further Reading

Note: This is an educational implementation. For production neural networks, use established frameworks like PyTorch, TensorFlow, or JAX.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Micrograd Implementation

🚀 Features

📁 Repository Structure

🔧 Core Components

Value Class (`engine.py`)

Neural Network (`neural_network.py`)

🎯 Example Usage

Binary Classification on Breast Cancer Dataset

📊 Demo Results

🛠️ Installation & Requirements

🎓 Learning Objectives

🔍 Key Implementation Details

🚨 Limitations

🙏 Acknowledgments

📖 Further Reading

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
demo.ipynb		demo.ipynb
engine.py		engine.py
neural_network.py		neural_network.py

Folders and files

Latest commit

History

Repository files navigation

Micrograd Implementation

🚀 Features

📁 Repository Structure

🔧 Core Components

Value Class (engine.py)

Neural Network (neural_network.py)

🎯 Example Usage

Binary Classification on Breast Cancer Dataset

📊 Demo Results

🛠️ Installation & Requirements

🎓 Learning Objectives

🔍 Key Implementation Details

🚨 Limitations

🙏 Acknowledgments

📖 Further Reading

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Value Class (`engine.py`)

Neural Network (`neural_network.py`)

Packages