📚 This repository is intended for educational purposes only. All notebooks are designed to support learning and are companion resources to the YouTube channel.
▲ Click the thumbnail to watch the full playlist
A visual map of all 125 topics with illustrations, formulas, and stage-by-stage grouping — optimised for desktop and mobile.
What's inside: 125 topic cards · 13 learning stages · click-to-expand modals with formulas, use-cases, and notebook links · dark/light theme · mobile-friendly layout
Practical Python examples for 116 Machine Learning and Data Science topics from my YouTube channel.
Difficulty legend: 🟢 Beginner · 🟡 Intermediate · 🔴 Advanced
| 📖 | Topic-wise practical code for every lecture |
| 💻 | Jupyter notebooks runnable locally |
| ☁️ | One-click cloud execution via Binder |
| 🎬 | Direct links to companion YouTube videos |
- 🔹 Keep each topic short, practical, and beginner-friendly.
- 🔹 Map every video to runnable Python code.
- 🔹 Make it easy for subscribers to fork, run, and learn.
.
├── notebooks/
│ ├── 01_linear_regression/
│ │ └── linear_regression_starter.ipynb
│ ├── ... (topics 02–114)
│ ├── 115_real_world_project/
│ │ └── real_world_project_starter.ipynb ← End-to-end capstone
│ ├── 116_advanced_time_series_ml/
│ ├── 117_stream_adaptive_learning/
│ ├── 118_denstream/
│ ├── 119_adwin/
│ ├── 120_vfdt_cvfdt/
│ ├── 121_drift_detection/
│ ├── 122_monic/
│ ├── 123_shap_values/
│ ├── 124_arima_sarima/
│ └── 125_clustream/
├── src/
│ ├── data/
│ ├── ml_basics/
│ └── utils/
├── requirements.txt
├── runtime.txt
├── environment.yml
├── CONTRIBUTING.md
└── README.md
Click any launch binder badge on a topic tile below, or launch the whole repo:
⏳ First build may take a few minutes. If it times out, simply retry.
- Clone the repository.
- Create and activate a virtual environment.
- Install dependencies.
- Start Jupyter Lab.
git clone https://github.com/ranjiGT/ai-machine-learning-codes.git
cd ai-machine-learning-codes
python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
jupyter lab125 topics across 13 themes — click a section to expand. Each tile links to the interactive Binder notebook and its companion YouTube video.
🟦 Supervised Learning — Core Classifiers (01–17) · 🟢 Beginner → 🟡 Intermediate
Prerequisites: Python basics, NumPy, pandas. Start here if you are new to ML.
🟨 Probabilistic & Rule-Based Methods (18–26) · 🟡 Intermediate
Prerequisites: Topics 01–07 (core classifiers).
🟧 Class Imbalance & Evaluation Curves (27–30) · 🟡 Intermediate
Prerequisites: Topics 01–07, 11 (cross-validation).
|
27. Class Imbalance Problem and Solutions |
28. SMOTE |
29. ROC and AUC Curves |
|
30. ISO Accuracy Lines |
🟩 Clustering (31–46) · 🟡 Intermediate
Prerequisites: Topics 06 (PCA), 11 (cross-validation). No labelled data required.
🟪 Active Learning (47–56) · 🔴 Advanced
Prerequisites: Topics 01–17, 27–30 (evaluation).
🔵 Multi-label & Multi-class Classification (57–64) · 🟡 Intermediate
Prerequisites: Topics 01–07, 11.
🟫 Pattern Mining & Statistical Theory (65–74) · 🟡 Intermediate
Prerequisites: Topics 07 (Naive Bayes), basic probability.
⬛ Concept Learning, Algorithms & Regression (75–85) · 🟡 Intermediate
Prerequisites: Topics 01, 03 (Decision Trees), 11.
🔶 Dimensionality Reduction & Lazy Learning (86–90) · 🟡 Intermediate
Prerequisites: Topics 05 (KNN), 06 (PCA).
|
86. Linear Discriminant Analysis (LDA) |
87. t-SNE |
88. Lazy Learning Algorithms |
|
89. Weighted K-NN |
90. Case-Based Reasoning |
🔴 Deep Learning (91–107) · 🔴 Advanced
Prerequisites: Topics 22–23 (Neural Networks), 80 (Gradient Descent). Familiarity with NumPy recommended.
⚫ Advanced & Semi-Supervised Methods (108–113) · 🔴 Advanced
Prerequisites: Topics 01–30, 91–107 (Deep Learning basics).
🎯 Advanced Activation Functions (114–114) · 🔴 Advanced
Prerequisites: Topics 91–107 (Deep Learning).
|
114. Softmax Activation Function |
🏁 End-to-End Real-World Project (115) · 🟡 Intermediate
Prerequisites: Topics 01–30 recommended. This notebook deliberately revisits concepts from across the course.
A capstone notebook that applies the concepts from the entire course to a real dataset.
We predict Heart Disease using the UCI Cleveland dataset, walking through every stage a practitioner faces:
EDA → Preprocessing → PCA → SMOTE → Multi-model benchmarking → Hyperparameter tuning → ROC/AUC → Feature importance
| Concept | Topics revisited |
|---|---|
| Hold-out split & scaling | 01, 13 |
| PCA visualisation | 06 |
| SMOTE for class imbalance | 28 |
| 5-fold cross-validation | 11 |
| GridSearchCV tuning | 14 |
| ROC-AUC evaluation | 29 |
| Classification report | 63, 108 |
|
115. End-to-End Real-World ML Project |
📈 Advanced Time Series ML (116) · 🔴 Advanced
Prerequisites: Topics 01, 11, 14, and basic understanding of lag/rolling features.
A practical advanced notebook focused on modern ML workflows for forecasting:
TimeSeriesSplit → Lag/Rolling features → RF/HistGB/SVR → Multi-step recursive forecasting
| Concept | Topics revisited |
|---|---|
| Time-aware validation | 11 |
| Model tuning mindset | 14 |
| Feature engineering | 01, 80 |
| Forecast diagnostics | 108 |
If you want to co-author with yatchmaster:
- Add
yatchmasteras a collaborator in GitHub repository settings. - Use feature branches and pull requests.
- Add co-author trailers in commit messages when both contribute:
Co-authored-by: yatchmaster <their-email@users.noreply.github.com>
See CONTRIBUTING.md for a full workflow.
Contributions are welcome from subscribers and learners.
Please read CONTRIBUTING.md before opening pull requests.
This project is licensed under the MIT License.