- Constantine, Algeria.
- https://fadibenz.vercel.app/
- in/fadi-benzaima
Highlights
- Pro
Pinned Loading
-
common-crawl-filtering
common-crawl-filtering PublicThis project provides a robust, scalable pipeline for processing thousands of WET raw web data from Common Crawl into a high-quality dataset, and includes a from-scratch transformer implementation …
Jupyter Notebook
-
llm-alignment-reasoning-RL
llm-alignment-reasoning-RL PublicThis repository implements a production-grade evaluation and supervised fine-tuning (SFT) pipeline for measuring and improving Qwen 2.5 0.5B zero-shot performance on the MATH dataset.
Python 1
-
systems-transformer-optimizations
systems-transformer-optimizations PublicThis project implements systems-level optimizations for transformer training, including custom Triton kernels, PyTorch distributed training, optimizer state sharding, and memory/latency benchmarkin…
Python
-
tokenizer_leakage
tokenizer_leakage PublicThis repository serves to document my empirical studies on tokenizer data leakage and how it affects training and downstream tasks.
Jupyter Notebook
-
Momentum-Experimentation
Momentum-Experimentation PublicThis work explores the concept of momentum in gradient descent optimization. It provides a detailed mathematical foundation for understanding how momentum accelerates convergence in gradient-based …
Jupyter Notebook
-
Architectures-From-Scratch
Architectures-From-Scratch PublicA curated collection of deep learning architectures I have implemented from scratch to clearly understand the design choices and the different inductive biases.
If the problem persists, check the GitHub status page or contact support.

