GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

soloshun/llm-from-scratch - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

llm-from-scratch

soloshun•PUBLIC

View on GitHub

This repository contains the complete source code, explanations, and visualizations for the "Building LLMs from Scratch" series. Whether you're a beginner curious about how ChatGPT works or an experienced developer wanting to understand transformer architecture deeply, this series will guide you through every component step by step.

pythonpytorch

MIT License

Created on Sep 15, 2025

Updated on Jan 17, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

50/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Slow updates - updated within 3 months

10/30

33%

Recent Commits

Update README.md to reflect the completion of Part 9: Multi-Head Attention, and adjust the status of upcoming sections in the educational series roadmap for clarity and consistency.

Solo Shun•3 months ago

26ce3f4View on GitHub

Update links for Part 8: Causal Attention across README.md, Jupyter notebook, and Python module to ensure consistency and accuracy in documentation. The new links direct to the correct Medium article.

Solomon Eshun•3 months ago

e8a418bView on GitHub

Add Part 8: Causal Attention (Masked Self-Attention) implementation, including updates to README.md, new medium article, Jupyter notebook, and Python module. This part addresses the future leakage problem in autoregressive models by implementing causal masking, ensuring proper attention behavior during text generation.

Solomon Eshun•3 months ago

a69ada9View on GitHub

Update README.md to reflect the completion of Parts 6 (The Attention Mechanism) and 7 (Self-Attention with Trainable Weights), and adjust the status of upcoming sections, ensuring clarity in the educational series roadmap.

Solomon Eshun•3 months ago

d05a238View on GitHub

Update Part 7: Self-Attention with Trainable Weights to include the correct article link across README, notebook, and Python module, ensuring consistency in documentation.

Solomon Eshun•4 months ago

e74dae9View on GitHub

Add Part 7: Self-Attention with Trainable Weights

Solomon Eshun•4 months ago

785a29aView on GitHub

Update README.md and related files to reflect the correct link for Part 6 (The Attention Mechanism). Adjust notebook and Python module to include the updated article link, ensuring consistency across documentation.

Solomon Eshun•5 months ago

b125357View on GitHub

lecture 16

Solomon Eshun•5 months ago

f3fab1aView on GitHub

Update README.md, medium article, and Python module to reflect the correct link for Part 5 (Complete Data Preprocessing Pipeline). Adjust notebook to include the updated article link and add a blank markdown cell for future content.

Solomon Eshun•6 months ago

819d748View on GitHub

Update README.md to include the completion of Part 5 (Complete Data Preprocessing Pipeline) and adjust the roadmap status for upcoming parts, ensuring clarity in the series progression.

Solomon Eshun•6 months ago

eccb59dView on GitHub

Update README.md to reflect the completion of Part 4 (Token Embeddings & Positional Encoding) and adjust the status of the Self-Attention Mechanism. Update links in the notebook and Python module to point to the correct Medium article for Part 4.

Solomon Eshun•6 months ago

e373fabView on GitHub

Refactor vector embedding demonstration notebook to improve clarity and organization, enhancing the user experience for word vector operations and similarity calculations.

Solomon Eshun•6 months ago

2447876View on GitHub

Add Jupyter notebook for vector embedding demonstration, showcasing word vector loading, similarity calculations, and vector operations using Gensim.

Solomon Eshun•6 months ago

9e7cb50View on GitHub

Add torch to requirements.txt for enhanced model support

Solomon Eshun•6 months ago

655bd36View on GitHub

Update README.md to reflect the completion status of Part 3 (Data Pipeline) and adjust the roadmap for upcoming parts, including changes to status indicators and links.

Solomon Eshun•6 months ago

5198a5dView on GitHub

View all commits