Found 324 repositories(showing 30)
《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。
VizuaraAILabs
Learn the building blocks of how to build gpt-oss from scratch
TatevKaren
BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training Generative Transformer Models: Building GPT from Scratch with a Step-by-Step Guide to Generative AI in PyTorch and Python
jeroenvlek
Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle
codewithdark-git
This repository guides you through the process of building a GPT-style Large Language Model (LLM) from scratch using PyTorch. The structure and approach are inspired by the book Build a Large Language Model (From Scratch) by Sebastian Raschka.
wtedw
Build GPT from scratch using Nx and Axon
This repository contains a step-by-step implementation of a ChatGPT-like model from scratch using PyTorch. It covers everything from tokenization and transformer architecture to training and fine-tuning on conversational data. The course is designed for those who want to deeply understand how large language models work and build their own! 🚀
mytechnotalent
Inspired by Andrej Karpathy’s "Let’s Build GPT", this project guides you step‑by‑step to build a GPT from scratch, demystifying its architecture through clear, hands‑on code.
ajeetkharel
Build and Train a GPT-2 from scratch using PyTorch
codewithaarohi
In this lecture, we are going to build our own Mini GPT Language Model from scratch using PyTorch! This is a beginner-friendly, step-by-step implementation of a tiny language model that learns to predict the next word, just like how GPT and other Large Language Models work internally.
danilop
Build a GPT from scratch, one concept at a time, from pure Python to PyTorch, JAX, MLX, and production serving. Inspired by Andrej Karpathy's microGPT.
Bensmail-anis
We build a Mini Generative Pretrained Transformer (GPT) inspired by the "Attention is All You Need" paper -11 M params
naresh-sharma
Build GPT from scratch to understand how LLMs work. A hands-on educational series for software engineers. No ML background required!
Lexxx42
Let's build GPT: from scratch, in code, spelled out.
MarcellusZhao
Train a mini-GPT model from scratch on the Tiny Shakespeare dataset by following Andrej Karpathy's online lecture.
ideaweaver-ai
No description available
aspiringastro
Homage to the "Let's build GPT : from scratch, in code and spelled out" from @karpathy
cmyandlqs
从零开始构建 GPT-2:预训练、微调、文本生成;Build GPT-2 from scratch: Pretraining, Finetuning, Text Generation
MohamedGamalElSherbiny
Implementation of a GPT-style language model from scratch, following the concepts and code presented in Sebastian Raschka’s Build a Large Language Model From Scratch. This repository includes step-by-step implementations, experiments, and notes to deepen understanding of transformer-based architectures.
No description available
matlab-deep-learning
MATLAB® adaptation of Andrej Karpathy's online lecture "Let's build GPT: from scratch, in code, spelled out"
KRcpl88
Advanced tutorial on how to build a transformer model from scratch using PyTorch, generated ENTIRELY by GPT 4. Use a transformer model to generate a tutorial on how to build a transformer model!
Elma-dev
No description available
Archistein
GPT: build and pre-training from scratch
PetropoulakisPanagiotis
GPT code - I completed the tutorial for building GPT components by Andrej Karpathy (Let's build GPT: from scratch, in code, spelled out)
Danish-Ahmed007
No description available
abtraore
Let's build GPT from scratch.
ManasNair1209
An genuine attempt to build an SLM from scratch, using GPT 2 weights
Lamorati92
📚 Build and train your own GPT-like Large Language Model from scratch with clear guidance and real code examples.
Implementation of "Let's build GPT: from scratch, in code, spelled out" video by @karpathy, but in elixir Nx and Axon