Found 2,744 repositories(showing 30)
jingyaogong
🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!
《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。
duoan
🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
zakirullin
Tiny GPT implemented from scratch in pure Go. Trained on Jules Verne books. Explained.
pranavjad
gpt-2 from scratch in mlx
Discover the power of GPT-3.5 turbo in presentation creation! 💻🤩 This web application can effortlessly generate captivating PowerPoint presentations. Say goodbye to the struggle of starting from scratch and let AI revolutionize your content creation process. 🎨
JohnMachado11
Building a GPT-like LLM from scratch with PyTorch.
HamzaElshafie
A PyTorch implementation of the GPT-OSS-20B architecture. All components are coded from scratch: RoPE with YaRN, RMSNorm, SwiGLU with clamping and residual connection, Mixture-of-Experts (MoE), Self-Attention with learned sinks, banded attention, GQA, and KV-cache.
tuanlda78202
implement GPT-OSS 20B & 120B C++ inference from scratch on AMD GPUs
kevinpdev
Educational implementation of a small GPT model from scratch in a single Jupyter Notebook
bclarkson-code
Autograd to GPT-2 completely from scratch
AayushSameerShah
This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried to compile all lectures from the Andrej Karpathy's 💎 playlist on Neural Networks - which we will end up with building GPT.
LeeSinLiang
Implementation of GPT from scratch. Design to be lightweight and easy to modify.
VizuaraAILabs
Learn the building blocks of how to build gpt-oss from scratch
TatevKaren
BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training Generative Transformer Models: Building GPT from Scratch with a Step-by-Step Guide to Generative AI in PyTorch and Python
projektjoe
From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.
VizuaraAILabs
A truly open version of gpt-oss which shows the entire pre-training from scratch
tag1consulting
A GPT-2 style transformer language model implemented from scratch in Rust for educational purposes.
FareedKhan-dev
Implementation of a GPT-4o like Multimodal from Scratch using Python
jeroenvlek
Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle
bashnick
A codebase implementing a simple GPT-like model from scratch based on the Attention is All You Need paper.
codewithdark-git
This repository guides you through the process of building a GPT-style Large Language Model (LLM) from scratch using PyTorch. The structure and approach are inspired by the book Build a Large Language Model (From Scratch) by Sebastian Raschka.
johnma2006
Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
shreydan
Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
creikey
An existential western RPG about semi-conscious GPT agents. Written in a 3d engine from scratch targeting Wasm/Webgl and Desktop
wtedw
Build GPT from scratch using Nx and Axon
vishnupsatish
Deep learning library (training & inference) and GPT-2 in C++ from scratch.
FareedKhan-dev
Train a 29M parameter GPT from Scratch
This repository contains a step-by-step implementation of a ChatGPT-like model from scratch using PyTorch. It covers everything from tokenization and transformer architecture to training and fine-tuning on conversational data. The course is designed for those who want to deeply understand how large language models work and build their own! 🚀
saqib1707
PyTorch Implementation of GPT-2