Search Results

Found 187 repositories(showing 30)

Steel-LLM

zhanshijinwat

🧡67

Train a 1B LLM with 1T tokens from scratch by personal

796

Jupyter Notebook

Updated 16 hours ago

large-language-modelllamallm+1

train-llm-from-scratch

FareedKhan-dev

💛72

A straightforward method for training your LLM, from downloading data to generating text.

548

109

MIT

Jupyter Notebook

Updated 3 hours ago

geminilarge-language-modelsllm+3

Train-llm-from-scratch

wei-potato

🧡55

使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力

163

Python

Updated 2 weeks ago

modernaicourse

marimo-team

🧡65

A companion to CMU professor Zico Kolter's Intro to Modern AI. Learn the basics of machine learning, then train your own LLM from scratch.

100

Apache-2.0

Python

Updated 18 hours ago

aideep-learningllm+3

video-pretrained-transformer

KastanDay

🧡50

Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scratch on YouTube (YT-1B dataset).

MIT

Jupyter Notebook

Updated 1 month ago

Train-LLM-from-scratch

timmzimm

❤️25

No description available

Python

Updated 1 year ago

SimpleLLM

131AIClub

🧡60

Implement and train a Tiny LLM from scratch!

MIT

Python

Updated 3 weeks ago

Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch

vvr-rao

❤️35

Trained a 114 million Parameter LLM from Scratch.

Python

Updated 7 months ago

MyLLM

LF-Luis

❤️35

Personal project to learn how to build and pre-train a modern LLM from scratch.

GPL-3.0

Python

Updated 4 months ago

train-llm-from-scratch

saketd403

🧡60

Train LLMs such as GPT and LLama from scratch.

NOASSERTION

Python

Updated 1 week ago

nonLlama

Emisaber

❤️35

A simple but somehow complete(though not enough) project to train a LLM from scratch

MIT

Jupyter Notebook

Updated 8 months ago

llm-from-scratch

merterbak

🧡60

Train LLM from scratch with SOTA techniques like RoPE, GQA and KV caching.

Apache-2.0

Python

Updated 3 weeks ago

llmpytorchqwen3

llm-from-scratch

adriablancafort

❤️45

A single script to train an LLM from scratch for less than $100 and run inference through a ChatGPT-like Web Interface or a CLI.

Python

Updated 1 month ago

superGPT

viralcode

💛70

Train your own LLM from scratch

MIT

Python

Updated 14 hours ago

aiattentiondeep-learning+7

krill

minpeter

🧡50

🦐 A minimal pretraining trainer for LLMs — from scratch.

Apache-2.0

Python

Updated 2 months ago

BarkGPT

vvasylkovskyi

🧡55

AI LLM Project that barks trained from scratch

MIT

Python

Updated 1 week ago

gpt.jax

VachanVY

❤️40

Generative Pretrained Model (GPT) in JAX. A step by step guide to train LLMs on large datasets from scratch

MIT

Python

Updated 1 year ago

aiattentiondeep-learning+8

Hindi-GPT

AIAnytime

❤️40

Train a Hindi LLM from Scratch in pure Pytorch.

MIT

Jupyter Notebook

Updated 5 months ago

nano-LLM

Tuziking

❤️35

A project to pre-train a small LLM from scratch

Python

Updated 6 months ago

llm_forge

Usman-Rafique

❤️20

LLM Forge: Experimental playground for building and training practical LLMs with limited compute resources. Pre-train from scratch, fine-tune, and generate text using modular GPT-style models

MIT

Python

Updated 11 months ago

MiniGPT-from-Scratch

akshatjain07065

❤️45

This repository contains a small-scale Transformer-based LLM built from scratch using PyTorch and Hugging Face. The model is trained on Wikipedia data and deployed via FastAPI.

Python

Updated 2 months ago

Qwen3-LLM-Pytorch-Implementation-From-Scratch

petermartens98

❤️45

Lightweight LLM inspired by Qwen3, built from scratch in PyTorch. Full training pipeline with transformer components including RMSNorm, Rotary Position Embeddings (RoPE), Grouped-Query Attention (GQA), and SwiGLU layers. Trained with hybrid Muon + AdamW optimizer, causal masking, efficient batching, and evaluation tools.

Jupyter Notebook

Updated 2 months ago

llmpytorchqwen3+1

training-llm-from-scratch

mayurnayak1705

❤️25

No description available

Python

Updated 4 months ago

train-hybrid-llm-from-scratch

CastorYu

❤️40

A simplistic script for training your own hybrid llm (using autoregressive model for drafting and diffusion model for refining).

AGPL-3.0

Python

Updated 5 months ago

ai-trainingartificial-intelligenceautoregressive+10

claude-from-scratch

vukrosic

❤️40

Code & Train Anthropic Claude style LLM from scratch.

MIT

Updated 4 months ago

model-nano

nabinkhair42

❤️45

~58M parameter transformer LLM trained from scratch for Git/GitHub developer assistance.

Python

Updated 1 month ago

ai-modelsllmsmodels

GPT-from-Scratch

Bilal-Ahmad6

❤️45

LLM built from scratch predicting the next word trained on self defined weights.

Python

Updated 1 month ago

FinanceWise

Jayesh-Dev21

🧡50

AI web application, using custom data-set to train and build a LLM from scratch to provide assistance for fraud detection and protfolio management

MIT

CSS

Updated 2 months ago

aicssflask+12

UrduGPT

fayazkhan121

❤️25

This is the official UI for UrduGPT — a custom-built English → Urdu translator powered by a Transformer-based LLM trained from scratch using PyTorch.

Apache-2.0

Python

Updated 11 months ago

LLM-From-Scratch-For-ChatBots-GPT2

AnoopCA

❤️35

This project implements an LLM from scratch, utilizing the transformer mechanism similar to that of GPT-2. The model is trained on a customer service dataset.

Jupyter Notebook

Updated 8 months ago

GitHub Explorer

Search Results

Steel-LLM

train-llm-from-scratch

Train-llm-from-scratch

modernaicourse

video-pretrained-transformer

Train-LLM-from-scratch

SimpleLLM

Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch

MyLLM

train-llm-from-scratch

nonLlama

llm-from-scratch

llm-from-scratch

superGPT

krill

BarkGPT

gpt.jax

Hindi-GPT

nano-LLM

llm_forge

MiniGPT-from-Scratch

Qwen3-LLM-Pytorch-Implementation-From-Scratch

training-llm-from-scratch

train-hybrid-llm-from-scratch

claude-from-scratch

model-nano

GPT-from-Scratch

FinanceWise

UrduGPT

LLM-From-Scratch-For-ChatBots-GPT2

Steel-LLM

train-llm-from-scratch

Train-llm-from-scratch

modernaicourse

video-pretrained-transformer

Train-LLM-from-scratch

SimpleLLM

Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch

MyLLM

train-llm-from-scratch

nonLlama

llm-from-scratch

llm-from-scratch

superGPT

krill

BarkGPT

gpt.jax

Hindi-GPT

nano-LLM

llm_forge

MiniGPT-from-Scratch

Qwen3-LLM-Pytorch-Implementation-From-Scratch

training-llm-from-scratch

train-hybrid-llm-from-scratch

claude-from-scratch

model-nano

GPT-from-Scratch

FinanceWise

UrduGPT

LLM-From-Scratch-For-ChatBots-GPT2