Search Results

Found 39 repositories(showing 30)

gpt2-ml-torch

ghosthamlet

🧡55

Pytorch model for https://github.com/imcaspar/gpt2-ml

Apache-2.0

Python

Updated 2 weeks ago

GPT-2-PyTorch

iyaja

❤️35

A companion repository for the GPT-2 article on the FloydHub blog.

Jupyter Notebook

Updated 1 year ago

torch-gpt-2

CyberZHG

❤️35

Load GPT-2 checkpoint and generate texts in PyTorch

MIT

Python

Updated 2 years ago

gpt-2language-modelnlp+1

gpt2_torch

gzroy

❤️35

Pytorch implementation for gpt2

Python

Updated 1 year ago

gpt2_torch_nucleus

BenjaminWegener

❤️40

text generation using GPT2

MIT

Jupyter Notebook

Updated 2 years ago

NanoGPT

ngocthinh09

🧡65

A from-scratch implementation of GPT-2 built for learning Transformer architectures. Optimized with DDP, Flash Attention, and torch.compile.

Python

Updated 3 days ago

gpt2transformer-decoder

demo-torch-gpt2

CaptainJa

❤️25

No description available

Python

Updated 2 years ago

micrograd_rs

cyyeh

❤️40

tiny torch-like engine implemented in Rust with Python bindings via PyO3 using Cursor agent mode(Opus 4.5, GPT 5.2)

MIT

Rust

Updated 3 months ago

PyTorch-gpt2

ParikshitGehlaut

❤️40

PyTorch implementation of GPT-2 124M model along with training script

MIT

Python

Updated 5 months ago

Reproduced the GPT-2 124M parameter LLM from scratch using PyTorch, referencing ”At tention is All You Need” and GPT papers. Optimized performance with Flash Attention, Torch Compile, Gradient Accumulation, and DDP for multi-GPU training. Evaluated on the Hellaswag dataset.

Python

Updated 1 year ago

gpt2

warrenzha

❤️40

GPT-2 Torch.

MIT

Jupyter Notebook

Updated 1 year ago

LLM_from_scratch

samnet

❤️35

Implementation of GPT 2.0 using Torch

Jupyter Notebook

Updated 7 months ago

GPT2-w-torch

S4vyss

❤️25

No description available

Jupyter Notebook

Updated 1 year ago

GPT2-PyTorch

JulianSprung

❤️35

Pytorch implementation of GPT-2

MIT

Jupyter Notebook

Updated 11 months ago

torch-basic-lm

shamashel

❤️40

Basic language model using torch, based on gpt-2

MIT

Python

Updated 1 year ago

gpt2-based-gpt3-torch-serving

gkswjdzz

❤️25

No description available

Dockerfile

Updated 5 years ago

PyTorch_finetune_GPT2

AnshDhalla1

❤️25

No description available

Python

Updated 7 months ago

PyTorch_finetuning_GPT2

AnshDhalla1

❤️25

No description available

Updated 7 months ago

gpt2-pure-PyTorch

VantaTomat

❤️45

Minimal GPT-2 inference in pure PyTorch (no transformers, no safetensors)

Python

Updated 2 months ago

gpt2_torch_activation_insights

habanoz

❤️25

No description available

Jupyter Notebook

Updated 1 year ago

GPT2-from-Scratch-PyTorch

Parsagh05

❤️20

No description available

Jupyter Notebook

Updated 5 months ago

GPT-2-PyTorch

nafisadipra

❤️30

No description available

MIT

Python

Updated 1 year ago

GPT2-Token-Embedding-with-PyTorch

SehbazSingh

❤️35

This repository demonstrates how to use the GPT-2 tokenizer from OpenAI's `tiktoken` library to tokenize text data, and then apply a simple PyTorch embedding layer on the tokenized input.

Python

Updated 10 months ago

gpt-2-large-torch-serving

gkswjdzz

❤️25

No description available

Dockerfile

Updated 5 years ago

Transformers-from-Scratch-BERT-and-GPT2-in-PyTorch

Doris-QZ

🧡50

No description available

MIT

Jupyter Notebook

Updated 3 weeks ago

GPTuesday_PyTorchFun_ML_C2_Wk2

thaddavis

❤️25

No description available

Python

Updated 1 year ago

miniGPT2-from-scratch-using-pyTorch

mmtq

❤️35

A minimalist implementation of the GPT-2 architecture built entirely from scratch using PyTorch.

MIT

Python

Updated 10 months ago

gpt2llmneural-networks+1

GPTuesday_PyTorchFun_ML_C2_Wk1

thaddavis

❤️25

No description available

Python

Updated 1 year ago

GPTuesday_PyTorchFun_ML_C2_Wk3

thaddavis

❤️25

No description available

Python

Updated 1 year ago

GPT-2-124M-From-Scratch-in-PyTorch

KshitijK288

❤️30

This project implements the GPT-2 (124M) transformer model entirely from scratch using PyTorch, including custom multi-head causal self-attention, LayerNorm, training loop, and text generation. It also supports loading and running OpenAI’s pretrained GPT-2 weights, enabling both training from scratch and pretrained inference.

Jupyter Notebook

Updated 3 months ago

GitHub Explorer

Search Results

gpt2-ml-torch

GPT-2-PyTorch

torch-gpt-2

gpt2_torch

gpt2_torch_nucleus

NanoGPT

demo-torch-gpt2

micrograd_rs

PyTorch-gpt2

GPT2-124M-NanoGPT

gpt2

LLM_from_scratch

GPT2-w-torch

GPT2-PyTorch

torch-basic-lm

gpt2-based-gpt3-torch-serving

PyTorch_finetune_GPT2

PyTorch_finetuning_GPT2

gpt2-pure-PyTorch

gpt2_torch_activation_insights

GPT2-from-Scratch-PyTorch

GPT-2-PyTorch

GPT2-Token-Embedding-with-PyTorch

gpt-2-large-torch-serving

Transformers-from-Scratch-BERT-and-GPT2-in-PyTorch

GPTuesday_PyTorchFun_ML_C2_Wk2

miniGPT2-from-scratch-using-pyTorch

GPTuesday_PyTorchFun_ML_C2_Wk1

GPTuesday_PyTorchFun_ML_C2_Wk3

GPT-2-124M-From-Scratch-in-PyTorch

gpt2-ml-torch

GPT-2-PyTorch

torch-gpt-2

gpt2_torch

gpt2_torch_nucleus

NanoGPT

demo-torch-gpt2

micrograd_rs

PyTorch-gpt2

GPT2-124M-NanoGPT

gpt2

LLM_from_scratch

GPT2-w-torch

GPT2-PyTorch

torch-basic-lm

gpt2-based-gpt3-torch-serving

PyTorch_finetune_GPT2

PyTorch_finetuning_GPT2

gpt2-pure-PyTorch

gpt2_torch_activation_insights

GPT2-from-Scratch-PyTorch

GPT-2-PyTorch

GPT2-Token-Embedding-with-PyTorch

gpt-2-large-torch-serving

Transformers-from-Scratch-BERT-and-GPT2-in-PyTorch

GPTuesday_PyTorchFun_ML_C2_Wk2

miniGPT2-from-scratch-using-pyTorch

GPTuesday_PyTorchFun_ML_C2_Wk1

GPTuesday_PyTorchFun_ML_C2_Wk3

GPT-2-124M-From-Scratch-in-PyTorch