Search Results

Found 10 repositories(showing 10)

External-Attention-pytorch

xmu-xiaoma666

💚97

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

12.2k

2.0k

MIT

Python

Updated 1 minute ago

attentioncbamexcitation-networks+5

external_attention.pytorch

shuuchen

❤️40

A PyTorch implementation of external attention.

MIT

Python

Updated 3 years ago

External-Attention-pytorch

lyp2333

❤️30

No description available

MIT

Python

Updated 2 years ago

Goal: Benchmark attention/MLP layers on CPU, GPU, and Apple M-series. Deliverable: Latency, FLOPs utilization, memory bottlenecks. Grading: Benchmark correctness (30%), Comparison depth (40%), Report (30%). Reference: pytorch/examplesLinks to an external site.

Python

Updated 3 weeks ago

External-Attention-pytorch

yanyuze123

❤️25

No description available

Updated 4 years ago

External-Attention-pytorch

yangwusi

❤️30

No description available

MIT

Python

Updated 3 years ago

External-Attention-pytorch-master

zhonghaochang

❤️25

No description available

Updated 1 year ago

211406M-Parameter-LLM-English-Plays

Bitxn

❤️35

Developed a custom 211,406 parameter autoregressive language model from scratch using PyTorch, implementing core components like multi-head self-attention, positional embeddings, and layer normalization. Built a character-level tokenizer, custom training loop, and data pipeline using NumPy and PyTorch with zero external NLP libraries.

Python

Updated 10 months ago

GPT-From-Scratch

Akhan521

❤️40

🧸 A fully custom GPT-style language model built from scratch using PyTorch and trained on Winnie-the-Pooh! Explored the core mechanics of self-attention, autoregressive text generation, and modular model training, all without relying on any external libraries.

MIT

Python

Updated 8 months ago

decoder-onlygenerative-aigpt+6

mini_gpt

psychias

❤️35

MiniGPT is a lightweight, self-contained implementation of a GPT-style transformer built entirely in PyTorch, without relying on external deep learning libraries like transformers. This project serves as an educational tool for understanding the inner workings of transformers, self-attention, and language modeling.

Python

Updated 8 months ago

All 10 repositories loaded

GitHub Explorer

Search Results

External-Attention-pytorch

external_attention.pytorch

External-Attention-pytorch

BenchmarkTransformerLayers

External-Attention-pytorch

External-Attention-pytorch

External-Attention-pytorch-master

211406M-Parameter-LLM-English-Plays

GPT-From-Scratch

mini_gpt

External-Attention-pytorch

external_attention.pytorch

External-Attention-pytorch

BenchmarkTransformerLayers

External-Attention-pytorch

External-Attention-pytorch

External-Attention-pytorch-master

211406M-Parameter-LLM-English-Plays

GPT-From-Scratch

mini_gpt