PicoGPT

A hands-on fork of NanoGPT with FlashAttention-2 CUDA kernels, INT8/INT4 GPTQ quantization, paged KV-cache reuse, and continuous batching, turning a tiny Shakespeare model into a full-speed GPU LLM inference demo.

Created on May 3, 2025

Updated on May 7, 2025

Stars

1

Forks

0

Watchers

1

Open Issues

0

Repository Health Score

❤️

35/100

Poor

Overall repository health assessment

Score Breakdown

Activity

Inactive - no updates in 3+ months

0/30

0%

Recent Commits

Update README.md

I-Hsuan(Ethan) Huang•11 months ago

9186d21View on GitHub

initial update

I-Hsuan Huang•11 months ago

45643c8View on GitHub

Update README.md

I-Hsuan(Ethan) Huang•11 months ago

2e0a3bbView on GitHub

Initial commit

I-Hsuan(Ethan) Huang•11 months ago

95cae4aView on GitHub

View all commits

Community

1 stars, 0 forks

0/30

0%

Documentation

Has description, wiki

15/20

75%

Maintenance

0.0% issue ratio

20/20

100%

Health score is calculated based on activity, community engagement, documentation quality, and maintenance practices

Languages

Python

77.3%

C++

22.7%

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

1

EthanCornell

User

4

commits

Languages

Python

77.3%

C++

22.7%

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

1

EthanCornell

User

4

commits

Recent Commits

Update README.md

I-Hsuan(Ethan) Huang•11 months ago

9186d21View on GitHub

initial update

I-Hsuan Huang•11 months ago

45643c8View on GitHub

Update README.md

I-Hsuan(Ethan) Huang•11 months ago

2e0a3bbView on GitHub

Initial commit

I-Hsuan(Ethan) Huang•11 months ago

95cae4aView on GitHub

View all commits

GitHub Explorer

PicoGPT

Score Breakdown