GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

malibayram/llm-from-scratch - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

llm-from-scratch

malibayram•PUBLIC

View on GitHub

Mastering Large Language Models: Build Your Own LLM from Scratch

Created on Jun 5, 2025

Updated on Apr 2, 2026

Stars

206

Forks

Watchers

206

Open Issues

Repository Health Score

🧡

65/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

91% of total

Closed

Recent Commits

Implement top-p filtering and enhance the generate method in UstaModel for improved token sampling. Adjust parameters for temperature, top_k, and top_p to refine output generation process.

malibayram•9 months ago

9b5bafcView on GitHub

Update execution counts in module_3_2.ipynb for consistent notebook flow and enhance model evaluation output with updated training metrics. Adjust model loading to include device specification for improved compatibility.

malibayram•9 months ago

bfec4f3View on GitHub

second version of the codes prepared

malibayram•9 months ago

467e7caView on GitHub

Implement automatic model download in app.py and update requirements.txt for package version compatibility. Replace local model path with a download method for u_model.pth and upgrade Gradio and Torch versions.

malibayram•9 months ago

e367670View on GitHub

Update model path in app.py for consistency and adjust execution counts in demo.ipynb for accurate notebook flow. Revise readme.md to reflect course title change and expand course details, including learning outcomes and module descriptions.

malibayram•9 months ago

7119c19View on GitHub

Add Gradio interface for Usta Model, including model loading, chat functionality, and example prompts. Create README_HF.md for Hugging Face integration and update requirements.txt. Remove unused files and refactor tokenizer and model structure for improved performance.

malibayram•9 months ago

34cc940View on GitHub

Enhance module_3_2.ipynb by adding model evaluation code, updating training epochs to 1,000,000, and implementing model saving/loading functionality. Update readme.md to include a link for running the model in Colab. Refactor UstaModel to utilize UstaEmbedding for improved embedding management.

malibayram•9 months ago

75ede35View on GitHub

Update module_pytorch_train.ipynb to correct training data indexing, adjust model architecture by simplifying the neural network layers, and enhance output logging with updated accuracy and loss metrics for improved training feedback.

malibayram•9 months ago

4303bdbView on GitHub

Update .gitignore to exclude 'data/' directory and enhance module_3_1.ipynb with additional code cells for model loading and tokenization examples, including execution count adjustments and output updates.

malibayram•9 months ago

ae32defView on GitHub

Refactor UstaModel to incorporate UstaDecoderBlock and update architecture to support multiple layers and linear output head

malibayram•9 months ago

a8d5742View on GitHub

Refactor UstaCausalAttention to UstaMultiHeadAttention and update UstaModel to utilize multi-head attention mechanism with context length support

malibayram•9 months ago

016a713View on GitHub

Add UstaCausalAttention class and update model to use causal attention mechanism

malibayram•9 months ago

29c1061View on GitHub

Refactor text_dataset.py to include DataLoader and remove tokenizer.py