GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

XavierZXY/labml.ai - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

labml.ai

XavierZXY•PUBLIC

View on GitHub

https://nn.labml.ai/

MIT License

Created on Feb 12, 2025

Updated on Dec 30, 2025

Stars

Forks

Watchers

Open Issues

Repository Health Score

❤️

40/100

Poor

Overall repository health assessment

Score Breakdown

Activity

Inactive - no updates in 3+ months

0/30

Recent Commits

feat: :sparkles: add Zero3Layer class for efficient parameter management in distributed training

XavierZXY•1 year ago

b3b8bfbView on GitHub

feat: add LoRA modules with linear and embedding layers for efficient parameterization

XavierZXY•1 year ago

660d881View on GitHub

feat: add FeedbackAttention module with support for key-value precomputation and positional embeddings

XavierZXY•1 year ago

a35351bView on GitHub

feat: add GLU variants implementation with Tiny Shakespeare dataset and training framework

XavierZXY•1 year ago

85a0a42View on GitHub

feat: :zap: add GPT model implementation with custom optimizer and training configurations

XavierZXY•1 year ago

d47fe7aView on GitHub

feat: add CompressiveTransformer and related classes for enhanced memory compression in transformer models

XavierZXY•1 year ago

6c95195View on GitHub

feat: add BERTChunkEmbeddings and RetroIndex for enhanced text processing and embedding retrieval

XavierZXY•1 year ago

48e1bd9View on GitHub

feat: :sparkles: implement Attention with Linear Biases (ALiBi) for input length extrapolation

XavierZXY•1 year ago

3010ef5View on GitHub

feat: add Rotary Position Embedding and RotaryPEMultiHeadAttention classes

XavierZXY•1 year ago

9fe182fView on GitHub

feat: add Relative Multi-Headed Attention implementation with shift functionality

XavierZXY•1 year ago

96a9563View on GitHub

refactor: :art: refactor the file structe

XavierZXY•1 year ago

2cf94a3View on GitHub

feat: enhance TransformerXL with improved forward method and layer normalization

XavierZXY•1 year ago

91597adView on GitHub

docs: add reference link for Transformer XL attention span explanation

XavierZXY•1 year ago

b54585fView on GitHub

partial update transformerxl

XavierZXY•1 year ago

798e15aView on GitHub

Start learning transformer xl

XavierZXY•1 year ago

06640dcView on GitHub

View all commits