Search Results

Found 5,676 repositories(showing 30)

triton

triton-lang

💚95

Development repository for the Triton language and compiler

18.8k

2.7k

MIT

MLIR

Updated 36 minutes ago

server

triton-inference-server

💛86

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

10.5k

1.7k

BSD-3-Clause

Python

Updated 6 hours ago

clouddatacenterdeep-learning+4

Liger-Kernel

💛81

Efficient Triton Kernels for LLM Training

6.3k

509

BSD-2-Clause

Python

Updated 16 hours ago

finetuninggemma2hacktoberfest+8

Triton

JonathanSalwan

💛80

Triton is a dynamic binary analysis library. Build your own program analysis tools, automate your reverse engineering, perform software verification or just emulate code.

4.1k

578

Apache-2.0

C++

Updated 7 hours ago

binary-analysisbinary-translationdeobfuscation+8

SageAttention

thu-ml

🧡67

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

3.3k

388

Apache-2.0

Cuda

Updated 1 day ago

attentioncudaefficient-attention+9

Triton-Puzzles

gpu-mode

💛74

Puzzles for learning Triton

2.4k

209

Apache-2.0

Jupyter Notebook

Updated 3 hours ago

machine-learningpuzzle

slides

TritonHo

🧡66

it is a repository to store all slides used by Triton Ho's public presentation and course.

2.2k

424

MIT

Updated 2 weeks ago

backendredisslides

kernl

ELS-RD

🧡68

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

1.6k

Apache-2.0

Jupyter Notebook

Updated 4 days ago

cudacuda-kernelpytorch+2

Triton-distributed

ByteDance-Seed

🧡68

Distributed Compiler based on Triton for Parallel Systems

1.4k

136

MIT

Python

Updated 2 days ago

triton

TritonDataCenter

💛73

Triton DataCenter: a cloud management platform with first class support for containers.

1.4k

189

MPL-2.0

Shell

Updated 3 days ago

cloudvirtualization

containerpilot

TritonDataCenter

🧡68

A service for autodiscovery and configuration of applications running in containers

1.1k

137

MPL-2.0

Updated 1 day ago

consulcontainerpilotcontainers+5

native-sparse-attention

fla-org

💛72

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

981

MIT

Python

Updated 1 day ago

FlagGems

flagos-ai

🧡54

FlagGems is an operator library for large language models implemented in the Triton Language.

944

308

Apache-2.0

Python

Updated 1 hour ago

pytorchtritontriton-kernels

tensorrtllm_backend

triton-inference-server

🧡57

The Triton TensorRT-LLM Backend

930

136

Apache-2.0

Updated 1 day ago

yellowstone-grpc

rpcpool

🧡69

Triton's Dragon's Mouth Yellowstone gRPC service for high-performance Solana streaming

925

336

AGPL-3.0

Rust

Updated 2 days ago

dragons-mouthgrpcsolana+2

autokernel

RightNow-AI

💛72

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

921

MIT

Python

Updated 6 hours ago

autoresearchcudagpu+3

Tigress_protection

JonathanSalwan

🧡62

Playing with the Tigress software protection. Break some of its protections and solve their reverse engineering challenges. Automatic deobfuscation using symbolic execution, taint analysis and LLVM.

889

147

LLVM

Updated 1 day ago

deobfuscationllvmreverse-engineering+6

pytriton

triton-inference-server

🧡66

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

841

Apache-2.0

Python

Updated 11 hours ago

deep-learninggpuinference

tutorials

triton-inference-server

💛72

This repository contains tutorials and examples for Triton Inference Server

828

145

BSD-3-Clause

Python

Updated 1 day ago

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

807

Updated 3 days ago

ai4sai4scienceaigc+17

turboquant

0xSero

💛72

TurboQuant: Near-optimal KV cache quantization for LLM inference (3-bit keys, 2-bit values) with Triton kernels + vLLM integration

706

GPL-3.0

Python

Updated 2 hours ago

client

triton-inference-server

🧡58

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

686

253

BSD-3-Clause

Python

Updated 4 days ago

python_backend

triton-inference-server

🧡58

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

673

193

BSD-3-Clause

C++

Updated 1 week ago

Triton-Puzzles-Lite

SiriusNEO

💛72

Puzzles for learning Triton, play it with minimal environment configuration!

659

Apache-2.0

Python

Updated 22 hours ago

attorch

BobMcDear

💛71

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

599

MIT

Python

Updated 20 hours ago

cudadeep-learningmachine-learning+4

flash-sparse-attention

HKUSTDial

💛71

Trainable fast and memory-efficient sparse attention

597

BSD-3-Clause

Python

Updated 1 day ago

flash-attentionflash-sparse-attentionkernel+2

acer-predator-turbo-and-rgb-keyboard-linux-module

JafarAkhondali

🧡57

Linux kernel module to support Turbo mode and RGB Keyboard for Acer Predator notebook series

579

GPL-3.0

Updated 1 day ago

acerhacktoberfesthelios+7

model_analyzer

triton-inference-server

🧡66

Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.

508

Apache-2.0

Python

Updated 4 days ago

deep-learninggpuinference+1

tiny-flash-attention

66RING

💛71

flash attention tutorial written in python, triton, cuda, cutlass

498

MIT

Cuda

Updated 3 days ago

triton-resources

rkinas

🧡66

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

470

Python

Updated 2 days ago

cudatriton

GitHub Explorer

Search Results

triton

server

Liger-Kernel

Triton

SageAttention

Triton-Puzzles

slides

kernl

Triton-distributed

triton

containerpilot

native-sparse-attention

FlagGems

tensorrtllm_backend

yellowstone-grpc

autokernel

Tigress_protection

pytriton

tutorials

awesome-llm-and-aigc

turboquant

client

python_backend

Triton-Puzzles-Lite

attorch

flash-sparse-attention

acer-predator-turbo-and-rgb-keyboard-linux-module

model_analyzer

tiny-flash-attention

triton-resources

triton

server

Liger-Kernel

Triton

SageAttention

Triton-Puzzles

slides

kernl

Triton-distributed

triton

containerpilot

native-sparse-attention

FlagGems

tensorrtllm_backend

yellowstone-grpc

autokernel

Tigress_protection

pytriton

tutorials

awesome-llm-and-aigc

turboquant

client

python_backend

Triton-Puzzles-Lite

attorch

flash-sparse-attention

acer-predator-turbo-and-rgb-keyboard-linux-module

model_analyzer

tiny-flash-attention

triton-resources