KernelHeim – development ground of custom Triton and CUDA kernel functions designed to optimize and accelerate machine learning workloads on NVIDIA GPUs. Inspired by the mythical stronghold of the gods, KernelHeim is a forge where high-performance kernels are crafted to unlock the full potential of the hardware.
Stars
3
Forks
0
Watchers
3
Open Issues
4
Overall repository health assessment
No package.json found
This might not be a Node.js project
Merge pull request #9 from debashishc/dependabot/pip/protobuf-5.29.5
287f82cView on GitHubMerge pull request #8 from debashishc/dependabot/pip/setuptools-78.1.1
57c2229View on GitHubfeat(flashattention): implement tiled forward pass in torch
d2aa2efView on GitHubdocs(readme): added gitmessage, corrected typos in resources
61fa75aView on GitHub