CUDA C reduction kernels benchmarking with Triton, PyTorch and CUB primitives
Stars
5
Forks
1
Watchers
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
11
commits
adding data and plots
5214373
adding updated notebook
2f041d1
bug fix
82104eb
tuned
6f2c05a
adding hybrid cub
964ee05
fixing colab
53b6704
updating triton
9472ae6
adding ILP4
fbdb3eb
adjustments to reduce overheads
5fc6b33
adding analysis
e6d4766
Initial commit
19eac8e