Implement a Pytorch-like DL library in C++ from scratch, step by step
Stars
240
Forks
32
Watchers
240
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
feat(matmul): add 3D x 3D batch matrix multiplication support
9b30ca5View on GitHubfeat(nn): implement Embedding layer with CPU and CUDA support
3a4a289View on GitHubperf(mat): use float4-optimized v7 matmul kernel to reach 89% of PyTorch
1083c6dView on GitHub