Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.
Stars
55
Forks
16
Watchers
55
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
58
commits
FPGA implementation seems to work for all three data sets. Need to optimize the data flow for better performance.
fb3f5cdView on GitHubComipling. There are some issues about run time usage of kernel code.
1aba76fView on GitHub