Back to search
Running large language models on a single M1/M2 GPU for throughput-oriented scenarios.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
11
commits
Added Python script to profile bandwidth across CPU, GPU and disk.
e9aec0bView on GitHubAdded the core logic to implement tensor computations with PyTorch.
6c76fe1View on GitHubAdded Python code for OPT models configuration and download.
cde8becView on GitHubAdded the Python script to benchmark matrix multiplications.
a86de59View on GitHub