Found 24 repositories(showing 24)
Victarry
No description available
ruipeterpan
Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo
yichiche
A tool to parse PyTorch profiler trace files for kernel-level analysis.
zovonoir
A toolkit to parse torch profiler data source and produce spreadsheet
Bobchenyx
LLM Profiling with DeepSpeed Flops Profiler & Torch Profiler
aribornstein
Example of How to Use the PyTorch Profiler with PyTorch Lightning
conda-forge
A conda-smithy repository for torch-tb-profiler.
Livinfly
TorchProf by torch.profiler
Bot1822
No description available
NathanGrimaud
No description available
dhpitt
A guide showing a little hack to get more info out of the torch profiler.
havill
Runs and collects performance and power profiling data for various MIMD optimized CPU/GPU/NPUs
shagunsodhani
No description available
cseduashraful
No description available
Staisha-N
An example of how to use the PyTorch profiler to analyze the performance of a machine learning algorithm.
HansBug
[WIP] Torch Profiler (Based on Verl Project)
James9Luo
One-stop PyTorch GPU profiling & optimization skill|Detect memory leaks, speed up training, fix device mismatches, and maximize GPU utilization Works with Cline / Claude / Continue.dev / VSCode AI|Out-of-the-box|CLI one-click analysis
KasterMist
This is based on the torch_tb_profiler source code and fix some bugs.
bquast
Operation-level profiler for Apple Silicon / MLX. The missing torch.profiler equivalent for the MLX ecosystem.
shivareddy42
GPU Training Pipeline Profiler — PyTorch + mixed precision + torch.compile optimization benchmarks
gfyi1026
CPU vs GPU performance benchmarking using PyTorch, Torch Profiler, and NVIDIA NVBandwidth
rajk97
Profile-driven PyTorch inference optimization: 15.65x speedup on ResNet-50 (RTX 4090). Demonstrates torch.profiler, channels_last, FP16, torch.compile.
A performance profiling study of LLM fine-tuning pipelines. Analyzes DataLoader bottlenecks, CUDA synchronization, and torch.compile (Inductor) speedups using PyTorch Profiler and Weights & Biases.
islamelkadi
GPU profiling baseline for PyTorch training on Amazon EKS Auto Mode — ResNet-18 on CIFAR-10 with torch.profiler, Terraform-provisioned infrastructure, and Kubernetes Job manifests.
All 24 repositories loaded