Zijie Tian
|
f5682ca4a7
|
🔧 chore: add GPU-only profiling script
Add scripts/profile.sh for nsys profiling of GPU-only mode benchmarks.
Usage:
bash scripts/profile.sh # Default: 32K xattn prefill
bash scripts/profile.sh --max-len 65536 --gpu-util 0.7
bash scripts/profile.sh --policy full
bash scripts/profile.sh --bench-decode
Output: results/nsys/bench_<policy>_<len>_<mode>_<timestamp>.nsys-rep
Generated with [Claude Code](https://claude.ai/code)
via [Happy](https://happy.engineering)
Co-Authored-By: Claude <noreply@anthropic.com>
Co-Authored-By: Happy <yesreply@happy.engineering>
|
2026-01-27 05:55:31 +08:00 |
|