Add scripts/profile.sh for nsys profiling of GPU-only mode benchmarks. Usage: bash scripts/profile.sh # Default: 32K xattn prefill bash scripts/profile.sh --max-len 65536 --gpu-util 0.7 bash scripts/profile.sh --policy full bash scripts/profile.sh --bench-decode Output: results/nsys/bench_<policy>_<len>_<mode>_<timestamp>.nsys-rep Generated with [Claude Code](https://claude.ai/code) via [Happy](https://happy.engineering) Co-Authored-By: Claude <noreply@anthropic.com> Co-Authored-By: Happy <yesreply@happy.engineering>
4.5 KiB
Executable File
4.5 KiB
Executable File