Add --model, --gpu-util, and --enforce-eager arguments for flexible vLLM benchmarking comparisons. Generated with [Claude Code](https://claude.ai/code) via [Happy](https://happy.engineering) Co-Authored-By: Claude <noreply@anthropic.com> Co-Authored-By: Happy <yesreply@happy.engineering>
4.0 KiB
4.0 KiB