- Add /sc:git command with smart commit functionality - Add /sc:ultra-think command for deep thinking - Update .claude/rules/ with improved documentation: - commands.md: command usage guidelines - doc-management.md: documentation policy - no-extra-docs.md: documentation creation policy - gpu-testing.md: GPU type detection and testing rules - Update .claude/settings.json with claude-flow MCP configuration 这些改进提供了更好的开发体验和工具支持。
527 B
527 B
Commands
Running (with PYTHONPATH)
For multi-instance development, use PYTHONPATH instead of pip install:
# Run example
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python example.py
# Run benchmarks
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python bench.py
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python bench_offload.py
Config Defaults
max_num_batched_tokens: 16384max_num_seqs: 512kvcache_block_size: 4096gpu_memory_utilization: 0.9enforce_eager: False (enables CUDA graphs)