Files
nano-vllm/.claude/rules/commands.md
Zijie Tian d1bbb7efe2 chore: update claude configuration and rules from tzj/vs_offload
- Add /sc:git command with smart commit functionality
- Add /sc:ultra-think command for deep thinking
- Update .claude/rules/ with improved documentation:
  - commands.md: command usage guidelines
  - doc-management.md: documentation policy
  - no-extra-docs.md: documentation creation policy
  - gpu-testing.md: GPU type detection and testing rules
- Update .claude/settings.json with claude-flow MCP configuration

这些改进提供了更好的开发体验和工具支持。
2026-01-18 18:56:49 +08:00

23 lines
527 B
Markdown

# Commands
## Running (with PYTHONPATH)
For multi-instance development, use PYTHONPATH instead of pip install:
```bash
# Run example
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python example.py
# Run benchmarks
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python bench.py
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python bench_offload.py
```
## Config Defaults
- `max_num_batched_tokens`: 16384
- `max_num_seqs`: 512
- `kvcache_block_size`: 4096
- `gpu_memory_utilization`: 0.9
- `enforce_eager`: False (enables CUDA graphs)