- Add /sc:git command with smart commit functionality - Add /sc:ultra-think command for deep thinking - Update .claude/rules/ with improved documentation: - commands.md: command usage guidelines - doc-management.md: documentation policy - no-extra-docs.md: documentation creation policy - gpu-testing.md: GPU type detection and testing rules - Update .claude/settings.json with claude-flow MCP configuration 这些改进提供了更好的开发体验和工具支持。
23 lines
527 B
Markdown
23 lines
527 B
Markdown
# Commands
|
|
|
|
## Running (with PYTHONPATH)
|
|
|
|
For multi-instance development, use PYTHONPATH instead of pip install:
|
|
|
|
```bash
|
|
# Run example
|
|
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python example.py
|
|
|
|
# Run benchmarks
|
|
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python bench.py
|
|
PYTHONPATH=/path/to/nano-vllm:$PYTHONPATH python bench_offload.py
|
|
```
|
|
|
|
## Config Defaults
|
|
|
|
- `max_num_batched_tokens`: 16384
|
|
- `max_num_seqs`: 512
|
|
- `kvcache_block_size`: 4096
|
|
- `gpu_memory_utilization`: 0.9
|
|
- `enforce_eager`: False (enables CUDA graphs)
|