Files
nano-vllm/nanovllm
Zijie Tian 7c41032a2e feat: add configurable stride and chunk_size for XAttention BSA
- Add sparse_chunk_size config option (default: 16384)
- Pass stride, chunk_size, use_triton through factory function
- Add --sparse-stride CLI option to test_ruler.py

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-23 10:37:04 +08:00
..
2025-06-15 10:36:45 +08:00
2025-06-15 01:31:24 +08:00
2025-08-31 22:55:34 +08:00