Zijie Tian
|
7c41032a2e
|
✨ feat: add configurable stride and chunk_size for XAttention BSA
- Add sparse_chunk_size config option (default: 16384)
- Pass stride, chunk_size, use_triton through factory function
- Add --sparse-stride CLI option to test_ruler.py
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
|
2026-01-23 10:37:04 +08:00 |
|
Zijie Tian
|
b97b0b96a0
|
[WIP] Before refactor the nanovllm sparse policy.
|
2026-01-19 22:34:44 +08:00 |
|
Zijie Tian
|
b5da802dff
|
[WIP] Before integrate the xattn operator.
|
2026-01-19 21:19:21 +08:00 |
|
Zijie Tian
|
2a6e0a2c02
|
[feat] Added Quest Sparsity Policy.
|
2026-01-07 03:29:21 +08:00 |
|
Zijie Tian
|
c99a6f3d3f
|
[WIP] Before add Quest policy.
|
2026-01-07 02:32:30 +08:00 |
|
Zijie Tian
|
690492e074
|
[WIP] Before refactor policies.
|
2026-01-06 20:47:55 +08:00 |
|
Zijie Tian
|
051f2295c9
|
[feat] Added sparse KVcache feature, NEED VERIFY.
|
2025-12-22 08:51:02 +08:00 |
|