nano-vllm

zijie-tian/nano-vllm

Fork 0

Commit Graph

Author	SHA1	Message	Date
Zijie Tian	f049971f84	✅ test: add hierarchical block sum estimation validation Validate the hierarchical estimation approach for XAttention: - Test 1: Math equivalence (diff = 0.0) between hierarchical and direct - Test 2: Score + threshold selection strategy (replaces mask + voting) - Test 3: Performance benchmark (41x speedup) Uses pure torch + xattn kernels, independent of nanovllm framework. Generated with [Claude Code](https://claude.ai/code) via [Happy](https://happy.engineering) Co-Authored-By: Claude <noreply@anthropic.com> Co-Authored-By: Happy <yesreply@happy.engineering>	2026-01-28 06:24:35 +08:00

Author

SHA1

Message

Date

Zijie Tian

f049971f84

✅ test: add hierarchical block sum estimation validation

Validate the hierarchical estimation approach for XAttention:
- Test 1: Math equivalence (diff = 0.0) between hierarchical and direct
- Test 2: Score + threshold selection strategy (replaces mask + voting)
- Test 3: Performance benchmark (41x speedup)

Uses pure torch + xattn kernels, independent of nanovllm framework.

Generated with [Claude Code](https://claude.ai/code)
via [Happy](https://happy.engineering)

Co-Authored-By: Claude <noreply@anthropic.com>
Co-Authored-By: Happy <yesreply@happy.engineering>

2026-01-28 06:24:35 +08:00

1 Commits