📝 docs: add chunked attention solutions guide and update doc index
Add comprehensive documentation analyzing the 32K chunked offload accuracy issues with proposed solutions covering LSE precision, ring buffer state management, and position encoding validation. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
1078
docs/chunked_attention_solutions.md
Normal file
1078
docs/chunked_attention_solutions.md
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user