Zijie Tian
ed3c8bb4b8
🐛 fix: memory leak in XAttentionBSAPolicy select_blocks
Fix severe memory leak (64GB -> 4GB growth) by:
- Remove unused sparse_metadata storage (was accumulating attn_scores)
- Delete intermediate tensor list (attn_scores_list) after use
- Explicitly delete intermediate tensors before return
Before: 16GB -> 80GB during 128K prefill
After: 16GB -> 19.8GB during 128K prefill
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-23 09:30:18 +08:00
..
2025-12-24 23:48:52 +08:00
2026-01-04 19:37:03 +08:00
2026-01-20 02:27:10 +08:00
2026-01-23 09:30:18 +08:00
2026-01-23 05:21:28 +08:00
2026-01-10 21:03:45 +08:00
2026-01-23 03:01:25 +08:00
2025-12-15 06:58:40 +08:00
2025-06-15 10:36:45 +08:00
2026-01-23 03:36:12 +08:00
2025-06-15 01:31:24 +08:00
2025-08-31 22:55:34 +08:00