Files
nano-vllm/nanovllm
Zijie Tian 8fd25d72d7 Merge perf_opt-1 and perf_opt-2 branches
Combines two performance optimization features:
- perf_opt-1: Cross-layer pipeline for decode (double-buffered layer cache)
- perf_opt-2: Per-layer prefill buffer for async offload

Both features are complementary and improve CPU offload performance.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-07 06:03:44 +08:00
..
2025-06-15 10:36:45 +08:00
2026-01-07 03:29:21 +08:00
2025-06-15 01:31:24 +08:00
2025-08-31 22:55:34 +08:00