nano-vllm/nanovllm/kvcache at 8df0c7517bc3c02deabebbb34f407c926b0e3bb7 - nano-vllm - Gitea: Git with a cup of tea

zijie-tian/nano-vllm

Files

History

Zijie Tian dc7807a211 [feat] Fixed warmup memory overhead.

2025-12-15 21:39:14 +08:00

..

[feat] Added chunked prefill and kvcache offload mechenism.

2025-12-10 03:47:37 +08:00

__init__.py

[fix] Fixed kvcache offload bugs.

2025-12-10 22:34:00 +08:00

base_manager.py

[feat] Added chunked prefill and kvcache offload mechenism.

2025-12-10 03:47:37 +08:00

chunked_attention.py

[feat] Fixed warmup memory overhead.

2025-12-15 21:39:14 +08:00

gpu_manager.py

[feat] Added chunked prefill and kvcache offload mechenism.

2025-12-10 03:47:37 +08:00

hybrid_manager.py

[feat] Need to optimized with async prefetch.

2025-12-15 06:58:40 +08:00

kernels.py

[feat] Added chunked prefill and kvcache offload mechenism.

2025-12-10 03:47:37 +08:00

offload_engine.py

[feat] Optimized with ASYNC offload.

2025-12-15 07:21:35 +08:00