nano-vllm/nanovllm at bf4c63c7ec93372933e0fcc7fd71f8ea31c55f81 - nano-vllm - Gitea: Git with a cup of tea

zijie-tian/nano-vllm

Files

History

Zijie Tian 82ed34fc2d [opt] optimize nanovllm performance compareable with vllm.

2025-12-25 03:47:07 +08:00

..

[WIP] Added sgDMA operator for scatter kvcache communication.

2025-12-24 23:48:52 +08:00

[refactor] Remove legacy mode path.

2025-12-22 20:17:56 +08:00

[opt] optimize nanovllm performance compareable with vllm.

2025-12-25 03:47:07 +08:00

[opt] optimize nanovllm performance compareable with vllm.

2025-12-25 03:47:07 +08:00

[refactor] Translate into english, void Chinese due to claude.

2025-12-11 00:30:24 +08:00

[feat] Need to optimized with async prefetch.

2025-12-15 06:58:40 +08:00

__init__.py

better

2025-06-15 10:36:45 +08:00

config.py

[WIP] remove num_prefetch_blocks varible.

2025-12-24 18:22:26 +08:00

llm.py

support tensor parallel

2025-06-15 01:31:24 +08:00

sampling_params.py

compile random sampling

2025-08-31 22:55:34 +08:00