nano-vllm/nanovllm at 190df5f70d1cd1223f132a09888900fcf626a31e - nano-vllm - Gitea: Git with a cup of tea

zijie-tian/nano-vllm

Files

History

Zijie Tian 190df5f70d [refactor] Refactor current gpu and cpu block allocation strategy.

2025-12-10 21:23:31 +08:00

..

[refactor] Refactor current gpu and cpu block allocation strategy.

2025-12-10 21:23:31 +08:00

[refactor] Refactor current gpu and cpu block allocation strategy.

2025-12-10 21:23:31 +08:00

[refactor] Refactor current gpu and cpu block allocation strategy.

2025-12-10 21:23:31 +08:00

support qwen2

2025-11-04 01:44:42 +08:00

[feat] Added num_gpu_blocks limit gpu blocks.

2025-12-10 20:17:42 +08:00

__init__.py

better

2025-06-15 10:36:45 +08:00

config.py

[refactor] Refactor current gpu and cpu block allocation strategy.

2025-12-10 21:23:31 +08:00

llm.py

support tensor parallel

2025-06-15 01:31:24 +08:00

sampling_params.py

compile random sampling

2025-08-31 22:55:34 +08:00