nano-vllm/nanovllm at 76af506956bf1c6276c07d11d867011b98d0b6ef - nano-vllm - Gitea: Git with a cup of tea

zijie-tian/nano-vllm

Files

History

Zijie Tian 76af506956 [claudesquad] update from 'multi-request-2' on 13 Jan 26 02:01 CST

2026-01-13 02:01:07 +08:00

..

[WIP] Added sgDMA operator for scatter kvcache communication.

2025-12-24 23:48:52 +08:00

[refactor] Refactor the kvcache offload.

2026-01-04 19:37:03 +08:00

[claudesquad] update from 'multi-request-2' on 13 Jan 26 02:01 CST

2026-01-13 02:01:07 +08:00

[claudesquad] update from 'multi-request-2' on 13 Jan 26 02:01 CST

2026-01-13 02:01:07 +08:00

Merge branch 'zijie/add-llama-1': Add multi-model support

2026-01-10 21:20:53 +08:00

[tests] Added test_niah_standalone.py.

2026-01-12 00:16:37 +08:00

[claudesquad] update from 'lw-offload-2' on 08 Jan 26 20:53 CST

2026-01-08 20:53:08 +08:00

__init__.py

better

2025-06-15 10:36:45 +08:00

config.py

[tests] Added test_niah_standalone.py.

2026-01-12 00:16:37 +08:00

llm.py

support tensor parallel

2025-06-15 01:31:24 +08:00

sampling_params.py

compile random sampling

2025-08-31 22:55:34 +08:00