This commit is contained in:
GeeeekExplorer
2025-06-13 13:07:33 +08:00
parent 135d1b38a2
commit 59aa3ff57c
4 changed files with 4 additions and 4 deletions

View File

@@ -4,7 +4,7 @@ A lightweight vLLM implementation built from scratch.
## Key Features
* 🚀 **Fase offline inference** - Comparable inference speeds to vLLM
* 🚀 **Fast offline inference** - Comparable inference speeds to vLLM
* 📖 **Readable codebase** - Clean implementation under 1,200 lines of Python code
***Optimization Suite** - Prefix caching, Torch compilation, CUDA graph, etc