Commit Graph

47 Commits

Author SHA1 Message Date
GeeeekExplorer
2f21442653 support qwen2 2025-11-04 01:44:42 +08:00
GeeeekExplorer
db1b49dce4 add logo and trendshift 2025-11-04 00:45:10 +08:00
GeeeekExplorer
6ef2a4f630 compile random sampling 2025-08-31 22:55:34 +08:00
GeeeekExplorer
df99418f7d simplify 2025-08-31 20:02:51 +08:00
Xingkai Yu
6a6d217de7 Merge pull request #67 from PeterDing/fix/decoding-positions
fix(model_runner): correct position indexing to be 0-based
2025-08-31 18:05:45 +08:00
PeterDing
f5b4840276 fix(model_runner): correct position indexing to be 0-based
- Change position calculation from len(seq) to len(seq) - 1
2025-07-04 14:29:12 +08:00
GeeeekExplorer
38baf0bbe4 remove assert shape 2025-06-27 23:00:30 +08:00
Xingkai Yu
2de882a395 Merge pull request #60 from GeeeekExplorer/warmup 2025-06-27 22:52:11 +08:00
GeeeekExplorer
cb0b3dec3f remove rng state 2025-06-27 22:50:33 +08:00
Xingkai Yu
6802cb2f42 Merge pull request #54 from TonyLianLong/patch-1 2025-06-27 22:44:38 +08:00
GeeeekExplorer
1caeec8dfa same as vllm 2025-06-27 18:50:56 +08:00
GeeeekExplorer
658520b788 warmup and allocate 2025-06-27 01:51:57 +08:00
Long(Tony) Lian
c2ee8b8dff Update pyproject.toml to fix missing files 2025-06-25 17:57:38 -07:00
papadopoulos Aggelos-Michael
cfc4cb6710 docs: add manual download instructions 2025-06-24 23:38:28 +08:00
Xingkai Yu
37eb91f890 Merge pull request #39 from xiaohajiayou/main 2025-06-24 22:51:58 +08:00
xiaohajiayou
054aec852d Fix: Division-by-Zero Risk and Typo 2025-06-24 02:02:33 +08:00
GeeeekExplorer
03cfc13bb3 faster pickle 2025-06-23 00:51:52 +08:00
Xingkai Yu
8162578b60 star history 2025-06-22 15:13:04 +08:00
GeeeekExplorer
cde3fc22c2 simplify 2025-06-21 17:19:15 +08:00
Xingkai Yu
ad4e95fbdc update .gitignore 2025-06-21 07:28:40 +08:00
GeeeekExplorer
801365a611 update bench 2025-06-19 23:28:11 +08:00
Xingkai Yu
fa0078174e Merge pull request #24 from jinghuan-Chen/fix/Release-CUDA-Graphs-resource-before-exit 2025-06-18 17:15:28 +08:00
jinghuan-Chen
ffafaeb133 Release CUDA Graphs resource before exit. 2025-06-18 16:17:31 +08:00
Xingkai Yu
4fc764f175 Merge pull request #22 from cheunglei/use_spawn 2025-06-17 23:53:59 +08:00
cheunglei
b5ace32982 use spawn 2025-06-17 23:49:15 +08:00
GeeeekExplorer
bc0ad5a116 better 2025-06-17 23:33:38 +08:00
GeeeekExplorer
7e42fa6f63 fix 2025-06-15 13:28:29 +08:00
Xingkai Yu
326b121fad Merge pull request #10 from MARD1NO/refine_return_hint_in_schedule 2025-06-15 10:39:51 +08:00
Xingkai Yu
ba96387043 Merge pull request #11 from GeeeekExplorer/tp_dev 2025-06-15 10:37:21 +08:00
GeeeekExplorer
fc778a4da9 better 2025-06-15 10:36:45 +08:00
Xingkai Yu
c1fd4ea3c2 Merge pull request #9 from cheunglei/tp_dev 2025-06-15 10:22:18 +08:00
MARD1NO
98bbbefb68 schedule return bool args 2025-06-15 10:15:05 +08:00
cheunglei
53b3ef2e32 support tensor parallel 2025-06-15 01:31:24 +08:00
GeeeekExplorer
b6136383c9 support fast pickle 2025-06-14 13:36:57 +08:00
GeeeekExplorer
4a8aa090a7 fix 2025-06-14 00:56:07 +08:00
Xingkai Yu
9b59dae751 Merge pull request #4 from cheunglei/main
require xxhash
2025-06-13 23:46:18 +08:00
cheunglei
0ea7414b19 require xxhash 2025-06-13 23:40:07 +08:00
GeeeekExplorer
59aa3ff57c better 2025-06-13 13:07:33 +08:00
GeeeekExplorer
135d1b38a2 release 2025-06-13 09:01:08 +08:00
GeeeekExplorer
98a1551a7d support CUDA_VISIBLE_DEVICES 2025-06-12 23:14:01 +08:00
GeeeekExplorer
ec3c60d96f update bench 2025-06-12 22:54:51 +08:00
GeeeekExplorer
f16adb729e refactor 2025-06-12 09:41:12 +08:00
GeeeekExplorer
fee58d44e4 fix 2025-06-12 01:00:31 +08:00
GeeeekExplorer
08c84ec08d multi file loader 2025-06-12 01:00:09 +08:00
GeeeekExplorer
386290d69e refactor 2025-06-11 21:12:57 +08:00
GeeeekExplorer
b98e1ca305 fix 2025-06-10 21:25:54 +08:00
GeeeekExplorer
a5a4909e6a init commit 2025-06-10 00:27:01 +08:00