-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(draft_model): support draft_model for RemoteModelLoader
#6407
opened May 19, 2025 by
DellCurry
Loading…
6 tasks
Add fp8 fused_experts kernel for CPU in sgl-kernel and add UT
#6404
opened May 19, 2025 by
chunyuan-w
•
Draft
Add environment flag for disabling message queue broadcaster
#6403
opened May 19, 2025 by
Fridge003
Loading…
6 tasks
[Misc] Replace
log_info_on_rank0
with RankZeroFilter
#6398
opened May 18, 2025 by
CatherineSue
Loading…
1 of 6 tasks
feat(kernel): fuse silu_and_mul with group quant fp8
high priority
#6394
opened May 18, 2025 by
tanruixiang
Loading…
4 of 6 tasks
[Fix] [Vul] fix the unsafe usage of
shell=True
for subprocess.check_output
#6393
opened May 18, 2025 by
shaoyuyoung
Loading…
Add composed attention backend for two batch overlapping
#6390
opened May 18, 2025 by
fzyzcjy
Loading…
6 tasks
Support DeepSeek EPLB algorithm with static distributions
#6387
opened May 18, 2025 by
fzyzcjy
Loading…
6 tasks
Support loading weights when physical experts are different from logical experts
#6386
opened May 18, 2025 by
fzyzcjy
Loading…
6 tasks
Support dispatching logical to physical experts
#6385
opened May 18, 2025 by
fzyzcjy
Loading…
6 tasks
aiter attention-backend (default enabled on AMD/ROCm)
high priority
#6381
opened May 18, 2025 by
HaiShaw
Loading…
6 tasks done
Disable compiling arch below sm_90 in aarch64 by default
#6380
opened May 18, 2025 by
Qiaolin-Yu
•
Draft
6 tasks
reduce torch.zeros overhead in moe align block size kernel
#6369
opened May 17, 2025 by
BBuf
Loading…
2 of 8 tasks
[VLM] Support chunk prefill for VLM
high priority
visIon-LM
#6355
opened May 16, 2025 by
CatherineSue
Loading…
2 of 6 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.