-
Notifications
You must be signed in to change notification settings - Fork 5k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] DCP: Decode Context Parallelism with A2A and FA3 Backend Support
deepseek
#21637
opened Mar 29, 2026 by
thanhhao98
Loading…
5 tasks done
fix broken anchor link in doc
documentation
Improvements or additions to documentation
model-gateway
#21636
opened Mar 29, 2026 by
doraeric
Loading…
1 of 5 tasks
[Model] Add Voxtral (speech-to-text) model support
#21635
opened Mar 29, 2026 by
LiYomi
Loading…
7 tasks done
[Diffusion][NPU] Add support for MOVA
diffusion
SGLang Diffusion
#21633
opened Mar 29, 2026 by
LLThomas
Loading…
3 of 5 tasks
feat: Add idle request lifecycle duplicate-rid diagnostic
#21632
opened Mar 29, 2026 by
kunkunzhishan
Loading…
2 of 5 tasks
[HiCache] Refactoring HiCache Write-Back Kernel
hicache
Hierarchical Caching for SGLang
jit-kernel
#21631
opened Mar 29, 2026 by
huangtingwei9988
•
Draft
5 tasks
Fix EP-aware slicing for NVFP4 MoE input scales in non-FlashInfer backends
dependencies
Pull requests that update a dependency file
diffusion
SGLang Diffusion
#21630
opened Mar 29, 2026 by
xueliangyang-oeuler
•
Draft
5 tasks
[AMD] Enable MXFP4 KV cache on MI355X (--kv-cache-dtype fp4_e2m1)
#21627
opened Mar 29, 2026 by
JohnQinAMD
Loading…
5 tasks
[CI] [FlashInfer v0.6.7] Use offline quantized checkpoint for MXFP8 Gemm tests
#21625
opened Mar 29, 2026 by
zianglih
Loading…
5 tasks
[HiCache] fix: Clone host indices to avoid memory leak
hicache
Hierarchical Caching for SGLang
run-ci
#21624
opened Mar 29, 2026 by
alphabetc1
Loading…
5 tasks
[AMD] Fix CI multimodal-gen-test-1-gpu-amd for gen model
amd
jit-kernel
run-ci
#21621
opened Mar 29, 2026 by
yichiche
Loading…
5 tasks done
fix: Mistral Small 4 fails to start due to config/weight format mismatch
#21620
opened Mar 29, 2026 by
LiYomi
Loading…
3 tasks done
feat(kv-cache): Add TurboQuant KV cache quantization (WIP)
quant
LLM Quantization
#21617
opened Mar 29, 2026 by
scottgl9
Loading…
4 tasks
[diffusion] Refactor TeaCache
diffusion
SGLang Diffusion
#21613
opened Mar 28, 2026 by
eitanturok
Loading…
5 tasks
fix: fix sharded state for ModelOptModelLoader
#21612
opened Mar 28, 2026 by
kunkunzhishan
Loading…
3 of 5 tasks
[sgl-kernel] support > 1024 experts in moe_align_block_size kernel
sgl-kernel
#21610
opened Mar 28, 2026 by
klshuster
Loading…
5 tasks done
[KDA] Fuse scaled_dot_kkt + solve_tril + recompute_w_u for KDA
run-ci
#21604
opened Mar 28, 2026 by
yuan-luo
Loading…
5 tasks
[Feature] Add FP4 KV cache support for SM120 GPUs
blackwell
SM100/SM120
documentation
Improvements or additions to documentation
quant
LLM Quantization
#21601
opened Mar 28, 2026 by
samuellees
Loading…
feat(speculative): add adaptive speculative decoding for EAGLE topk=1
speculative-decoding
#21599
opened Mar 28, 2026 by
alphabetc1
Loading…
5 tasks
Add test cases for feature parameters in quantization and data type feature.
npu
#21598
opened Mar 28, 2026 by
liuxianglong17
Loading…
5 tasks done
[CI] Consolidate Docker release workflows into reusable workflow
#21596
opened Mar 28, 2026 by
Kangyan-Zhou
Loading…
4 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.