-
Notifications
You must be signed in to change notification settings - Fork 186
Pull requests: alibaba/rtp-llm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: switch FC endpoints from VPC to public network
#1000
opened May 13, 2026 by
guoj14
Contributor
Loading…
feat: update rtp-kernel for w4a8-opt and sm103a
#999
opened May 13, 2026 by
Bruce-Lee-LY
Collaborator
Loading…
feat: Complete P2PConnector implementation for high-performance PD Disaggregation
#997
opened May 13, 2026 by
ZhihanYan
Collaborator
Loading…
test: add server_args for server_test
#994
opened May 12, 2026 by
zhangjianning-zjn
Collaborator
Loading…
feat(rocm): support FP8 KV cache with ASM paged-attention and enable non-ASM path
#991
opened May 12, 2026 by
liaocz
Collaborator
Loading…
feat(flexlb): add configurable group routing policy
#988
opened May 10, 2026 by
jianglan89
Collaborator
Loading…
feat(rocm): prefill host-overhead and opt-in kernel optimizations before fullattention
#983
opened May 9, 2026 by
chengshu-lcc
Collaborator
Loading…
fix(model-loader): avoid deepcopy for fp8 scale params
#978
opened May 8, 2026 by
siluzhou
Collaborator
Loading…
feat(mori-ep): Add MoRI Expert Parallelism support for ROCm
#977
opened May 8, 2026 by
jacobwin-ai
Collaborator
Loading…
fix(rocm): apply RoPE for embedding models without KV cache
#973
opened May 7, 2026 by
siluzhou
Collaborator
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.