Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[neuron] fix authorization issue ci/build
#18364 opened May 19, 2025 by liangfu Loading…
Update run_cluster.sh documentation Improvements or additions to documentation
#18361 opened May 19, 2025 by Khalid1G Loading…
[Misc] add xgrammar for arm64 ci/build
#18359 opened May 19, 2025 by prashantgupta24 Loading…
[Misc] Allow AutoWeightsLoader to skip loading weights with specific substr in name ready ONLY add when PR is ready to merge/full CI is needed
#18358 opened May 19, 2025 by Isotr0py Loading…
[Minor] Rename quantization nvfp4 to modelopt_fp4 ready ONLY add when PR is ready to merge/full CI is needed
#18356 opened May 19, 2025 by mgoin Loading…
[Misc] Call ndarray.tobytes() directly instead of ndarray.data.tobytes() multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed
#18347 opened May 19, 2025 by lgeiger Loading…
[WIP] [Core][P/D] CPU connector for PD disagg v1
#18332 opened May 19, 2025 by ApostaC Draft
1 of 9 tasks
Intialize io_thread_pool attribute in the beginning. v1
#18331 opened May 19, 2025 by rabi Loading…
[Model]: Fused MoE for nomic-embed-text-v2-moe
#18321 opened May 18, 2025 by Isotr0py Loading…
Update arch overview for v1 codex documentation Improvements or additions to documentation
#18317 opened May 18, 2025 by simon-mo Draft
[Quantization] Add compressed-tensors NVFP4 support quantization ready ONLY add when PR is ready to merge/full CI is needed
#18312 opened May 17, 2025 by dsikka Loading…
[Core] Accelerate startup time v1
#18307 opened May 17, 2025 by jianzs Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.