-
-
Notifications
You must be signed in to change notification settings - Fork 7.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] fix adding bias twice in ipex GPTQ quantization
#18363
opened May 19, 2025 by
rand-fly
Loading…
Update run_cluster.sh
documentation
Improvements or additions to documentation
#18361
opened May 19, 2025 by
Khalid1G
Loading…
[Misc] Allow ONLY add when PR is ready to merge/full CI is needed
AutoWeightsLoader
to skip loading weights with specific substr in name
ready
#18358
opened May 19, 2025 by
Isotr0py
Loading…
[Minor] Rename quantization nvfp4 to modelopt_fp4
ready
ONLY add when PR is ready to merge/full CI is needed
#18356
opened May 19, 2025 by
mgoin
Loading…
[V1][Metrics] Remove gpu_ prefix from non GPU specific metrics.
v1
#18354
opened May 19, 2025 by
sahelib25
Loading…
fix:Build torch wheel inline rather than picking from nightly
ci/build
#18351
opened May 19, 2025 by
dilipgb
Loading…
Remove used KV cache from MooncakeStore to prevent overfill
#18349
opened May 19, 2025 by
gronsti-amd
Loading…
[Misc] Call Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
ndarray.tobytes()
directly instead of ndarray.data.tobytes()
multi-modality
#18347
opened May 19, 2025 by
lgeiger
Loading…
[Core] Add Lora Support to Beam Search
frontend
#18346
opened May 19, 2025 by
alex-jw-brooks
Loading…
[P/D] Fix minor case in example disagg_prefill_proxy_server.py
#18341
opened May 19, 2025 by
gc-fu
Loading…
[FEAT][ROCm] Upgrade AITER MLA v1 backend
ci/build
v1
#18338
opened May 19, 2025 by
vllmellm
Loading…
[V1] Fix general plugins not loaded in engine for multiproc
v1
#18326
opened May 18, 2025 by
sarckk
Loading…
[Quantization] Add compressed-tensors NVFP4 support
quantization
ready
ONLY add when PR is ready to merge/full CI is needed
#18312
opened May 17, 2025 by
dsikka
Loading…
[Bugfix] Use a different prompt for benchmark_serving.py's test prompt
#18311
opened May 17, 2025 by
tlrmchlsmth
•
Draft
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.