Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Feat/usage policy tests frontend
#38477 opened Mar 29, 2026 by Csrayz Loading…
[WIP] Add TRITON_MLA_SPARSE backend for SM80 sparse MLA support documentation Improvements or additions to documentation nvidia rocm Related to AMD ROCm v1
#38476 opened Mar 29, 2026 by haosdent Draft
Fix potential infinite loop in SonnetDataset.sample performance Performance-related issues
#38471 opened Mar 29, 2026 by frankie-ys Loading…
1 of 5 tasks
Add platform manual_seed_all API intel-gpu Related to Intel GPU nvidia performance Performance-related issues rocm Related to AMD ROCm speculative-decoding v1
#38468 opened Mar 29, 2026 by yma11 Loading…
[Bugfix] Add CPU profiler summary equivalent to CUDA summary bug Something isn't working cpu Related to CPU backends nvidia
#38466 opened Mar 29, 2026 by NJX-njx Loading…
[Bugfix] Fix limit_mm_per_prompt being ignored for encoder cache profiling bug Something isn't working multi-modality Related to multi-modality (#4194)
#38465 opened Mar 29, 2026 by NJX-njx Loading…
[Logging] Add JIT compilation progress log for FlashInfer nvidia v1
#38462 opened Mar 29, 2026 by WJYuuuu Loading…
3 of 5 tasks
Fixed issues multi-modality Related to multi-modality (#4194)
#38461 opened Mar 29, 2026 by rpathade Draft
[Perf] Batch KV cache swap copies via cuMemcpyBatchAsync ready ONLY add when PR is ready to merge/full CI is needed v1
#38460 opened Mar 29, 2026 by Etelis Loading…
[Docs] Add vLLM CI overview documentation for contributors documentation Improvements or additions to documentation
#38458 opened Mar 29, 2026 by khluu Loading…
3 tasks
[ROCm] [DOC] Update the Documentation to include ROCm Nightly Wheel support documentation Improvements or additions to documentation rocm Related to AMD ROCm
#38457 opened Mar 29, 2026 by tjtanaa Loading…
5 tasks
[CI] Fix online FP8 quantization materializing tensors on CPU bug Something isn't working
#38456 opened Mar 29, 2026 by haosdent Loading…
[ROCm] Add RDNA 3.5/4 device IDs (gfx1150, gfx1151, gfx1201) rocm Related to AMD ROCm
#38455 opened Mar 29, 2026 by dondetir Loading…
[ROCm][Test] Add hybrid block size and RDNA4 backend selection tests rocm Related to AMD ROCm v1
#38454 opened Mar 29, 2026 by dondetir Loading…
[Perf] Fix DBO overlap: capture DeepEP event before yield
#38451 opened Mar 29, 2026 by czhu-cohere Loading…
5 tasks
fix(tokenizer): skip reasoning_effort when None in Mistral tokenizer
#38448 opened Mar 29, 2026 by marioiseli89 Loading…
3 tasks done
ProTip! What’s not been updated in a month: updated:<2026-02-28.