-
-
Notifications
You must be signed in to change notification settings - Fork 14.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature]: IndexCache support for DSA models
deepseek
Related to DeepSeek models
#37735
opened Mar 21, 2026 by
chaunceyjiang
•
Draft
5 tasks
Fix Mamba state corruption from stale CUDA graph block table entries
fb-exported
meta-exported
nvidia
v1
#37728
opened Mar 21, 2026 by
minosfuture
Loading…
Revert "[Model] Deprecate the score task (this will not affect users)." (#37537)
documentation
Improvements or additions to documentation
frontend
v1
[ROCm][CI] Stabilize ROCm speech-to-text translation test with ROCM_EXTRA_ARGS
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
#37723
opened Mar 20, 2026 by
AndreasKaratzas
•
Draft
[ROCm][CI] Update GSM8K eval config to use fp8-and-mixed models list (MI355)
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
#37721
opened Mar 20, 2026 by
AndreasKaratzas
Loading…
[Test] Only Run MLA model when user explicitly set for batch invariance
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#37719
opened Mar 20, 2026 by
yewentao256
Loading…
[Bug] Fix fp8 deepgemm batch invariant
bug
Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
#37718
opened Mar 20, 2026 by
yewentao256
Loading…
[ROCm][CI] Add large_gpu_mark to test_max_tokens_none for ROCm
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
#37717
opened Mar 20, 2026 by
AndreasKaratzas
Loading…
[BugFix] Fix MoRIIOConnector for disaggregated P/D inference
bug
Something isn't working
kv-connector
#37716
opened Mar 20, 2026 by
raviguptaamd
•
Draft
Readability cleanup for wvSplitK reduces.
rocm
Related to AMD ROCm
#37713
opened Mar 20, 2026 by
amd-hhashemi
Loading…
5 tasks
Properly enable wvSplitK fp8 path for RDNA
#37712
opened Mar 20, 2026 by
amd-hhashemi
Loading…
5 tasks
[Bugfix] Fix structured output crash on CPU due to pin_memory=True
bug
Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
v1
#37706
opened Mar 20, 2026 by
wjhrdy
Loading…
3 tasks done
[Test] Add more unittests for CUDAGraphWrapper
nvidia
v1
#37702
opened Mar 20, 2026 by
SoluMilken
Loading…
3 of 5 tasks
[Bugfix] Fix FLA Hopper/TMA misclassification on SM12x desktop Blackwell
bug
Something isn't working
#37700
opened Mar 20, 2026 by
RobTand
Loading…
3 tasks done
[Bugfix] Respect VLLM_WEIGHT_OFFLOADING_DISABLE_PIN_MEMORY in prefetch offloader
bug
Something isn't working
#37699
opened Mar 20, 2026 by
he-yufeng
Loading…
[ROCm][Bugfix] fix exception related to trust_remote_code for MiniMax-M2.1-MXFP4
bug
Something isn't working
cpu
Related to CPU backends
rocm
Related to AMD ROCm
#37698
opened Mar 20, 2026 by
hongxiayang
Loading…
5 tasks
[torch.compile]: Disable Sequence Parallelism (SP) for piecewise compilation
v1
#37696
opened Mar 20, 2026 by
SouthWest7
Loading…
3 of 5 tasks
[Perf] Use torch compile to fuse pack topk in trtllm moe
nvidia
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#37695
opened Mar 20, 2026 by
wzhao18
Loading…
5 tasks
[cpu][ci] remove soft-fail for Arm CI and add quant model tests
ci/build
cpu
Related to CPU backends
#37691
opened Mar 20, 2026 by
fadara01
Loading…
2 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.