-
Notifications
You must be signed in to change notification settings - Fork 148
Pull requests: NVIDIA-NeMo/Automodel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ci: Update transformers to latest version 5.8.1
#2223
opened May 13, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix: Remove Qwen3.5 packing known issue marker
docs-only
With great power comes great responsibility.
#2222
opened May 13, 2026 by
HuiyingLi
Contributor
Loading…
chore: missing docstrings, update pyproject
#2219
opened May 12, 2026 by
akoumpa
Contributor
Loading…
3 tasks
fix(dsv4): preserve reference fp32 parameters
community-request
waiting-on-customer
Waiting on the original author to respond
#2216
opened May 12, 2026 by
khazic
Contributor
Loading…
3 tasks done
feat(dllm): add DFlash and LLaDA2 SFT recipes
community-request
#2214
opened May 12, 2026 by
kashif
Loading…
3 tasks
fix: call init_weights() instead of initialize_weights() to restore w…
community-request
waiting-on-customer
Waiting on the original author to respond
#2213
opened May 12, 2026 by
Meiyim
Loading…
fix(vlm): align KD distributed train step
community-request
#2212
opened May 12, 2026 by
khazic
Contributor
Loading…
refactor: move VLM PP media chunking into pipelining
#2210
opened May 12, 2026 by
HuiyingLi
Contributor
Loading…
docs(fern): scaffold Fern docs site mirroring published v0.4.0 sidebar
#2196
opened May 8, 2026 by
lbliii
Loading…
7 tasks
feat(deepseek-v4): add Multi-Token Prediction (MTP) training support
community-request
#2191
opened May 8, 2026 by
khazic
Contributor
Loading…
ci(diffusion): remove local_dir and post process directly on cache
#2182
opened May 7, 2026 by
thomasdhc
Contributor
Loading…
3 tasks
feat(nemotron-v3): add Multi-Token Prediction (MTP) support
#2161
opened May 6, 2026 by
adil-a
Collaborator
Loading…
6 tasks done
fix(gpt_oss): free quantized expert tensors per-layer to reduce peak memory
community-request
waiting-on-customer
Waiting on the original author to respond
#2149
opened May 6, 2026 by
stanley1208
Contributor
Loading…
ci: Update transformers to latest version 5.8.0
#2148
opened May 6, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix(qwen3_5): preserve packed-sample boundaries in GatedDeltaNet
#2147
opened May 6, 2026 by
HuiyingLi
Contributor
Loading…
docs: add bump-dependency skill for shepherding dependency PRs to green
docs-only
With great power comes great responsibility.
documentation
Improvements or additions to documentation
#2130
opened May 5, 2026 by
ko3n1g
Contributor
Loading…
refactor: Remove separate moe_mesh references
community-request
waiting-on-customer
Waiting on the original author to respond
#2123
opened May 4, 2026 by
edjson
Contributor
Loading…
2 of 3 tasks
ci: align CUDA 13.2 / cu130 toolchain for TE 2.14.1 bump
#2121
opened May 4, 2026 by
thomasdhc
Contributor
Loading…
3 tasks
ci: Update transformers to latest version 5.7.0
#2089
opened Apr 29, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-04-13.