-
Notifications
You must be signed in to change notification settings - Fork 137
Pull requests: NVIDIA-NeMo/Automodel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs: use absolute raw.githubusercontent.com URLs for embedded images
#2104
opened May 1, 2026 by
HuiyingLi
Contributor
Loading…
2 tasks
fix: use config.expert_dim for MoE expert LoRA init
#2102
opened Apr 30, 2026 by
adil-a
Collaborator
Loading…
4 tasks done
ci: Update transformers to latest version 5.7.0
#2089
opened Apr 29, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
feat: automodel researcher agent
#2087
opened Apr 29, 2026 by
krishnakalyan3
Contributor
Loading…
3 tasks done
feat: add inbatch neg sampling for training
#2077
opened Apr 28, 2026 by
rnyak
Collaborator
Loading…
2 of 3 tasks
feat: add optimized DeepSeek V4 kernels
dsv4
#2076
opened Apr 28, 2026 by
hemildesai
Contributor
•
Draft
8 tasks done
fix: add 7-day grace window for model card
#2068
opened Apr 27, 2026 by
akoumpa
Contributor
Loading…
2 of 3 tasks
fix: nemotron-super-v3-hellaswag checkpoint robustness
#2056
opened Apr 26, 2026 by
adil-a
Collaborator
Loading…
3 tasks done
ci: add retrieval bi-encoder and cross-encoder nightly tests
#2042
opened Apr 24, 2026 by
oliverholworthy
Contributor
Loading…
2 of 3 tasks
ci: Update transformers to latest version 5.6.2
#2038
opened Apr 24, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
ci: Update transformers to latest version 5.6.0
#2015
opened Apr 23, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix: llama3_3_nemotron_super_49B_squad checkpoint robustness thresholds
#1950
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
2 of 4 tasks
feat: add Context Parallelism support for Gemma4 dense and MoE VLM
community-request
waiting-on-customer
Waiting on the original author to respond
#1914
opened Apr 20, 2026 by
khazic
Contributor
Loading…
3 tasks done
fix: fp32 master weights for custom MoE models under FSDP2
#1896
opened Apr 17, 2026 by
zpqiu
Contributor
Loading…
1 of 3 tasks
docs: add embedding + reranker model coverage
docs-only
With great power comes great responsibility.
#1843
opened Apr 14, 2026 by
akoumpa
Contributor
Loading…
3 tasks
feat: add extract_submodel parameter to build_encoder_backbone
#1838
opened Apr 14, 2026 by
oliverholworthy
Contributor
•
Draft
2 of 3 tasks
refactor: use config.is_causal=False for bidirectional attention
#1837
opened Apr 14, 2026 by
oliverholworthy
Contributor
•
Draft
2 of 3 tasks
refactor: Remove separate moe_mesh references
community-request
waiting-on-customer
Waiting on the original author to respond
#1824
opened Apr 14, 2026 by
edjson
Contributor
Loading…
3 tasks
ci: Update transformers to latest version 5.5.4
#1823
opened Apr 14, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix: Set CUDA arch list for UCCL EP build to SM90+
r0.4.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1808
opened Apr 13, 2026 by
thomasdhc
Contributor
Loading…
3 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.