Skip to content

Pull requests: NVIDIA-NeMo/Automodel

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

docs: use absolute raw.githubusercontent.com URLs for embedded images
#2104 opened May 1, 2026 by HuiyingLi Contributor Loading…
2 tasks
fix: use config.expert_dim for MoE expert LoRA init
#2102 opened Apr 30, 2026 by adil-a Collaborator Loading…
4 tasks done
ci: Update transformers to latest version 5.7.0
#2089 opened Apr 29, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat: automodel researcher agent
#2087 opened Apr 29, 2026 by krishnakalyan3 Contributor Loading…
3 tasks done
fix: MoE aux-loss dtype mismatch under activation checkpointing
#2083 opened Apr 28, 2026 by pzelasko Contributor Draft
3 tasks done
feat: add inbatch neg sampling for training
#2077 opened Apr 28, 2026 by rnyak Collaborator Loading…
2 of 3 tasks
feat: add optimized DeepSeek V4 kernels dsv4
#2076 opened Apr 28, 2026 by hemildesai Contributor Draft
8 tasks done
fix: add 7-day grace window for model card
#2068 opened Apr 27, 2026 by akoumpa Contributor Loading…
2 of 3 tasks
fix(deepseek_v4): support DeepSeek-V4-Flash-Base dsv4
#2064 opened Apr 27, 2026 by zpqiu Contributor Draft
3 tasks
fix: nemotron-super-v3-hellaswag checkpoint robustness
#2056 opened Apr 26, 2026 by adil-a Collaborator Loading…
3 tasks done
ci: add retrieval bi-encoder and cross-encoder nightly tests
#2042 opened Apr 24, 2026 by oliverholworthy Contributor Loading…
2 of 3 tasks
ci: Update transformers to latest version 5.6.2
#2038 opened Apr 24, 2026 by svcnvidia-nemo-ci Contributor Loading…
ci: Update transformers to latest version 5.6.0
#2015 opened Apr 23, 2026 by svcnvidia-nemo-ci Contributor Loading…
fix: llama3_3_nemotron_super_49B_squad checkpoint robustness thresholds
#1950 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 of 4 tasks
feat: add Context Parallelism support for Gemma4 dense and MoE VLM community-request waiting-on-customer Waiting on the original author to respond
#1914 opened Apr 20, 2026 by khazic Contributor Loading…
3 tasks done
fix: fp32 master weights for custom MoE models under FSDP2
#1896 opened Apr 17, 2026 by zpqiu Contributor Loading…
1 of 3 tasks
fix: lora with gemma4 large models on Spark single GPU
#1866 opened Apr 15, 2026 by athitten Contributor Draft
3 tasks
docs: add embedding + reranker model coverage docs-only With great power comes great responsibility.
#1843 opened Apr 14, 2026 by akoumpa Contributor Loading…
3 tasks
ci: add sync-skills workflow
#1841 opened Apr 14, 2026 by ko3n1g Contributor Loading…
2 tasks
feat: add extract_submodel parameter to build_encoder_backbone
#1838 opened Apr 14, 2026 by oliverholworthy Contributor Draft
2 of 3 tasks
refactor: Remove separate moe_mesh references community-request waiting-on-customer Waiting on the original author to respond
#1824 opened Apr 14, 2026 by edjson Contributor Loading…
3 tasks
ci: Update transformers to latest version 5.5.4
#1823 opened Apr 14, 2026 by svcnvidia-nemo-ci Contributor Loading…
fix: Set CUDA arch list for UCCL EP build to SM90+ r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1808 opened Apr 13, 2026 by thomasdhc Contributor Loading…
3 tasks
ProTip! Adding no:label will show everything without a label.