Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

small fix on qwen3-235b-a22b launch script
#1719 opened Mar 12, 2026 by Zhuohao-Li Loading…
Add Mooncake Backend for Rollout Data Transfer run-ci-megatron
#1709 opened Mar 11, 2026 by zxpdemonio Loading…
6 tasks done
fix: auto-detect GPUs in qwen3-4b script
#1700 opened Mar 10, 2026 by ailuntz Loading…
fix: make ray actor gpu fractions configurable
#1699 opened Mar 10, 2026 by ailuntz Loading…
fix: accept unboxed math answers
#1698 opened Mar 10, 2026 by ailuntz Loading…
fix: default reward for aborted samples
#1697 opened Mar 10, 2026 by ailuntz Loading…
fix: handle missing sglang cuda-graph constant
#1696 opened Mar 10, 2026 by ailuntz Loading…
PipelineRL -- keep cache on weight update
#1694 opened Mar 9, 2026 by hari-hm Loading…
fix: quote $MOE_LAYER_FREQ
#1689 opened Mar 8, 2026 by lawrence-harmonic Loading…
internv3.5 support
#1660 opened Mar 3, 2026 by samaritan1998 Loading…
fix: normalize rewards per-group when sample counts are unequal
#1655 opened Mar 2, 2026 by dubin555 Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654 opened Mar 2, 2026 by tourzhao Loading…
3 tasks
Refactor code safety checks by removing patterns
#1643 opened Feb 28, 2026 by Rohan5commit Loading…
[Feature] Add modular tracking interface with MLflow backend
#1591 opened Feb 17, 2026 by mouad-hpc Loading…
4 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.