-
Notifications
You must be signed in to change notification settings - Fork 628
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: http_utils. disable system proxy for internal SGLang httpx clients
#1714
opened Mar 12, 2026 by
DongzhuoranZhou
Loading…
Add Mooncake Backend for Rollout Data Transfer
run-ci-megatron
#1709
opened Mar 11, 2026 by
zxpdemonio
Loading…
6 tasks done
[WIP] fix(cp): wrap linear attention CP in custom autograd.Function
#1692
opened Mar 9, 2026 by
lilei199908
Loading…
fix: normalize rewards per-group when sample counts are unequal
#1655
opened Mar 2, 2026 by
dubin555
Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654
opened Mar 2, 2026 by
tourzhao
Loading…
3 tasks
Fix the Rotary Position Embedding (RoPE) parameter passing in the GLM5 mode
#1650
opened Mar 2, 2026 by
hanxdmech-ship-it
Loading…
[WIP] fix transforrmers api change at 5.2.0
run-ci-megatron
#1647
opened Feb 28, 2026 by
UbeCc
Loading…
feat: add --lazy-multimodal-load to defer image process to rollout time
#1623
opened Feb 25, 2026 by
yzlnew
Loading…
fix(r3,vlm): remove orphaned RoutingReplay from decoder rebuild.
#1620
opened Feb 24, 2026 by
yxyOo
Loading…
[Feature] Add configurable arguments for rollout manager actor
#1596
opened Feb 18, 2026 by
TSunny007
Loading…
[Feature] Add curriculum learning example with dynamic multi-task training and online prompt filtering
#1594
opened Feb 18, 2026 by
zhangzx-uiuc
Loading…
[Feature] Add modular tracking interface with MLflow backend
#1591
opened Feb 17, 2026 by
mouad-hpc
Loading…
4 tasks done
Add retries to the Remote Reward Model, do not fail on connection drops or endpoint instability
#1582
opened Feb 12, 2026 by
joyliu-q
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.