Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

internv3.5 support
#1660 opened Mar 3, 2026 by samaritan1998 Loading…
fix: normalize rewards per-group when sample counts are unequal
#1655 opened Mar 2, 2026 by dubin555 Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654 opened Mar 2, 2026 by tourzhao Loading…
3 tasks
Fix missing packed_seq_params in bshd qkv_format
#1649 opened Mar 1, 2026 by coding-famer Loading…
Refactor code safety checks by removing patterns
#1643 opened Feb 28, 2026 by Rohan5commit Loading…
Autofix/issue 1578 hf2megatron arg suffix
#1636 opened Feb 27, 2026 by yitianlian Loading…
[Feature] Add modular tracking interface with MLflow backend
#1591 opened Feb 17, 2026 by mouad-hpc Loading…
4 tasks done
add rollout_data to rollout_data_postprocess
#1581 opened Feb 12, 2026 by zui-jiang Loading…
fix: Handle IPv6 addresses in weight sync init_method
#1576 opened Feb 11, 2026 by zx3xyy Loading…
Support qwen3-next MTP Training
#1575 opened Feb 11, 2026 by zx3xyy Loading…
ProTip! Updated in the last three days: updated:>2026-03-02.