-
Notifications
You must be signed in to change notification settings - Fork 453
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix] support converting torch_dist to hf for qwen3vl dense model
#1491
opened Jan 25, 2026 by
p1k0pan
Loading…
fix: use aread() to fully consume HTTP response body
#1488
opened Jan 24, 2026 by
ann-qin-lu
Loading…
4 tasks done
[Fix] Allow default TIS function usage when get_mismatch_metrics is set
#1483
opened Jan 23, 2026 by
eecspan
Loading…
[Fix] Fix some tiny bugs in fault tolerance
run-ci-megatron
#1480
opened Jan 23, 2026 by
yitianlian
Loading…
[Feature] Distribute Prefill on different node
run-ci-megatron
#1478
opened Jan 22, 2026 by
yitianlian
Loading…
fix(rollout): raise error when buffer is insufficient without global dataset
#1474
opened Jan 21, 2026 by
JackXu0
Loading…
perf(rollout): compress loss masks with run-length encoding
#1473
opened Jan 20, 2026 by
JackXu0
Loading…
fix: remove enforced CPU initialization assertion for AMD GPUs
#1471
opened Jan 20, 2026 by
Vivicai1005
Loading…
Fix multi‑turn loss masking, clarify qwen25/qwen3 types, and strengthen mask tests
#1418
opened Jan 14, 2026 by
Daucloud
Loading…
issue #1414 fix: correct reward normalization for unequal group sizes
#1415
opened Jan 14, 2026 by
ccggddmm
Loading…
Add support for per token reward/advantages with a custom_reward_post_process_path
#1389
opened Jan 12, 2026 by
vpj
Loading…
[Feature] Add rollout concurrency argument for full async training
#1310
opened Jan 3, 2026 by
yitianlian
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-12-25.