-
Notifications
You must be signed in to change notification settings - Fork 410
Pull requests: InternLM/xtuner
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix][muon] fix assertion err when dim0 size < world_size
#1611
opened Mar 21, 2026 by
nil0x9
Loading…
[Fix] Support fp32 param preservation during FSDP save and load
#1607
opened Mar 20, 2026 by
HAOCHENYE
Loading…
[Refactor] Reduce memory usage in HardPackDataset via shared memory
#1602
opened Mar 19, 2026 by
HAOCHENYE
Loading…
[Fix] Muon optimizer per-expert orthogonalization for MoE models
#1582
opened Mar 13, 2026 by
CyCle1024
Loading…
[Feature] Add Multi-Token Prediction (MTP) module implementation
#1572
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Refactor] Rename CELossContext to LMHeadLossContext and refactor loss context base class
#1571
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Feature] Add Multi-Token Prediction (MTP) module implementation
#1570
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Refactor] Refactor loss context API to support multiple loss types
#1569
opened Mar 12, 2026 by
HAOCHENYE
Loading…
[Feature] Domino EP support and training optimizations for InternS1 Pro VL
blocked
#1528
opened Mar 3, 2026 by
tina-wen
Loading…
[Optimization] Incremental checkpoint save for dcp on torch 2.7.x (ARM CPU optimization)
npu
#1525
opened Mar 3, 2026 by
tina-wen
Loading…
[Feature] Offload optimizer states to CPU to reduce memory
#1524
opened Mar 3, 2026 by
tina-wen
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-03-19.