-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Pull requests: jingyaogong/minimind
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[mod & add] fix spo algorithm, add dapo and cispo algorithm in the RLAIF part.
#658
opened Jan 30, 2026 by
vanking20000918
Loading…
[docs] Fix wording in RLHF section of README.md file
#653
opened Jan 27, 2026 by
vanking20000918
Loading…
feat: add merge_lora.py to support merging LoRA weights into base model
#569
opened Dec 5, 2025 by
dyhuachi
Loading…
fix: Loading LoRA parameters which saved from multi-card training
#523
opened Nov 6, 2025 by
yuyu5333
Loading…
增加可选的MLA支持、修复模型内部精度一致,优化代码add mla, fix model dtype, improve codes
#240
opened Feb 28, 2025 by
Zephor5
Loading…
ProTip!
What’s not been updated in a month: updated:<2026-01-12.