Skip to content

Pull requests: hiyouga/LlamaFactory

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: pin 12 unpinned action(s)
#10325 opened Mar 26, 2026 by dagecko Loading…
ci: add nginx cache config for Ascend NPU CI environment
#10323 opened Mar 26, 2026 by Goalina Loading…
1 of 2 tasks
[WIP] Support huggingface/kernels
#10319 opened Mar 25, 2026 by zheliuyu Draft
2 tasks
fix: add qwen3_5_moe to MoE configuration in moe.py invalid This doesn't seem right
#10307 opened Mar 21, 2026 by majiayu000 Loading…
feat: clearer train_result metrics log through calculate_tps function
#10288 opened Mar 17, 2026 by UmeanNever Loading…
1 of 2 tasks
[V1]support resume training from checkpoint
#10280 opened Mar 13, 2026 by frozenleaves Loading…
[V1]add init on rank0 for fsdp2
#10264 opened Mar 9, 2026 by jiaqiw09 Loading…
1 of 2 tasks
[v1] support ulysses cp for fsdp2
#10262 opened Mar 9, 2026 by sunyi0505 Loading…
2 tasks done
feat: add LightOnOCR-2 integration for LoRA/QLoRA fine-tuning
#10192 opened Feb 16, 2026 by johnlockejrr Loading…
2 tasks
Fix memory leak on MPS by explicitly clearing cache in trainer step
#10190 opened Feb 14, 2026 by asebaq Loading…
1 of 2 tasks
[v1] Add hyperparams and training docs
#10188 opened Feb 13, 2026 by frozenleaves Loading…
[deps] Add libibverbs for RDMA support
#10185 opened Feb 12, 2026 by RossCZ Loading…
1 of 2 tasks
Feature: experimental fine-tuning comparison
#10172 opened Feb 6, 2026 by caterina0718 Loading…
[feat] Add DeepSpeed ZeRO-3 LoRA checkpoint save support
#10124 opened Jan 22, 2026 by kimberlykang Loading…
2 tasks done
[model] support NVIDIA's Audio-Flamingo-3 audio model
#9740 opened Jan 9, 2026 by vovanphuc Loading…
4 tasks done
Add entropy logging for SFT training path
#9717 opened Jan 5, 2026 by pankd Loading…
Support loss_mask in dataset to control loss calculation for specific turns solved This problem has been already solved
#9630 opened Dec 18, 2025 by CjangCjengh Loading…
2 tasks
Add hf_infer script for inference using HuggingFace backend pending This problem is yet to be addressed
#9370 opened Oct 29, 2025 by WinterShiver Loading…
1 of 2 tasks
support pre-tokenized parquet datasets pending This problem is yet to be addressed
#9351 opened Oct 25, 2025 by AbdulmalikDS Loading…
2 of 3 tasks
Implement LoRA for MoE with support for LoRA injection for nn.parameters pending This problem is yet to be addressed
#9337 opened Oct 23, 2025 by Ziheng-Zhang-AUS Loading…
2 tasks done
ProTip! no:milestone will show everything without a milestone.