minimind/trainer
2025-10-23 19:08:42 +08:00
..
train_distill_reason.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_distillation.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_dpo.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_full_sft.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_grpo.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_lora.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_ppo.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_pretrain.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00
train_spo.py [fix] loss-issues-430 2025-10-23 19:08:42 +08:00