minimind/trainer
2025-12-09 13:01:38 +08:00
..
train_distill_reason.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
train_distillation.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
train_dpo.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
train_full_sft.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
train_grpo.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
train_lora.py [feat] release memory 2025-11-27 19:39:49 +08:00
train_ppo.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
train_pretrain.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
train_spo.py [fix] cuda memory #559 2025-12-01 16:17:43 +08:00
trainer_utils.py [fix] dtype & lr 2025-12-09 13:01:38 +08:00