minimind/trainer
2025-04-29 20:45:39 +08:00
..
train_distill_reason.py 250426 2025-04-26 10:05:47 +08:00
train_distillation.py 250426 2025-04-26 10:05:47 +08:00
train_dpo.py fix bugs 2025-04-29 20:45:39 +08:00
train_full_sft.py fix bugs 2025-04-29 20:45:39 +08:00
train_lora.py update lora 2025-04-27 15:45:06 +08:00
train_pretrain.py 250426 2025-04-26 10:05:47 +08:00