llms-from-scratch-cn/Codes/ch07/04_preference-tuning-with-dpo/README.md
2024-08-11 23:00:36 +08:00

366 B

Chapter 7: Finetuning to Follow Instructions