kanshan/llms-from-scratch-cn

mirror of https://github.com/datawhalechina/llms-from-scratch-cn.git synced 2026-01-14 01:07:34 +08:00

jwu049 8e5301e4ed add the code of ch07

2024-08-11 23:00:36 +08:00

366 B

Raw Blame History

Chapter 7: Finetuning to Follow Instructions

create-preference-data-ollama.ipynb: A notebook that creates a synthetic dataset for preference finetuning dataset using Llama 3.1 and Ollama
dpo-from-scratch.ipynb: This notebook implements Direct Preference Optimization (DPO) for LLM alignment