llms-from-scratch-cn/Codes/ch05/01_main-chapter-code/README.md
2024-06-10 17:00:23 +08:00

8 lines
578 B
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# Chapter 5: 使用未标记数据进行预训练
- [ch05.ipynb](ch05.ipynb) 本章所有代码
- [previous_chapters.py](previous_chapters.py) 在前面章节中的包含 `MultiHeadAttention`的python代码模块在这里的未标记数据模型预训练过程中我们会用到它
- [train.py](train.py) 一个独立的python脚本文件包含我们在[ch05.ipynb](ch05.ipynb)中实现的GPT模型训练部分代码
- [generate.py](generate.py) 一个独立的python脚本文件包含我们在[ch05.ipynb](ch05.ipynb)中实现的GPT模型权重加载和应用代码