llms-from-scratch-cn/ch04/01_main-chapter-code/README.md
2024-02-28 23:31:55 +08:00

502 B

Chapter 4: Implementing a GPT model from Scratch To Generate Text

  • ch04.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module from the previous chapter, which we import in ch04.ipynb to create the GPT model
  • gpt.py is a standalone Python script file with the code that we implemented thus far, including the GPT model we coded in this chapter