Commit Graph

154 Commits

Author SHA1 Message Date
Ethan-Chen-plus
65cc17a68c
Update README.md 2024-08-15 10:19:06 +08:00
Ethan-Chen-plus
31f25b85cb
Merge pull request #46 from jwu049/main
add the new chapters for part 4
2024-08-14 15:32:51 +08:00
kewei
a9fbbf5bac 0813 2024-08-13 16:55:27 +08:00
jwu049
8e5301e4ed add the code of ch07 2024-08-11 23:00:36 +08:00
jwu049
3076c492ba modify the name for 4.1 and 4.2 2024-08-06 23:17:02 +08:00
jwu049
fcf6ca84ff add the new chapters for part 4 2024-08-06 23:11:45 +08:00
jwu049
9633433eb0 add the new chapters for part 4 2024-08-06 23:07:20 +08:00
Ethan-Chen-plus
d323d5cf26
Merge pull request #45 from 0-yy-0/main
Model_Architecture_Discussions 新增 MiniCPM
2024-08-01 16:10:23 +08:00
gaoliye
0a52a81a03 update MiniCpm 2024-07-25 16:49:18 +00:00
gaoliye
2337d6e76f init 2024-07-08 01:58:53 +00:00
gaoliye
aba6c85144 tmp 2024-06-25 09:02:55 +08:00
Ethan-Chen-plus
6f214b20d5
Update README.md 2024-06-13 14:00:07 +08:00
Ethan-Chen-plus
d37b1e27bf
Merge pull request #40 from Ethan-Chen-plus/main
add GLM4 and RWKV2~6
2024-06-10 22:53:11 +08:00
kewei
95f505f8d6 new 2024-06-10 22:52:19 +08:00
Ethan-Chen-plus
980692833b
Merge pull request #39 from Ethan-Chen-plus/main
Add RWKV2~V6 and GLM4
2024-06-10 22:50:47 +08:00
kewei
e2360bc0a2 0610new 2024-06-10 22:49:34 +08:00
kewei
894b20d3c3 0610 2024-06-10 17:00:23 +08:00
Ethan-Chen-plus
099c1c620f
Merge pull request #38 from Ethan-Chen-plus/main
add olmo and gptj
2024-06-05 19:23:02 +08:00
kewei
daade3d142 add gptj 2024-06-05 19:22:12 +08:00
kewei
5293898b58 add olmo 2024-06-05 17:13:49 +08:00
Ethan-Chen-plus
ee25faa084
Merge pull request #36 from Ethan-Chen-plus/main
add more models
2024-06-01 18:26:54 +08:00
kewei
5533175bcc add pangu 2024-06-01 18:22:51 +08:00
kewei
4c86b986e8 add openelm 2024-06-01 17:33:19 +08:00
kewei
7fa3701ca8 0601 2024-06-01 14:52:18 +08:00
Ethan-Chen-plus
b5be67a834
Merge pull request #35 from Ethan-Chen-plus/main
add mamba
2024-05-31 17:45:48 +08:00
kewei
49bd1bccf2 add mamba 2024-05-31 17:07:42 +08:00
kewei
1df8bae480 add mamba 2024-05-31 17:04:42 +08:00
Ethan-Chen-plus
df69f68a53
Merge pull request #34 from Ethan-Chen-plus/main
add rwkv
2024-05-31 16:45:03 +08:00
kewei
2bd03665dd add rwkv 2024-05-31 16:44:23 +08:00
Ethan-Chen-plus
b377e1a60d
Merge pull request #33 from jodie-kang/main
增加第六章的翻译
2024-05-28 20:33:38 +08:00
kjq_glb
7a18d5b868 补充最新代码 2024-05-28 18:20:54 +08:00
kewei
cbc1c27962 0527 2024-05-27 15:17:00 +08:00
kewei
f48e08b34f 0527 2024-05-27 15:11:03 +08:00
kjq_glb
310cdb21f5 增加第六章的翻译 2024-05-22 15:53:12 +08:00
Ethan-Chen-plus
d665982073
Merge pull request #32 from SamanthaTso/main
第三章翻译:3.3 - 3.7 (《Build a Large Language Model (From Scratch)》)(包含代码)
2024-05-17 11:13:44 +08:00
SamanthaTso
1ef4490dae [Book] Add info of the translator of Ch 3.3-3.7 2024-05-16 22:24:31 +08:00
SamanthaTso
3099672cd0 [Book] correct code and its output 2024-05-16 22:22:49 +08:00
SamanthaTso
a5797fbc55 [Book] update output from code to markdown 2024-05-16 21:45:46 +08:00
SamanthaTso
ae26ef8d1e [book] initial commit for ch3.3-3.7 2024-05-16 21:10:18 +08:00
Ethan-Chen-plus
33303ef91b
Merge pull request #31 from graility/main
5.3
2024-05-16 17:06:54 +08:00
graility
aa1a5c4354
Add files via upload 2024-05-15 16:12:47 +08:00
tan90º
c977dbd21a
Merge pull request #30 from Tangent-90C/main
merge fixed
2024-05-12 21:05:44 +08:00
jianuo
75267fbf8e Merge remote-tracking branch 'origin/main' 2024-05-12 21:03:11 +08:00
jianuo
de1556aefd 删去一个多余文件 2024-05-12 20:58:06 +08:00
tan90º
57baf4ceea
Merge pull request #29 from Tangent-90C/main
更新对ChatGLM3模型的实现
2024-05-12 20:56:20 +08:00
tan90º
cdd57ed068
Merge branch 'main' into main 2024-05-12 20:55:53 +08:00
jianuo
0a0fafb95a 更新对ChatGLM3模型的实现 2024-05-12 20:48:39 +08:00
Ethan-Chen-plus
4d81949de8
Merge pull request #27 from 0-yy-0/gly
add:翻译 5.1
2024-05-12 17:48:57 +08:00
Ethan-Chen-plus
10f9c025b1
Merge pull request #26 from prime234/feat-branch
update ch02-2.7
2024-05-12 17:48:35 +08:00
Ethan-Chen-plus
27ee897f83
Merge pull request #28 from Tsumugii24/main
update 2.4 translation
2024-05-12 17:48:20 +08:00