mirror of
https://github.com/datawhalechina/llms-from-scratch-cn.git
synced 2026-05-01 11:58:17 +08:00
| .. | ||
| .keep | ||
| 2.1理解词嵌入.ipynb | ||
| 2.2文本分词(序列化).ipynb | ||
| 2.3将令牌转换为令牌 ID.ipynb | ||
| 2.4添加特殊上下文tokens.ipynb | ||
| 2.5 字节对编码(BPE).ipynb | ||
| 2.6使用滑动窗口进行数据采样.ipynb | ||
| 2.7 构建词符嵌入.ipynb | ||
| 2.8词位置编码.ipynb | ||
| 2.文本数据处理.ipynb | ||