Andrej Karpathy
|
d5b91270a9
|
allow to use fewer documents for training tfidf features to prevent OOMs
|
2021-11-29 15:38:36 -08:00 |
|
Andrej Karpathy
|
aa877c9397
|
when writing features do it safely and atomically
|
2021-11-26 20:00:37 -08:00 |
|
Andrej Karpathy
|
77279e1777
|
sequester all file sytem IO ops only to db.py, so it's not total chaos
|
2021-11-25 13:28:04 -08:00 |
|
Andrej Karpathy
|
cf1bef6f53
|
big new feature: ability to inspect any paper to see the raw tfidf tokens and their weights that summarize the paper, and which powers the SVM recommendation engine. basically a bit of a debugging / insight feature, but a really good sanity check that papers are being properly represented
|
2021-11-21 20:51:01 -08:00 |
|
Andrej Karpathy
|
548ee210df
|
better default parameters, based on qualitative inspection of tfidf features and word vectors
|
2021-11-21 13:46:14 -08:00 |
|
Andrej Karpathy
|
13a1d5ff48
|
sequester gross details about database instantiation in the filesystem away from the scripts
|
2021-11-12 21:12:09 -08:00 |
|
Andrej Karpathy
|
194b7f4b22
|
first leet codes
|
2021-11-12 20:40:19 -08:00 |
|