6bfede2657Updated to see github links
Subramanya N
2022-02-16 21:27:26 -0500
cc4b15efb5updated with github links from paperswithcode
Subramanya N
2022-02-14 05:02:19 -0500
d7a303b410add thumbnails for papers, which apparently ppl like
master
Andrej Karpathy
2022-02-13 18:30:14 -0800
f980c7947alink to arxiv-sanity-lite instead of directly to arxiv
Andrej Karpathy
2022-01-22 16:36:11 -0800
48a7e01aa2we will only send emails to serious users
Andrej Karpathy
2022-01-22 16:33:13 -0800
23b0e109bffix bug in script due to schema change of pids variable earlier
Andrej Karpathy
2021-12-19 11:50:50 -0800
759f7e73e6fix bug in pagination, clean up the approach a bit more
Andrej Karpathy
2021-12-18 23:25:45 -0800
c3cb157c9ffirst version of pagination w00t w00t! it's a bit hacky i think, should be possible to improve this code and make it smaller and cleaner and etc.
Andrej Karpathy
2021-12-18 21:14:44 -0800
4210a73d15remove recommendation by default on / endpoint. this was causing some unintended confusion, e.g. the search function was using time_filter 7 and giving bad results from the main page. also getting recommendations takes time and is slowish, so only do it on actual demand
Andrej Karpathy
2021-12-18 20:21:04 -0800
0f21e69125Merge branch 'ajdintrejic-master'
Andrej Karpathy
2021-12-18 15:06:36 -0800
6e85778b24change the way we recommend papers. we'll do it per tag instead of just mixing everything together. i believe this should produce higher quality results
Andrej Karpathy
2021-12-18 14:53:57 -0800
7a438d9c46Made site a bit more mobile friendly
Ajdin Trejic
2021-12-18 23:30:40 +0100
65d4bb8415some sleep is good for you
Andrej Karpathy
2021-12-10 20:28:34 -0800
cd2a88f375when a user is logged in and hits the default endpoint '/' show recommendations over the last week instead of just totally random recent papers, by default
Andrej Karpathy
2021-12-10 09:15:29 -0800
33b2b018abkind of big changes here: we can now inspect & see the most positive and negative words for a trained svm, to help tune the value C. then there is also the UI for setting value C in the SVM. Finally the value of C I adjusted to default to 0.01 (was 0.1 before) because the results and the weights look more sensible based on manual inspection. We need some dataset of people libraries in order to potentially cross-validate a good value C automatically. For now there are not enough active users of the site that such a thing could be attempted and succeed. Doing my best now just by eyeballing
Andrej Karpathy
2021-12-08 00:17:37 -0800
6be574a0fehah don't crash if a user submits an invalid query, better to return empty results. in the future we may want a helpful error message too
Andrej Karpathy
2021-12-07 22:56:09 -0800
4ec9a75293add a secret GET argument svm_c that changes the C value in the SVM. no UI for this yet :)
Andrej Karpathy
2021-12-07 22:52:13 -0800
6308e0cfa4list out the tags i think
Andrej Karpathy
2021-12-07 22:33:50 -0800
28163bfcb7adding the user account into the email, ty @Randl for the suggestion. I think this should be ok to include in plain text here because it is an email direct to the user
Andrej Karpathy
2021-12-07 22:27:14 -0800
1153a70563print the total number of emails sent
Andrej Karpathy
2021-12-04 00:39:55 -0800
6df5f1d178add requirements.txt ty @Prakyathkantharaju and @e-tornike for help
Andrej Karpathy
2021-12-02 16:08:05 -0800
31e02f267chandle the edge case of zero positives... sigh
Andrej Karpathy
2021-11-30 23:34:47 -0800
fe96775e6efix issues and improve logic of removing tags
Andrej Karpathy
2021-11-30 22:57:59 -0800
9c00c8dacddisallow the special tag null, which can result if user escapes the prompt
Andrej Karpathy
2021-11-30 22:51:16 -0800
a01c637cc8tweak email sending
Andrej Karpathy
2021-11-29 20:45:24 -0800
d5b91270a9allow to use fewer documents for training tfidf features to prevent OOMs
Andrej Karpathy
2021-11-29 15:38:36 -0800
e182dda381make send_emails into a proper script and add a few more widgets
Andrej Karpathy
2021-11-29 09:29:59 -0800
6d7f648384visually distinguish the special 'all' tag
Andrej Karpathy
2021-11-28 08:23:51 -0800
8947dd3fc5tweak search scoring for better results, i think
Andrej Karpathy
2021-11-27 18:09:17 -0800
aff7c89695use the older arxiv-sanity colorscheme for header
Andrej Karpathy
2021-11-27 17:15:40 -0800
a83dbb2138it doesn't align, the ocd is killing me
Andrej Karpathy
2021-11-27 14:53:14 -0800
5990a938a4add few more stats
Andrej Karpathy
2021-11-27 14:47:28 -0800
1769b7b02fminor profile page html/css cleanup
Andrej Karpathy
2021-11-27 14:31:18 -0800
d236a5dff9small tweaks to not do useless work in special cases
Andrej Karpathy
2021-11-27 13:06:00 -0800
a169f0fa61adding an about page as well
Andrej Karpathy
2021-11-27 11:47:17 -0800
6c87055349slight readme tweak
Andrej Karpathy
2021-11-27 11:41:04 -0800
6d4d1eb78dtweak readme
Andrej Karpathy
2021-11-27 11:38:31 -0800
6685b0a05fi think this detail matters...
Andrej Karpathy
2021-11-27 11:23:35 -0800
5eece3f992add email sending script to repo
Andrej Karpathy
2021-11-27 11:19:55 -0800
49995465dfadd ability to store an email for a user for recommendations
Andrej Karpathy
2021-11-27 11:04:36 -0800
970a9e24cfmaintain last active state for users
Andrej Karpathy
2021-11-27 09:37:02 -0800
47fba66712add a new stats page
Andrej Karpathy
2021-11-26 21:57:48 -0800
6a0ff6a9bfclarify even further
Andrej Karpathy
2021-11-26 21:10:22 -0800
b52b64e0f7don't show the +/- controls for tags when a person is not logged in, it's confusing
Andrej Karpathy
2021-11-26 21:09:10 -0800
e4fe77d118show user warning if they are not logged in that things won't work
Andrej Karpathy
2021-11-26 20:57:20 -0800
c3ce3e3f70update the screenshot since the interface changed quite a bit
Andrej Karpathy
2021-11-26 20:33:10 -0800
4538eedb84update the readme
Andrej Karpathy
2021-11-26 20:31:57 -0800
9c88a4eea4use the process exit code to communicate whether any updates successfully made it into the database at all
Andrej Karpathy
2021-11-26 20:19:48 -0800
d2582c5b77if the very first API call fails it is very unlikely that there are new papers at all, no need to try multiple pages
Andrej Karpathy
2021-11-26 20:10:32 -0800
aa877c9397when writing features do it safely and atomically
Andrej Karpathy
2021-11-26 20:00:37 -0800
c3161b2a49do not reveal username since they are kind of secret now
Andrej Karpathy
2021-11-26 17:11:19 -0800
5b5b503a93let things breathe a bit more
Andrej Karpathy
2021-11-26 16:44:21 -0800
c5b5c7833cand i think that's it, we now support user accounts (lite)git commit -m 'and i think that\'s it, we now support user accounts litegit status sweet.'! sweet.
Andrej Karpathy
2021-11-26 16:38:36 -0800
54157c6e9cok here we go, the beginning of user profiles. for now we assume there is a single hard-coded user root
Andrej Karpathy
2021-11-26 11:07:16 -0800
dc55146e03few notes on some outstanding todos
Andrej Karpathy
2021-11-26 10:28:49 -0800
3e2d1248d8speed up time filtering by 20X by loading all of metas into memory once instead of querying it item by item in a massive loop
Andrej Karpathy
2021-11-26 10:22:52 -0800
3d5bbb9851optimization: read only metas if that's all that we need to complete the request. that said, i expected this to run much faster and it doesn't. i think the way i'm using sqlites here is not exactly efficient or appropriate and i have to re-think the database schema. e.g. getting random papers over the last 7 days should be almost instantaneous, but currently this request takes like 3 seconds. have to better understand why this is happening
Andrej Karpathy
2021-11-26 09:51:48 -0800
b283c85c72this was bothering me
Andrej Karpathy
2021-11-26 09:36:09 -0800
8c74150337eliminate a whole spurious endpoint of /search and remove code duplication
Andrej Karpathy
2021-11-26 09:31:06 -0800
470bd00563refactor and clean up the code in preparation for some consolidation, coming up
Andrej Karpathy
2021-11-26 09:10:35 -0800
f565eba1c7use jinja templates so there's no copy paste code all over the place
Andrej Karpathy
2021-11-25 14:45:08 -0800
46f556caf6example makefile
Andrej Karpathy
2021-11-25 13:51:52 -0800
1ed6e3f1b0ok now we can sequester all the database files into data/ folder so everything is nice and clean yay
Andrej Karpathy
2021-11-25 13:47:45 -0800
77279e1777sequester all file sytem IO ops only to db.py, so it's not total chaos
Andrej Karpathy
2021-11-25 13:28:04 -0800
bd11865a49add an option to break out early when we've pulled in all new papers most likely
Andrej Karpathy
2021-11-24 09:15:36 -0800
cf1bef6f53big new feature: ability to inspect any paper to see the raw tfidf tokens and their weights that summarize the paper, and which powers the SVM recommendation engine. basically a bit of a debugging / insight feature, but a really good sanity check that papers are being properly represented
Andrej Karpathy
2021-11-21 20:51:01 -0800
e5798ddb2fmake the header link to root of the site as the case should always be ok cool
Andrej Karpathy
2021-11-21 14:03:17 -0800
548ee210dfbetter default parameters, based on qualitative inspection of tfidf features and word vectors
Andrej Karpathy
2021-11-21 13:46:14 -0800
8be24dae43allow for full sorting, slice and dicing with all options in the UI, and the previous shortcuts. now have a lot more flexibility to slice and dice as desired
Andrej Karpathy
2021-11-16 21:39:43 -0800