Commit Graph

  • d5f7a0b38a
    Merge 8fc8cf5c38 into d7a303b410 Ihnat Vyhuliar 2025-02-28 21:38:34 +0100
  • 8fc8cf5c38 Orm database with pgsql IhnatVyhuliar 2025-02-28 21:01:44 +0100
  • 8085608d62
    Merge pull request #14 from WyszukiwarkaPublikacji/add-vector-db Bartosz Trojan 2025-02-21 16:01:14 +0100
  • 940f603030 Add a vector db kachim2 2025-02-16 21:26:23 +0100
  • 167fc1607d fix requirements.txt pzarczynski 2025-02-15 16:44:19 +0100
  • 09565e6cf4 add .python-version to .gitignore pzarczynski 2025-02-15 16:10:03 +0100
  • d4f955ebc9
    SMILES input handling @btrojan-official Jean Strychaluk 2025-02-10 18:16:49 +0100
  • 9a48d1404d added chemical_formulas_rank template (to implement later) btrojan-official 2025-02-07 18:23:10 +0100
  • c6dd272679 input for SMILES btrojan-official 2025-02-07 18:18:23 +0100
  • 8fabd2f8c1 Added .venv to .gitignore btrojan-official 2025-01-27 19:03:30 +0100
  • 4bb4dd14cd added config for black and isort btrojan-official 2025-01-27 17:46:43 +0100
  • 13d77f609a Added .venv to gitignore btrojan-official 2025-01-27 17:42:34 +0100
  • 9fb1083f56
    Merge f6a0ae171a into d7a303b410 N-McA 2023-06-19 09:23:04 -0700
  • f6a0ae171a correct comment Nat McAleese 2023-06-19 09:22:29 -0700
  • 7b59d87982 remove trailing space Nat McAleese 2023-06-19 09:20:52 -0700
  • ffea30770c Add overflow: hidden to thumbails to improve mobile Nat McAleese 2023-06-19 09:18:17 -0700
  • 1a5c4f9889
    Merge 55090d8b72 into d7a303b410 Hemang Joshi 2023-01-30 10:07:32 +0530
  • 55090d8b72
    added star-history Hemang Joshi 2023-01-30 10:07:08 +0530
  • 909e4f4604
    Merge 36e6cd2e03 into d7a303b410 Martín Baigorria Alonso 2023-01-08 21:37:05 -0500
  • 36e6cd2e03 Minor improvement to pagination mbaigorria 2023-01-08 21:34:26 -0500
  • cbcb95fd3c
    Merge 53e9746b13 into d7a303b410 pbkompasz 2022-11-02 10:21:20 +0200
  • 53e9746b13 Fix overflow Peter Kompasz 2022-11-02 10:19:14 +0200
  • b7317a0065
    Merge 25c76c9d83 into d7a303b410 Pranay Aryal 2022-10-16 16:05:06 -0400
  • 25c76c9d83 spelling error pranayaryal 2022-10-16 16:04:16 -0400
  • 96a06403bd
    Merge f7f35b4a2e into d7a303b410 Ben Rush 2022-05-06 09:53:08 -0700
  • f7f35b4a2e Refactor for performance Ben Rush 2022-05-06 09:49:17 -0700
  • bea33f6d71 Add twitter functionality similar to original site Ben Rush 2022-05-05 20:08:33 -0700
  • ee826a86ee I propose to joint fit and transform methods for the sake of time. Fedor Krasnov 2022-02-28 15:20:29 +0300
  • ffe6cb7851
    Merge 6bfede2657 into d7a303b410 Subramanya N 2022-02-16 21:28:17 -0500
  • 6bfede2657 Updated to see github links Subramanya N 2022-02-16 21:27:26 -0500
  • cc4b15efb5 updated with github links from paperswithcode Subramanya N 2022-02-14 05:02:19 -0500
  • d7a303b410 add thumbnails for papers, which apparently ppl like master Andrej Karpathy 2022-02-13 18:30:14 -0800
  • f980c7947a link to arxiv-sanity-lite instead of directly to arxiv Andrej Karpathy 2022-01-22 16:36:11 -0800
  • 48a7e01aa2 we will only send emails to serious users Andrej Karpathy 2022-01-22 16:33:13 -0800
  • 23b0e109bf fix bug in script due to schema change of pids variable earlier Andrej Karpathy 2021-12-19 11:50:50 -0800
  • 759f7e73e6 fix bug in pagination, clean up the approach a bit more Andrej Karpathy 2021-12-18 23:25:45 -0800
  • c3cb157c9f first version of pagination w00t w00t! it's a bit hacky i think, should be possible to improve this code and make it smaller and cleaner and etc. Andrej Karpathy 2021-12-18 21:14:44 -0800
  • 4210a73d15 remove recommendation by default on / endpoint. this was causing some unintended confusion, e.g. the search function was using time_filter 7 and giving bad results from the main page. also getting recommendations takes time and is slowish, so only do it on actual demand Andrej Karpathy 2021-12-18 20:21:04 -0800
  • 0f21e69125 Merge branch 'ajdintrejic-master' Andrej Karpathy 2021-12-18 15:06:36 -0800
  • 68e4e0f3bd Merge branch 'master' of https://github.com/ajdintrejic/arxiv-sanity-lite into ajdintrejic-master Andrej Karpathy 2021-12-18 15:01:40 -0800
  • 6e85778b24 change the way we recommend papers. we'll do it per tag instead of just mixing everything together. i believe this should produce higher quality results Andrej Karpathy 2021-12-18 14:53:57 -0800
  • 7a438d9c46 Made site a bit more mobile friendly Ajdin Trejic 2021-12-18 23:30:40 +0100
  • 65d4bb8415 some sleep is good for you Andrej Karpathy 2021-12-10 20:28:34 -0800
  • cd2a88f375 when a user is logged in and hits the default endpoint '/' show recommendations over the last week instead of just totally random recent papers, by default Andrej Karpathy 2021-12-10 09:15:29 -0800
  • 33b2b018ab kind of big changes here: we can now inspect & see the most positive and negative words for a trained svm, to help tune the value C. then there is also the UI for setting value C in the SVM. Finally the value of C I adjusted to default to 0.01 (was 0.1 before) because the results and the weights look more sensible based on manual inspection. We need some dataset of people libraries in order to potentially cross-validate a good value C automatically. For now there are not enough active users of the site that such a thing could be attempted and succeed. Doing my best now just by eyeballing Andrej Karpathy 2021-12-08 00:17:37 -0800
  • 6be574a0fe hah don't crash if a user submits an invalid query, better to return empty results. in the future we may want a helpful error message too Andrej Karpathy 2021-12-07 22:56:09 -0800
  • 4ec9a75293 add a secret GET argument svm_c that changes the C value in the SVM. no UI for this yet :) Andrej Karpathy 2021-12-07 22:52:13 -0800
  • 6308e0cfa4 list out the tags i think Andrej Karpathy 2021-12-07 22:33:50 -0800
  • 28163bfcb7 adding the user account into the email, ty @Randl for the suggestion. I think this should be ok to include in plain text here because it is an email direct to the user Andrej Karpathy 2021-12-07 22:27:14 -0800
  • 1153a70563 print the total number of emails sent Andrej Karpathy 2021-12-04 00:39:55 -0800
  • 6df5f1d178 add requirements.txt ty @Prakyathkantharaju and @e-tornike for help Andrej Karpathy 2021-12-02 16:08:05 -0800
  • 65b3f0e4fd added requirements Prakyath 2021-12-01 19:03:00 -0600
  • 31e02f267c handle the edge case of zero positives... sigh Andrej Karpathy 2021-11-30 23:34:47 -0800
  • fe96775e6e fix issues and improve logic of removing tags Andrej Karpathy 2021-11-30 22:57:59 -0800
  • 9c00c8dacd disallow the special tag null, which can result if user escapes the prompt Andrej Karpathy 2021-11-30 22:51:16 -0800
  • a01c637cc8 tweak email sending Andrej Karpathy 2021-11-29 20:45:24 -0800
  • d5b91270a9 allow to use fewer documents for training tfidf features to prevent OOMs Andrej Karpathy 2021-11-29 15:38:36 -0800
  • e182dda381 make send_emails into a proper script and add a few more widgets Andrej Karpathy 2021-11-29 09:29:59 -0800
  • 6d7f648384 visually distinguish the special 'all' tag Andrej Karpathy 2021-11-28 08:23:51 -0800
  • 8947dd3fc5 tweak search scoring for better results, i think Andrej Karpathy 2021-11-27 18:09:17 -0800
  • aff7c89695 use the older arxiv-sanity colorscheme for header Andrej Karpathy 2021-11-27 17:15:40 -0800
  • a83dbb2138 it doesn't align, the ocd is killing me Andrej Karpathy 2021-11-27 14:53:14 -0800
  • 5990a938a4 add few more stats Andrej Karpathy 2021-11-27 14:47:28 -0800
  • 1769b7b02f minor profile page html/css cleanup Andrej Karpathy 2021-11-27 14:31:18 -0800
  • d236a5dff9 small tweaks to not do useless work in special cases Andrej Karpathy 2021-11-27 13:06:00 -0800
  • a169f0fa61 adding an about page as well Andrej Karpathy 2021-11-27 11:47:17 -0800
  • 6c87055349 slight readme tweak Andrej Karpathy 2021-11-27 11:41:04 -0800
  • 6d4d1eb78d tweak readme Andrej Karpathy 2021-11-27 11:38:31 -0800
  • 6685b0a05f i think this detail matters... Andrej Karpathy 2021-11-27 11:23:35 -0800
  • 5eece3f992 add email sending script to repo Andrej Karpathy 2021-11-27 11:19:55 -0800
  • 49995465df add ability to store an email for a user for recommendations Andrej Karpathy 2021-11-27 11:04:36 -0800
  • 970a9e24cf maintain last active state for users Andrej Karpathy 2021-11-27 09:37:02 -0800
  • 47fba66712 add a new stats page Andrej Karpathy 2021-11-26 21:57:48 -0800
  • 6a0ff6a9bf clarify even further Andrej Karpathy 2021-11-26 21:10:22 -0800
  • b52b64e0f7 don't show the +/- controls for tags when a person is not logged in, it's confusing Andrej Karpathy 2021-11-26 21:09:10 -0800
  • e4fe77d118 show user warning if they are not logged in that things won't work Andrej Karpathy 2021-11-26 20:57:20 -0800
  • c3ce3e3f70 update the screenshot since the interface changed quite a bit Andrej Karpathy 2021-11-26 20:33:10 -0800
  • 4538eedb84 update the readme Andrej Karpathy 2021-11-26 20:31:57 -0800
  • 9c88a4eea4 use the process exit code to communicate whether any updates successfully made it into the database at all Andrej Karpathy 2021-11-26 20:19:48 -0800
  • d2582c5b77 if the very first API call fails it is very unlikely that there are new papers at all, no need to try multiple pages Andrej Karpathy 2021-11-26 20:10:32 -0800
  • aa877c9397 when writing features do it safely and atomically Andrej Karpathy 2021-11-26 20:00:37 -0800
  • c3161b2a49 do not reveal username since they are kind of secret now Andrej Karpathy 2021-11-26 17:11:19 -0800
  • 5b5b503a93 let things breathe a bit more Andrej Karpathy 2021-11-26 16:44:21 -0800
  • c5b5c7833c and i think that's it, we now support user accounts (lite)git commit -m 'and i think that\'s it, we now support user accounts litegit status sweet.'! sweet. Andrej Karpathy 2021-11-26 16:38:36 -0800
  • 54157c6e9c ok here we go, the beginning of user profiles. for now we assume there is a single hard-coded user root Andrej Karpathy 2021-11-26 11:07:16 -0800
  • dc55146e03 few notes on some outstanding todos Andrej Karpathy 2021-11-26 10:28:49 -0800
  • 3e2d1248d8 speed up time filtering by 20X by loading all of metas into memory once instead of querying it item by item in a massive loop Andrej Karpathy 2021-11-26 10:22:52 -0800
  • 3d5bbb9851 optimization: read only metas if that's all that we need to complete the request. that said, i expected this to run much faster and it doesn't. i think the way i'm using sqlites here is not exactly efficient or appropriate and i have to re-think the database schema. e.g. getting random papers over the last 7 days should be almost instantaneous, but currently this request takes like 3 seconds. have to better understand why this is happening Andrej Karpathy 2021-11-26 09:51:48 -0800
  • b283c85c72 this was bothering me Andrej Karpathy 2021-11-26 09:36:09 -0800
  • 8c74150337 eliminate a whole spurious endpoint of /search and remove code duplication Andrej Karpathy 2021-11-26 09:31:06 -0800
  • 470bd00563 refactor and clean up the code in preparation for some consolidation, coming up Andrej Karpathy 2021-11-26 09:10:35 -0800
  • f565eba1c7 use jinja templates so there's no copy paste code all over the place Andrej Karpathy 2021-11-25 14:45:08 -0800
  • 46f556caf6 example makefile Andrej Karpathy 2021-11-25 13:51:52 -0800
  • 1ed6e3f1b0 ok now we can sequester all the database files into data/ folder so everything is nice and clean yay Andrej Karpathy 2021-11-25 13:47:45 -0800
  • 77279e1777 sequester all file sytem IO ops only to db.py, so it's not total chaos Andrej Karpathy 2021-11-25 13:28:04 -0800
  • bd11865a49 add an option to break out early when we've pulled in all new papers most likely Andrej Karpathy 2021-11-24 09:15:36 -0800
  • cf1bef6f53 big new feature: ability to inspect any paper to see the raw tfidf tokens and their weights that summarize the paper, and which powers the SVM recommendation engine. basically a bit of a debugging / insight feature, but a really good sanity check that papers are being properly represented Andrej Karpathy 2021-11-21 20:51:01 -0800
  • e5798ddb2f make the header link to root of the site as the case should always be ok cool Andrej Karpathy 2021-11-21 14:03:17 -0800
  • 548ee210df better default parameters, based on qualitative inspection of tfidf features and word vectors Andrej Karpathy 2021-11-21 13:46:14 -0800
  • 8be24dae43 allow for full sorting, slice and dicing with all options in the UI, and the previous shortcuts. now have a lot more flexibility to slice and dice as desired Andrej Karpathy 2021-11-16 21:39:43 -0800