Code for text generation papers searches on ArXiv, with very manual jekyll site creation.

  • By Lynn Cherny
  • Last update: Dec 14, 2022
  • Comments: 0

text-gen-arxiv-papers

This is the raw files for the gh pages site: https://arnicas.github.io/text-gen-arxiv-papers.

Code used is being gradually cleaned up and checked in. Basically I do most of it manually using pandas, since jekyll is pretty bad at what I needed.

The file scrape.py has the search strings and saves a pickle of the latest data from ArXiv.

The file build_pages.py takes the pickle as an argument and processes it. There are required files and directories etc. I'll try to document more and clean it up for re-use.

Github

https://github.com/arnicas/text-gen-arxiv-papers