Go to file
Joe Rayhawk 24fcf41760 translation server submodule: switch over to kanzure's github and the paperbot branch specifically 2013-02-05 01:21:41 -08:00
modules possibly better sciencedirect handling 2013-01-23 20:01:03 -06:00
phenny@7752b56cc2 Add phenny and translation-server as submodules. 2013-02-05 01:12:39 -08:00
translation-server@4d35648672 translation server submodule: switch over to kanzure's github and the paperbot branch specifically 2013-02-05 01:21:41 -08:00
.gitignore initial commit 2013-01-07 22:27:46 -08:00
.gitmodules translation server submodule: switch over to kanzure's github and the paperbot branch specifically 2013-02-05 01:21:41 -08:00
README.md README: also also 2013-01-27 08:03:14 -06:00
requirements.txt explicitly list some python dependencies 2013-01-27 07:59:38 -06:00

README.md

paperbot

Paperbot is an IRC bot that fetches academic papers. It monitors all conversation for links to scholarly content, then fetches the content and posts a public link. This seems to help enhance the quality of discussion and make us less ignorant.

## deets

All content is scraped using zotero/translators. These are javascript scrapers that work on a large number of academic publisher sites and are actively maintained. Paperbot offloads links to zotero/translation-server, which runs the zotero scrapers headlessly in a gecko and xulrunner environment. The scrapers return metadata and a link to the pdf. Then paperbot fetches that particular pdf. Sometimes in IRC someone drops a link straight to a pdf, which paperbot is also happy to compulsively archive.

## TODO

It would be nice to use multiple proxies to resolve a pdf request.

## active demo

say hi to paperbot on irc.freenode.net ##hplusroadmap

## license

BSD.