README: describe another paperbot compulsion

This commit is contained in:
Bryan Bishop 2013-01-27 08:01:05 -06:00
parent dfb9b34c5c
commit 88aff1a06c

View File

@ -6,7 +6,7 @@ Paperbot is an IRC bot that fetches academic papers. It monitors all conversatio
<div id="deets" /> <div id="deets" />
## deets ## deets
All content is scraped using [zotero/translators](https://github.com/zotero/translators). These are javascript scrapers that work on a large number of academic publisher sites and are actively maintained. Paperbot offloads links to [zotero/translation-server](https://github.com/zotero/translation-server), which runs the zotero scrapers headlessly in a gecko and xulrunner environment. All content is scraped using [zotero/translators](https://github.com/zotero/translators). These are javascript scrapers that work on a large number of academic publisher sites and are actively maintained. Paperbot offloads links to [zotero/translation-server](https://github.com/zotero/translation-server), which runs the zotero scrapers headlessly in a gecko and xulrunner environment. The scrapers return metadata and a link to the pdf. Then paperbot fetches that particular pdf. Also, sometimes in IRC someone drops a link straight to a pdf, which paperbot is also happy to compulsively archive.
* [zotero/translators](https://github.com/zotero/translators) * [zotero/translators](https://github.com/zotero/translators)
* [zotero/translation-server](https://github.com/zotero/translation-server) * [zotero/translation-server](https://github.com/zotero/translation-server)