bmc/README.md

# paperbot

Paperbot is an IRC bot that fetches academic papers. It monitors all conversation for links to scholarly content, then fetches the content and posts a public link. This seems to help enhance the quality of discussion and make us less ignorant.

<div id="details" />
<div id="deets" />
## deets

All content is scraped using [zotero/translators](https://github.com/zotero/translators). These are javascript scrapers that work on a large number of academic publisher sites and are actively maintained. Paperbot offloads links to [zotero/translation-server](https://github.com/zotero/translation-server), which runs the zotero scrapers headlessly in a gecko and xulrunner environment. The scrapers return metadata and a link to the pdf. Then paperbot fetches that particular pdf. Sometimes in IRC someone drops a link straight to a pdf, which paperbot is also happy to compulsively archive.

* [zotero/translators](https://github.com/zotero/translators)
* [zotero/translation-server](https://github.com/zotero/translation-server)
* [patched translation-server](https://github.com/kanzure/translation-server)
* [phenny](https://github.com/sbp/phenny)

<div id="todo" />
## TODO

It would be nice to use multiple proxies to resolve a pdf request.

<div id="demo" />
<div id="channel" />
## active demo

say hi to paperbot on irc.freenode.net ##hplusroadmap

<div id="license" />
## license

BSD.
README: initial content 2013-01-27 14:50:12 +01:00			`# paperbot`

			`Paperbot is an IRC bot that fetches academic papers. It monitors all conversation for links to scholarly content, then fetches the content and posts a public link. This seems to help enhance the quality of discussion and make us less ignorant.`

			`<div id="details" />`
			`<div id="deets" />`
			`## deets`

README: also also 2013-01-27 15:03:14 +01:00			All content is scraped using [zotero/translators](https://github.com/zotero/translators). These are javascript scrapers that work on a large number of academic publisher sites and are actively maintained. Paperbot offloads links to [zotero/translation-server](https://github.com/zotero/translation-server), which runs the zotero scrapers headlessly in a gecko and xulrunner environment. The scrapers return metadata and a link to the pdf. Then paperbot fetches that particular pdf. Sometimes in IRC someone drops a link straight to a pdf, which paperbot is also happy to compulsively archive.
README: initial content 2013-01-27 14:50:12 +01:00
			`* [zotero/translators](https://github.com/zotero/translators)`
			`* [zotero/translation-server](https://github.com/zotero/translation-server)`
			`* [patched translation-server](https://github.com/kanzure/translation-server)`
README: add a link to phenny 2013-01-27 14:51:44 +01:00			`* [phenny](https://github.com/sbp/phenny)`
README: initial content 2013-01-27 14:50:12 +01:00
			`<div id="todo" />`
			`## TODO`

			`It would be nice to use multiple proxies to resolve a pdf request.`

			`<div id="demo" />`
			`<div id="channel" />`
			`## active demo`

			`say hi to paperbot on irc.freenode.net ##hplusroadmap`

			`<div id="license" />`
			`## license`

			`BSD.`