Commit Graph

183 Commits

Author SHA1 Message Date
Phyks
3d07af0e71 Check output before processing the whole file for ISBN and DOI search 2014-04-30 00:36:49 +02:00
Phyks
91685bc46b Various re.compile 2014-04-29 21:55:35 +02:00
Phyks
7d36f3206f Move server to a new branch 2014-04-28 23:40:37 +02:00
Phyks
46fc17b2da Refactor in progress
TODO :
* Use new API from bibtex-parser
* Split addFile / downloadFile in main to a function in main (interface)
and one in backend
2014-04-28 23:35:06 +02:00
Phyks
cce4fd50bb Beginning of refactor 2014-04-28 22:23:05 +02:00
Phyks
613f082471 Upgrade to latest version of bibtex-parser 2014-04-27 10:06:30 +02:00
Phyks
872d93c6aa Use $EDITOR
* Use $EDITOR to edit the bibtex entries, as `git commit` or `crontab -e` does.
* Bugfixes that prevented the import of an article without associated
  bibtex entry or without doi.
2014-04-27 09:46:43 +02:00
Phyks
49df58bf70 Remove first page of IOP papers + various bugfixes 2014-04-26 23:26:25 +02:00
Phyks
73809b867d No fail if proxy is not accessible 2014-04-26 18:43:25 +02:00
Phyks
0c8e52844a Bugfix with tempfile 2014-04-26 18:40:32 +02:00
Phyks
6e6418fbad Use tempfile when downloading a file URL 2014-04-26 18:27:01 +02:00
Phyks
6e18c16010 Improved doc 2014-04-26 15:32:34 +02:00
Phyks
ffdbbedfbb Updated example params file 2014-04-26 12:35:07 +02:00
Phyks
02e679bc72 Download of papers working
You should pass the url of the pdf file to the script, along with the
`download` parameter. It will try the proxies in the `params.py` file,
until it finds one that allow him to get the pdf file.

TODO : Use pdfparanoia to remove watermarks
2014-04-26 11:52:19 +02:00
Phyks
b9f6e145e9 Bugfixes in main.py
* Forgotten "\n"
* Very beginning of a web interface
2014-04-25 16:53:08 +02:00
Phyks
5396f1e75b Update README 2014-04-25 16:20:04 +02:00
Phyks
49f4fb41ee Deletion working 2014-04-25 15:36:54 +02:00
Phyks
72e2def380 Import should be working \ol/ 2014-04-25 14:22:34 +02:00
Phyks
138dd92439 Check automatically fetched bibtex 2014-04-25 14:13:37 +02:00
Phyks
20a06d1f41 Add new fields in bibtex 2014-04-25 01:37:19 +02:00
Phyks
70e805f58f Manual check of generated bibtex
* Display the bibtex before adding it, and provide interface to edit it.

TODO :
* Add new fields
2014-04-25 01:13:19 +02:00
Phyks
0f17eaafa6 Added ISBN support
Added support for books using the ISBN number.
2014-04-24 22:39:27 +02:00
Phyks
5aed10f4c5 Check if file already exists when importing 2014-04-24 21:19:27 +02:00
Phyks
c304eb2578 Import updated
* Added djvu support
* Nice mask for file renaming

TODO : Append to bibtex index + test if file already exists
2014-04-24 19:38:52 +02:00
Phyks
1420cf37a9 Added extension checking when importing file 2014-04-24 16:23:28 +02:00
Phyks
ea53e0720f Working on PDF import
* Search the PDF file for DOI, manual fallback if not found
* Move the PDF file
* Add its Bibtex entry to the general bibtex file

TODO :
* Better renaming
* Adding to bibtex file
2014-04-24 16:18:56 +02:00
Phyks
93d1fefa26 Started the main code 2014-04-24 00:18:49 +02:00
Phyks
052f39b6f2 Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
Phyks
41d8fb16d9 Clone repo form a3nm 2014-04-23 13:08:10 +02:00
Antoine Amarilli
e19aa9e534 Config file, SOCKS support, multiple servers 2013-05-11 16:10:48 +02:00
Antoine Amarilli
86c2e11a8c remove phenny, tweak some things 2013-05-11 11:57:28 +02:00
Bryan Bishop
960e86327e use a random title if title extraction fails 2013-04-15 01:13:49 -05:00
Bryan Bishop
04644364e2 fix jstor title determination 2013-02-21 17:30:25 -06:00
Bryan Bishop
16c7f4d4db fix jstor pdf urls 2013-02-21 17:13:22 -06:00
Bryan Bishop
14bdf23876 jstor 2013-02-21 17:11:28 -06:00
Bryan Bishop
56f0caf6ae skip translator results if [] or not http 200 2013-02-18 05:17:57 -06:00
Bryan Bishop
0253a0a9db allow unicode in filenames when returning a url 2013-02-16 21:23:39 -06:00
Bryan Bishop
05669229c4 catch PDFNotImplementedErrors 2013-02-11 10:03:28 -06:00
Bryan Bishop
5fbeedd76b remove extra periods from filenames 2013-02-09 19:22:51 -06:00
Bryan Bishop
a8abdb2322 support both jap.aip.org and apl.aip.org 2013-02-09 08:03:29 -06:00
Bryan Bishop
143323b096 README: better description of paperbot's manners 2013-02-09 07:56:08 -06:00
Bryan Bishop
7209fbb620 README: mention pdfparanoia 2013-02-09 07:54:26 -06:00
Bryan Bishop
db58d53c10 pass StringIO to pdfparanoia 2013-02-09 07:45:53 -06:00
Bryan Bishop
53de3f3648 scrub away watermarks in another situation 2013-02-09 07:42:57 -06:00
Bryan Bishop
357e268e96 use pdfparanoia to remove watermarks 2013-02-09 07:41:50 -06:00
Bryan Bishop
bef66e1241 citation_pdf_url is not always available 2013-02-08 15:23:29 -06:00
Bryan Bishop
d400040c10 an even better IEEE fix 2013-02-08 04:45:43 -06:00
Bryan Bishop
c48a377f44 better support for IEEE Xplore 2013-02-08 04:16:10 -06:00
Bryan Bishop
b6977593cd set pdfparanoia as a dependency 2013-02-07 21:05:51 -06:00
Bryan Bishop
0ef3debca8 prevent a catastrophic error in paper retrieval 2013-02-07 03:54:05 -06:00