Commit Graph

24 Commits

Author SHA1 Message Date
Phyks
0866367887 Fixed bool is not iterable if URL is not fetchable 2014-06-30 11:05:03 +02:00
Phyks
7e99f7898a Merge branch 'master' into config_file 2014-06-30 00:37:48 +02:00
Phyks
6be5cb3f16 Updated conf 2014-06-30 00:19:38 +02:00
Phyks
b231b578cc Updated test files
According to https://github.com/Phyks/BMC/issues/7.

Also updated fetcher file to fix two bugs :
* Whitespaces in ISBN
* If PDF to text (or djvu to text) is not long enough, the end of the
file was not processed.
2014-06-29 23:02:44 +02:00
Phyks
2e7369e14e Test files added
See https://github.com/Phyks/BMC/issues/7.
2014-06-29 20:36:02 +02:00
François Boulogne
938ceaf9b8 fix import isbntools 2014-06-28 22:35:56 -04:00
Phyks
479aea4101 Some unit test 2014-05-26 16:50:58 +02:00
Phyks
5a8ea0750f License + Consolidating fetcher.py 2014-05-26 16:12:21 +02:00
Phyks
e3bc6f2d92 Autoconfirm option + flake8 2014-05-17 17:23:56 +02:00
Phyks
9eaf790983 Progress bar for download 2014-05-14 23:07:06 +02:00
Phyks
b4f0e7c0eb Import / Download / Delete working
All bug should be fixed for the import / download / delete functions.

* Some problems with utf-8 and homogeneize_latex_encoding in
python-bbtexparser are bypassed and will be cleaned in a better way when
the latest version will be available in pip.
* Tweaked regex for isbn, which was not cas insensitive and forgot about
spaces separated numbers.
* File entry in arXiv bibtex is now deleted to avoid confusion.
2014-05-11 19:29:42 +02:00
Phyks
681283a0db Download working 2014-05-09 23:37:17 +02:00
Phyks
787113db66 Bugfixes in fetcher.py + function to find hal id 2014-05-02 00:33:09 +02:00
Phyks
289c7dece4 Functions to handle arXiv metadata 2014-05-02 00:07:49 +02:00
Phyks
980e678883 Function for automatic find of arxiv id 2014-05-01 19:46:04 +02:00
Phyks
741cde352e Flake8 2014-04-30 00:54:15 +02:00
Phyks
3d07af0e71 Check output before processing the whole file for ISBN and DOI search 2014-04-30 00:36:49 +02:00
Phyks
91685bc46b Various re.compile 2014-04-29 21:55:35 +02:00
Phyks
46fc17b2da Refactor in progress
TODO :
* Use new API from bibtex-parser
* Split addFile / downloadFile in main to a function in main (interface)
and one in backend
2014-04-28 23:35:06 +02:00
Phyks
cce4fd50bb Beginning of refactor 2014-04-28 22:23:05 +02:00
Phyks
49df58bf70 Remove first page of IOP papers + various bugfixes 2014-04-26 23:26:25 +02:00
Phyks
73809b867d No fail if proxy is not accessible 2014-04-26 18:43:25 +02:00
Phyks
02e679bc72 Download of papers working
You should pass the url of the pdf file to the script, along with the
`download` parameter. It will try the proxies in the `params.py` file,
until it finds one that allow him to get the pdf file.

TODO : Use pdfparanoia to remove watermarks
2014-04-26 11:52:19 +02:00
Phyks
93d1fefa26 Started the main code 2014-04-24 00:18:49 +02:00