8cc93fc84c
Update docstrings
2016-02-10 14:41:56 +01:00
b2dbe2b8f9
Refactor bibtex to make better use of bibtexparser
2016-02-10 14:41:56 +01:00
0ae4894360
Fix missing requirements.txt in package
2016-02-10 14:41:56 +01:00
c785e04589
Add a __valid_identifiers__ list to ease fetching of identifiers in
...
papers
See the detailed explanations in README.md.
Also fixed some typos in docstrings.
2016-02-01 17:32:24 +01:00
5f42e7ca6c
Use journal instead of publisher for page tearing
2016-01-30 17:18:32 +01:00
9853469f3c
Remove useless TODOs
2016-01-30 17:03:12 +01:00
6c5b69a23d
Unittests for fetcher
2016-01-30 16:51:30 +01:00
69e9414742
Tearpages ok
2016-01-30 16:28:53 +01:00
a2875fc242
Wrapper around Grobid
2016-01-30 16:28:21 +01:00
f5183a1d11
Add a setup.py file and __init__.py for module and submodules
2016-01-25 17:56:34 +01:00
8a905a9776
References extraction using CERMINE
2016-01-24 22:23:41 +01:00
d4d0e97295
Update acknowledgements
2016-01-24 18:42:36 +01:00
bed4ffff69
Fix ISBN validation and complete unittests
2016-01-24 18:21:52 +01:00
eb576f5fa6
Use a local version of CERMINE if available
2016-01-24 18:21:34 +01:00
3eb251a8d4
Fix citation fetching from arXiv papers
2016-01-20 23:40:07 +01:00
975fd0f38f
Unittests ofr repositories
2016-01-20 23:18:13 +01:00
609fa6ce4f
doi.py fetcher.py unittests
2016-01-20 22:35:43 +01:00
681ec1e5ac
Unittests for isbn.py
2016-01-20 21:57:35 +01:00
d4e4184385
Unittests in tools.py
2016-01-20 21:42:17 +01:00
7a393a746f
State that it is a WIP
2016-01-20 11:10:06 +01:00
9019833dbb
Add some doc, especially about external dependencies
2016-01-19 18:17:12 +01:00
65967cfa96
Add a statement about common issues with pdfextract
2016-01-19 17:58:24 +01:00
e9d7f3ad78
Add functions to extract references from a PDF file
...
Add some functions to extract references from a PDF file. They are
basically wrappers around Cermine, Grobid and pdf-extract. The Grobid
wrapper is still to be done and more deeply embedded in the toolchain.
2016-01-19 17:45:58 +01:00
962b4adc23
Fix requests.exception import error
2016-01-19 17:45:21 +01:00
dec7257eff
Add a function to look for updated arXiv versions.
2016-01-14 00:04:33 +01:00
c964dcb0c6
Fix sphinx doc generation + error in doi module
2016-01-10 18:35:24 +01:00
7a281528e3
Readd downloader code
2016-01-10 18:19:30 +01:00
4a89c8c136
Add some functions to tear first pages from a PDF
2016-01-10 17:52:45 +01:00
ba564be738
Add identifiers fetching from papers
2016-01-10 15:12:06 +01:00
f688534e33
Add some bibtex manipulation functions
2016-01-10 14:44:27 +01:00
d606b5e56b
Add a disclaimer about bulk downloadin arXiv
2016-01-07 00:54:28 +01:00
dd98237bfe
Try to setup Sphinx for doc generation
2015-12-28 01:06:31 +01:00
168e37f247
Add a citations fetcher for bibtex files
2015-12-28 00:43:45 +01:00
0d17254f6c
Add a citation fetcher for plaintext, and factorize code with bbl citations fetcher
2015-12-28 00:21:41 +01:00
bd0016cb51
Complete isbn API and fix a typo in arXiv API.
2015-12-27 23:55:57 +01:00
d8b74ae356
Reimport bbl citations parsing and make some minor fixes
2015-12-27 23:46:43 +01:00
97eb5a3ae0
First commit
2015-12-27 19:35:55 +01:00