Commit Graph

38 Commits

Author SHA1 Message Date
d97537d174 Use default theme in Sphinx 2016-02-10 14:41:56 +01:00
8cc93fc84c Update docstrings 2016-02-10 14:41:56 +01:00
b2dbe2b8f9 Refactor bibtex to make better use of bibtexparser 2016-02-10 14:41:56 +01:00
0ae4894360 Fix missing requirements.txt in package 2016-02-10 14:41:56 +01:00
c785e04589 Add a __valid_identifiers__ list to ease fetching of identifiers in
papers

See the detailed explanations in README.md.

Also fixed some typos in docstrings.
2016-02-01 17:32:24 +01:00
5f42e7ca6c Use journal instead of publisher for page tearing 2016-01-30 17:18:32 +01:00
9853469f3c Remove useless TODOs 2016-01-30 17:03:12 +01:00
6c5b69a23d Unittests for fetcher 2016-01-30 16:51:30 +01:00
69e9414742 Tearpages ok 2016-01-30 16:28:53 +01:00
a2875fc242 Wrapper around Grobid 2016-01-30 16:28:21 +01:00
f5183a1d11 Add a setup.py file and __init__.py for module and submodules 2016-01-25 17:56:34 +01:00
8a905a9776 References extraction using CERMINE 2016-01-24 22:23:41 +01:00
d4d0e97295 Update acknowledgements 2016-01-24 18:42:36 +01:00
bed4ffff69 Fix ISBN validation and complete unittests 2016-01-24 18:21:52 +01:00
eb576f5fa6 Use a local version of CERMINE if available 2016-01-24 18:21:34 +01:00
3eb251a8d4 Fix citation fetching from arXiv papers 2016-01-20 23:40:07 +01:00
975fd0f38f Unittests ofr repositories 2016-01-20 23:18:13 +01:00
609fa6ce4f doi.py fetcher.py unittests 2016-01-20 22:35:43 +01:00
681ec1e5ac Unittests for isbn.py 2016-01-20 21:57:35 +01:00
d4e4184385 Unittests in tools.py 2016-01-20 21:42:17 +01:00
7a393a746f State that it is a WIP 2016-01-20 11:10:06 +01:00
9019833dbb Add some doc, especially about external dependencies 2016-01-19 18:17:12 +01:00
65967cfa96 Add a statement about common issues with pdfextract 2016-01-19 17:58:24 +01:00
e9d7f3ad78 Add functions to extract references from a PDF file
Add some functions to extract references from a PDF file. They are
basically wrappers around Cermine, Grobid and pdf-extract. The Grobid
wrapper is still to be done and more deeply embedded in the toolchain.
2016-01-19 17:45:58 +01:00
962b4adc23 Fix requests.exception import error 2016-01-19 17:45:21 +01:00
dec7257eff Add a function to look for updated arXiv versions. 2016-01-14 00:04:33 +01:00
c964dcb0c6 Fix sphinx doc generation + error in doi module 2016-01-10 18:35:24 +01:00
7a281528e3 Readd downloader code 2016-01-10 18:19:30 +01:00
4a89c8c136 Add some functions to tear first pages from a PDF 2016-01-10 17:52:45 +01:00
ba564be738 Add identifiers fetching from papers 2016-01-10 15:12:06 +01:00
f688534e33 Add some bibtex manipulation functions 2016-01-10 14:44:27 +01:00
d606b5e56b Add a disclaimer about bulk downloadin arXiv 2016-01-07 00:54:28 +01:00
dd98237bfe Try to setup Sphinx for doc generation 2015-12-28 01:06:31 +01:00
168e37f247 Add a citations fetcher for bibtex files 2015-12-28 00:43:45 +01:00
0d17254f6c Add a citation fetcher for plaintext, and factorize code with bbl citations fetcher 2015-12-28 00:21:41 +01:00
bd0016cb51 Complete isbn API and fix a typo in arXiv API. 2015-12-27 23:55:57 +01:00
d8b74ae356 Reimport bbl citations parsing and make some minor fixes 2015-12-27 23:46:43 +01:00
97eb5a3ae0 First commit 2015-12-27 19:35:55 +01:00