bmc/README.md

BiblioManager
=============

BiblioManager is a simple script to download and store your articles. Read on if you want more info :)

**Note :** This script is currently a work in progress.

Travis build status : [![Build Status](https://travis-ci.org/Phyks/BMC.svg?branch=master)](https://travis-ci.org/Phyks/BMC)

## What is BiblioManager (or what it is **not**) ?

I used to have a folder with poorly named papers and books and wanted something to help me handle it. I don't like Mendeley and Zotero and so on, which are heavy and overkill for my needs. I just want to feed a script with PDF files of papers and books, or URLs to PDF files, and I want it to automatically maintain a BibTeX index of these files, to help me cite them and find them back. Then, I want it to give me a way to easily retrieve a file, either by author, by title or with some other search method, and give me the associated bibtex entry.

This is the goal of BiblioManager. This script can :
* Download or import PDF/Djvu files
* Try to get automatically the metadata of the files (keywords, author, review, …)
* Store all the metadata in a BibTex file
* Rename your files to store them in a logical and homogeneous way according to a user-defined mask
* Help you find them back
* Give you directly the bibtex entry necessary to cite them
* Remove some of the watermarks included in those files (the front page with your ip address from IOP for instance)

BiblioManager will always use standard formats such as BibTeX, so that you can easily edit your library, export it and manage it by hand, even if you quit this software for any reason.


## Current status

Should be almost working and usable now, although still to be considered as **experimental**. It can be **broken** at **any commit** and not repaired for a few days. I will update this when I will have a version that I can consider to be “stable”.

**Important note :** I use it for personal use, but I don't read articles from many journals. If you find any file which is not working, please fill an issue or send me an e-mail with the relevant information. There are alternative ways to get the metadata for example, and I didn't know really which one was the best one as writing this code. Please do backups regularly if using this. I could not be held responsible for any loss of papers.


* Import
    * working: all (file / tags / bibtex modification / bibtex retrieval / remove watermark pages)
* Download
    * working: all
* Delete
    * working: all (by file and by id)
* Edit
    * working: all
* List
    * working
* Search
    * TODO
* Open
    * working: all
* Resync
    * working
* Update
    * working


**Error reporting :** If you have any issue with this script, please report error. If possible, send me the article responsible for the error, or at least give me the reference so that I can test and debug easily.

## Installation

* Clone this git repository where you want:
```
git clone https://github.com/Phyks/BMC
```
* Install `arxiv2bib`, `PySocks`, `bibtexparser` (https://github.com/sciunto/python-bibtexparser), `PyPDF2` and `isbnlib` _via_ Pypi (or better, in a virtualenv, or using your package manager, according to your preferences)
```
sudo pip install arxiv2bib PySocks bibtexparser pyPDF2 isbnlib
```
(this script should be compatible with Python 2 and Python 3)
* Install `pdftotext` (provided by Xpdf) and `djvulibre` _via_ your package manager or the way you want
* Install the script _via_ `python setup.py install`.
* Run the script to initialize the conf in `~/.config/bmc/bmc.json`.
* Customize the configuration by editing `~/.config/bmc/bmc.json` according to your needs. A documentation of the available options can be found in file `config.py`.
* _Power users :_ Add your custom masks in `~/.config/bmc/masks.py`.

*Note:* To update the script, just run `git pull` in the script dir.

## Usage

### To import an existing PDF / Djvu file

Run `./bmc.py import PATH_TO_FILE [article|book]`. `[article|book]` is an optional argument (article or book) to search only for DOI or ISBN and thus, speed up the import.

It will get automatically the bibtex entry corresponding to the document, and you will be prompted for confirmation. It will then copy the file to your papers dir, renaming it according to the specified mask in `~/.config/bmc/bmc.json`.

### To download a PDF / Djvu file

Run `./bmc.py download URL_TO_PDF [article|book]`, where `[article|book]` (article or book) is again a parameter to specify to search only for DOI or ISBN only, and thus speed up the import. The `URL_TO_PDF` parameter should be a direct link to the PDF file (meaning it should be the link to the pdf page, which may have an authentication portal and not the page with abstract on many publishers websites).

The script will try to download the file with the proxies specified in `~/.config/bmc/bmc.json` until it manages to get the file, or runs out of available proxies.

It will get automatically the bibtex entry corresponding to the document, and you will be prompted for confirmation. It will then put the file in your papers dir, renaming it according to the specified mask in `~/.config/bmc/bmc.json`.

### Delete an entry

Run `./bmc.py delete PARAM` where `PARAM` should be either a path to a paper file, or an ident in the bibtex index. This will remove the corresponding entry in the bibtex index, and will remove the file from your papers dir. Although it will prompt you for confirmation, there's no way to recover your file after deletion, so use with care.

### Search for an entry

TODO

_Note :_ There is currently no search engine implemented. I will first focus on stabilizing the script, and will implement it later. The `search.py` file is not functional as of today and is only there to present a rough idea of what I expect the search engine to be. Ideally, it should understand complex expressions like `(author=foo or title=bar) or year=1111`. However, in the meantime, you can `grep` the generated `index.bib` file to have basic search features.


### List all entries

Run `./bmc.py list` to list all the papers in your paper folder.

### Edit entries

Run `./bmc.py edit PARAM` where `PARAM` should be either a path to a paper file or an ident in the bibtex index. This will open a text editor to edit the corresponding bibtex entry.

### Download the latest version for papers from arXiv

Run `./bmc.py update` to look for available updated versions of your arXiv papers. You can use the optionnal `--entries ID` argument (where ID is either a bibtex index identifier or a filename) to search only for a limited subset of papers.

### Importing long articles / books without DOI / ISBN

When you import a long article without any DOI or ISBN, the script will process the whole file before finding out that there is no such information. This can take a while for long articles, and you may feel the script has entered an infinite loop. If you think it's taking too long, you can `^C` and you will be dropped to manual entry of bibtex infos.

### Data storage

All your documents will be stored in the papers dir specified in `~/.config/bmc/bmc.json`. All the bibtex entries will be added to the `index.bib` file. You should **not** add entries to this file (but you can edit existing entries without any problem), as this will break synchronization between documents in papers dir and the index. If you do so, you can resync the index file with `./bmc.py resync`.

The resync option will check that all bibtex entries have a corresponding file and all file have a corresponding bibtex entry. It will prompt you what to do for unmatched entries.


## Unittests

Unittests are available for all the files in the `lib/`. You can simply run the tests using `nosetests`. Builds are run after each commit on [Travis](https://travis-ci.org/Phyks/BMC).


## License

All the source code I wrote is under a `no-alcohol beer-ware license`. All functions that I didn't write myself are under the original license and their origin is specified in the function itself.
```
* --------------------------------------------------------------------------------
* "THE NO-ALCOHOL BEER-WARE LICENSE" (Revision 42):
* Phyks (webmaster@phyks.me) wrote this file. As long as you retain this notice you
* can do whatever you want with this stuff (and you can also do whatever you want
* with this stuff without retaining it, but that's not cool...). If we meet some
* day, and you think this stuff is worth it, you can buy me a <del>beer</del> soda
* in return.
*																		Phyks
* ---------------------------------------------------------------------------------
```


## Inspiration

Here are some sources of inspirations for this project :

* MPC
* http://en.dogeno.us/2010/02/release-a-python-script-for-organizing-scientific-papers-pyrenamepdf-py/
* [Bibsoup](http://openbiblio.net/2012/02/09/bibsoup-beta-released/)
* [Paperbot](https://github.com/kanzure/paperbot)

## Ideas, TODO

A list of ideas and TODO. Don't hesitate to give feedback on the ones you really want or to propose your owns.

80. Search engine
85. Anti-duplicate ?
90. Look for published version in arXiv
95. No DOI for HAL => metadata with SOAP API… don't want to handle it for now :/
200. Webserver interface ? GUI ? (not likely for now…)

## Thanks

* Nathan Grigg for his [arxiv2bib](https://pypi.python.org/pypi/arxiv2bib/1.0.5#downloads) python module
* François Boulogne for his [python-bibtexparser](https://github.com/sciunto/python-bibtexparser) python module and his integration of new requested features
* pyparsing [search parser example](http://pyparsing.wikispaces.com/file/view/searchparser.py)
* François Boulogne (@sciunto) for his (many) contributions to this software !

## Note on test files

* The test files used, provided in `tests/src` are under CC-BY license, from arXiv, HAL, New Journal of Physics and PhysRev.
* The `test_watermark.pdf` file originally had a first blank page, which is supposed to be teared down. For this test, I just duplicated the first page, as the original first page contained personnal information.
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00			`BiblioManager`
			`=============`
README: initial content 2013-01-27 14:50:12 +01:00
Improved doc 2014-04-26 15:32:34 +02:00			`BiblioManager is a simple script to download and store your articles. Read on if you want more info :)`
Config file, SOCKS support, multiple servers 2013-05-11 16:10:48 +02:00
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00			`Note : This script is currently a work in progress.`
README: initial content 2013-01-27 14:50:12 +01:00
Update README.md Fix Travis icon 2014-08-02 22:03:50 +02:00			`Travis build status : [![Build Status](https://travis-ci.org/Phyks/BMC.svg?branch=master)](https://travis-ci.org/Phyks/BMC)`
Update README.md Add Travis build status. 2014-08-01 00:45:04 +02:00
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00			`## What is BiblioManager (or what it is not) ?`

Improved doc 2014-04-26 15:32:34 +02:00			I used to have a folder with poorly named papers and books and wanted something to help me handle it. I don't like Mendeley and Zotero and so on, which are heavy and overkill for my needs. I just want to feed a script with PDF files of papers and books, or URLs to PDF files, and I want it to automatically maintain a BibTeX index of these files, to help me cite them and find them back. Then, I want it to give me a way to easily retrieve a file, either by author, by title or with some other search method, and give me the associated bibtex entry.
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
Improved doc 2014-04-26 15:32:34 +02:00			`This is the goal of BiblioManager. This script can :`
Update README 2014-04-25 16:20:04 +02:00			`* Download or import PDF/Djvu files`
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00			`* Try to get automatically the metadata of the files (keywords, author, review, …)`
			`* Store all the metadata in a BibTex file`
Improved doc 2014-04-26 15:32:34 +02:00			`* Rename your files to store them in a logical and homogeneous way according to a user-defined mask`
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00			`* Help you find them back`
			`* Give you directly the bibtex entry necessary to cite them`
Improved doc 2014-04-26 15:32:34 +02:00			`* Remove some of the watermarks included in those files (the front page with your ip address from IOP for instance)`
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
			`BiblioManager will always use standard formats such as BibTeX, so that you can easily edit your library, export it and manage it by hand, even if you quit this software for any reason.`


Import should be working \ol/ 2014-04-25 14:22:34 +02:00			`## Current status`

Unstable indicated in README 2014-05-01 01:39:07 +02:00			`Should be almost working and usable now, although still to be considered as experimental. It can be broken at any commit and not repaired for a few days. I will update this when I will have a version that I can consider to be “stable”.`
Import should be working \ol/ 2014-04-25 14:22:34 +02:00
List 2014-05-14 22:52:17 +02:00			`Important note : I use it for personal use, but I don't read articles from many journals. If you find any file which is not working, please fill an issue or send me an e-mail with the relevant information. There are alternative ways to get the metadata for example, and I didn't know really which one was the best one as writing this code. Please do backups regularly if using this. I could not be held responsible for any loss of papers.`
Import should be working \ol/ 2014-04-25 14:22:34 +02:00

Import / Download / Delete working All bug should be fixed for the import / download / delete functions. * Some problems with utf-8 and homogeneize_latex_encoding in python-bbtexparser are bypassed and will be cleaned in a better way when the latest version will be available in pip. * Tweaked regex for isbn, which was not cas insensitive and forgot about spaces separated numbers. * File entry in arXiv bibtex is now deleted to avoid confusion. 2014-05-11 19:29:42 +02:00			`* Import`
Utf-8 + README updated 2014-05-13 15:21:00 +02:00			`* working: all (file / tags / bibtex modification / bibtex retrieval / remove watermark pages)`
Import / Download / Delete working All bug should be fixed for the import / download / delete functions. * Some problems with utf-8 and homogeneize_latex_encoding in python-bbtexparser are bypassed and will be cleaned in a better way when the latest version will be available in pip. * Tweaked regex for isbn, which was not cas insensitive and forgot about spaces separated numbers. * File entry in arXiv bibtex is now deleted to avoid confusion. 2014-05-11 19:29:42 +02:00			`* Download`
Utf-8 + README updated 2014-05-13 15:21:00 +02:00			`* working: all`
Import / Download / Delete working All bug should be fixed for the import / download / delete functions. * Some problems with utf-8 and homogeneize_latex_encoding in python-bbtexparser are bypassed and will be cleaned in a better way when the latest version will be available in pip. * Tweaked regex for isbn, which was not cas insensitive and forgot about spaces separated numbers. * File entry in arXiv bibtex is now deleted to avoid confusion. 2014-05-11 19:29:42 +02:00			`* Delete`
Utf-8 + README updated 2014-05-13 15:21:00 +02:00			`* working: all (by file and by id)`
Edit working 2014-05-14 14:53:56 +02:00			`* Edit`
			`* working: all`
Utf-8 + README updated 2014-05-13 15:21:00 +02:00			`* List`
List 2014-05-14 22:52:17 +02:00			`* working`
Utf-8 + README updated 2014-05-13 15:21:00 +02:00			`* Search`
			`* TODO`
			`* Open`
			`* working: all`
			`* Resync`
Resync should be working 2014-05-14 17:07:57 +02:00			`* working`
Utf-8 + README updated 2014-05-13 15:21:00 +02:00			`* Update`
Update arXiv papers 2014-05-14 22:45:25 +02:00			`* working`
Import / Download / Delete working All bug should be fixed for the import / download / delete functions. * Some problems with utf-8 and homogeneize_latex_encoding in python-bbtexparser are bypassed and will be cleaned in a better way when the latest version will be available in pip. * Tweaked regex for isbn, which was not cas insensitive and forgot about spaces separated numbers. * File entry in arXiv bibtex is now deleted to avoid confusion. 2014-05-11 19:29:42 +02:00
Typo with arXiv versions 2014-05-14 23:59:12 +02:00
			`Error reporting : If you have any issue with this script, please report error. If possible, send me the article responsible for the error, or at least give me the reference so that I can test and debug easily.`

Updated README and cleaned repo 2014-04-23 22:27:55 +02:00			`## Installation`

MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			`* Clone this git repository where you want:`
Typo with arXiv versions 2014-05-14 23:59:12 +02:00			```
			`git clone https://github.com/Phyks/BMC`
			```
Do not specify Python version by default 2015-06-11 16:56:14 +02:00			* Install `arxiv2bib`, `PySocks`, `bibtexparser` (https://github.com/sciunto/python-bibtexparser), `PyPDF2` and `isbnlib` _via_ Pypi (or better, in a virtualenv, or using your package manager, according to your preferences)
Typo with arXiv versions 2014-05-14 23:59:12 +02:00			```
Edit README.md accordingly 2014-08-02 23:35:29 +02:00			`sudo pip install arxiv2bib PySocks bibtexparser pyPDF2 isbnlib`
Typo with arXiv versions 2014-05-14 23:59:12 +02:00			```
Update README for Python3 compatibility 2014-08-03 23:12:43 +02:00			`(this script should be compatible with Python 2 and Python 3)`
Typo with arXiv versions 2014-05-14 23:59:12 +02:00			* Install `pdftotext` (provided by Xpdf) and `djvulibre` _via_ your package manager or the way you want
Update README 2014-10-07 11:22:13 +02:00			* Install the script _via_ `python setup.py install`.
Update doc 2014-06-30 23:25:51 +02:00			* Run the script to initialize the conf in `~/.config/bmc/bmc.json`.
			* Customize the configuration by editing `~/.config/bmc/bmc.json` according to your needs. A documentation of the available options can be found in file `config.py`.
			* _Power users :_ Add your custom masks in `~/.config/bmc/masks.py`.
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
Typo with arXiv versions 2014-05-14 23:59:12 +02:00			Note: To update the script, just run `git pull` in the script dir.

Improved doc 2014-04-26 15:32:34 +02:00			`## Usage`
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
Improved doc 2014-04-26 15:32:34 +02:00			`### To import an existing PDF / Djvu file`
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			Run `./bmc.py import PATH_TO_FILE [article\|book]`. `[article\|book]` is an optional argument (article or book) to search only for DOI or ISBN and thus, speed up the import.
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
Update doc 2014-06-30 23:25:51 +02:00			It will get automatically the bibtex entry corresponding to the document, and you will be prompted for confirmation. It will then copy the file to your papers dir, renaming it according to the specified mask in `~/.config/bmc/bmc.json`.
Updated README and cleaned repo 2014-04-23 22:27:55 +02:00
Improved doc 2014-04-26 15:32:34 +02:00			`### To download a PDF / Djvu file`
README: initial content 2013-01-27 14:50:12 +01:00
MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			Run `./bmc.py download URL_TO_PDF [article\|book]`, where `[article\|book]` (article or book) is again a parameter to specify to search only for DOI or ISBN only, and thus speed up the import. The `URL_TO_PDF` parameter should be a direct link to the PDF file (meaning it should be the link to the pdf page, which may have an authentication portal and not the page with abstract on many publishers websites).
Improved doc 2014-04-26 15:32:34 +02:00
Update doc 2014-06-30 23:25:51 +02:00			The script will try to download the file with the proxies specified in `~/.config/bmc/bmc.json` until it manages to get the file, or runs out of available proxies.
Improved doc 2014-04-26 15:32:34 +02:00
Update doc 2014-06-30 23:25:51 +02:00			It will get automatically the bibtex entry corresponding to the document, and you will be prompted for confirmation. It will then put the file in your papers dir, renaming it according to the specified mask in `~/.config/bmc/bmc.json`.
Improved doc 2014-04-26 15:32:34 +02:00
			`### Delete an entry`

MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			Run `./bmc.py delete PARAM` where `PARAM` should be either a path to a paper file, or an ident in the bibtex index. This will remove the corresponding entry in the bibtex index, and will remove the file from your papers dir. Although it will prompt you for confirmation, there's no way to recover your file after deletion, so use with care.
Improved doc 2014-04-26 15:32:34 +02:00
			`### Search for an entry`

			`TODO`

Update README.md to add info on search 2014-07-10 10:55:12 +02:00			_Note :_ There is currently no search engine implemented. I will first focus on stabilizing the script, and will implement it later. The `search.py` file is not functional as of today and is only there to present a rough idea of what I expect the search engine to be. Ideally, it should understand complex expressions like `(author=foo or title=bar) or year=1111`. However, in the meantime, you can `grep` the generated `index.bib` file to have basic search features.


Improved doc 2014-04-26 15:32:34 +02:00			`### List all entries`
README: initial content 2013-01-27 14:50:12 +01:00
MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			Run `./bmc.py list` to list all the papers in your paper folder.
Started the main code 2014-04-24 00:18:49 +02:00
Remove first page of IOP papers + various bugfixes 2014-04-26 23:26:25 +02:00			`### Edit entries`

MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			Run `./bmc.py edit PARAM` where `PARAM` should be either a path to a paper file or an ident in the bibtex index. This will open a text editor to edit the corresponding bibtex entry.
Remove first page of IOP papers + various bugfixes 2014-04-26 23:26:25 +02:00
Update arXiv articles via CLI 2014-05-07 22:04:46 +02:00			`### Download the latest version for papers from arXiv`

MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			Run `./bmc.py update` to look for available updated versions of your arXiv papers. You can use the optionnal `--entries ID` argument (where ID is either a bibtex index identifier or a filename) to search only for a limited subset of papers.
Update arXiv articles via CLI 2014-05-07 22:04:46 +02:00
Infos about apparent infinite loop in README 2014-06-08 19:27:00 +02:00			`### Importing long articles / books without DOI / ISBN`

			When you import a long article without any DOI or ISBN, the script will process the whole file before finding out that there is no such information. This can take a while for long articles, and you may feel the script has entered an infinite loop. If you think it's taking too long, you can `^C` and you will be dropped to manual entry of bibtex infos.

Improved doc 2014-04-26 15:32:34 +02:00			`### Data storage`

Update doc 2014-06-30 23:25:51 +02:00			All your documents will be stored in the papers dir specified in `~/.config/bmc/bmc.json`. All the bibtex entries will be added to the `index.bib` file. You should not add entries to this file (but you can edit existing entries without any problem), as this will break synchronization between documents in papers dir and the index. If you do so, you can resync the index file with `./bmc.py resync`.
Resync function. To be tested… 2014-05-01 00:45:31 +02:00
			`The resync option will check that all bibtex entries have a corresponding file and all file have a corresponding bibtex entry. It will prompt you what to do for unmatched entries.`
Improved doc 2014-04-26 15:32:34 +02:00
Update doc for unittests 2014-12-03 12:18:39 +01:00
			`## Unittests`

			Unittests are available for all the files in the `lib/`. You can simply run the tests using `nosetests`. Builds are run after each commit on [Travis](https://travis-ci.org/Phyks/BMC).


Improved doc 2014-04-26 15:32:34 +02:00			`## License`

License + Consolidating fetcher.py 2014-05-26 16:12:21 +02:00			All the source code I wrote is under a `no-alcohol beer-ware license`. All functions that I didn't write myself are under the original license and their origin is specified in the function itself.
Improved doc 2014-04-26 15:32:34 +02:00			```
			`* --------------------------------------------------------------------------------`
			`* "THE NO-ALCOHOL BEER-WARE LICENSE" (Revision 42):`
			`* Phyks (webmaster@phyks.me) wrote this file. As long as you retain this notice you`
			`* can do whatever you want with this stuff (and you can also do whatever you want`
MAINT: rename main.py to bmc.py 2014-05-23 13:48:25 +02:00			`* with this stuff without retaining it, but that's not cool...). If we meet some`
			`* day, and you think this stuff is worth it, you can buy me a <del>beer</del> soda`
Improved doc 2014-04-26 15:32:34 +02:00			`* in return.`
			`* Phyks`
			`* ---------------------------------------------------------------------------------`
			```

Use tempfile when downloading a file URL 2014-04-26 18:27:01 +02:00
Started the main code 2014-04-24 00:18:49 +02:00			`## Inspiration`

Improved doc 2014-04-26 15:32:34 +02:00			`Here are some sources of inspirations for this project :`

Import should be working \ol/ 2014-04-25 14:22:34 +02:00			`* MPC`
Update README 2014-04-25 16:20:04 +02:00			`* http://en.dogeno.us/2010/02/release-a-python-script-for-organizing-scientific-papers-pyrenamepdf-py/`
			`* [Bibsoup](http://openbiblio.net/2012/02/09/bibsoup-beta-released/)`
Improved doc 2014-04-26 15:32:34 +02:00			`* [Paperbot](https://github.com/kanzure/paperbot)`
Import should be working \ol/ 2014-04-25 14:22:34 +02:00
			`## Ideas, TODO`

			`A list of ideas and TODO. Don't hesitate to give feedback on the ones you really want or to propose your owns.`

Tag handling + various bugfixes 2014-05-05 00:19:29 +02:00			`80. Search engine`
Customization option for files renaming 2014-05-17 22:21:51 +02:00			`85. Anti-duplicate ?`
			`90. Look for published version in arXiv`
			`95. No DOI for HAL => metadata with SOAP API… don't want to handle it for now :/`
Update arXiv articles via CLI 2014-05-07 22:04:46 +02:00			`200. Webserver interface ? GUI ? (not likely for now…)`
Remove first page of IOP papers + various bugfixes 2014-04-26 23:26:25 +02:00
Functions to handle arXiv metadata 2014-05-02 00:07:49 +02:00			`## Thanks`

			`* Nathan Grigg for his [arxiv2bib](https://pypi.python.org/pypi/arxiv2bib/1.0.5#downloads) python module`
			`* François Boulogne for his [python-bibtexparser](https://github.com/sciunto/python-bibtexparser) python module and his integration of new requested features`
Import is working * Various bugfixes * Bugfix with utf-8 2014-05-08 22:07:52 +02:00			`* pyparsing [search parser example](http://pyparsing.wikispaces.com/file/view/searchparser.py)`
Update README.md Add @ßciunto in the thanks part. 2014-07-11 10:32:04 +02:00			`* François Boulogne (@sciunto) for his (many) contributions to this software !`
Test files added See https://github.com/Phyks/BMC/issues/7. 2014-06-29 20:35:22 +02:00
			`## Note on test files`

Added a test file for watermarks 2014-06-29 20:44:49 +02:00			* The test files used, provided in `tests/src` are under CC-BY license, from arXiv, HAL, New Journal of Physics and PhysRev.
			* The `test_watermark.pdf` file originally had a first blank page, which is supposed to be teared down. For this test, I just duplicated the first page, as the original first page contained personnal information.