diff options
author | yvesf <yvesf-git@xapek.org> | 2010-11-26 23:59:59 +0100 |
---|---|---|
committer | yvesf <yvesf-git@xapek.org> | 2010-11-26 23:59:59 +0100 |
commit | 15f17b4fce5a1e4b70709aac569c18d8cf57deba (patch) | |
tree | 89eebcc47aebed71cc86eedfaee8cf894e47c07e /INSTALL | |
parent | 9122cfed447a643a8c383be5558fd82fc3be7cc7 (diff) | |
download | booksearch-15f17b4fce5a1e4b70709aac569c18d8cf57deba.tar.gz booksearch-15f17b4fce5a1e4b70709aac569c18d8cf57deba.zip |
add pdfminer for text extraction;
replaced all ' with "
metadata hacking
Diffstat (limited to 'INSTALL')
-rw-r--r-- | INSTALL | 3 |
1 files changed, 1 insertions, 2 deletions
@@ -3,8 +3,6 @@ TODO clean html consistent schema (_book _metadata) ? /usr/bin/convert ? -check pdfminer for better text-extraction (whitespace) - ------------------------ @@ -21,6 +19,7 @@ Install dependencies pip install whoosh pip install pypdf pip install flask + pip install pdfminer ( Clone ) git clone http://xapek.org/~yvesf/repos/booksearch.git |