summaryrefslogtreecommitdiff
path: root/INSTALL
diff options
context:
space:
mode:
authoryvesf <yvesf-git@xapek.org>2010-11-26 23:59:59 +0100
committeryvesf <yvesf-git@xapek.org>2010-11-26 23:59:59 +0100
commit15f17b4fce5a1e4b70709aac569c18d8cf57deba (patch)
tree89eebcc47aebed71cc86eedfaee8cf894e47c07e /INSTALL
parent9122cfed447a643a8c383be5558fd82fc3be7cc7 (diff)
downloadbooksearch-15f17b4fce5a1e4b70709aac569c18d8cf57deba.tar.gz
booksearch-15f17b4fce5a1e4b70709aac569c18d8cf57deba.zip
add pdfminer for text extraction;
replaced all ' with " metadata hacking
Diffstat (limited to 'INSTALL')
-rw-r--r--INSTALL3
1 files changed, 1 insertions, 2 deletions
diff --git a/INSTALL b/INSTALL
index c2ff89f..9ec902c 100644
--- a/INSTALL
+++ b/INSTALL
@@ -3,8 +3,6 @@ TODO
clean html
consistent schema (_book _metadata)
? /usr/bin/convert ?
-check pdfminer for better text-extraction (whitespace)
-
------------------------
@@ -21,6 +19,7 @@ Install dependencies
pip install whoosh
pip install pypdf
pip install flask
+ pip install pdfminer
( Clone )
git clone http://xapek.org/~yvesf/repos/booksearch.git