From 15f17b4fce5a1e4b70709aac569c18d8cf57deba Mon Sep 17 00:00:00 2001 From: yvesf Date: Fri, 26 Nov 2010 23:59:59 +0100 Subject: add pdfminer for text extraction; replaced all ' with " metadata hacking --- INSTALL | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) (limited to 'INSTALL') diff --git a/INSTALL b/INSTALL index c2ff89f..9ec902c 100644 --- a/INSTALL +++ b/INSTALL @@ -3,8 +3,6 @@ TODO clean html consistent schema (_book _metadata) ? /usr/bin/convert ? -check pdfminer for better text-extraction (whitespace) - ------------------------ @@ -21,6 +19,7 @@ Install dependencies pip install whoosh pip install pypdf pip install flask + pip install pdfminer ( Clone ) git clone http://xapek.org/~yvesf/repos/booksearch.git -- cgit v1.2.1