BrickPi Book Paper Reader
A few months ago, we made a Bookreader powered by the BrickPi which had the ability to read pages from your Kindle. There was a huge response to the project. But what everyone really wanted was a bookreader that could read a real paper book.
Here we present to you the BrickPi Bookreader 2 which can read aloud a real book (the voice becomes a bit irritating after a while) and also turn the pages of the book (we are really proud of this).
9. Installing the OCR Engine
The OCR (Optical Character Recognition) engine converts the image file we take of the book into text. We are using Tesseract OCR Engine. It runs well on the Raspberry Pi, it does not require an online connection, and it reliably converts images to text.
First, install Tesseract:
sudo apt-get install tesseract-ocr
Next, test the OCR engine.
Take a good image of a piece of text in a Book and run Tesseract:
tesseract image.jpg o
where image.jpg is the image which was taken by the Raspberry Pi Camera and o is the file in which the text will be saved (Tesseract will make it o.txt so no need to add the extension).
Now, wait a few minutes, the OCR takes a lot of processing power.
When it is done processing, open o.txt. In our experience, the recognition was >90% and works better with larger font size. If the OCR did not detect anything at all, try rotating the image and running Tesseract again.