We, at CVIT, have been working on OCR (Optical character Recognition) technology for the last few years. Presently we have OCRs for more than 12 Indian languages with almost 95% accuracy. Our audiobook application is a composition of OCR and TTS (Text to Speech) technologies. We use google TTS in our application.
Check out the video to know how this works.