10Mar/110
book-expander
As messing around getting some ebooks onto my nice new kindle, i have run across the problem where some books have been poorly OCRed and have lost a few spaces along the way. i.e. "Monty python" may become "Montypython"
To combat this i have written up a little script that will search though and try to identify and correct where this has happened.
Currently i have it all setup and hard-coded for use on ZIP/HTML exports from calibre returning a new zip that can be re imported. If you are interested in using the functionality on just plain text feel free to download it and use the clean_book function.
All the code and bug tracker can be found HERE on GitHub
Leave a comment