Text this: Document image analysis: issues, comparison of methods and remaining problems