There is something north of 8% OCR error rates.. that will hurt model quality!