Unsupervised Pretraining For Semi-Supervised Ocr