grl0-alpha-0_1.tar.gz | Tesseract language files for Ancient Greek texts with Latin bibliographical references (based on scanned pages extracted from Kaibel's edition of Athenaeus, 1st, 2nd and 3d vol.). It contains: grl0.DangAmbigs grl0.freq-dawg grl0.inttemp grl0.normproto grl0.user-words grl0.word-dawg |
grl1-alpha-0_1.tar.gz | As above, but additional pages have been generated using TeubnerLSU and Porson fonts and have been clustered with the scanned pages |
gr-lat-ocr-train-alpha_0_1.tar.gz | All text pages, images, box maps that have been used to generate grl0 and grl1 trainings. You can download single files. |
color-ocropus-alpha_0_1.tar.gz | Colored image for training OCRopus, the state-of-the-art document analysis and OCR system. |