Label: jpeg2000+ocr

Content with label jpeg2000+ocr in AQuA (See content from all spaces)
Related Labels: mj2, validation, pdf, characterisation, workflow, jp2k, qa, jpm, xml, hocr, extraction, levenshtein, image, acroform, mets, java, aqua, alto, itext, more » ( - jpeg2000, - ocr )

Page: Check consistency between metadata and content
One line summary Check that the METS, OCR, JPEG2000 masters and the PDFs are consistent \\ Detailed description As shown in the diagram below, check images and ALTO files information defined in METS against the real files stored in separate Zip files. Also ...
Other labels: mets, metadata, jp2k, pdf, jp2, jpx, mj2, jpm
Page: Compare OCR results of the same source material in different formats (TIFF, JP2)
One line summary The intention of this solution was to compare two OCR results where the images that are OCRed have two different formats, one is the original TIFF file, the other one is a JP2 (JPEG 2000) representation of this TIFF file. The goal was to find ...
Other labels: jp2, levenshtein, solution, aqua, quality_assurance
Page: OCR Comparison
One line summary Compare two different OCR results. If the results are not sufficiently close, the source pages may be different indicating possible issues. \\ Detailed description See detailed scenario descriptions below. \\ Solution champion Georg Petz & Sven ...
Other labels: taverna, workflow, comparison, tiff, tesseract, hocr, quality_assurance