Label: jpeg2000

Content with label jpeg2000 in AQuA (See content from all spaces)
Related Labels: mj2, validation, characterisation, pdf, workflow, jp2k, jpm, xml, hocr, ocr, extraction, levenshtein, java, acroform, mets, aqua, quality_assurance, itext, alto, more »

Page: Check consistency between metadata and content
One line summary Check that the METS, OCR, JPEG2000 masters and the PDFs are consistent \\ Detailed description As shown in the diagram below, check images and ALTO files information defined in METS against the real files stored in separate Zip files. Also ...
Other labels: mets, ocr, metadata, jp2k, pdf, jp2, jpx, mj2
Page: Compare OCR results of the same source material in different formats (TIFF, JP2)
One line summary The intention of this solution was to compare two OCR results where the images that are OCRed have two different formats, one is the original TIFF file, the other one is a JP2 (JPEG 2000) representation of this TIFF file. The goal was to find ...
Other labels: ocr, jp2, levenshtein, solution, aqua, quality_assurance
Page: OCR Comparison
One line summary Compare two different OCR results. If the results are not sufficiently close, the source pages may be different indicating possible issues. \\ Detailed description See detailed scenario descriptions below. \\ Solution champion Georg Petz & Sven ...
Other labels: taverna, workflow, ocr, comparison, tiff, tesseract, hocr, quality_assurance
Page: PDF Characterisation Tool
One line summary Java program to characterise PDF files, looking for preservation concerns.                                     &nbsp ...
Other labels: pdf, characterise, pdfbox, api, fonts, issue, acroform, embedded