Label: quality_assurance

Content with label quality_assurance in AQuA (See content from all spaces)
Related Labels: validation, characterisation, identification, workflow, jpeg2000, gimp, hocr, ocr, levenshtein, de-duplication, bit_rot_detection, image, fu-script, format, aqua, fits, solution, tesseract, comparison, more »

Page: AQDC - Document Compare
One line summary Tool that used Apache Tika to parse & compare documents. \\ Detailed description AQDC is a Spring MVC Framework based Web application that wraps Apache Tika to provide a quick analysis of two documents (typically the original and its ...
Other labels: aqua, solution
Page: Compare OCR results of the same source material in different formats (TIFF, JP2)
One line summary The intention of this solution was to compare two OCR results where the images that are OCRed have two different formats, one is the original TIFF file, the other one is a JP2 (JPEG 2000) representation of this TIFF file. The goal was to find ...
Other labels: ocr, jp2, jpeg2000, levenshtein, solution, aqua
Page: EAP Compare Metadata with Requirements
One line summary Tool will ID files as Bad/Substandard/Good/Unprocessed depending on file type and metadata requirements set by content owner           &nbs p;&nbsp ...
Other labels: solution, aqua, characterisation
Page: EAP File Verification
One line summary When media are detected, the tool will identify the selected format and identify valid / invalid / broken files Detailed description Solution for EAP Issue 1 Broken TIFFs \\ Same tool as for Solution 3 \\ \\ \\ Developed ...
Other labels: image, tiff, validation, identification, bit_rot_detection, solution, characterisation
Page: Identify compressed TIFFs and convert them to uncompressed TIFFs
One line summary Given is a list of TIFF images, some of them are compressed as "Group 4 Fax" TIFF images. The compression causes issues in some application contexts, therefore it might be required to remove the compression from a large TIFF images ...
Other labels: fits, format, characterisation, migration, conversion, gimp, fu-script, taverna
Page: Identifying rotated, duplicate images using pHash
One line summary Identifying duplicate images rotated during postprocessing Detailed description Several tools, including XCL Extractor/Comparator, ImageJ and pHash were briefly explored to determine if an image was a rotated copy of another image. The solution ...
Other labels: image, de-duplication, comparison, solution, aqua
Page: java image blocks comparison
One line summary Based on the blog entry: http://mindmeat.blogspot.com/2008/07/javaimagecomparison.html it can be shown that a basic jpg image comparision can be achieved with just a few lines of code.             &nbsp ...
Other labels: de-duplication, solution, aqua
Page: jp2 header analysis
One line summary "image not available" in Google Books                                             &nbsp ...
Other labels: aqua, solution
Page: Newspaper issue dates - solution
One line summary For cataloguing purposes, it is of absolute importance that the issue data metadata is accurate. How can we ensure this? And can we predict where issues may be missing? Detailed description Code was written to extract issue number and publication ...
Other labels: solution, aqua, structural_relationships
Page: OCR Comparison
One line summary Compare two different OCR results. If the results are not sufficiently close, the source pages may be different indicating possible issues. \\ Detailed description See detailed scenario descriptions below. \\ Solution champion Georg Petz & Sven ...
Other labels: taverna, workflow, ocr, comparison, tiff, jpeg2000, tesseract, hocr