Label: quality_assurance

All content with label quality_assurance.
Related Labels: validation, characterisation, identification, workflow, jpeg2000, gimp, ocr, hocr, levenshtein, de-duplication, bit_rot_detection, image, fu-script, format, aqua, fits, solution, tesseract, comparison, more »

Page: AQDC - Document Compare (AQuA)
One line summary Tool that used Apache Tika to parse & compare documents. \\ Detailed description AQDC is a Spring MVC Framework based Web application that wraps Apache Tika to provide a quick analysis of two documents (typically the original and its ...
Other labels: aqua, solution
Page: Compare OCR results of the same source material in different formats (TIFF, JP2) (AQuA)
One line summary The intention of this solution was to compare two OCR results where the images that are OCRed have two different formats, one is the original TIFF file, the other one is a JP2 (JPEG 2000) representation of this TIFF file. The goal was to find ...
Other labels: ocr, jp2, jpeg2000, levenshtein, solution, aqua
Page: EAP Compare Metadata with Requirements (AQuA)
One line summary Tool will ID files as Bad/Substandard/Good/Unprocessed depending on file type and metadata requirements set by content owner           &nbs p;&nbsp ...
Other labels: solution, aqua, characterisation
Page: EAP File Verification (AQuA)
One line summary When media are detected, the tool will identify the selected format and identify valid / invalid / broken files Detailed description Solution for EAP Issue 1 Broken TIFFs \\ Same tool as for Solution 3 \\ \\ \\ Developed ...
Other labels: image, tiff, validation, identification, bit_rot_detection, solution, characterisation
Page: Identify compressed TIFFs and convert them to uncompressed TIFFs (AQuA)
One line summary Given is a list of TIFF images, some of them are compressed as "Group 4 Fax" TIFF images. The compression causes issues in some application contexts, therefore it might be required to remove the compression from a large TIFF images ...
Other labels: fits, format, characterisation, migration, conversion, gimp, fu-script, taverna
Page: Identifying rotated, duplicate images using pHash (AQuA)
One line summary Identifying duplicate images rotated during postprocessing Detailed description Several tools, including XCL Extractor/Comparator, ImageJ and pHash were briefly explored to determine if an image was a rotated copy of another image. The solution ...
Other labels: image, de-duplication, comparison, solution, aqua
Page: java image blocks comparison (AQuA)
One line summary Based on the blog entry: http://mindmeat.blogspot.com/2008/07/javaimagecomparison.html it can be shown that a basic jpg image comparision can be achieved with just a few lines of code.             &nbsp ...
Other labels: de-duplication, solution, aqua
Page: jp2 header analysis (AQuA)
One line summary "image not available" in Google Books                                             &nbsp ...
Other labels: aqua, solution
Page: Newspaper issue dates - solution (AQuA)
One line summary For cataloguing purposes, it is of absolute importance that the issue data metadata is accurate. How can we ensure this? And can we predict where issues may be missing? Detailed description Code was written to extract issue number and publication ...
Other labels: solution, aqua, structural_relationships
Page: OCR Comparison (AQuA)
One line summary Compare two different OCR results. If the results are not sufficiently close, the source pages may be different indicating possible issues. \\ Detailed description See detailed scenario descriptions below. \\ Solution champion Georg Petz & Sven ...
Other labels: taverna, workflow, ocr, comparison, tiff, jpeg2000, tesseract, hocr