Label: aqua

All content with label aqua.
Related Labels: sox, word, 3gpp, jpg, jhears, ocr, fu-script, audio, quality_assurance, gif, mixed_misc, video, obsolescence, wmv, comparison, macromedia, bmp, flvdump, apache, more »

Page: Check consistency between metadata and content (AQuA)
One line summary Check that the METS, OCR, JPEG2000 masters and the PDFs are consistent \\ Detailed description As shown in the diagram below, check images and ALTO files information defined in METS against the real files stored in separate Zip files. Also ...
Other labels: mets, ocr, metadata, jpeg2000, jp2k, pdf, jp2, jpx
Page: Compare OCR results of the same source material in different formats (TIFF, JP2) (AQuA)
One line summary The intention of this solution was to compare two OCR results where the images that are OCRed have two different formats, one is the original TIFF file, the other one is a JP2 (JPEG 2000) representation of this TIFF file. The goal was to find ...
Other labels: ocr, jp2, jpeg2000, levenshtein, solution, quality_assurance
Page: Detect, extract and analyse embedded objects in PDFs (AQuA)
One line summary Detect and identify embedded objects in PDFs, then where appropriate extract and analyse analyse further \\ Detailed description The PDF specification is complex, and PDF files can contain other other objects, embedded at the file or page level ...
Other labels: pdf, objects, bmp, jpg, png, gif, tiff, pdfbox
Page: Diagnosing FLV problems using FLVmeta's flvdump (AQuA)
One line summary Deconstruct the FLV at the top level using flvdump and see if it is valid. Detailed description Used FLVmeta package which contained the flvdump programme, which was able to walk through the FLV file and check container was valid ...
Other labels: flv, flash, macromedia, video, validation, flvdump, bit_rot_detection, solution
Page: Digitised Books (ONB, Google Books) (AQuA)
Basic description 19th century digitised books. Master images in JPEG2000 format with lossy compression. Profile unknown. OCR data in the HTML based HOCR format.                   &nbsp ...
Other labels: dataset, image
Page: Digitised Books (ONB) (AQuA)
Basic description 19th century digitised books. Master images in JPEG2000 format with lossy compression. Profile unknown. OCR data in the HTML based HOCR format. \\ Licensing TBC Institution ONB Collection expert shsdev\\ List of issues Unknown ...
Other labels: dataset, image
Page: EAP Compare Metadata with Requirements (AQuA)
One line summary Tool will ID files as Bad/Substandard/Good/Unprocessed depending on file type and metadata requirements set by content owner           &nbs p;&nbsp ...
Other labels: solution, quality_assurance, characterisation
Page: East London Theatre Archive (AQuA)
Basic description East London Theatre Archive: Collection of play bills, programmes and other media encoded as uncompressed TIFF and PDF 1.6 Licensing Sample images tested are copyright V&A. Publication only allowed through http://www.eltaproject.org/ Institution ...
Other labels: dataset, image
Page: Email Mailbox Collections (AQuA)
Basic description{} An Eudora Mailbox and some sample .pst / .mbox / .msg / .eml files \\ Can we identify the content of these container files without opening them and inspect them manually. Can we generate a report that is useful for further ...
Other labels: dataset, email
Page: Endangered Archives Programme (EAP) (AQuA)
Basic description (Multimedia) Image, Audio and Video collections EAP \\ Licensing AQuA event files cleared for use. Normally Content creators clear copyright. EAP can provide access for research. \\ Institution British Library (on behalf of many institutions ...
Other labels: dataset, audio, image, video