Label: solution

Content with label solution in AQuA (See content from all spaces)
Related Labels: sox, word, mj2, validation, 3gpp, bwf, jpg, identification, ole, jhears, bwfmetaedit, fixity, xml, ocr, levenshtein, broadcast-wave, fu-script, image, audio, more »

Page: Analysis of Lucene Index Word Frequency
One line summary Create a word frequency list from a Lucene index and try to ascertain the subject matter of the collection that the index was created against. Detailed description The solution for AQuA:Characterising Externally Generated Content generated a Lucene index of the collection ...
Other labels: aqua, appraisal_assessment
Page: Apache POI Office Document Analyser
One line summary A utility based on Apache POI that is able to analyse MS Office documents. Detailed description Uses POI to walk through the OLE file structures and look for embedded objects and their properties. \\ \\ \\ \\ \\ Solution champion anjackson Git link ...
Other labels: apache, poi, ms-office, office, ms, ole, word, excel
Page: AQDC - Document Compare
One line summary Tool that used Apache Tika to parse & compare documents. \\ Detailed description AQDC is a Spring MVC Framework based Web application that wraps Apache Tika to provide a quick analysis of two documents (typically the original and its ...
Other labels: aqua, quality_assurance
Page: AQUAdio - characterization of user-generated audio field recordings
One line summary Tool to extract audio properties and metadata from audio files                   Detailed description AQUAdio is a wrapper script around the Open Source getID3() PHPlibrary ...
Other labels: audio, mp3, mp4, wmv, wma, aiff, 3gpp, id3v1
Page: Audio Auditing Script
One line summary{} A script to check a collection of audio recordings for 1) expected files, 2) expected specification, and 3) provenance.                           &nbsp ...
Other labels: audio, auditing, metadata, wav, broadcast-wave, bwf, jhears, audioscout
Page: Characterising Externally Generated Content
One line summary Tool to create a manifest of digital content, including format and SHA256 digest, and index content where possible Detailed description Java code, currently runs as a command line application.  Uses Apache Tika to obtain ...
Other labels: aqua, characterisation, fixity, appraisal_assessment
Page: Check consistency between metadata and content
One line summary Check that the METS, OCR, JPEG2000 masters and the PDFs are consistent \\ Detailed description As shown in the diagram below, check images and ALTO files information defined in METS against the real files stored in separate Zip files. Also ...
Other labels: mets, ocr, metadata, jpeg2000, jp2k, pdf, jp2, jpx
Page: Compare OCR results of the same source material in different formats (TIFF, JP2)
One line summary The intention of this solution was to compare two OCR results where the images that are OCRed have two different formats, one is the original TIFF file, the other one is a JP2 (JPEG 2000) representation of this TIFF file. The goal was to find ...
Other labels: ocr, jp2, jpeg2000, levenshtein, aqua, quality_assurance
Page: Detect, extract and analyse embedded objects in PDFs
One line summary Detect and identify embedded objects in PDFs, then where appropriate extract and analyse analyse further \\ Detailed description The PDF specification is complex, and PDF files can contain other other objects, embedded at the file or page level ...
Other labels: pdf, objects, bmp, jpg, png, gif, tiff, pdfbox
Page: Diagnosing FLV problems using FLVmeta's flvdump
One line summary Deconstruct the FLV at the top level using flvdump and see if it is valid. Detailed description Used FLVmeta package which contained the flvdump programme, which was able to walk through the FLV file and check container was valid ...
Other labels: flv, flash, macromedia, video, validation, flvdump, bit_rot_detection, aqua