Label: aqua

Content with label aqua in AQuA (See content from all spaces)
Related Labels: sox, word, 3gpp, jpg, jhears, ocr, fu-script, audio, quality_assurance, gif, mixed_misc, video, comparison, wmv, obsolescence, macromedia, bmp, flvdump, apache, more »

Page: 19th Century Books (BL)
Basic description 150 000 digitsed books.  Master images in JPEG2000 format, pdf service images and METS metadata. Licensing Sample of approximately 10000 pages available for use under BL licence Institution British Library (BL ...
Other labels: dataset, image
Page: Analysis of Lucene Index Word Frequency
One line summary Create a word frequency list from a Lucene index and try to ascertain the subject matter of the collection that the index was created against. Detailed description The solution for AQuA:Characterising Externally Generated Content generated a Lucene index of the collection ...
Other labels: solution, appraisal_assessment
Page: Apache POI Office Document Analyser
One line summary A utility based on Apache POI that is able to analyse MS Office documents. Detailed description Uses POI to walk through the OLE file structures and look for embedded objects and their properties. \\ \\ \\ \\ \\ Solution champion anjackson Git link ...
Other labels: apache, poi, ms-office, office, ms, ole, word, excel
Page: AQDC - Document Compare
One line summary Tool that used Apache Tika to parse & compare documents. \\ Detailed description AQDC is a Spring MVC Framework based Web application that wraps Apache Tika to provide a quick analysis of two documents (typically the original and its ...
Other labels: solution, quality_assurance
Page: AQUAdio - characterization of user-generated audio field recordings
One line summary Tool to extract audio properties and metadata from audio files                   Detailed description AQUAdio is a wrapper script around the Open Source getID3() PHPlibrary ...
Other labels: audio, mp3, mp4, wmv, wma, aiff, 3gpp, id3v1
Page: Audio Auditing Script
One line summary{} A script to check a collection of audio recordings for 1) expected files, 2) expected specification, and 3) provenance.                           &nbsp ...
Other labels: audio, auditing, metadata, wav, broadcast-wave, bwf, jhears, audioscout
Page: Audio Collection (York)
Basic description York Sound Archive digitised audio.                                           &nbsp ...
Other labels: dataset, audio
Page: BOPCRIS
Basic description British Official Publications: black and white TIFF images                                         &nbsp ...
Other labels: dataset, image
Page: Brightsolid digitisation of British Library newspapers
Basic description Project to digitise items from the British Library newspaper archive. Scanning started in October 2010, and scans from paper are currently running at around 5,000 pages per working day. Scanning from microfilm is due to start ...
Other labels: dataset, image
Page: Characterising Externally Generated Content
One line summary Tool to create a manifest of digital content, including format and SHA256 digest, and index content where possible Detailed description Java code, currently runs as a command line application.  Uses Apache Tika to obtain ...
Other labels: solution, characterisation, fixity, appraisal_assessment