Label: characterisation

All content with label characterisation.
Related Labels: sox, word, 3gpp, xena, jpg, jhears, qa, fu-script, audio, gif, quality_assurance, scenario, video, dependency-analysis, obsolescence, wmv, keyword, bmp, webarchive, more »

Page: Apache PDFBox (The Registry)
Summary Purpose JAVA PDF library for creation, manipulation and content extraction of PDF documents Homepage http://pdfbox.apache.org/ License Apache License v2.0 Debian Package Description Quoted from the website "Apache PDFBox™ is an open source Java ...
Other labels: pdf, too, tool, library
Page: Apache POI Office Document Analyser (AQuA)
One line summary A utility based on Apache POI that is able to analyse MS Office documents. Detailed description Uses POI to walk through the OLE file structures and look for embedded objects and their properties. \\ \\ \\ \\ \\ Solution champion anjackson Git link ...
Other labels: apache, poi, ms-office, office, ms, ole, word, excel
Page: AQUAdio - characterization of user-generated audio field recordings (AQuA)
One line summary Tool to extract audio properties and metadata from audio files                   Detailed description AQUAdio is a wrapper script around the Open Source getID3() PHPlibrary ...
Other labels: audio, mp3, mp4, wmv, wma, aiff, 3gpp, id3v1
Page: Audio Auditing Script (AQuA)
One line summary{} A script to check a collection of audio recordings for 1) expected files, 2) expected specification, and 3) provenance.                           &nbsp ...
Other labels: audio, auditing, metadata, wav, broadcast-wave, bwf, jhears, audioscout
Page: Characterising Externally Generated Content (AQuA)
One line summary Tool to create a manifest of digital content, including format and SHA256 digest, and index content where possible Detailed description Java code, currently runs as a command line application.  Uses Apache Tika to obtain ...
Other labels: solution, aqua, fixity, appraisal_assessment
Page: Dependency Discovery Tool (The Registry)
Summary Purpose The Dependency Discovery Tool searches through binary office files (.doc, .xls and .ppt) and tries to find any documents or files that are linked to the document. Homepage \\ http://sourceforge.net/projects/officeddt Source Code Repository \\ http://sourceforge.net ...
Other labels: tool, dependency-analysis, apache2
Page: Detect, extract and analyse embedded objects in PDFs (AQuA)
One line summary Detect and identify embedded objects in PDFs, then where appropriate extract and analyse analyse further \\ Detailed description The PDF specification is complex, and PDF files can contain other other objects, embedded at the file or page level ...
Other labels: pdf, objects, bmp, jpg, png, gif, tiff, pdfbox
Page: Distinguishing Files with Descriptive Metadata (SPRUCE)
Distinguishing Files with Descriptive Metadata A Java program making use of a custom Apache Tika wrapper to extract file format identification and metadata from a directory of files and present aggregated data for identifying which files have full descriptive metadata ...
Other labels: spruce_london, solution, identification
Page: DROID (The Registry)
Summary Purpose {}D{}igital R{}ecord O{}bject ID{}entification (DROID): Automatic file format identification tool. Homepage http://www.nationalarchives.gov.uk/informationmanagement/projectsandwork/dr oid.htm Source Code Repository http://digitalpreservation.github.com/droid/ License New ...
Other labels: identification, java, tool
Page: EAP Compare Metadata with Requirements (AQuA)
One line summary Tool will ID files as Bad/Substandard/Good/Unprocessed depending on file type and metadata requirements set by content owner           &nbs p;&nbsp ...
Other labels: solution, aqua, quality_assurance