Label: characterisation+spruce_london

Content with label characterisation+spruce_london in SPRUCE (See content from all spaces)
Related Labels: validation, identification, cataloguing, sharepoint, fixity, strategy, xml, scanline, image, planning_management, mixed_misc, solution, video, multi, corruption, integrity, obsolescence, duplication, software, more » ( - characterisation, - spruce_london )

Page: Distinguishing Files with Descriptive Metadata
Distinguishing Files with Descriptive Metadata A Java program making use of a custom Apache Tika wrapper to extract file format identification and metadata from a directory of files and present aggregated data for identifying which files have full descriptive metadata ...
Other labels: solution, identification
Page: Extracting and aggregating metadata with Apache Tika
Extracting and aggregating metadata with Tika At the Glasgow Mashup Peter May created a Python wrapper for Apache Tika. Carl Wilson extended this work, creating a Java utility class that wrapped Tika, providing simple configuration, two types of call to Tika ...
Other labels: solution
Page: Maintain a list of metadata mappings outside of the script
Title maintain a list of metadata mappings outside of the script Detailed description A PHP script invoking exiftool which returns a PHP array. This array is used to fill in an XML template, which can be edited at will. Outside metadata is contained in a .ini ...
Other labels: solution, metadata, xml, php, exiftool, mapping
Page: Solving TIFF malformation using exiftool
Title Solving TIFF malformation using exiftool Detailed description The issue page http://wiki.opflabs.org/display/SPR/ValidandwellformedTIFF%27swithscanlinec orruption describes the problem as (essentially): TIFF files being unusable, despite being "validated" by tools like JHOVE. Solution ...
Other labels: solution, bit_rot_detection
Page: Using Perl to write scripts for reporting on the content of the collection
Title Using Perl to write scripts to find duplicates for reporting on the content of the collection. Perl was used to write scripts that used the metadata that was extracted using Apache Tika SPR:Extracting and aggregating metadata with Apache Tika to help locate duplicates and different ...
Other labels: solution, fixity