Label: solution+spruce_london

Content with label solution+spruce_london in SPRUCE (See content from all spaces)
Related Labels: validation, spruce_glasgow, identification, cataloguing, tool, sharepoint, fixity, server, strategy, xml, ocr, visualisation, scanline, msg, image, planning_management, management, mixed_misc, video, more » ( - solution, - spruce_london )

Page: Distinguishing Files with Descriptive Metadata
Distinguishing Files with Descriptive Metadata A Java program making use of a custom Apache Tika wrapper to extract file format identification and metadata from a directory of files and present aggregated data for identifying which files have full descriptive metadata ...
Other labels: characterisation, identification
Page: Extracting and aggregating metadata with Apache Tika
Extracting and aggregating metadata with Tika At the Glasgow Mashup Peter May created a Python wrapper for Apache Tika. Carl Wilson extended this work, creating a Java utility class that wrapped Tika, providing simple configuration, two types of call to Tika ...
Other labels: characterisation
Page: FFMPEG as Video Transcoder
Title FFMPEG as Video Transcoder. Detailed description FFMPEG http://ffmpeg.org/ is a free software project that provides crossplatform tools and libraries for converting and playing video formats, including the libavcodec audio/video codec library. 28GB seems excessively large ...
Other labels: migration
Page: Maintain a list of metadata mappings outside of the script
Title maintain a list of metadata mappings outside of the script Detailed description A PHP script invoking exiftool which returns a PHP array. This array is used to fill in an XML template, which can be edited at will. Outside metadata is contained in a .ini ...
Other labels: metadata, xml, php, exiftool, mapping, characterisation
Page: Moving records from Sharepoint to Eprints for preservation solution
Title Moving records from Sharepoint to Eprints for preservation Detailed description It is not feasible to come up with a working solution to convert SP to ePrints within 3 days since SP is a very complex CMS. In the end we created a new "view ...
Other labels: sharepoint, eprint, export, excel, strategy, data_capture
Page: National Videogame Archive - Game Preservation & Public Access Solutions
Title National Videogame Archive Game Preservation & Public Access Solutions Detailed description Data Extraction It is extremely important that this is done as soon as possible. Media degrades over time (bit rot http://en.wikipedia.org/wiki/Bitrot, etc.), so ...
Other labels: miscellaneous
Page: NeXus Data Collection ISIS - STFC - solution
Title NeXus Data Collection ISIS STFC solution Detailed description Two parts: 1. techmaurice added Nexus file recognition to TR:Fido 2. GitHub repository set up containing script that runs over ...
Other labels: identification
Page: Solving TIFF malformation using exiftool
Title Solving TIFF malformation using exiftool Detailed description The issue page http://wiki.opflabs.org/display/SPR/ValidandwellformedTIFF%27swithscanlinec orruption describes the problem as (essentially): TIFF files being unusable, despite being "validated" by tools like JHOVE. Solution ...
Other labels: characterisation, bit_rot_detection
Page: Using Perl to write scripts for reporting on the content of the collection
Title Using Perl to write scripts to find duplicates for reporting on the content of the collection. Perl was used to write scripts that used the metadata that was extracted using Apache Tika SPR:Extracting and aggregating metadata with Apache Tika to help locate duplicates and different ...
Other labels: characterisation, fixity