Label: identification

Content with label identification in SPRUCE (See content from all spaces)
Related Labels: parallel, processing, solution, spruce_glasgow, characterisation, multi, obsolescence, spruce_london_2, issue, spruce, spruce_london, process, unknown_file_formats

Page: Distinguishing Files with Descriptive Metadata
Distinguishing Files with Descriptive Metadata A Java program making use of a custom Apache Tika wrapper to extract file format identification and metadata from a directory of files and present aggregated data for identifying which files have full descriptive metadata ...
Other labels: spruce_london, solution, characterisation
Page: File Format Identification and Metadata Extraction using FITS
Title File Format Identification and Metadata Extraction using FITS Detailed description Practicioners who are new to Digital Preservation are often looking for ways to identify file format types in their collections and extract metadata from these files. The best way to get ...
Other labels: spruce_london_2, solution, characterisation
Page: Identification of file formats with incorrect file extensions
Title Identification of file formats with incorrect file extensions Detailed description Electronic documents and image files are deposited on a variety of media, including floppy disc, CDR and memory sticks.  In copying process, file extensions can be lost, or period ...
Other labels: issue, spruce, spruce_glasgow, unknown_file_formats
Page: NeXus Data Collection ISIS - STFC - solution
Title NeXus Data Collection ISIS STFC solution Detailed description Two parts: 1. techmaurice added Nexus file recognition to TR:Fido 2. GitHub repository set up containing script that runs over ...
Other labels: spruce_london, solution
Page: Parallel processing of identification and characterisation jobs
Here's some ideas and suggestions regarding parallel processing, focusing on running identification and characterisation jobs in parallel. Please contribute and comment! techmaurice / 30012012 IMHO we should also give multi/parallel processing more attention. Most (nonJava ...
Other labels: parallel, processing, multi, process, characterisation
Page: Simple preservation actions with few IT resources
Title Taking simple preservation actions that will begin to tackle preservation issues with few resources. Detailed description The collection of London 2012 material is catalgoued on our CALM cataloguing system. However, before putting a programme in place, we are looking for some ...
Other labels: spruce_london_2, issue, obsolescence
Page: Tika Batch File Identification
Title Tika Batch File Identification \\ Detailed description {}Overview:   \\ Group of issues surrounding batch processing of large number of files to identify file formats and therefore hint as to which applications may be useful for rendering the files.   \\ \\ Various ...
Other labels: spruce, spruce_glasgow, solution