Filter Search Results
Content the user has created or edited.
Clear Filter

Showing 51-60 of 264 for characterisation

  • SO23 Pushing additional metadata into NeXus metadata fields Title SO23 Pushing additional metadata into NeXus metadata fields Detailed description SO23 Pushing additional metadata into NeXus metadata fields Solution Champion SP:Responsibilities of the roles described on these pages Erica Yang (STFC) Evaluation Not appl
  • SO27 Analyse huge text files containing information about a web archive using Hadoop Title SO27 Analyse huge text files containing information about a web archive using Hadoop Detailed description Analyse huge text files containing information about a web archive using Hadoop Solution Champion SP:Responsibilities of th
  • JHOVE Summary Purpose Provides functions to perform formatspecific identification, validation, and characterisation of digital objects. Homepage http://hul.harvard.edu/jhove/ Source Code Repository http://sourceforge.net/projects/jhove/ License JHOVE is made available by JSTOR and the President and Fellows of Harvard
  • . User Experiences e.g. links to AQuA/SCAPE/Hackathon issues that use the tool SP:IS25 Web Content Characterisation SP:SO11 The Tika characterisation Tool
  • PDF Tools (by Didier Stevens) Summary Purpose Tools for parsing and analysing PDF documents Homepage http://blog.didierstevens.com/programs/pdftools/ Source Code Repository http://blog.didierstevens.com/programs/pdftools/ License Not specified, public domain Debian Package N/A Description This is a set of Python script
  • Identifying the content of Email Mailboxes {}One line summary{} A single mailbox file (.mbox/.mbx/.pst) can consists of a lot of email messages with or without attachments and we want to identify them. {}Detailed description{} The main focus will be on an Eudora mailbox. Eudora uses an mboxo variation which is one of t
  • jp2StructCheck Summary Purpose A JPEG2000 Structure Checking Tool Homepage https://github.com/bitsgalore/jp2StructCheck License GPL Debian Package Description Checks JPEG2000 image files for the presence of all required toplevel 'boxes' (which acts as a rough wellformedness check), and verifies if the code stream is te
  • Extracting and aggregating metadata with Apache Tika Extracting and aggregating metadata with Tika At the Glasgow Mashup Peter May created a Python wrapper for Apache Tika. Carl Wilson extended this work, creating a Java utility class that wrapped Tika, providing simple configuration, two types of call to Tika (simple
  • DROID Summary Purpose {}D{}igital R{}ecord O{}bject ID{}entification (DROID): Automatic file format identification tool. Homepage http://www.nationalarchives.gov.uk/informationmanagement/projectsandwork/droid.htm Source Code Repository http://digitalpreservation.github.com/droid/ License New BSD License https://raw.git
  • Simple JP2 file structure checker This appears to be the same as TR:jp2StructCheck. Merge them? Summary Purpose Simple JP2 file structure checker Homepage https://github.com/bitsgalore/jp2StructCheck License GNU General Public License v3 Debian Package Description In brief, when jp2StructCheck analyses a file, it firs