Apprasing OST file for restricted data

Skip to end of metadata
Go to start of metadata

Apprasing OST file for restricted data (as different from PPI data).

Detailed description

This includes initial migration to open, non-proprietary mbox format. The first step is to obtain a PST file from an OST file.  Migration to non-proprietary format - a single-platform solution is fine, since this is an occasional use case.  Capture the metadata about the conversion process that could be later incorporated into PREMIS event records.  Output log file details the tool specifics. Enable parsing of the messages within an OST file so that the data can be searched for restricted data (using case-specific key words).  This issue includes several steps and the solutions listed below address beginning tasks in the workflow that would achieve the total solution.  The Solution approaches listed below are other approaches that might also assist with parts of the solution set for this issue.

Issue champion
Kari Smith, Bill LeFurgy

Other interested parties
Any other parties who are also interested in applying Issue Solutions to their Datasets.

Possible Solution approaches

Brief brainstorm of possible approaches to solving the Issue. Each approach should be described in a single sentence as part of a bulleted list. Further detail can go in a dedicated Solution page. - summary metadata module for MUSE, word clouds, graphs, etc.. - A library to read PST files with java, without need for external libraries. - google apps email migration tool -  Email message to XML file extractor for digital preservation created by the Persistent Digital Archives and Library System (PeDALS) research project. - squeak/smalltalk PST parser. - EMCAP NC State Archives email converter - C library and linux utilities for migrating PST/OST - Library and tools to access the Personal Folder File (PFF) and the Offline Folder File (OFF) format.

Analysis of Lucene Index Word Frequency - Lucene/Solr is a good base for creating search/browse and other viz features -

Details of the institutional context to the Issue.

Lessons Learned
Notes on Lessons Learned from tackling this Issue that might be useful to inform digital preservation best practice

Reference to the appropriate Dataset page, by hyperlink. Note that all Issues MUST be linked to at least one Dataset!

OST archive with attachments - MIT IASC
Email archive in OST format (LeFurgy)

"Parsing PST OST file using TIKA

"Converting PST & OST files to MBOX format

chapel_hill chapel_hill Delete
issue issue Delete
appraisal_assessment appraisal_assessment Delete
obsolescence obsolescence Delete
unknown_characteristics unknown_characteristics Delete
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.