Parsing OST & PST email files for textual extraction and search

Skip to end of metadata
Go to start of metadata

The issue is that PST/OST [MS Outlook] files cannot be used by many text extraction tools because they generally require MBOX format.

We want to have a solution that parses email messages so that they can be further used by other text extraction and search tools.

Critical to the parsing is to have at minimum:

  • Sender
  • Recipient
  • Date
  • Subject Line
  • Message Body

Issue Champions

"Bill LeFurgy

"Kari Smith

Solution Requirements

Solution does not need to be cross-platform as we anticipate this to be a function or task that will be run periodically and users of the solution will have access to multiple operating systems if necessary.

Labels:
None
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.