|One line summary|| We have OLE2 Office documents, which may contain more documents, and we want to identify which version of Office each was created by.
|Detailed description|| The older binary Office document formats (OLE) are effectively file systems, and the format information only really gives very superficial information about the object. We can tell that it is an OLE 2.0 Compound Document, but need to know which kind and what the creating application was. OLE can also contain sub-objects, so we want to know about that too.
|Issue champion||Mette van Essen|
|Possible approaches|| Use Apache POI (http://poi.apache.org/) to deconstruct the object.
Use doc2x etc. (http://b2xtranslator.sourceforge.net/) to transform the older format documents to the new OOXML formats and examine those.
Use the commercial http://www.aspose.com/ library to analyse the object.http://www.aspose.com/
|AQuA Solutions||Apache POI Office Document Analyser|
|Collections||MS Word 97-2003 Documents (NANETH)|
Skip to end of metadata Go to start of metadata