View Source

h2. {color:#ff0000}Please note that this experiment is inactive{color}


h2. Investigator(s)

Johan van der Knijff

h2. Dataset

* [The Archivist's PDF Cabinet of Horrors|http://www.opf-labs.org/format-corpus/pdfCabinetOfHorrors/] (part of the [OPF Format Corpus|http://www.opf-labs.org/format-corpus/])
* [Adobe Acrobat Engineering PDFs|http://acroeng.adobe.com/wp/]
* [Reduced Govdocs1 dataset|http://www.openplanetsfoundation.org/blogs/2012-07-26-1-million-21000-reducing-govdocs-significantly]

h2. Platform

TBC

h2. Workflow

At this stage this work still focuses on evaluating Apache Preflight, and making sense of its output. The results of this are continuously used to update the information on PDF in the [OPF File Format Risk Registry|http://wiki.opf-labs.org/display/TR/OPF+File+Format+Risk+Registry]. See the following page (and its child pages):

[http://wiki.opf-labs.org/display/TR/Portable+Document+Format]


h2. Requirements and Policies

_Policy statements that relate to this experiment and any evaluation criteria taken from SCAPE metrics_

h2. Evaluations

_Links to results of the experiment using the evaluation template._