Title
PDF files from the Archaeology Data Service's grey literature library collection
Description
A collection of 3000+ PDF and PDF/A files. The files are excavation reports from the Grey Literature library from the Archaeology Data Service. http://archaeologydataservice.ac.uk/archives/view/greylit/ The collection brought is a selection of original PDFs (of all versions) and the PDF/As created from them. The whole collection contains over 20000 files.
Licensing
Covered by the Archaeology Data Service's Terms and conditions of use. available at http://archaeologydataservice.ac.uk/advice/termsOfUseAndAccess
Owner
The collection has multiple creators/owners but is hosted by Archaeology Data Service
Dataset Location
http://archaeologydataservice.ac.uk/archives/view/greylit/
Sample files here.
Dataset Champion
Issues brainstorm
- Batch conversion of PDF to PDF/A
- Batch validation of PDF/A files created
- Defining errors in validation/conversion which are actively a problem for long term preservation.
List of Issues
PDFA Validation tools give different results