Label: aqua+pdf

All content with label aqua+pdf.
Related Labels: sox, word, 3gpp, jpg, jhears, qa, eml, case, server, ocr, business, msg, fu-script, audio, quality_assurance, gif, documents, video, obsolescence, more » ( - aqua, - pdf )

Page: Check consistency between metadata and content (AQuA)
One line summary Check that the METS, OCR, JPEG2000 masters and the PDFs are consistent \\ Detailed description As shown in the diagram below, check images and ALTO files information defined in METS against the real files stored in separate Zip files. Also ...
Other labels: mets, ocr, metadata, jpeg2000, jp2k, jp2, jpx, mj2
Page: Detect, extract and analyse embedded objects in PDFs (AQuA)
One line summary Detect and identify embedded objects in PDFs, then where appropriate extract and analyse analyse further \\ Detailed description The PDF specification is complex, and PDF files can contain other other objects, embedded at the file or page level ...
Other labels: objects, bmp, jpg, png, gif, tiff, pdfbox, jpxfilter
Page: Open Access PDFs (AQuA)
Basic description Open Access research outputs and etheses (sample being used from White Rose Research Online and White Rose eTheses Online)                     &nbsp ...
Other labels: dataset, document
Page: PDF Characterisation Tool (AQuA)
One line summary Java program to characterise PDF files, looking for preservation concerns.                                     &nbsp ...
Other labels: characterise, pdfbox, api, fonts, issue, acroform, embedded, jpeg2000
Page: Visual Analysis of Preflight Output (SPRUCE)
Visual Analysis of Preflight Output Detailed description During the mashup we ran Apache's PDFBox Preflight over 4000 PDFs sourced from ADS and Middlesex. I was curious about how we might ...
Other labels: spruce_london_2, solution, gephi, graphs, visualisation, appraisal_assessment