Label: pdf

All content with label pdf.
Related Labels: jpg, qa, eml, case, server, ocr, business, msg, gif, documents, obsolescence, comparison, bmp, without, pst, api, conversion, matching, characterisation, more »

Page: Analysis of Acrobat Engineering PDFs with Acrobat Preflight and Apache Preflight (The Registry)
About this page This page shows the results of a comparative analysis of a selection of PDFs from the Adobe Acrobat Engineering website http://acroeng.adobe.com/wp/. The current analysis is limited to the following subset of PDFs that are hosted there: 1. all files in the General section of the Font ...
Page: Apache PDFBox (The Registry)
Summary Purpose JAVA PDF library for creation, manipulation and content extraction of PDF documents Homepage http://pdfbox.apache.org/ License Apache License v2.0 Debian Package Description Quoted from the website "Apache PDFBox™ is an open source Java ...
Other labels: characterisation, too, tool, library
Page: Born-digital - migration success (AQuA)
One line summary Checking whether an automated normalisation produces a surrogate of sufficient quality ... Detailed description "sufficient" obviously needs to be defined in terms of significant properties relevant to the context but are there some checks which can be run to determine whether ...
Other labels: qa, comparison, characterise, office, issue
Page: Check consistency between metadata and content (AQuA)
One line summary Check that the METS, OCR, JPEG2000 masters and the PDFs are consistent \\ Detailed description As shown in the diagram below, check images and ALTO files information defined in METS against the real files stored in separate Zip files. Also ...
Other labels: mets, ocr, metadata, jpeg2000, jp2k, jp2, jpx, mj2
Page: Detect, extract and analyse embedded objects in PDFs (AQuA)
One line summary Detect and identify embedded objects in PDFs, then where appropriate extract and analyse analyse further \\ Detailed description The PDF specification is complex, and PDF files can contain other other objects, embedded at the file or page level ...
Other labels: objects, bmp, jpg, png, gif, tiff, pdfbox, jpxfilter
Page: Disassociation of files and metadata (SPRUCE)
Title \\ Disassociation of files and metadata \\ Detailed description Each digitised page on the website must have a tif file, a htm file and a pdf file (plus other derivatives such as jpegs). These must all match each other (i.e. represent the same ...
Other labels: file, management, excel, htm, tif, matching, ocr, tool
Page: Embedded links within the PDF (AQuA)
One line summary Need to identify links embedded within PDFs and check whether they are still live                                  &nbsp ...
Other labels: issue, obsolescence, dependency
Page: Embedded objects in PDFs (AQuA)
One line summary Need to detect embedded objects within PDFs                                          &nbsp ...
Other labels: issue, embedded_objects
Page: Encryption (The Registry)
Description PDF permits the use of encryption as a means of restricting access or (re)use of content. This may range from documents that can only be opened after providing a password, to disabling specific functionality (e.g. printing, copying content). Risks Content ...
Other labels: formatissue
Page: File attachments (The Registry)
Description PDFs may contain file attachments. There are two ways to include an attachment in a PDF: # Pagelevel attachments which use a File Attachment Annotation (section 12.5.6.15 of ISO32000 http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat ...
Other labels: formatissue