Label: embedded_objects+issue

Content with label embedded_objects+issue in Practical Preservation Issues (See content from all spaces)
Related Labels: spruce_glasgow, opf_montpellier, identification, appraisal_assessment, data_capture, preser, qa, hackathon, preservation, database, unknown_file_formats, permissions, preservingpdf, retention, solution, email, android, bit_rot, opf, more » ( - embedded_objects, - issue )

Page: Extracting embedded objects from docx files
Title \\ Extracting embedded objects from docx files Detailed description We preserve MS Word documents as docx files. We are reasonably confident that the XML structure preserves the report text and structure well. We are not so confident about ...
Other labels: york_hackathon
Page: Web based email "harvesting"
Title \\ Web based email "harvesting" Detailed description The setting is collecting private archives, more specific web based emails. It should be possible to automatically harvest emails from web based email accounts. The system should scale as the number ...
Other labels: york_hackathon, email, harvesting, data_capture