Web based email "harvesting"

Skip to end of metadata
Go to start of metadata
Web based email "harvesting"
Detailed description The setting is collecting private archives, more specific web based emails. It should be possible to automatically harvest emails from web based email accounts. The system should scale as the number of users increase. The system should be able to harvest only parts of a web based e-mail account. It should be possible for the donors to start a e-mail harvesting (to the library) by passing their credentials (login and password) for their e-mail account to the system. The file format of the harvested e-mails could be HTML or plain text. The attachments of the emails can be in their original format.
Issue champion Claus Jensen
Other interested parties
Any other parties who are also interested in applying Issue Solutions to their Datasets
Possible Solution approaches Brief brainstorm of possible approaches to solving the Issue. Each approach should be described in a single sentence as part of a bulleted list
Context Details of the institutional context to the Issue. (May be expanded at a later date)
Lessons Learned Notes on Lessons Learned from tackling this Issue that might be useful to inform digital preservation best practice
Datasets Web based emails
Solutions Harvest webmail accounts
york_hackathon york_hackathon Delete
email email Delete
issue issue Delete
harvesting harvesting Delete
embedded_objects embedded_objects Delete
data_capture data_capture Delete
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.