
Please make any changes to the public agenda page: SCAPE Future Formats First Agenda
![]() | Please make any changes to the public agenda page: SCAPE Future Formats First Agenda |
Learning Outcomes (by the end of the session the attendees will be able to:
- Understand scalable platforms and evaluate the situations in which such environments are required.
- Apply knowledge of existing tools to solve migration and quality control problems.
- Combine and modify tool chains in order to create automated workflows for migration and quality control.
- Implement best practice for discovering and sharing workflows for use and re-use.
- Make use of a scalable environment and apply a number of workflows to automatically perform migration and quality assurance checks on a large number of objects.
- Identify a number of potential problems when working in a scalable environment and propose solutions.
- Understand the potential to use scalable platforms in digital preservation and synthesise new opportunities within your own environments.
Draft Agenda:
Monday 16 September
Time | Session | Facilitator | Learning outcomes |
---|---|---|---|
09.15 - 09.45 | Registration and coffee | ||
09.45 - 09.50 | Welcome and housekeeping | BL | |
09.50 - 10.20 | Building Scalable Environments Understanding the fundamentals of scalability and why it is important:
|
Rainer Schmidt, AIT |
1 |
10.20 - 10.35 | Use case Migrating TIFF to JPEG 2000 at the British Library |
Peter May / Will Palmer, BL | 1 |
10.35 - 11.15 | Practical exercise Experiment with a pre-built environment to migrate TIFF to JPEG 2000. Delegates can bring their own images or use sample files. |
Rainer Schmidt (AIT), Roman Graf (AIT), Matthias Rella (AIT) Dave Tarrant, OPF |
2 |
11.15 - 11.30 | Coffee break | ||
11.30 - 12.45 | Migration and Quality Assurance Exploring migration and quality control tools for images and understanding how these are invoked on a single machine instance. Demonstration and practical exercise: How imageMagik and jpylyzer are run on a single TIFF to JPEG 2000 conversion. This exercise will be carried out in your own local instance and not built to scale. |
Sven Schlarb (ONB) Carl Wilson (OPF) |
2 |
12.45 - 13.30 | Lunch | ||
13.30 - 15.00 | Workflows With the tools explored, we will introduce workflows and look at how these can be used to invoke multiple operations to both migrate content and run quality control checks on the results. Demonstration and practical exercise: Migrate an image using imageMagick then use jpylyzer to check for valid JPEG 2000 image. Again this exercise will be carried out in your own local instance and not built to scale. |
Sven Schlarb (ONB) |
3 |
15:00 - 15.15 | Coffee | |
|
15.15 - 16:30 | How to share your workflow Having built a workflow we look at how to share and discover other workflows. Practical exercise: Describe and upload workflows |
Donal Fellows (UNIMAN) |
4 |
16.30 - 17.00 | Wrap up | Dave Tarrant, OPF Rainer Schmidt, AIT |
|
17.00 | Close | ||
19.30 | Event dinner at The Betjeman Arms![]() |
Tuesday 17 September
Time | Session | Facilitator | Learning outcomes |
---|---|---|---|
09.00 - 09.15 | Coffee, welcome back and overview of agenda for the day | Dave Tarrant, OPF | |
09.15 - 10.15 | Introduction to preservation at scale This session introduces the Hadoop platform introduces its application for executing preservation workflows in a distributed environment. More than just "getting the job done" we look at the tools for monitoring and controlling complex operations at scale and look at how these can be used to identify potential problems. |
Sven Schlarb, ONB Rainer Schmidt, AIT |
5 |
11.00 - 11.15 | Coffee | ||
11.15 - 12.30 | Building Scalable Environments continued Practical exercises Set up the Hadoop test installation running the same scripts as the demonstration cluster. Read and analyse various log files to identify potential problems (e.g. tool versions) |
Rainer/Graf/Rella (AIT) Sven Schlarb (ONB) |
5 6 |
12.30 - 13.30 | Lunch | ||
13.30 - 14.30 | Invited talk: Introduction to the SCAPE repository reference implementation This talk will introduce the SCAPE repository reference implementation as a guide to get you started with using Fedora 4. It will discuss the opportunities and potential for the future for scalability with respect to digital object management systems. |
Matthias Hahn (FIZ) Frank Asseg (FIZ) |
7 |
14.45 - 15.00 | Coffee break | |
|
15.00 - 16.00 | Integrating Taverna and Hadoop This final session recaps the work that has been done to this point and allows attendees to fully integrate a number of workflows (both of their own making as well as existing ones) into scalable preservation platform on-site. |
Sven Schlarb (ONB) Rella, Schmidt Graf (AIT) |
5 |
16.00 - 17.00 | Panel and wrap up | Rainer Schmidt, AIT Dave Tarrant, OPF |
|
17.00 | Close |
Labels:
None