Collection:
this is an example include. edit this to include the datasets you would like in your scenario
Title |
Austrian National Library - Digital Book Collection |
Description | The Austrian Books online project of the Austrian National Library is currently creating a digital book collection from the 16th to the 19th century of about 600.000 books that will be digitised over the coming years. |
Licensing | Sample only available to SCAPE partners. |
Owner | Austrian National Library |
Dataset Location | Austrian National Library |
Issue:
this is an example include. edit this to include the issues you would like in your scenario
Title |
IS1 Digitised TIFFs do not meet storage and access requirements |
Detailed description | A important part of digital preservation is the willingness and financial commitment of a memory institution to preserve the data for the long term. Given the time scales in question any cost saving is to be welcomed. At the BL, as elsewhere ![]() As a side benefit, replacing the TIFF images with alternative representations will facilitate access to the materials - smaller files to manipulate and download and native tool support in browsers and standard OSs. Access metrics also help to obtain the commitment of the memory institution to preserve data. |
Scalability Challenge |
The JISC1 collection is high volume (80TB). There are no specific requirements around performance of migration+QA solutions, although it would be desirable to complete processing within weeks rather than months. |
Issue champion | ![]() |
Other interested parties |
Schlarb Sven![]() |
Possible approaches | Migration from TIFF to JPEG2000 |
Context | |
Lessons Learned | |
Training Needs | |
Datasets | JISC1 19th Century Digitised Newspapers |
Solutions | SO31 Preservation Grade TIFF to JPEG2000 Migration |
Solutions:
this is an example include. edit this to include the solutions you would like in your scenario
Title | SO28 A heuristic measure for detecting undesired influence of lossy JP2 compression on OCR in the absence of ground truth |
Detailed description | This solution provides a workflow for detecting undesired influence of lossy JP2 compression on OCR in absence of ground truth. The workflow takes TIFF image instances as input, applies a defined list of JP2 compression parameter values, applies OCR using an open source OCR engine, evaluates the results and creates a final report. |
Solution Champion |
Schlarb Sven![]() |
Corresponding Issue(s) |
|
myExperiment Link |
http://www.myexperiment.org/workflows/2174.html ![]() |
Tool Registry Link |
|
Evaluation |
|