Corrupted JPEG and JPEG2000 files

Skip to end of metadata
Go to start of metadata
Corrupted JPEG and JPEG2000 files
Detailed description JPEG/JPEG2000 scans are sometimes corrupted. They contain areas which come from other areas. When such an area contains a dark area (edge of scan showing page edges), this is particularly visible. When one area of text is on top of another area of text, it is less visible. The images have also been rotated after the corruption occurred.
Also described here: Shifted Crop Corruption
Issue champion Paul Wheatley
Other interested parties
Any other parties who are also interested in applying Issue Solutions to their Datasets
Possible Solution approaches
  • Put the images through a filter to detect edges
  • Write program to find dark areas
Context Details of the institutional context to the Issue. (May be expanded at a later date)
Lessons Learned Notes on Lessons Learned from tackling this Issue that might be useful to inform digital preservation best practice
Datasets BL 19th Century digitised newspaper collection
Solutions Corrupted JPEG and JPEG2000 files solution
jpeg jpeg Delete
jpeg2000 jpeg2000 Delete
spruce_glasgow spruce_glasgow Delete
spruce spruce Delete
issue issue Delete
bit_rot bit_rot Delete
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.