Title | Corrupted JPEG and JPEG2000 files solution |
Detailed description | I tried some edge detection, but the pages full of text and line drawings had too many edges, so the edges of the corrupted areas were no more visible than before. I converted the JPEGs to smaller 1 bit PNGs, so that processing them would be quicker. I wrote a Python2 script to find areas of black. The program would first look for rows which had a higher than average number of black pixels and were contiguous. Within these rows, it would then look for columns which were largely black and contiguous. It reports files which have such areas and also produces mask image files which show where the black areas were found. The mask files will be put into small_newspapers, and the names of any images with black areas will be put into results.txt. |
Solution Champion | ![]() |
Corresponding Issue(s) | |
Tool/code link | SPRUCE/tree/master/black_pixels![]() |
Tool Registry Link![]() |
|
Evaluation |
|
Labels: