Label: duplication+york_hackathon

Content with label duplication+york_hackathon in Practical Preservation Issues (See content from all spaces)
Related Labels: spruce_glasgow, embedded_objects, identification, vector, appraisal_assessment, data_capture, qa, unknown_file_formats, permissions, postscript, solution, convert, email, bit_rot, obsolescence, issue, spruce, harvesting, characterise, more » ( - duplication, - york_hackathon )

Page: Deduplication
Title \\ Deduplication \\ Detailed description Collection owners need a way to easily identify duplicates in a collections.  Duplicates are a common and seemingly simple issue but the fact that it is rarely cracked illustrates the complexity. A collection of several hundred it may be possible to identify manually ...
Other labels: issue