Label: characterisation

Content with label characterisation in SCAPE (See content from all spaces)
Related Labels: planning, hadoop, lsdr, solution, validation, watch, identification, obsolescence, issue, scape, qa, conformance, webarchive, unknown_characteristics, scenario

Page: IS11 PDF files may face preservation risks
Title \\ IS11 PDF files may face preservation risks Detailed description Problem: the PDF document standard contains various features that pose a direct threat to the longterm accessibility of PDF files. Examples are: password protection, external dependencies, nonembedded ...
Other labels: lsdr, issue, watch, obsolescence
Page: IS14 Diverse preservation risks in large archives with millions of objects
Title \\ IS14 Diverse preservation risks in large archives with millions of objects Detailed description While we ingested millions of objects in the past, we expanded our knowledge about the risks of the objects. However, before we could make a decision ...
Other labels: webarchive, identification, issue, watch, obsolescence
Page: IS20 Detect audio files with very bad sound quality
Title \\ IS20 Detect audio files with very bad sound quality Detailed description In a collection of mp3 files (20 Tbytes 360.000 files) we have discovered files with very bad sound quality. Before ingesting everything into our ...
Other labels: lsdr, qa, issue
Page: IS22 Characterise and Validate very large mpeg-1 and mpeg-2 files
Title \\ IS22 Characterise and Validate very large mpeg1 and mpeg2 files Detailed description Collections of very large videofiles (50Gb\ each) are hard to handle when it comes to characterisation and validation. Known characterisation tools do not nessecarily like very ...
Other labels: identification, lsdr, issue, obsolescence
Page: IS24 Characterisation of large amounts of wav audio
Title \\ IS24 Characterisation of large amounts of wav audio Detailed description SB holds large amounts of WAV audio (200Tb \) in different resolutions (ranging from 22Khz 16 bit to 96Khz 24 bit). Different resolutions have been ...
Other labels: lsdr, issue, unknown_characteristics
Page: IS25 Web Content Characterisation
Title \\ IS25 Web Content Characterisation Detailed description \\ The issue with web content is mainly the fact that web archive data is very heterogeneous. Depending on the policy of the institution, data contains text documents in all kinds of text encoding, html content ...
Other labels: identification, webarchive, issue, obsolescence
Page: IS2 Do acquired files conform to an agreed technical profile, are they valid and are they complete?
Title \\ IS2 Do acquired files conform to an agreed technical profile, are they valid and are they complete? Detailed description Some forms of content arrive at the preserving institution and will be preserved "as is" regardless of how the files have been constructed (eg. web archived ...
Other labels: lsdr, qa, issue, conformance
Page: IS3 Large media files are difficult to characterise without mass processing + We cannot identify preservation risks in uncharacterised files
Title \\ IS3 Large media files are difficult to characterise without mass processing We cannot identify preservation risks in uncharacterised files Description At SB, data from broadcasters contain huge media files like MPEG2 transport streams ...
Other labels: lsdr, issue, watch, planning, obsolescence
Page: IS41 Analyse huge text files containing information about a web archive
Title \\ IS41 Analyse huge text files containing information about a web archive \\ Detailed description Some web archive produce information about the content of a web archive on a periodical basis. The result is sometimes stored as huge text files ...
Other labels: issue, hadoop, webarchive, unknown_characteristics
Page: IS42 Detecting Encryption and DRM in Digital Content
Title \\ Detecting Encryption/DRM in Digital Content \\ Detailed description Many file formats make provision for the encryption of content, e.g. password protected PDFs. Outside of formats software exists that will encrypt data at a file, directory, and device level, e.g. ...
Other labels: issue, obsolescence, lsdr