Label: hadoop+webarchive

All content with label hadoop+webarchive.
Related Labels: planning, lsdr, representationinformation, characterisation, microsoft, watch, identification, obsolescence, issue, tool, qa, formatprofile, arc, database, researchdata, unknown_file_formats, unknown_characteristics, dataset, azure, more » ( - hadoop, - webarchive )

Page: IS41 Analyse huge text files containing information about a web archive (SCAPE)
Title \\ IS41 Analyse huge text files containing information about a web archive \\ Detailed description Some web archive produce information about the content of a web archive on a periodical basis. The result is sometimes stored as huge text files ...
Other labels: issue, characterisation, unknown_characteristics
Page: WCT8 Huge text file analysis using hadoop (SCAPE)
Collection: Issue: Solutions
Other labels: scenario, characterisation