Skip to end of metadata
Go to start of metadata
Title
British Library UK Web Domain Dataset: Format Profile
Description MIME type records have been created for the UK Web Domain Dataset, using three sources/tools:
  • the MIME types delivered by the server 
  • Apache Tika
  • DROID 
    All three MIME types are collected, along with the year the resource was crawled. These four pieces of information are treated as a 'key' for the resource, and the number of resources with that key are counted up, over the entire dataset. The result is output as tab separated data. For full details see, the Nanite codebase, tag v.0.1.1 was used to create this dataset.
Licensing Open data, public domain.
Owner See above, originating from the British Library
Dataset Location https://github.com/ukwa/opendata/tree/master/datasets/ukwa.ds.2/fmt#uk-web-domain-dataset-1996-2010-format-profile
Collection expert Maureen Pennock (BL)
Issues brainstorm TBC
List of Issues TBC


Labels:
dataset dataset Delete
webarchive webarchive Delete
formatprofile formatprofile Delete
representationinformation representationinformation Delete
identification identification Delete
researchdata researchdata Delete
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.