compared with
Current by Leïla Medjkoune
on Oct 01, 2014 13:14.

This line was removed.
This word was removed. This word was added.
This line was added.

Changes (3)

View Page History
| *Title* \\ | Internet Memory Web collections \\ |
| *Description* | _The data consists in web content crawled, stored and hosted by the Internet Memory Foundation_ (W)ARC format (approx. 2300TB) \\
Using this content, IM can also use its taskforce (QA team) to provide annotated data such as pairs of annotated snapshots for quality assurance scenarios. \\
1000 annotated paires of web pages (similar/dissimilar) were produced as part of PC.WP3: Quality Assurance Components. \\ |
| *Licensing* | _Web collections_ crawled on behalf of partner institutions will require institutions agreement to be used by SCAPE partners \\ |
| *Owner* | _Internet Memory_ \\ |