Label: dataset

Content with label dataset in Practical Preservation Issues (See content from all spaces)
Related Labels: opf_montpellier, disk_image, email, opf, spruce_london_2, software, spruce, opf_copenhagen, document, web, untagged, mixed, database, image, preservingpdf, audio

Page: ADS Grey Literature Library
Title \\ {}ADS Grey Literature Library{} \\ Description {}This collection is owned by the Archaeology Data Service. It is a library of unpublished fieldwork reports recording the results of archaeological fieldwork in the UK. Currently holds over 11,000 reports and increases by around 200 reports per ...
Other labels: document
Page: BL 19th Century digitised newspaper collection
Title \\ BL 19th Century digitised newspaper collection Description 1,000000\ digitised newspaper pages from the 19th century. TIFF master files cropped TIFFs. ALTO XML and METS XML. Derived JPEG2000. \\ Licensing Access to images via ...
Other labels: image
Page: Computer game disc images
Title \\ Computer game disc images Description The dataset consists of disc image files of computer games. The image file types are dd, iso, img, and sub.\\ Licensing Unknown. Owner The Royal Library of Denmark. Dataset Location The repository for private ...
Other labels: opf, disk_image
Page: Database containing a unique list of Danish words
Title \\ Database containing a unique list of Danish words Description The dataset is based on a web scrape of selected Danish websites, extraction of words from the webpage <body></body> and insertion of unique words into mySQL databqase. The words are enriched with information on the surrounding ...
Other labels: opf, opf_copenhagen, web, document
Page: eTheses
Title \\ eTheses Description White Rose consortium holds thousands of theses and have rarely run preservation actions on the files.&nbsp; Do we know of any issues related to hte PDFs?&nbsp; Can we convert them to PDF ...
Other labels: document
Page: French Web Archives
Title \\ French Web Archives Description Several hundreds Tb of data representing 15 years of harvesting the French web. The harvesting was outsourced at first, and is now done&nbsp;inhouse. It is a mix of large scale harvests of the .fr domain and indepth harvests of a curated ...
Other labels: web, mixed
Page: Ida Roper Herbarium archive
Title \\ Ida Roper Herbarium archive \\ Description The Roper archive consists of approximately 10,000 specimens of English plants. The digital archive donated to Leeds represents the outputs of a Arts and Humanitites Research Board project from 2003, to improve access to the Ida ...
Other labels: mixed, web, document, image, database
Page: Imperial College Exploration Board Adventure 2001 Comprising Overland Pakistan and Biafo Climbing Nick Adlam Alain Hosley James Smyth Tim Harris Nick Saunders
Title \\ Imperial College Exploration Board Adventure 2001 Comprising Overland Pakistan and Biafo Climbing Nick Adlam ,Alain Hosley, James Smyth, Tim Harris, Nick Saunders \\ \\ \\ Description Imperial College created on an Apple Mac report ...
Other labels: opf, preservingpdf
Page: LAVC audio
Title \\ LAVC (Leeds Archive of Vernacular Culture) audio files \\ Description audio files copied from masters and then recopied \\ Licensing content owned by University of Leeds, use only with permission \\ Owner University of Leeds \\ Dataset Location Test files ...
Other labels: audio
Page: Leeds image duplicates and versions
Title \\ Duplicate image files a.k.a find the master or external drive of horrors \\ Description 150G folder of image files, unknown origin, some are duplicates, some are derivatives, many are 3rd or 4th generation copies... \\ 20,000\ files and where does ...
Other labels: image