British Library Sound & Vision

Skip to end of metadata
Go to start of metadata
Institutional context
Institution type (eg. Library, Archive)
Library & archive
Storage and access
Where are collection masters stored (media, number of copies, backup, preservation system)?
Temporary/working storage: networked Satabeast RAID storage with automated tape backup
Final storage: All nodes of the Library's Digital Library System
Where is access to the collection provided from?
Access to master files on temporary/working storage is restricted to staff logins via Library workstations
Access to low resolution access copies to the public will be delivered online via British Library Sounds  
What technical protocol is used to access files? Local file systems? Windows shares? (SMB/CIFS)
Master files: Windows shares
Access files: http
Describe your existing content workflow.
  1. Digital audio files created by users on handheld / portable devices, using the AudioBoo app.  Users may add descriptive metadata, to be embedded in each file.
  2. Audio files uploaded from portable devices to Audioboo at
  3. Audio files FTP'd to BL servers, in batches, by Audioboo.
  4. Catalogue data produced by curator using Excel for later export to Library catalogue.
  5. Audio files manually accessed using a combination of Foobar and MediaCoder Audio Edition; technical and ID3 metadata recorded for FLAC, m4a, mp4, 3gpp, amr files in Excel for later recording in Library catalogue entries.  Non-valid/unreadable files weeded out manually.
  6. Technical metadata and ID3 data for mp3, wma files extracted to csv using mp3tag.  csv data merged with Excel data.  csv files deleted.  Non-valid/unreadable files weeded out manually.
  7. Technical metadata and BEXT data extracted from (user-generated) WAV files using Jhove. 
  8. Jhove data filtered and parsed with Saxon to csv.  csv data merged with Excel data.  Jhove and csv data deleted.
  9. Audio files normalized to WAVE.
  10. Technical metadata recorded for normalized wave files and merged in Excel.
  11. Normalized audio files renamed with UIDs.
  12. md5 checksums generated for normalized / renamed audio files with fsum frontend.
  13. Excel data exported to catalogue-ready text files.  Catalogue entries created.
  14. Normalized audio files and checksums uploaded to DLS.
  15. Original audio files deleted.
What tools are part of the existing workflow
What technologies underly the existing workflow?
What challenges are present in the existing workflow? (technology, organisational, staffing)
Time and complexity
Does the workflow include manual steps?
Where in this content workflow would the prototype solution be deployed? To replace steps 5-10
What is the process for changing or enhancing the workflow? What obstacles to change are present?
Written workflow documents and diagrams will require re-writing to incorporate improvements.  Python to be installed.  xslt to be defined for transforming outputs.  No other changes should be necessary.
Who executes the existing workflow?
Production Co-ordinator, Sound & Vision Technical Services
Who adminsters the existing workflow?
Production Co-ordinator, Sound & Vision Technical Services
What system rights do the workflow executors have? Can they install software? Can they use the web?
Full admin rights
Who is the collection owner or curator? (section/department/team)
British Library Sound & Vision, Unlocking & Integration AV Content
Is there a workflow champion, who is it?
Adam Tovell
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.