Skip to end of metadata
Go to start of metadata


Bolette Jurik (SB)


Danish Radio broadcasts, mp3


SB Hadoop Platform


The workflow is the same as SB Experiment SO4 Audio mp3 to wav Migration and QA Workflow.

  • Migration from Mp3 to Wav using FFmpeg
  • Validation that the migrated file is a correct file in the wanted format using JHOVE2
  • Extract and compare header information properties of the original and the migrated files using Ffprobe
  • Convert the Mp3 file to Wav using MPG321
  • Compare the two Wav files using xcorrSound waveform-compare (earlier migrationQA)

The difference is that the workflow is written as a number of Hadoop jobs / Hadoop mappers instead of a Taverna workflow.

The project is available from

In addition there now is a Taverna workflow combining three of these Hadoop jobs.

To sum up what this workflow does, is migration, conversion and content comparison. The top left box (nested workflow) migrates a list of mp3s to wav files using a Hadoop map-reduce job using the command line tool Ffmpeg, and outputs a list of migrated wav files. The top right box converts the same list of mp3s to wav files using another Hadoop map-reduce job using the command line tool mpg321, and outputs a list of converted wav files. The Taverna work flow then puts the two lists of wav files together and the bottom box receives a list of pairs of wav files to compare. The bottom box compares the content of the paired files using a Hadoop map-reduce job using the xcorrSound waveform-compare commandline tool, and outputs the results of the comparisons.


The file containing the list of mp3 files to be migrated is available from HDFS. The mp3 files are stored on NFS and the resulting wav files are written to NFS. This has a number of reasons.

  • The first is that the audio tools, we are using, were written to read from and write to NFS.
  • Also at SB digitally preserved material does not reside on HDFS, which means that in order to migrate from and to HDFS, we would first need to copy the mp3s to HDFS and later copy the wavs from HDFS. These extra copy operations are expensive, when we are talking large-scale audio collections.
  • Finally the SB Hadoop Platform is set up using network storage as local storage, which means that we do not exploit the HDFS locality property, and thus accessing the files on NFS rather than HDFS does not present a large overhead.

The preservation event and log files are all written to HDFS. This means we have a rather complex input/output model with input from both HDFS and NFS and also with output to both HDFS and NFS. And this is of course only an experiment! If this workflow is going to be used in production, we need to add the repository connection, such that data can be both retrieved from the repository and written to the repository.

Future Work

What we would like to do next is:

  • Run an experiment using 1TB of mp3 files on the SB Hadoop cluster. This however requires some updates to the workflow. For 1TB input mp3 files, the workflow currently generates approximately 25TB of output and temporary wav files. Our test set-up is not suited for this, so we would like to delete these files along the way. Thus we would like the Taverna workflow to work on lists of lists of files. We can then limit the size of data written to eg. 2TB, then delete before continuing, as the only important output of the experiments are the comparison results and performance measurements. Also we can experiment with sending the Hadoop jobs lists of different sizes.
  • Extend the workflow with property comparison. The waveform-compare tool only compares sound waves; it does not look at the header information. This should be part of a quality assurance of a migration. The reason this is not top priority is that FFprobe property extraction and comparison is very fast, and will probably not affect overall workflow performance much.

Requirements and Policies



**Evaluation - SB Experiment mp3 to wav Migration and QA on Hadoop Cluster

Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.
  1. Oct 26, 2015


    There is no such thing at the moment but there are goods that promise to stop additional hair loss and maybe
    restore some of the lost hair, if the follicles haven't
    completely withered away. It is often mixed with other oils to keep cost down but still
    provide benefits. The initial consultation will give you a good opportunity
    to clear all doubts that you may have.

  2. Jan 15, 2016


    Several recent studies have shown Proscar is effective for hair loss and can help regrow some hair on some people, but
    as I said before, the use of 5AR inhibitors only deals with a part of the problem and are generally not very effective when used as the only
    treatment. The platinum blonde is a much sought after persona, and the look, an elegant
    and beautiful one if carried off properly, is a much desired and much employed
    part of dressing up. Have Some Much more Derm - Organic Care For The Locks.

  3. Feb 15, 2016


    If you are going to invest money into your smile through teeth bleaching,
    bonding, or veneers, you should be prepared to take some extra steps
    in keeping your teeth healthy. If you are contemplating
    a dental treatment, there are several questions you
    could ask your dentist prior to deciding if a specific procedure is
    suitable for you. Innovative dental implants look, feel and
    perform just like natural teeth not only giving the patient a
    healthy, strong and beautiful smile but also the confidence to
    recover their integral health. You will find salicylic acid peels that can also
    work with acne scarring. Idleness to maintain dental hygiene
    or lack of knowledge might be major cause of dental issues that
    finish up needing restorative dentistry. After successful completion of the DAT,
    family dentists must complete three to five years in a dental school program.
    'We all know that when we send something precious out to
    be restored we expect it to look like the day we bought it when the work is done.
    You can find more information on bruxism and how to prevent this phenomenon by
    asking your dental health care professional. Article Source: the
    best Dentist to take care of you, especially
    those near to you is easier online. Gum disease, or periodontal disease can affect the unborn child, lead
    to low birth weight and even premature birth, and
    severe periodontal disease can even cause low birth rate in the child.