Investigator(s)
Paweł Kominek, Michał Kozak, Aleksander Stroiński, Tomasz Parkoła
Dataset
The medical dataset for this experiment comes from the overall WCPT dataset described here: http://wiki.opf-labs.org/display/SP/WCPT+medical+dataset.
Purpose of this experiment
The main goal of this experiment is to test performance of the search functionality build-in into the MDC portal. The statistics should show how many concurrent users can use MDC portal.
Platform
PSNC Hadoop Platform (http://wiki.opf-labs.org/display/SP/PSNC+Hadoop+Platform)
Workflow
The following steps compose the search functionality in the MDC portal:
- Request from the user (search in the MDC portal)
- Search request interpretation and lookup in the Hadoop cluster for the best matches
- Creation of the response, including data retrieval from HBase/HDFS
- Response to the user with the search results
Requirements and Policies
The experiment should test only a single instance of MDC portal (e.g. no load balancing should be used).
Evaluations
Links to results of the experiment using the evaluation template.