Big Data technologies like Hadoop have become an important part of the enterprise IT ecosystem.
Hadoop
Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs.
xFlow Research Inc. worked in collaboration with Dell on Benchmarking the Performance of Hadoop in different Cloud platforms including OpenStack, VMware vCloud and Microsoft Azure. The benchmarking has been done using the open-source benchmarking suite TPCx-HS.
TPC Express Benchmark HS (TPCx-HS) provides an objective measure of hardware, operating system and commercial Apache Hadoop File System API compatible software distributions, thus providing the industry with verifiable performance, price-performance and availability metrics.
The results of this POC has been published in the 8th TPC Technology Conference. It shows how an OpenStack cloud can be optimized to get the performance of TPCx-HS on the Cloud to match as closely as possible that on a Bare-metal configuration.
Click here to access the POC