Case Study

Evaluating Apache Spark and Alluxio for Data Analytics

Evaluating Apache Spark and Alluxio for Data Analytics

Pages 15 Pages

© Copyright Alluxio, Inc. All rights reserved. Alluxio is a trademark of Alluxio, Inc. Evaluating Apache Spark and Alluxio for Data Analytics Benchmarking Recommendations and Results WHITEPAPER What’s Inside 1 / Use Cases 2 / Benchmark Descriptions 3 / Best Practices 4 / Benchmark Results 5 / Conclusion This whitepaper details how to evaluate Alluxio’s data orchestra- tion platform as a distributed cache for Apache Spark in a public cloud or on-premises. We discuss best practices and benchmark- ing results with a combination of standard industry benchmarking suites, such as TPC-DS and HiBench, on cloud storage. This guide serves as a reference for reproducing similar experiments in your own environment as part of a Proof of Concept (PoC) to evaluate the use of Alluxio with Apach

Join for free to read