Case Study
Achieving 10x acceleration of Spark and Hive Jobs on AWS S3 with Alluxio Tiered Storage
© Copyright Alluxio, Inc. All rights reserved. Alluxio is a trademark of Alluxio, Inc. Achieving 10x acceleration of Spark and Hive Jobs on AWS S3 with Alluxio Tiered Storage WHITEPAPER What’s Inside 1 / Introduction 2 / The Bazaarvoice Big Data Architecture 3 / Why Alluxio 4 / Bazaarvoice Architecture with Alluxio 5 / Optimization: Tiered Storage using ZFS 6 / Micro and Real-world Benchmark Results 7 / Conclusion A Digital Marketing Case Study/ 2 The data engineering team at Bazaarvoice, a software-as-a-service digital marketing company based in Austin, Texas, must handle data at massive Internet-scale to serve its customers. The company enables retailers and brands to curate, manage, and understand user-generated content (UGC) like product reviews, shopper questions and answ