Case Study
ETL pipelining and analytics using Spark on AWS EMR
ETL pipelining and analytics using Spark on AWS EMR ABG analytics team wanted to migrate their on-premise architecture to AWS cloud. HashedIn, being the advanced partner with AWS, helped them with PoCs of shifting their processes to AWS cloud. Executive SummaryProblem Statement There were basically 2 problems that we helped them resolve: Business Requirements Data Warehouse migration from Teradata to AWS - ABG had an on-premise datacenter running on Teradata servers which they wanted to transfer to AWS ecosystem for them to run their analytics at a scale. Root Cause Analysis Optimization Using Spark on AWS EMR - They want to move their analytics running on R server to spark on AWS EMR. One of the ABG companies have an on-premise data warehouse. This data warehouse is on a shared infra