Apache Spark is one of the most widely adopted open-source Big Data processing engines. High performance and ease of use for a wide class of users are some of the primary reasons for the wide adoption. Although data partitioning increases the performance of the analytics workload, its application to Apache Spark is very limited due to layered data abstractions.
Download count: 0
- Partial requirement for: M.S., Arizona State University, 2021
- Field of study: Computer Science