文摘
We focus on three performance indicators, the execution time, resource utilization and scalability. We conducted realistic log file analysis experiments in both frameworks. We proposed a power consumption model and an utilization-based cost estimation. We experimentally confirmed Spark’s best performance.