Performance evaluation of cloud-based log file analysis with Apache Hadoop and Apache Spark
详细信息    查看全文
文摘
We focus on three performance indicators, the execution time, resource utilization and scalability. We conducted realistic log file analysis experiments in both frameworks. We proposed a power consumption model and an utilization-based cost estimation. We experimentally confirmed Spark’s best performance.
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.