参考文献:1.Anderson, D.P.: BOINC: a system for public-resource computing and storage. In: Fifth IEEE/ACM International Workshop on Grid Computing, pp. 4-0 (2004) 2.Cesario, E., De Caria, N., Mastroianni, C., Talia, D.: Distributed data mining using a public resource computing framework. In: Desprez, F., Getov, V., Priol, T., Yahyapour, R. (eds.) Grids, P2P and Services Computing, pp. 33-4. Springer, US (2010)View Article 3.Schlitter, N., Laessig, J., Fischer, S., Mierswa, I.: Distributed data analytics using RapidMiner and BOINC. In: Proceedings of the 4th RapidMiner Community Meeting and Conference (RCOMM 2013), pp. 81-5 (2013) 4.Savasere, A., Omiecinski, E., Navathe, S.: An efficient algorithm for mining association rules in large databases. In: Proceedings of 21st International Conference on Very Large Data Bases, pp. 432-44. Morgan Kaufmann, San Francisco (1995) 5.Li, H., Wang, Y., Zhang, D., Zhang, M., Chang, E.Y.: Pfp: parallel fp-growth for query recommendation. In: RecSys 2008 Proceedings of the 2008 ACM conference on Recommender systems, pp. 107-14 (2008) 6.Cheung, D., Han, J., Ng, V.T., Fu, A. W., Fu, Y., Yongjian, A.W.: A fast distributed algorithm for mining association rules. In: Proceedings of International Conference on PDIS 1996, pp. 31-2 (1996) 7.Agrawal, R., Srikant, R.: Fast discovery of association rules. In: Proceedings of the 20th International Conference on VLDB, pp. 307-28. Santiago, Chile (1994) 8.Han, J., Pei, H., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings Conference on the Management of Data, pp. 1-2. Dallas, TX (2000) 9.Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372-90 (2000)MathSciNet View Article 10.The 5th Annual Rexer Analytics Data Miner Survey. http://?www.?rexeranalytics.?com/?Data-Miner-Survey-Results-2011.?html 11.Encyclopedia Britannica. http://?global.?britannica.?com/?EBchecked/?topic/-056150/?data-mining 12.Barbalace, D., Lucchese, C., Mastroianni, C., Orlando, S., Talia, D.: Mining@HOME: public resource computing for distributed data mining. Concurrency Comput. Pract. Experience 22(5), 658-82 (2010) 13.Frequent Itemset Mining Dataset Repository. http://?fimi.?ua.?ac.?be 14.Saad, M.K., Abed, R.M.: Distributed data mining on grid environment. Am. Acad. Sch. Res. J. Spec. Iss. 4(5), 240-43 (2012) 15. Talia, D., Trunfio, P., Verta, O.: Weka4WS: a WSRF-enabled weka toolkit for distributed data mining on grids. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 309-20. Springer, Heidelberg (2005) View Article
作者单位:Evgeny Ivashko (14) Alexander Golovin (14)
14. Institute of Applied Mathematical Research, Karelian Research Centre of Russian Academy of Sciences, Petrozavodsk, Russia
丛书名:Parallel Computing Technologies
ISBN:978-3-319-21909-7
刊物类别:Computer Science
刊物主题:Artificial Intelligence and Robotics Computer Communication Networks Software Engineering Data Encryption Database Management Computation by Abstract Devices Algorithm Analysis and Problem Complexity
出版者:Springer Berlin / Heidelberg
ISSN:1611-3349
文摘
The paper describes an approach to association rules mining from big data sets using BOINC–based Enterprise Desktop Grid. An algorithm of data analysis and a native BOINC–based application are developed. Several experiments with the aim of validation and performance evaluation of the algorithm implementation are performed. The results of the experiments show that the approach is promising; it could be used by small and medium businesses, scientific groups and organizations.