基于人工智能技术的大数据分析方法研究进展
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Progress of big data analytics methods based on artificial intelligence technology
  • 作者:王万良 ; 张兆娟 ; 高楠 ; 赵燕伟
  • 英文作者:WANG Wanliang;ZHANG Zhaojuan;GAO Nan;ZHAO Yanwei;College of Computer Science and Technology,Zhejiang University of Technology;Key laboratory of Special Purpose Equipment and Advanced Manufacturing Technology,Ministry of Education,Zhejiang University of Technology;
  • 关键词:大数据 ; 人工智能 ; 机器学习 ; 深度学习 ; 计算智能
  • 英文关键词:big data;;artificial intelligence;;machine learning;;deep learning;;computational intelligence
  • 中文刊名:JSJJ
  • 英文刊名:Computer Integrated Manufacturing Systems
  • 机构:浙江工业大学计算机科学与技术学院;浙江工业大学特种装备制造与先进加工技术教育部重点实验室;
  • 出版日期:2018-08-20 16:18
  • 出版单位:计算机集成制造系统
  • 年:2019
  • 期:v.25;No.251
  • 基金:国家自然科学基金资助项目(61873240,61572438,61702456)~~
  • 语种:中文;
  • 页:JSJJ201903002
  • 页数:19
  • CN:03
  • ISSN:11-5946/TP
  • 分类号:5-23
摘要
人工智能、大数据、云计算、物联网等信息技术为推动集成制造快速发展提供了关键技术手段。近年来,采用人工智能技术进行大数据分析取得了突破性进展。系统总结了基于人工智能技术的大数据分析方法的最新研究进展。从大数据的聚类、关联分析、分类和预测4个主要的数据挖掘任务出发,分析了大数据环境下机器学习的研究现状;针对深度学习这一热点,总结了基于MapReduce、Spark的分布式深度学习实现,以及面向大数据分析的深度学习算法改进相关研究;从群智能、进化算法两方面梳理了基于计算智能的大数据分析相关研究;针对大数据平台,特别对大数据分析和深度学习集成框架进行了归纳,介绍了大数据机器学习系统和算法库;分析了大数据分析中人工智能技术面临的主要挑战,并提出了进一步的研究方向。
        Artificial intelligence,big data,cloud computing,Internet of things and other information technologies promote the development of integrated manufacturing.Remarkable achievements were achieved in the methods of big data analytics with artificial intelligence technology.The latest research progress of big data analytics methods based on artificial intelligence was summarized comprehensively.A summary of research on machine learning was respectively introduced at first,including big data clustering,correlation analysis,classification and prediction.For the deep learning,a hotspot of research in machine learning,distributed deep learning models based on MapReduce/Spark and other improved deep learning algorithms for big data were discussed especially.The big data analytics based on computational intelligence were discussed from swarm intelligence and evolutionary algorithms two aspects.Furthermore,the engineering implementation of distributed computation platforms for big data were described,including the integrated frameworks for the distributive deep learning,big data machine learning systems and algorithms library.The challenges and the possible research directions of artificial intelligence technologies for big data analytics were put forward.
引文
[1]LABRINIDIS A,JAGADISH H V.Challenges and opportunities with big data[J].Proceedings of the VLDB Endowment,2012,5(12):2032-2033.
    [2]LI Xuelong,GONG Haigang.A survey on big data systems[J].Scientia Sinica Informations,2015,45(1):1-44(in Chinese).[李学龙,龚海刚.大数据系统综述[J].中国科学:信息科学,2015,45(1):1-44.]
    [3]WANG Wanliang.Artificial intelligence:principles and applications[M].3rd ed.Beijing:Higher Education Press,2016(in Chinese).[王万良.人工智能及其应用[M].3版.北京:高等教育出版社,2016.]
    [4]WANG L.Machine learning in big data[J].International Journal of Advances in Applied Sciences,2016,4(4):117-123.
    [5]JAPKOWICZ N,STEFANOWSKI J.A machine learning perspective on big data analysis[M]//Big Data Analysis:New Algorithms for a New Society.Berlin,Germany:Springer-Verlag,2016:1-31.
    [6]GRIMMER J.We are all social scientists now:how big data,machine learning,and causal inference work together[J].Political Science&Politics,2015,48(1):80-83.
    [7]ZHAO W,MA H,HE Q.Parallel k-means clustering based on mapreduce[C]//Proceedings of the 1st International Conference on Cloud Computing.Berlin,Germany:Springer-Verlag,2009:674-679.
    [8]GAO H,JIANG J,SHE L,et al.A new agglomerative hierarchical clustering algorithm implementation based on the MapReduce framework[J].International Journal of Digital Content Technology and its Applications,2010,4(3):95-100.
    [9]HE Y,TAN H,LUO W,et al.Mr-dbscan:an efficient parallel density-based clustering algorithm using mapreduce[C]//Proceedings of the 17th International Conference on Parallel and Distributed Systems(ICPADS).Washington,D.C.,USA:IEEE,2011:473-480.
    [10]YAN W,BRAHMAKSHATRIYA U,XUE Y,et al.pPIC:parallel power iteration clustering for big data[J].Journal of Parallel and Distributed Computing,2013,73(3):352-359.
    [11]ZHAO Y,CHEN Y,LIANG Z,et al.Big data processing with probabilistic latent semantic analysison MapReduce[C]//Proceedings of the 2014International Conference on Cyber Enabled Distributed Computing and Knowledge Discovery(CyberC).Washington,D.C.,USA:IEEE,2014:162-166.
    [12]KIM Y,SHIM K,KIM M S,et al.DBCURE-MR:an efficient density-based clustering algorithm for large data using MapReduce[J].Information Systems,2014,42:15-35.
    [13]HU C,KANG X,LUO N,et al.Parallel clustering of big data of spatio-temporal trajectory[C]//Proceedings of the11th International Conference on Natural Computation.Washington,D.C.,USA:IEEE,2015:769-774.
    [14]BU Fanyu,CHEN Zhikui,LI Peng,et al.A high-order CFSalgorithm for clustering big data[EB/OL].[2017-08-10].https://www.hindawi.com/journals/misy/2016/4356127.
    [15]LIAO Q,YANG F,ZHAO J.An improved parallel K-means clustering algorithm with MapReduce[C]//Proceedings of the15th IEEE International Conference on Communication Technology.Washington,D.C.,USA:IEEE,2013:764-768.
    [16]CUI X,ZHU P,YANG X,et al.Optimized big data kmeans clustering using MapReduce[J].The Journal of Supercomputing,2014,70(3):1249-1259.
    [17]AKTHAR N,AHAMAD M V,KHAN S.Clustering on big data using hadoop MapReduce[C]//Proceedings of the 2015International Conference on Computational Intelligence and Communication Networks.Washington,D.C.,USA:IEEE,2015:789-795.
    [18]XIA Dawen,WANG B,LI Yantao,et al.An efficient MapReduce-based parallel clustering algorithm for distributed traffic subarea division[EB/OL].[2017-08-10].https://www.hindawi.com/journals/ddns/2015/793010/abs.
    [19]NGUYEN C D,NGUYEN D T,PHAM V H.Parallel twophase K-means[C]//Proceedings of the International Conference on Computational Science and Its Applications.Berlin,Germany:Springer-Verlag,2013:224-231.
    [20]AGRAWAL R,IMIELISKI T,SWAMI A.Mining association rules between sets of items in large databases[C]//Proceedings of the ACM Sigmod Record International Conference on Management of Data.New York,N.Y.,USA:ACM,1993:207-216.
    [21]HAN J,PEI J,YIN Y.Mining frequent patterns without candidate generation[C]//Proceedings of the 2000 ACMSIGMOD International Conference on Management of Data.New York,N.Y.,USA:ACM,2000:1-12.
    [22]LI N,ZENG L,HE Q,et al.Parallel implementation of apriori algorithm based on mapreduce[C]//Proceedings of the13th ACIS International Conference on Software Engineering,Artificial Intelligence,Networking and Parallel&Distributed Computing(SNPD).Washington,D.C.,USA:IEEE,2012:236-241.
    [23]EZHILVATHANI A,RAJA K.Implementation of parallel apriori algorithm on hadoop cluster[J].International Journal of Computer Science and Mobile Computing,2013,2(4):513-516.
    [24]ORUGANTI S,DING Q,TABRIZI N.Exploring Hadoop as a platform for distributed association rule mining[C]//Proceedings of the F C 5th International Conferenceon Future Computational Technologies and Applications.New York,N.Y.,USA:IARIA,2013:62-67.
    [25]HAO X F,TAN Y S,WANG J Y.Research and implementation of parallel apriori algorithm on Hadoop platform[J].Computer and Modernization,2013,3(1):1-5.
    [26]XIAO L,HONGWU L,FANGFANG G,et al.A cloud security situational awareness model based on parallel apriori algorithm[J].Applied Mechanics&Materials,2014,556-562:6294-6297.
    [27]QIU H,GU R,YUAN C,et al.Yafim:aparallel frequent itemset mining algorithm with spark[C]//Proceedings of the2014IEEE International Parallel&Distributed Processing Symposium Workshops(IPDPSW).Washington,D.C.,USA:IEEE,2014:1664-1671.
    [28]SHE X Y,ZHANG L.Apriori parallel improved algorithm based on MapReduce distributed architecture[C]//Proceedings of the 6th IEEE International Conference on Instrumentation and Measurement,Computer,Communication and Control.Washington,D.C.,USA:IEEE,2016:517-521.
    [29]ZHOU X,HUANG Y.An improved parallel association rules algorithm based on MapReduce framework for big data[C]//Proceedings of the 11th International Conference on Fuzzy Systemsand Knowledge Discovery(FSKD).Washington,D.C.,USA:IEEE,2014:284-288.
    [30]MI Yunlong,JIANG Lin,MI Chunqiao.Rough association rules algorithm with negation under MapReduce[J].Computer Integrated Manufacturing Systems,2014,20(11):2893-2903(in Chinese).[米允龙,姜麟,米春桥.MapReduce环境下的否定粗糙关联规则算法[J].计算机集成制造系统,2014,20(11):2893-2903.]
    [31]PADILLO F,LUNA J M,VENTURA S.Mining perfectly rare itemsets on big data:an approach based on apriori-inverse and MapReduce[C]//Proceedings of the International Conference on Intelligent Systems Design and Applications.Berlin,Germany:Springer-Verlag,2016:508-518.
    [32]FENG D,ZHU L,ZHANG L.Research on improved apriori algorithm based on MapReduce and Hbase[C]//Proceedings of the Advanced Information Management,Communicates,Electronicand Automation Control Conference(IMCEC).Washington,D.C.,USA:IEEE,2016:887-891.
    [33]SINGH S,GARG R,MISHRA P.Review of apriori based algorithms on mapreduce framework[C]//Proceedings of the International Conference on Communication and Computing.Washington,D.C.,USA:IEEE,2014:593-604.
    [34]ZHOU L,ZHONG Z,CHANG J,et al.Balanced parallel fp-growth with mapreduce[C]//Proceedings of the 2010IEEE Youth Conference on Information Computing and Telecommunications(YC-ICT).Washington,D.C.,USA:IEEE,2010:243-246.
    [35]XIAO T,YUAN C,HUANG Y.PSON:A parallelized SONalgorithm with MapReduce for mining frequent sets[C]//Proceedings of the 4th International Symposium on Parallel Architectures,Algorithms and Programming.Washington,D.C.,USA:IEEE,2011:252-257.
    [36]WANG Jie,DAI Qinghao,ZENG Yu,et al.Parallel frequent pattern growth algorithm optimizationin cloud manufacturing environment[J].Computer Integrated Manufacturing Systems,2012,18(9):2124-2129(in Chinese).[王洁,戴清灏,曾宇,等.云制造环境下并行频繁模式增长算法优化[J].计算机集成制造系统,2012,18(9):2124-2129.]
    [37]WANG L,FENG L,ZHANG J,et al.An efficient algorithm of frequent itemsets mining based onmapreduce[J].Journal of Information&Computational Science,2014,11(8):2809-2816.
    [38]LIU Zhiyong.Parallelizable algorithms research of association rules mining[D].Nanjing:Southeast University,2016(in Chinese).[刘智勇.关联规则挖掘的并行化算法研究[D].南京:东南大学,2016.]
    [39]XIA D,RONG Z,ZHOU Y,et al.A novel parallel algorithm for frequent itemsets mining in massive small files datasets[J].ICIC Express Letters,Part B:Applications,2014,5(2):459-466.
    [40]BIN Z,WENSHENG X.An improved algorithm for highspeed train's maintenance data mining based on MapReduce[C]//Proceedings of the 2015International Conference on Cloud Computing and Big Data(CCBD).Washington,D.C.,USA:IEEE,2015:59-66.
    [41]BECHINI A,MARCELLONI F,SEGATORI A.A MapReduce solution for associative classification of big data[J].Information Sciences,2016,332(C):33-55.
    [42]DOU Meng,WEN Lijie,WANG Jianmin,et al.Parallel algorithm to convert big event log based on MapReduce[J].Computer Integrated Manufacturing Systems,2013,19(8):1784-1793(in Chinese).[窦蒙,闻立杰,王建民,等.基于MapReduce的海量事件日志并行转化算法[J].计算机集成制造系统,2013,19(8):1784-1793.]
    [43]SUN Z Y,TSAI M C,TSAI H P.Mining uncertain sequence data on hadoop platform[C]//Proceedings of the Pacific Asia Conference on Knowledge Discovery and Data Mining.Cham,Switzerland:Springer International Publishing,2014:204-215.
    [44]AGBEHADJI I E,FONG S,MILLHAM R.Wolf search algorithm for numeric association rule mining[C]//Proceedings of the 2016IEEE International Conference on Cloud Computing and Big Data Analysis(ICCCBDA).Washington,D.C.,USA:IEEE,2016:146-151.
    [45]NGUYEN D,NGUYEN L T,VO B,et al.Efficient mining of class association rules with the itemset constraint[J].Knowledge-Based Systems,2016,103(C):73-88.
    [46]LEE D,QUADRIFOGLIO L,TEULADA E S D,et al.Discovering relationships between factors of round-trip car sharing by using association rules approach[J].Procedia Engineering,2016,161:1282-1288.
    [47]WENG J,ZHU J Z,YAN X,et al.Investigation of work zone crash casualty patterns using association rules[J].Accident Analysis&Prevention,2016,92:43-52.
    [48]DEL R S,LPEZ V,BENITEZ J M,et al.On the use of MapReduce for imbalanced big data using Random Forest[J].Information Sciences,2014,285(C):112-137.
    [49]SINGH K,GUNTUKU S C,THAKUR A,et al.Big data analytics framework for peer-to-peer botnet detection using random forests[J].Information Sciences,2014,278(C):488-497.
    [50]LPEZ V,DEL R S,BENITEZ J M,et al.Cost sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data[J].Fuzzy Sets and Systems,2015,258(C):5-38.
    [51]HUANG G,HUANG G B,SONG S,et al.Trends in extreme learning machines:A review[J].Neural Networks,2015,61(C):32-48.
    [52]KAMAL S,RIPON S H,DEY N,et al.A MapReduce approach to diminish imbalance parametersfor big deoxyribonucleic acid dataset[J].Computer Methods and Programs in Biomedicine,2016,131(C):191-206.
    [53]KUMAR M,RATH N K,RATH S K.Analysis of microarray leukemia data using an efficient MapReduce-based K-nearest-neighbor classifier[J].Journal of Biomedical Informatics,2016,60(C):395-409.
    [54]FERNANDEZ-DELGADO M,CERNADAS E,BARRO S,et al.Do we need hundreds of classifiers to solve real world classification problems[J].Journal of Machine Learning Research,2014,15(1):3133-3181.
    [55]HAFEZ M M,SHEHAB M E,EL FAKHARANY E,et al.Effective selection of machine learning algorithms for big data analytics using apache spark[C]//Proceedings of the International Conference on Advanced Intelligent Systems and Informatics.Berlin,Germany:Springer-Verlag,2016:692-704.
    [56]RUTA D.Automated trading with machine learning on big data[C]//Proceedings of the 2014IEEE International Congress on Big Data.Washington,D.C.,USA:IEEE,2014:824-830.
    [57]SUTHAHARAN S.Big data classification:Problems and challenges in network intrusion prediction with machine learning[J].ACM SIGMETRICS Performance Evaluation Review,2014,41(4):70-73.
    [58]RAMAKRISHNAN R,DRAL P O,RUPP M,et al.Big data meets quantum chemistry approximations:theΔ-machine learning approach[J].Journal of Chemical Theory and Computation,2015,11(5):2087-2096.
    [59]GINSBERG J,MOHEBBI M H,PATEL R S,et al.Detecting influenza epidemics using search engine query data[J].Nature,2009,457(7232):1012-1014.
    [60]BIBAULT J E,GIRAUD P,BURGUN A.Big data and machine learning in radiation oncology:state of the art and future prospects[J].Cancer Letters,2016,382(1):110-117.
    [61]ZHU X,YAO J,ZHU F,et al.Wsisa:Making survival prediction from whole slide histopathological images[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Washington,D.C.,USA:IEEE,2017:7234-7242.
    [62]SIMMHAN Y,AMAN S,KUMBHARE A,et al.Cloudbased software platform for big data analytics in smart grids[J].Computing in Science&Engineering,2013,15(4):38-47.
    [63]YANG Jungang,ZHANG Jie,QIN Wei,et al.Big data analysis platform for semiconductor manufacturing[J].Computer Integrated Manufacturing Systems,2016,22(12):2900-2910(in Chinese).[杨俊刚,张洁,秦威,等.面向半导体制造的大数据分析平台[J].计算机集成制造系统,2016,22(12):2900-2910.]
    [64]ZHANG Jie,GAO Liang,QIN Wei,et al.Big-Data-driven operational analysis and decision-making methodology in intelligent workshop[J].Computer Integrated Manufacturing Systems,2016,22(5):1220-1228(in Chinese).[张洁,高亮,秦威,等.大数据驱动的智能车间运行分析与决策方法体系[J].计算机集成制造系统,2016,22(5):1220-1228.]
    [65]LYU Youlong,ZHANG Jie.Big-data-based technical framework of smart factory[J].Computer Integrated Manufacturing Systems,2016,22(11):2691-2697(in Chinese).[吕佑龙,张洁.基于大数据的智慧工厂技术框架[J].计算机集成制造系统,2016,22(11):2691-2697.]
    [66]YAO Xifan,ZHOU Jiajun,ZHANG Cunji,et al.Procative manufacturing-a big-data driven emerging manufacturing paradigm[J].Computer Integrated Manufacturing Systems,2017,23(1):172-185(in Chinese).[姚锡凡,周佳军,张存吉,等.主动制造---大数据驱动的新兴制造范式[J].计算机集成制造系统,2017,23(1):172-185.]
    [67]ZHU Xuechu,QIAO Fei.Cycle time prediction method of wafer fabrication system based on industrial big data[J].Computer Integrated Manufacturing Systems,2017,23(10):2172-2179(in Chinese).[朱雪初,乔非.基于工业大数据的晶圆制造系统加工周期预测方法[J].计算机集成制造系统,2017,23(10):2172-2179.]
    [68]WAMBA S F,GUNASEKARAN A,AKTER S,et al.Big data analytics and firm performance:effects of dynamic capabilities[J].Journal of Business Research,2017,70(C):356-365.
    [69]HINTON G E,OSINDERO S,TEH Y W.A fast learning algorithm for deep belief nets[J].Neural Computation,2006,18(7):1527-1554.
    [70]ZHANG C Y,CHEN C P,CHEN D,et al.MapReduce based distributed learning algorithm for restricted boltzmann machine[J].Neurocomputing,2016,198(C):4-11.
    [71]ZHANG K,CHEN X W.Large-scale deep belief nets with mapreduce[J].IEEE access,2014,2:395-403.
    [72]WANG Dewen,SUN Zhiwei.Big data analysis and parallel load forecasting of electric power userside[J].Proceedings of the CSEE,2015,35(3):527-537(in Chinese).[王德文,孙志伟.电力用户侧大数据分析与并行负荷预测[J].中国电机工程学报,2015,35(3):527-537.]
    [73]ZHANG H J,XIAO N F.Parallel implementation of multilayered neural networks based on MapReduce on cloud computing clusters[J].Soft Computing,2016,20(4):1471-1483.
    [74]CAO J,CUI H,SHI H,et al.Big Data:aparallel particle swarm optimization-back-propagation neural network algorithm based on MapReduce[J].PLos One,2016,11(6):e0157551.
    [75]MAO Guojun,HU Dianjun,XIE Songyan.Models and algorithms for classifying big data based on distribued data streams[J].Chinese Journal of Computers,2017,40(1):161-175(in Chinese).[毛国君,胡殿军,谢松燕.基于分布式数据流的大数据分类模型和算法[J].计算机学报,2017,40(1):161-175.]
    [76]HE Jieyue,MA Bei.Based on real-valued conditional restricted boltzman machine and social network for collaborative filtering[J].Chinese Journal of Computers,2016,39(1):183-195(in Chinese).[何洁月,马贝.利用社交关系的实值条件受限玻尔兹曼机协同过滤推荐算法[J].计算机学报,2016,39(1):183-195.]
    [77]LI H,SU P,CHI Z,et al.Image retrieval and classification on deep convolutional SparkNet[C]//Proceedings of the 2016IEEE International Conference on Signal Processing,Communications and Computing(ICSPCC).Washington,D.C.,USA:IEEE,2016:1-6.
    [78]YANG Jiaju.MapReduce-based and deep learning for load analysis and forecasting[D].Nanjing:Southeast University,2016(in Chinese).[杨佳驹.基于MapReduce和深度学习的负荷分析与预测[D].南京:东南大学,2016.]
    [79]ALSHEIKH M A,NIYATO D,LIN S,et al.Mobile big data analytics using deep learning and apache spark[J].IEEEnetwork,2016,30(3):22-29.
    [80]OUYANG X,ZHANG C,ZHOU P,et al.DeepSpace:an online deep learning framework for mobile big data to understand human mobility patterns[EB/OL].(2017-07-09)[2018-03-05].https://arxiv.org/pdf/1610.07009.pdf.
    [81]YAN Y,CHEN M,SADIQ S,et al.Efficient imbalanced multimedia concept retrieval by deep learning on spark clusters[J].International Journal of Multimedia Data Engineering and Management,2017,8(1):1-20.
    [82]YANG Y,ZHAN D C,FAN Y,et al.Deep learning for fixed model reuse[C]//Proceedings of the AAAI.Palo Alto,Cal.,USA:AAAI,2017:2831-2837.
    [83]JIE Z,WEI Y,JIN X,et al.Deep self-taught learning for weakly supervised object localization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Washington,D.C.,USA:IEEE,2017:4294-4302.
    [84]IOFFE S,SZEGEDY C.Batch normalization:Accelerating deep network training by reducing internal covariate shift[EB/OL].(2017-07-09)[2018-03-05].https://arxiv.org/pdf/1502.03167.pdf.
    [85]HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//Proceedings of the 29th IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Washington,D.C.,USA:IEEE,2016:770-778.
    [86]KRANJC J,ORAR,PODPEAN V,et al.ClowdFlows:online workflows for distributed big data mining[J].Future Generation Computer Systems,2017,68(C):38-58.
    [87]MNIH V,KAVUKCUOGLU K,SILVER D,et al.Human level control through deep reinforcement learning[J].Nature,2015,518(7540):529-533.
    [88]XUE H,LIU Y,CAI D,et al.Tracking people in RGBDvideos using deep learning and motion clues[J].Neurocomputing,2016,204(C):70-76.
    [89]ZHAO Peng,ZHOU Zhihua.Distribution-Free One-Pass Learning[EB/OL].(2017-07-09)[2018-03-05].http://www.doc88.com/p-3495610983591.html.
    [90]KE W,CHEN J,JIAO J,et al.SRN:Side-output residual network for object symmetry detection in the wild[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Washington,D.C.,USA:IEEE,2017:302-310.
    [91]ZHOU Y,YE Q,QIU Q,et al.Oriented response networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Washington,D.C.,USA:IEEE,2017:4961-4970.
    [92]FINN C,ABBEEL P,LEVINE S.Model-agnostic metalearning for fast adaptation of deep networks[C]//Proceedings of the 34th International Conference on Machine Learning.New York,N.K.,USA:ACM,2017:1126-1135.
    [93]CHENG S,SHI Y,QIN Q,et al.Swarm intelligence in big data analytics[C]//Proceedings of the International Conference on Intelligent Data Engineering and Automated Learning.Berlin,Germany:Springer-Verlag,2013:417-426.
    [94]MCNABB A W,MONSON C K,SEPPI K D.Parallel-pso using mapreduce[C]//Proceedings of the 2007IEEE Congress on Evolutionary Computation.Washington,D.C.,USA:IEEE,2007:7-14.
    [95]LIANG J,WANG F,DANG C,et al.An efficient rough feature selection algorithm with a multi granulation view[J].International Journal of Approximate Reasoning,2012,53(6):912-926.
    [96]LI X,YAO X.Cooperatively coevolving particle swarms for large scale optimization[J].IEEE Transactions on Evolutionary Computation,2012,16(2):210-224.
    [97]XU Zongben,ZHANG Wei,LIU Lei,et al.The scientificprinciple and prospect for data scienceand big data the 462nd expert speech act of academic forum on Xiangshan science conference[J].Science&Technology for Development,2014,10(1):66-75(in Chinese).[徐宗本,张维,刘雷,等.“数据科学与大数据的科学原理及发展前景”-香山科学会议第462次学术讨论会专家发言摘登[J].科技促进发展,2014,10(1):66-75.]
    [98]GOVINDARAJAN K,SOMASUNDARAM T S,KUMARV S.Continuous clustering in big data learning analytics[C]//Proceedings of the 5th International Conference on Technology for Education.Washington,D.C.,USA:IEEE,2013:61-64.
    [99]GUPTA S L,GOEL S,BAGHEL A S.An approach to handle big data analytics using potential of swarm intelligence[C]//Proceedings of the 3rd International Conference on Computing for Sustainable Global Development.Washington,D.C.,USA:IEEE,2016:3640-3644.
    [100]WANG L,WANG Y,CHANG Q.Feature selection methods for big data bioinformatics:A survey from the search perspective[J].Methods,2016,111(C):21-31.
    [101]CHENG S,ZHANG Q,QIN Q.Big data analytics with swarm intelligence[J].Industrial Management&Data Systems,2016,116(4):646-666.
    [102]FONG S,WONG R,VASILAKOS A V.Accelerated PSOswarm search feature selection for data stream mining big data[J].IEEE Transactions on Services Computing,2016,9(1):33-45.
    [103]CAO B,LI W,ZHAO J,et al.Spark-based parallel cooperative co-evolution particle swarm optimization algorithm[C]//Proceedings of the 2016IEEE International Conference on Web Services.Washington,D.C.,USA:IEEE,2016:570-577.
    [104]WANG Y,LI Y,CHEN Z,et al.Cooperative particle swarm optimization using MapReduce[J].Soft Computing,2017,21(22):6593-6603.
    [105]LI Y,CHEN Z,WANG Y,et al.Quantum-behaved particle swarm optimization using mapreduce[C]//Proceedings of the Bio-Inspired Computing-Theories and Applications.Berlin,Germany:Springer-Verlag,2016:173-178.
    [106]DING W,LIN C T,CHEN S,et al.Multiagent-consensusmapreduce-based attribute reduction using co-evolutionary quantum PSO for big data applications[J].Neurocomputing,2018,272(C):136-153.
    [107]CHENG X,XIAO N.Parallel implementation of dynamic positive and negative feedback aco with iterative mapreduce model[J].Journal of Information&Computational Science,2013,10(8):2359-2370.
    [108]WU Hao,NI Zhiwei,WANG Huiying.MapReduce-based ant colony optimization[J].Computer Integrated Manufacturing Systems,2012,18(7):1503-1509(in Chinese).[吴昊,倪志伟,王会颖.基于MapReduce的蚁群算法[J].计算机集成制造系统,2012,18(7):1503-1509.]
    [109]MA Wenlong,WANG Zheng,ZHAO Yanwei.Optimizing services composition in cloud manufacturing based on improved ant colony algorithm[J].Computer Integrated Manufacturing Sysems,2016,22(1):113-121(in Chinese).[马文龙,王铮,赵燕伟.基于改进蚁群算法的制造云服务组合优化[J].计算机集成制造系统,2016,22(1):113-121.]
    [110]LIN C Y,PAI Y M,TSAI K H,et al.Paralllizing modified cuckoo search on mapreduce architecture[J].Journal of E-lectronic Science and Technology,2013,11(2):115-123.
    [111]XU X,JI Z,YUAN F,et al.A novel parallel approach of cuckoo search using MapReduce[C]//Proceedings of the2014International Conference on Computer,Communications and InformationTechnology(CCIT 2014).Amsterdam,the Netherlands:Atlantis Press,2014:114-117.
    [112]AL-MADI N,ALJARAH I,LUDWIG S A.Parallel glowworm swarm optimization clustering algorithm based on MapReducee[C]//Proceedings of the 2014IEEE Symposium on Swarm Intelligence.Washington,D.C.,USA:IEEE,2014:1-8.
    [113]ZHENG Hongsheng,YU Dongjin,ZHANG Lei.Multi-Qos cloud workflow scheduling based on firely algorithm and dynamic priorities[J].Computer Integrated Manufacturing Systems,2017,23(5):963-971(in Chinese).[郑宏升,俞东进,张蕾.基于萤火虫算法和动态优先级的多QoS云工作流调度[J].计算机集成制造系统,2017,23(5):963-971.]
    [114]LIN K C,ZHANG K Y,HUANG Y H,et al.Feature selection based on an improved cat swarm optimization algorithm for big data classification[J].The Journal of Supercomputing,2016,72(8):3210-3221.
    [115]JIN C,VECCHIOLA C,BUYYA R.MRPGA:an extension of MapReduce for parallelizing genetic algorithms[C]//Proceedings of the 4th International Conference on eScience2008.Washington,D.C.,USA:IEEE,2008:214-221.
    [116]YANG Z,TANG K,YAO X.Large scale evolutionary optimization using cooperative coevolution[J].Information Sciences,2008,178(15):2985-2999.
    [117]OMIDVAR M N,LI X,MEI Y,et al.Cooperative coevolution with differential grouping for large scale optimization[J].IEEE Transactions on Evolutionary Computation,2014,18(3):378-393.
    [118]QIAN Chao,ZHOU Zhihua.Decomposition-based paretooptimization for subset selection[J].Scientia Sinica Informations,2016,46(9):1276-1287(in Chinese).[钱超,周志华.基于分解策略的多目标演化子集选择算法[J].中国科学:信息科学,2016,46(9):1276-1287.]
    [119]GHEYAS I A,SMITH L S.Feature subset selection in large dimensionality domains[J].Pattern Recognition,2010,43(1):5-13.
    [120]BACARDIT J,LLOR X.Large-scale data mining using genetics-based machine learning[J].Wiley Inter Disciplinary Reviews:Data Mining and Knowledge Discovery,2013,3(1):37-61.
    [121]YUAN F,LIAN F,XU X,et al.Decision tree algorithm optimization research based on MapReduce[C]//Proceedings of the 6th IEEE International Conference on Software Engineering and Service Science.Washington,D.C.,USA:IEEE,2015:1010-1013.
    [122]ZHANG Ying,ZHAI Li,WANG Jing.Cloud computing federation data resource server comppsotion in big data background[J].Computer Integrated Manufacturing Systems,2016,22(12):2920-2929(in Chinese).[张影,翟丽丽,王京.大数据背景下的云联盟数据资源服务组合模型[J].计算机集成制造系统,2016,22(12):2920-2929.]
    [123]ZHU Linan,WANG Wanliang,SHEN Guojiang.Resource optimization combinaton method basedon improved differential evolution algorithm forcloud manufaturing[J].Computer Integrated Manufacturing Systems,2017,23(1):203-214(in Chinese).[朱李楠,王万良,沈国江.基于改进差分进化算法的云制造资源优化组合方法[J].计算机集成制造系统,2017,23(1):203-214.]
    [124]MORITZ P,NISHIHARA R,STOICA I,et al.Sparknet:training deep networks in spark[EB/OL].(2017-07-09)[2018-03-05].https://arxiv.org/pdf/1511.06051.pdf.
    [125]Computer Network Information Center,Chinese Academy of Sciences.Depth learning method and system for big data:China,CN106570565 A[P/OL].(2017-04-19)[2017-08-28].https://www.google.com/patents/CN106570565A?cl=en&hl=zh-CN(in Chinese).[中国科学院计算机网络信息中心.一种面向大数据的深度学习方法及系统:中国,CN106570565A[P/OL].(2017-04-19)[2017-08-28].https://www.google.com/patents/CN106570565A?cl=en&hl=zh-CN.]
    [126]KIM H,PARK J,JANG J,et al.DeepSpark:a spark-based distributed deep learning framework for commodity clusters[EB/OL].(2017-07-09)[2018-03-05].https://arxiv.org/pdf/1602.08191.pdf.
    [127]TEAM D.Deeplearning4j:Open-source distributed deep learning for the JVM.[EB/OL].[2017-08-28].https://deeplearning4j.org/.
    [128]LIU S,TANG J,WANG C,et al.Implementing a cloud platform for autonomous driving[EB/OL].(2017-07-09)[2018-03-05].https://arxiv.org/ftp/arxiv/papers/1704/1704.02696.pdf.
    [129]XING E P,HO Q,XIE P,et al.Strategies and principles of distributed machine learning on big data[J].Engineering,2016,2(2):179-195.
    [130]HUANG Yihua.Research progress on big data machine learning system[J].Big Data Research,2015,1(1):28-47(in Chinese).[黄宜华.大数据机器学习系统研究进展[J].大数据,2015,1(1):28-47.]
    [131]LOW Y,BICKSON D,GONZALEZ J,et al.Distributed GraphLab:a framework for machine learning and data mining in the cloud[J].Proceedings of the VLDB Endowment,2012,5(8):716-727.
    [132]XING E P,HO Q,DAI W,et al.Petuum:a new platform for distributed machine learning on big data[J].IEEETransactions on Big Data,2015,1(2):49-67.
    [133]OOI B C,TAN K L,WANG S,et al.SINGA:a distributed deep learning platform[C]//Proceedings of the 23rd ACMinternational conference on Multimedia.New York,N.Y.,USA:ACM,2015:685-688.
    [134]LI M,ANDERSEN D G,PARK J W,et al.Scaling distributed machine learning with the parameter server[C]//Proceedings of the OSDI.Berkeley,Cal.,USA:USENIX Association,2014:583-598.
    [135]JIANG J,YU L,JIANG J,et al.Angel:a new large-scale machine learning system[J].National Science Review,2017:5(2):216-236.
    [136]POP D,IUHASZ G,PETCU D.Distributed platforms and cloud services:enabling machine learning for big data[M]//Data Science and Big Data Computing.Cham,Switzerland:Springer International Publishing,2016:139-159.
    [137]MENG X,BRADLEY J,YAVUZ B,et al.MLlib:machine learning in apache spark[J].The Journal of Machine Learning Research,2016,17(1):1235-1241.
    [138]OWEN S,ANIL R,DUNNING T,et al.Mahout in action[M].New York,N.Y.,USA:Manning Publications,2011.
    [139]DINO K.H2Opersistence framework for column oriented distributed(NoSQL)databases[C]//Proceedings of the 3rd International Symposiu on Sustainable Development.Southampton,UK:Eprints,2012:22-28.
    [140]ZHANG X,YAO F,TIAN Y.Greedy step averaging:aparameter-free stochastic optimization method[EB/OL].(2016-11-14)[2018-03-15].http://pdfs.semanticscholar.org/256d/e07d93d8e4021b444e0c6905997fc64e9d70.pdf.
    [141]GHOTING A,KRISHNAMURTHY R,PEDNAULT E,et al.SystemML:declarative machine learning on MapReduce[C]//Proceedings of the 27th International Conference on Data Engineering(ICDE).Washington,D.C.,USA:IEEE,2011:231-242.
    [142]PEDREGOSA F,VAROQUAUX G,GRAMFORT A,et al.Scikit-learn:machine learning in python[J].Journal of Machine Learning Research,2011,12:2825-2830.
    [143]DEAN J,CORRADO G,MONGA R,et al.Large scale distributed deep networks[C]//Advances in Neural Information Processing Systems.Cambridge,Mass.,USA:MITPress,2012:1223-1231.
    [144]GONG Y J,CHEN W N,ZHAN Z H,et al.Distributed evolutionary algorithms and their models:a survey of the state-of-the-art[J].Applied Soft Computing,2015,34(C):286-300.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700