在线社交网络挖掘与搜索技术研究
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:A survey on online social network mining and search
  • 作者:石磊 ; 杜军平 ; 周亦鹏 ; 叶杭 ; 赖金财 ; 何奕江
  • 英文作者:SHI Lei;DU Junping;ZHOU Yipeng;YE Hang;LAI Jincai;HE Yijiang;Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia, Beijing University of Posts and Telecommunications;School of Computer Science and Information Engineering, Beijing Technology and Business University;Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia, Beijing University of Posts and Telecommunica-tions;
  • 关键词:社交网络 ; 数据挖掘 ; 搜索 ; 社区发现 ; 信息传播
  • 英文关键词:social networks;;data mining;;search;;community detection;;information transmission
  • 中文刊名:ZNXT
  • 英文刊名:CAAI Transactions on Intelligent Systems
  • 机构:北京邮电大学智能通信软件与多媒体北京市重点实验室;北京工商大学计算机与信息工程学院;
  • 出版日期:2017-01-11 16:19
  • 出版单位:智能系统学报
  • 年:2016
  • 期:v.11;No.62
  • 基金:国家自然科学基金重点项目(61532006);国家自然科学基金重大国际合作项目(61320106006)
  • 语种:中文;
  • 页:ZNXT201606007
  • 页数:11
  • CN:06
  • ISSN:23-1538/TP
  • 分类号:71-81
摘要
随着在线社交网络的蓬勃发展,传统的数据挖掘的和搜索方法已经不能完全适用于Web 2.0时代的社交网络。社交网络具有社交关系复杂、数据量大、动态更新、数据多模态等特点,给数据挖掘和搜索的研究来了巨大的挑战。因此,研究基于社交网络挖掘和搜索的新方法成为学术界和工业界的一项新任务。文章全面分析了社交网络发展的基本情况和存在的问题,阐述了社交网络结构建模、信息传播机制、社区发现、情感分析、事件监测及社交网络搜索排序技术的主要研究工作,并基于已有研究工作对社交网络挖掘和网络搜索技术进行了分析和展望。
        With the vigorous development of online social networks, the traditional technologies of data mining andsearching cannot solve the problems of social networks in the Web 2.0 era. Social networks, accompanied by com-plex social relationships, large amounts of data, dynamic updates, multimodal data, etc. have brought great chal-lenge to the study of data mining and searching. Therefore, the research of novel algorithms of social network miningand searching has become a new task in both academia and industry. This paper summarized the basic situation andproblems of social networks, and analyzed structural modeling techniques, information transmission mechanisms,community detection, sentiment analysis, event detection and search ranking techniques of social networks. Basedon the analysis of previous researches, the prospect of social network data mining and search technologies was fore-casted in this paper.
引文
[1]李立耀,孙鲁敬,杨家海.社交网络研究综述[J].计算机科学,2015,42(11):8-21,42.LI Liyao,SUN Lujing,YANG Jiahai.Research on online social network[J].Computer science,2015,42(11):8-21,42.
    [2]王大玲,冯时,张一飞,等.社会媒体多模态、多层次资源推荐技术研究[J].智能系统学报,2014,9(3):265-275.WANG Daling,FENG Shi,ZHANG Yifei,et al.Study on the recommendations of multi-modal and multi-level resources in social media[J].CAAI transactions on intelligent systems,2014,9(3):265-275.
    [3]AGRAWAL R,GOLSHAN B,PAPALEXAKIS E.Whither social networks for web search[C]//Proceedings of the 21thACM SIGKDD International Conference on Knowledge Dis-covery and Data Mining.New York,NY,USA:ACM,2015:1661-1670.
    [4]贺超波,汤庸,麦辉强,等.在线社交网络挖掘综述[J].武汉大学学报:理学版,2014,60(3):189-200.HE Chaobo,TANG Yong,MAI Huiqiang,et al.A surveyon online social network Mining[J].Journal of Wuhan uni-versity:natural science edition,2014,60(3):189-200.
    [5]SHENG Q Z,VASILAKOS A V,YU Qi,et al.Guest edito-rial:big data analytics and the web[J].IEEE transactionson big data,2015,1(4):123-124.
    [6]唐杰,陈文光.面向大社交数据的深度分析与挖掘[J].科学通报,2015,60(5/6):509-519.TANG Jie,CHEN Wenguang.MAI Huiqiang Deep analyticsand mining for big social data[J].Chinese science bulletin,2015,60(5/6):509-519.
    [7]许进,杨扬,蒋飞,等.社交网络结构特性分析及建模研究进展[J].中国科学院院刊,2015,30(2):216-228.XU Jin,YANG Yang,JIANG Fei,et al.Social networkstructure feature analysis and its modelling[J].Bulletin ofChinese academy of sciences,2015,30(2):216-228.
    [8]AGGARWAL C C.Social network analysis[J].Encyclope-dia of social network analysis&mining,2015,22(1):109-127.
    [9]HSU T Y,KSHEMKALYANI A D.Modeling social networktopology with variable social vector clocks[C]//Proceedings of 2015 IEEE/ACM International Conference on Advancesin Social Networks Analysis and Mining.Paris,France:IEEE,2015:584-589.
    [10]DONG Yuxiao.User modeling in large social networks[C]//Proceedings of the Ninth ACM International Conference on Web Search and Data Mining.New York,NY,USA:ACM,2016:713.
    [11]SLAUGHTER A J,KOEHLY L M.Multilevel models forsocial networks:hierarchical bayesian approaches to expo-nential random graph modeling[J].Social networks,2016,44:334-345.
    [12]AMATO F,MOSCATO V,PICARIELLO A,et al.Multi-media social network modeling:a proposal[C]//Proceed-ings of 2016 IEEE Tenth International Conference on Se-mantic Computing.Laguna Hills,CA,USA:IEEE,2016:448-453.
    [13]BAJAJ A,SEN S.Simulating the effect of social networkstructure on workflow efficiency performance[J].Socialnetworking,2014,3(1):32-40.
    [14]MIHOUB A,BAILLY G,WOLF C,et al.Graphical models for social behavior modeling in face-to face interaction[J].Pattern recognition letters,2016,74:82-89.
    [15]RODRIGUEZ M G,BALDUZZI D,SCH?LKOPF B.Un-covering the temporal dynamics of diffusion Networks[C]//Proceedings of the 28th International Conference on Machine Learning.Bellevue,Washington,USA:ICML,2011:561-568.
    [16]JONES S,WEUTHEN T,HARMER Q J,et al.Modelinginformation propagation with survival theory[J].Philosoph-ical magazine letters,2013,95(2):85-91.
    [17]RODRIGUEZ M G,LESKOVEC J,BALDUZZI D,et al.Uncovering the structure and temporal dynamics of informa-tion propagation[J].Network science,2014,2(1):26-65.
    [18]SADIKOV E,MEDINA M,LESKOVEC J,et al.Correcting for missing data in information cascades[C]//Pro-ceedings of the Fourth ACM International Conference onWeb Search and Data Mining.New York,NY,USA:ACM,2011:55-64.
    [19]ROMERO D M,GALUBA W,ASUR S,et al.Influence and passivity in social media[M]//Gunopulos D,Hof-mann T,Hofmann D,et al.Machine Learning and Knowl-edge Discovery in Databases.Berlin Heidelberg:Springer,2010:18-33.
    [20]KIMURA M,SAITO K,OHARA K,et al.Speeding-upnode influence computation for huge social networks[J].International journal of data science and analytics,2016,1(1):3-16.
    [21]GUILLE A,HACID H,FAVRE C.Predicting the temporal dynamics of information diffusion in social networks[J].Computer science,2013,144(1):1145-1152.
    [22]XU Xin,CHEN Xin,EUN D Y.Modeling time-sensitiveinformation diffusion in online social networks[C]//Pro-ceedings of 2015 IEEE Conference on Computer Communications Workshops(INFOCOM WKSHPS).Hong Kong,China:IEEE,2015:408-413.
    [23]WEN Sheng,HAGHIGHI M S,CHEN Chao,et al.Asword with two edges:propagation studies on both positiveand negative information in online social networks[J].IEEE transactions on computers,2015,64(3):640-653.
    [24]TUAROB S,TUCKER C S,SALATHE M,et al.Modelingindividual-level infection dynamics using social network in-formation[C]//Proceedings of the 24th ACM Internationalon Conference on Information and Knowledge Management.New York,NY,USA:ACM,2015:1501-1510.
    [25]TAMBUSCIO M,RUFFO G,FLAMMINI A,et al.Fact-checking effect on viral hoaxes:a model of misinformationspread in social networks[C]//Proceedings of the 24th International Conference on World Wide Web.New York,NY,USA:ACM,2015:977-982.
    [26]WANG Ru,RHO S,CHEN Bowei,et al.Modeling oflarge-scale social network services based on mechanisms ofinformation diffusion:Sina Weibo as a case study[J].Fu-ture generation computer systems,2016,doi:10.1016/j.future.2016.03.018.
    [27]PAL A,COUNTS S.Identifying topical authorities in mi-croblogs[C]//Proceedings of the Fourth ACM Internation-al Conference on Web Search and Data Mining.New York,NY,USA:ACM,2011:45-54.
    [28]SUO Qi,SUN Shiwei,HAJLI N,et al.User ratings analy-sis in social networks through a hypernetwork method[J].Expert systems with applications,2015,42(21):7317-7325.
    [29]吴岘辉,张晖,赵旭剑,等.基于用户行为网络的微博意见领袖挖掘算法[J].计算机应用研究,2015,32(9):2678-2683.WU Xianhui,ZHANG Hui,ZHAO Xujian,et al.Miningalgorithm of microblogging opinion leaders based on user-behavior network[J].Application research of computers,2015,32(9):2678-2683.
    [30]SUPPA P,ZIMEO E.A clustered approach for fast compu-tation of betweenness centrality in social networks[C]//Proceedings of 2015 IEEE International Congress on Big Data.New York,NY,USA:IEEE,2015:47-54.
    [31]YANG Yang,TANG Jie,LEUNG C W K,et al.Rain:so-cial role-aware information diffusion[C]//Proceedings ofthe Twenty-Ninth AAAI Conference on Artificial Intelli-gence.Austin,Texas,USA:AAAI,2015:367-373.
    [32]SUBBIAN K,AGGARWAL C C,SRIVASTAVA J.Quer-ying and tracking influencers in social streams[C]//Pro-ceedings of the Ninth ACM International Conference onWeb Search and Data Mining.New York,NY,USA:ACM,2016:493-502.
    [33]SU Jianhai,HAVENS T C.Quadratic program-based mod-ularity maximization for fuzzy community detection in socialnetworks[J].IEEE transactions on fuzzy systems,2015,23(5):1356-1371.
    [34]KLOSTER K,GLEICH D F.Heat kernel based community detection[C]//Proceedings of the 20th ACM SIGKDD In-ternational Conference on Knowledge Discovery and DataMining.New York,NY,USA:ACM,2014:1386-1395.
    [35]ALTUNBEY F,ALATAS B.Overlapping community detec-tion in social networks using parliamentary optimization algorithm[J].International journal of computer networks andapplications,2015,2(1):12-19.
    [36]ARAB M,AFSHARCHI M.Community detection in socialnetworks using hybrid merging of sub-communities[J].Journal of network and computer applications,2014,40:73-84.
    [37]CHEN Pinyu,HERO A O.Deep community detection[J].IEEE transactions on signal processing,2015,63(21):5706-5719.
    [38]ZHANG Yuan,LEVINA E,ZHU Ji.Detecting overlapping communities in networks using spectral methods[J].Phys-ica a:statistical mechanics and its applications,2014,405:1-37.
    [39]GAO Chao,MA Zongming,ZHANG A Y,et al.Achievingoptimal misclassification proportion in stochastic block model[J].Computer science,2015,20(3):88-90.
    [40]MAHMOOD A,SMALL M.Subspace based network com-munity detection using sparse linear coding[J].IEEEtransactions on knowledge and data engineering,2016,28(3):801-812.
    [41]AIROLDI E M,BLEI D M,FIENBERG S E,et al.Mixedmembership stochastic blockmodels[J].The journal of ma-chine learning research,2008,9:1981-2014.
    [42]赵文清,侯小可,沙海虹.语义规则在微博热点话题情感分析中的应用[J].智能系统学报,2014,9(1):121-125.ZHAO Wenqing,HOU Xiaoke,SHA Haihong.Applicationof semantic rules to sentiment analysis of microblog hot top-ics[J].CAAI Transactions on intelligent systems,2014,9(1):121-125.
    [43]BRAVO-MARQUEZ F,MENDOZA M,POBLETE B.Combining strengths,emotions and polarities for boostingTwitter sentiment analysis[C]//Proceedings of the Sec-ond International Workshop on Issues of Sentiment Discovery and Opinion Mining.New York,NY,USA:ACM,2013:2.
    [44]HU Xia,TANG Lei,TANG Jiliang,et al.Exploiting so-cial relations for sentiment analysis in microblogging[C]//Proceedings of the Sixth ACM International Conference on Web Search and Data Mining.New York,NY,USA:ACM,2013:537-546.
    [45]NASKAR D,MOKADDEM S,REBOLLO M,et al.Senti-ment analysis in social networks through topic modeling[C]//Proceedings of the 10th Edition of the Language Resources and Evaluation Conference(LREC)2016.Porto-roz:LREC,2016.
    [46]SIXTO J,ALMEIDA A,LPEZ-DE-IPI?A D.Improvingthe sentiment analysis process of spanish tweets with bm25[M]//MéTAIS E,MEZIANE F,SARAEE M,et al.Natural Language Processing and Information Systems.Switzer-land:Springer,2016:285-291.
    [47]YOU Quanzeng,LUO Jiebo,JIN Hailin,et al.Robust im-age sentiment analysis using progressively trained and domain transferred deep networks[C]//Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence.Austin,Texas,USA:AAAI Press,2015:381-388.
    [48]CHAO Linlin,TAO Jianhua,YANG Minghao,et al.Longshort term memory recurrent neural network based multimo-dal dimensional emotion recognition[C]//Proceedings of the 5th International Workshop on Audio/Visual EmotionChallenge.New York,NY,USA:ACM,2015:65-72.
    [49]PORIA S,CAMBRIA E,HOWARD N,et al.Fusing au-dio,visual and textual clues for sentiment analysis frommultimodal content[J].Neurocomputing,2016,174:50-59.
    [50]KALEEL S B,ABHARI A.Cluster-discovery of twittermessages for event detection and trending[J].Journal ofcomputational science,2015,6:47-57.
    [51]D'ANDREA E,DUCANGE P,LAZZERINI B,et al.Real-time detection of traffic from twitter stream analysis[J].IEEE transactions on intelligent transportation systems,2015,16(4):2269-2283.
    [52]LI Jianxin,WEN Jianfeng,TAI Zhenying,et al.Bursty e-vent detection from microblog:a distributed and incremental approach[J].Concurrency and computation practiceand experience,2016,28(11):3115-3130.
    [53]ZHANG Xiaoming,CHEN Xiaoming,CHEN Yan,et al.Event detection and popularity prediction in microblogging[J].Neurocomputing,2015,149:1469-1480.
    [54]ZHOU Xiangmin,CHEN Lei.Event detection over twittersocial media streams[J].The VLDB journal,2014,23(3):381-400.
    [55]POHL D,BOUCHACHIA A,HELLWAGNER H.Social media for crisis management:clustering approaches forsub-event detection[J].Multimedia tools and applica-tions,2015,74(11):3901-3932.
    [56]GUILLE A,FAVRE C.Mention-anomaly-based event de-tection and tracking in twitter[C]//Proceedings of 2014IEEE/ACM International Conference on Advances in SocialNetworks Analysis and Mining.Beijing,China:IEEE,2014:375-382.
    [57]ZHANG Yu,QU Zhiyi.A novel method for online burstyevent detection on twitter[C]//Proceedings of the 20156th IEEE International Conference on Software Engineeringand Service Science(ICSESS).Beijing,China:IEEE,2015:284-288.
    [58]YAN Yan,YANG Yi,MENG Deyu,et al.Event orienteddictionary learning for complex event detection[J].IEEE transactions on image processing,2015,24(6):1867-1878.
    [59]ABDELHAQ H,SENGSTOCK C,GERTZ M.Eventweet:online localized event detection from twitter[J].Proceedings of the VLDB endowment,2013,6(12):1326-1329.
    [60]SCHINAS M,PAPADOPOULOS S,PETKOS G,et al.Multimodal event detection and summarization in largescale image collections[C]//Proceedings of the 2016ACM on International Conference on Multimedia Retrieval.New York,NY,USA:ACM,2016:421-422.
    [61]GAO Yue,ZHAO Sicheng,YANG Yang,et al.Multime-dia social event detection in microblog[M]//HEXiangjian,LUO Suhuai,TAO Dacheng,et al.MultiMediaModeling.Switzerland:Springer International Publishing,2015:269-281.
    [62]UNANKARD S,LI Xue,SHARAF M A.Emerging eventdetection in social networks with location sensitivity[J].World wide web,2015,18(5):1393-1417.
    [63]BOUADJENEK M R,HACID H,BOUZEGHOUB M.So-cial networks and information retrieval,how are they converging?A survey,a taxonomy and an analysis of social in-formation retrieval approaches and platforms[J].Informa-tion systems,2016,56:1-18.
    [64]刘峤,李杨,段宏,等.知识图谱构建技术综述[J].计算机研究与发展,2016,53(3):582-600.LIU Qiao,LI Yang,DUAN Hong,et al.Knowledge graphconstruction techniques[J].Journal of computer researchand development,2016,53(3):582-600.
    [65]费洪晓,莫天池,秦启飞,等.社交网络相关机制应用于搜索引擎的研究综述[J].计算技术与自动化,2014,33(1):1-9.FEI Hongxiao,MO Tianchi,QIN Qifei,et al.The resear-ches of applying social networking mechanism to search en-gine:a survey[J].Computing technology and automation,2014,33(1):1-9.
    [66]CHEN Chun,LI Feng,OOI B C,et al.Ti:an efficient in-dexing mechanism for real-time search on tweets[C]//Proceedings of the 2011 ACM SIGMOD International Con-ference on Management of Data.New York,NY,USA:ACM,2011:649-660.
    [67]CHEN Hanhua,JIN Hai.Efficient keyword searching inlarge-scale social network service[J].IEEE transactionson services computing,2015,doi:10.1109/TSC.2015.2464819.
    [68]LI Yuchen,BAO Zhifeng,LI Guoliang,et al.Real timepersonalized search on social networks[C]//Proceedingsof the 2015 IEEE 31st International Conference on DataEngineering.Seoul,South Korea:IEEE,2015:639-650.
    [69]ZHAO Feng,LIU Jun,ZHOU Jingyu,et al.LS-AMS:an adaptive indexing structure for realtime search on microb-logs[J].IEEE transactions on big data,2015,1(4):125
    [70]HUANG Haifei,LI Jianxin,ZHANG Richong,et al.Liveindex:a distributed online index system for temporal microblog data[C]//Proceedings of 2015 IEEE 17th International Conference on High Performance Computing and Communications(HPCC),the 2015 IEEE 7th International Symposium on Cyberspace Safety and Security(CSS),2015 IEEE 12th International Conferen on Embedded Software and Systems(ICESS).New York,NY,USA:IEEE,2015:884-887.
    [71]YUAN Jingbo,WANG Bairong,DING Shunli.A real-time search structure and classification algorithm of microblog based on partial indexing[J].Indonesian journal of electrical engineering and computer science,2014,12(3):2271-2277.
    [72]RíSSOLA E A,TOLOSA G H.Inverted index entry invalidation strategy for real time search[C]//Proceedings of XXI Congreso Argentino de Ciencias de la Computación.Junín:CACIC,2015.
    [73]XIE Haoran,LI Xiaodong,WANG Tao,et al.Personalized search for social media via dominating verbal context[J].Neurocomputing,2016,172:27-37.
    [74]LIANG Shangsong,REN Zhaochun,WEERKAMP W,et al.Time-aware rank aggregation for microblog search[C]//Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management.New York,NY,USA:ACM,2014:989-998.
    [75]WANG Wenbo,DUAN Lei,KOUL A,et al.YouRank:let user engagement rank microblog search results[C]//Proceedings of the Eighth International AAAI Conference on Weblogs and Social Media.Palo Alto,California:AAAI,2014.
    [76]LIU Lijun.Research on real-time personalized recommendation algorithm[J].International journal of u-and e-service,science and technology,2014,7(5):359-368.
    [77]卫冰洁,王斌.面向微博搜索的时间感知的混合语言模型[J].计算机学报,2014,37(1):229-239.WEI Bingjie,WANG Bin.Time-aware mixed language model for microblog search[J].Chinese journal of computers,2014,37(1):229-239.
    [78]周霞娟,汪飞,金玲,等.用户驱动的微博可视化搜索[J].中国图象图形学报,2015,20(5):715-723.ZHOU Xiajuan,WANG Fei,JIN Ling,et al.User-driven visual micro-blog search[J].Journal of image and graphics,2015,20(5):715-723.
    [79]SEVERYN A,MOSCHITTI A.Learning to rank short text pairs with convolutional deep neural networks[C]//Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval.New York,NY,USA:ACM,2015:373-382.
    [80]CHY A N,ULLAH M Z,AONO M.Combining temporal and content aware features for microblog retrieval[C]//Proceedings of the 2015 2nd International Conference on Advanced Informatics:Concepts,Theory and Applications(ICAICTA).Chonburi,Thailand:IEEE,2015:1-6.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700