在线社会网络挖掘及个性化推荐研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着网络及信息技术的飞速发展,在线购物、社交网络等在线社会网络已成为人们生活中必不可少的一部分。然而网络中的信息呈现出爆炸性增长,信息的数量大大超出了人们的处理能力,“网络信息过载”问题日趋严重。为了满足用户越来越高的信息服务要求,可以向用户推荐感兴趣项目,个性化推荐技术应运而生,并引起了国内外学者和用户的广泛关注。利用个性化推荐技术解决在线社会网络下的信息过载问题,可以有效的提高信息利用率及用户浏览体验,具有重要的现实意义和广阔的应用前景。
     目前,针对在线社会网络下的个性化推荐问题,国内外研究学者相继提出了一系列的解决方法,为后续的进一步研究奠定了基础。但是现有研究仍存在着以下不足:
     (1)现有推荐算法普遍只注重用户或资源间的相似度,而忽略了用户兴趣的动态变化,难以快速捕捉到用户兴趣。
     (2)由于在线社会网络的动态性、不可预知性,传统的网络社团挖掘算法具有低效性且难以反映在线社会网络的动态变化情况。
     (3)随着推荐系统中的用户和资源数量的不断增加,现有协同过滤推荐算法普遍存在数据稀疏性问题,难以有针对性的选择用户近邻进而给出有效的个性化预测。
     本文以在线社会网络环境为研究背景,针对当前研究存在的以上问题展开研究,本文的主要工作与创新点如下:
     1.提一种基于记忆效应的协同过滤推荐算法,与传统推荐算法相比有效的提高了推荐精度,解决了用户兴趣变化下的个性化推荐问题。
     该模型以认知心理学对人脑记忆-遗忘规律的研究成果为基础,充分分析用户心理,挖掘用户兴趣偏好,将用户的访问行为抽象为人脑的记忆、遗忘过程,使推荐系统中的原始数据随着系统的运行逐渐进行演化,形成不平衡分布,从而在需要时快速抽取出重要数据,分析在记忆、遗忘过程中不断变化的用户兴趣。算法实验表明,引入记忆效应的协同过滤推荐算法,可以更好的反映用户兴趣变化,提高了推荐精度,为现有的推荐系统的设计提供了一种新思路。
     2.提出一种基于文化的多目标网络社团划分算法,有效克服了动态在线社会网络的不可预知性及传统网络社团划分算法的低效性,解决了动态在线社会网络下的目标团体挖掘问题。
     该模型对在线社会网络的社团进行着重分析,提出基于文化的多目标社团发现算法,有效的解决了细粒度社团发现问题。该方法不需了解整个网络的拓扑结构便可动态的获得社团的划分数目且可直接得到社团的概貌结构。仿真实验表明,算法在收敛性能及解集分布性方面较传统算法均有改善,并应用于动态网络聚类,取得了良好的聚类效果。
     3.提出一种基于在线社会网络社团发现的协同过滤推荐算法,克服了传统协同过滤推荐算法推荐生成速度慢及数据稀疏性问题,有效的解决了在线社会网络下的群体推荐问题。
     针对大数据量环境下传统协同过滤算法的低效性、非实时性及数据稀疏性等问题,将离线网络社团发现及在线推荐相结合,提出基于在线社会网络社团发现的协同过滤推荐算法,使得个性化推荐在同一社团内根据临近用户的喜好相似度进行一致性逼近。
As the rapid development of the network and information technology, the online socialnetwork such as online shopping and social network has become an indispensable part of people'slife. But the information of the network shows explosive growth. The amount of information isbeyond people's handling ability. The problem of network information overload has become moreand more serious. In order to satisfy people’s more and more high information servicerequirements, the interesting projects can be recommended to users. The recommendationtechnology of assisting users’decision arises at the historic moment and calls the domestic andforeign scholars and users' wide concern. To solve the problem of information overload bypersonalized recommendation technology can effectively improve the utilization rate ofinformation and users’browse experience, which has important practical significance and broadapplication prospects.
     At present, on account of Personalized Recommendation issues based on online socialnetwork, the domestic and foreign research scholars have proposed a series of the solution, whichlay a foundation for the further research. However, the existing research still has much insufficientas follows:
     (1) The existing recommendation algorithms only pay attention to the similarity betweenusers and resources, however, they ignore the dynamic changes of users’interests, so it is difficultto quickly capture the user interest
     (2)As the online social network is dynamic and unpredictability, traditional networkcommunity mining algorithms are inefficient, and it is difficult to reflect the dynamic changes ofthe online social network.
     (3) With the uninterrupted increase of users and resource, it is common that existingcollaborative filtering algorithms have the problem of data sparseness. We can't choice users’nearneighbors’nichetargeting and put forward effective personalized forecast.
     In this paper, the online social network environment is the research background. The paperstudies based on the above-mentioned existing problems. The main work and innovation pointsare as follows:
     1. In view of the changes of users’interest, it puts forward a collaborative filtering algorithmbased on memory. Compared with the traditional collaborative filtering algorithms, it effectivelyimproves the precision of the recommendation, and it provides a new idea for existing design ofthe recommend system.
     The algorithm, based on the research achievement of cognitive psychology to the brainmemory-forget rule, fully analyzes users’psychological, excavates users’interest preference,abstracts the user access behavior to the human brain's memory-forgotten process, makes theoriginal data of recommendation system evolve with the operation of the system and formdisequilibrium distribution, then extracts important data quickly when needed, analyzes thechanging user's interests in the process of memory and forgotten. Experimental results show thatcollaborative filter recommendation algorithm with memory effects can reflect the change ofusers’interests commendably. The algorithm improves the precision of the recommendation and provides a new idea for existing design of the recommendation system
     2. In view of unpredictability of online social network and inefficiency of traditional networkcommunity division, the paper gives the optimization problem description of the networkcommunity division and puts forward a multi-objective network community partition algorithmbased on culture, which effectively solve the problem of fine grained community finding.
     The paper analyzes the online social network community. It Introduces the reality and theorysignificance that the online social network community have for personalized recommendation, andthen puts forward a multi-objective community finding algorithm based on culture, whicheffectively solved the fine grain community finding problem. With this method we need not toknow the topology of the whole network, we can get the division of the club number dynamicallyand can get the community’s management structure directly. The simulation results show that thealgorithm is better in the aspects of convergence and solution set distribution than traditionalalgorithms. It is applied to dynamic network clustering and achieves good clustering effect.
     3. Aimed at the traditional collaborative filtering recommendation algorithms’problem ofslow generating speed and data sparseness, this paper puts forward a collaborative filterrecommendation algorithm based on online social network community finding. The algorithmsolves the speed problem of generating recommendation to a certain degree, and it improves thequality of the recommendation.
     In view of the problems of low efficiency、non-timeliness and data sparse in traditionalcollaborative filtering algorithms under the environment of large amount of data, It combinesoff-line network community finding and online recommendation, and proposes a collaborativefiltering recommendation algorithm based on online network community finding, which makespersonalized recommendation in the same community do consistency approximation according tothe similarity degree of near users’interests.
引文
[1] Xu HL,Wu X,Li XD,Yah BP.Comparison study of Internet recommendation system. Journalof Software,2009,20(2):350-362.
    [2] BLOGGER DON.web2.0概念诠释.
    [3] Perugini, S., & Ramakrishnan, N. (2003). Personalizing Interactions with Information Systems.Advances in Computers, 57, 323-382.
    [4] Powell M J D. Approximation Theory and Methods. England: Cambridge UniversityPress,1981.
    [5] Rich E. User Modeling via Stereotypes. Cognitive Science, 1979, 3(4):329~354.
    [6] Armstrong J S. Principles of Forecasting - A Handbook for Researchers and Practitioners. New
    [7] Meadow C T, Bert R. Boyce B R, Kraft D H, et al. Text Information Retrieval Systems (3rded.).Orlando: Academic Press, Inc., 2007.York: Kluwer Academic, 2001.
    [8] Murthi B P S, Sarkar S. The Role of the Management Sciences in Research on Personalization.Management Science, 2003, 49(10):1344~1362.
    [9] Schafer J B, Konstan J A, Riedl J. Recommender Systems in E-Commerce. In: Proceedings ofthe 1st ACM Conference on Electronic Commerce, Denver, United States, ACM Press,1999,158-166.
    [10] Schafer J B, Konstan J A, Riedl J. E-Commerce Recommendation Applications. Data Miningand Knowledge Discovery, 2001, 5(1): 115~153.
    [11] Herlocker J. Understanding and Improving Automated Collaborative Filtering System.
    [Ph.D.Thesis], Minneapolis, Minnesota: University of Minnesota, 2000.
    [12] Rashid A M. Mining Influence in Recommender Systems. [Ph.D. Thesis], Minneapolis,Minnesota:University of Minnesota, 2007.
    [13] Hill W, Stead L, Rosenstein M, et al. Recommending and Evaluating Choices in a VirtualCommunity of Use. In: Proceedings of the ACM SIGCHI Conference on Human Factors inComputing Systems (CHI’95), Denver, Colorado, USA ACM Press, 1995, 194~201.
    [14] Resnick P, Iakovou N, Sushak M, et al. GroupLens: An Open Architecture for CollaborativeFiltering of Netnews. In: Proceedings of the 1994 ACM Conference on Computer SupportedCooperative Work, Chapel Hill, United States, ACM Press, 1994, 175~186.
    [15] Shardanand U, Maes P. Social Information Filtering: Algorithms for Automating‘Word ofMouth’. In: Proceedings of the ACM SIGCHI Conference on Human Factors in ComputingSystems (CHI’95), Denver, Colorado, USA, ACM Press, 1995, 210~217.
    [16] Belkin N, Croft B W. Information Filtering and Information Retrieval: Two Sides of theSame Coin? Communications of the ACM,1992,35(12):29~38.
    [17] Manning C D, Raghavan P, Schütze H. Introduction to Information Retrieval.England:Cambridge University Press, 2008.
    [18] Pirkola A, Hedlund T, Keskustalo H, et al. Dictionary-Based Cross-Language InformationRetrieval: Problems, Methods, and Research Findings. Information Retrieval, 2001,4(2-3):209~230.
    [19] Manning C D, Raghavan P, Schütze H. Introduction to Information Retrieval . England:Cambridge University Press, 2008.]和信息过滤[ Manning C D, Raghavan P, Schütze H.Introduction to Information Retrieval. England: Cambridge University Press, 2008.
    [20] Goldberg D, Nichols D, Oki B M, et al. Using Collaborative Filtering to Weave anInformation Tapestry. Communications of the ACM, 1992, 35(12):61~70.
    [21] D.Boyd,N.Ellison.Social network sites:Definition,history,and scholarship. Journal ofComputer-Mediated Communication,2008,13(1):210-230.
    [22]ZHOU D,MANAVOGLU E,LI J,et a1.Probabilistic models for discovering E-Communities[C]//Proceedings of the 15th International World Wide Web Conference.2006.
    [23] Jamali M,Ester M,TrustWalker:A random walk model for combining trust-based anditem-based recommendation//Proceedings of the 15th ACM SIGKDD International Conference onKnowledge Discovery and Data Mining.Paris,France,2009:397-406.
    [24]ZHANG Y WU Z,CHEN H,et al.Mining target marketing groups from users’web of trust onepinions[C]. 2008.
    [25] D.Boyd, N.Ellison.Social network sites:Definition,history,and scholarship. Journal ofComputer-Mediated Communication,2008,13(1):210-230.
    [26] MILGRAM S. The small world problem [J].Psychology Today,1967,60-457.
    [27] Ehrlich P R,Raven P H.Butterflies and plants: a study in co-evolution[J].Evolution,1964,18(4):586-608.
    [28] Li Z Y,Tong T S.Research on ANN evolutionary design method based on populationsevolution niche genetic algorithm[J].Control and Decision,2003,18(5):607—610.
    [29] Mitchell A Potter.The design and analysis of computational model of cooperativeconvolution[D].Virginia: Geroge Mason University,1997.
    [30] Multi-objective cooperative coevolution of artificial neural networks[J].Neural Networks,2002:1259-1278.
    [31]董红斌,黄厚宽,印桂牛等.协同演化算法研究进展[J].计算机研究与发展,2008,45(3):454-463.
    [32]巩敦卫,孙晓燕.变搜索区域多种群遗传算法[J].控制理论与应用,2006,23(2):256-260.
    [33] Li Bi,YONG Zheng-zheng.A high-efficient parallel genetic algorithm based on multi-levelcompetition[J].Acta Eleetronica Sinica,2002,30(12A):2161-2162(in Chinese).
    [34] Cao X B,Li J L.Reserach on coevolutionary optimization based on ecologicalcooperation[J].Journal of software.2001,12(4):521-528.
    [35] Potter M.The Design and Analysis of a Computational Model of CooperativeCoevolution[D].Fairfax,VA,USA:George Mason University,1997.
    [36] Fan Huiyuan,Wang Shangjin,Xi Guang.Directional Evolution Operator Applied to GeneticAlgorithm[J].Journal of Xi’an Jiaotong University,1999,33(5):45-49(in Chinese)
    [37]陈国良.遗传算法及其应用[M].北京:人民邮电出版社,1996:1-433.
    [38]王文义,秦广军,王若雨.自适应的多种群并行遗传算法研究[J].计算机工程与应用,2006,(15):34-36.
    [39] Guha S,Rastogi R,Shim K.ROCK:A robust clustering algorithm for categoricalattributes[C]//Proceedings of the 15th International Conference on Data Engineering.Sydney,Australia。1999:1-1l
    [40]赵志强等.基于个体适应度梯度的定向进化算法,模式识别与人工智能[J]2010,23(1):30-34.
    [41] CD Rosin,RK Belew.Methods for competitive co-evolution,finding opponents worthbeating [C]//Proc of International Conference on Genetic Algorithms.San Diego:MorganKaufman,1995:256-328.
    [42]苗金凤,王洪国,邵增珍等.基于多级搜索区域的协同进化遗传算法[J].计算机应用研究,2010,27(9):3345-3347.
    [43] J Schafer, J Konstan, J Riedl . Recommender system in e-commerce[C]. In: Proc of ACME-Commerce. New York:ACM Press, 1999 .158-166 .
    [44] Champa Jayawardana, K Priyantha Hewagamage Masashito HIrakawa. A personalizeinformation environment for digital libraries[J]. Information Technology and Libraries, 2001,20(4):185-196 .
    [45] J Konstan, B Miller, D Maltz, et al . GroupLens: Applying collaborative filtering to Usenetnews[J]. Communications of the ACM, 1997, 40(3):77-87.
    [46] Adomavicius G, Tuzhilin A . Toward the next generation of recommender systems: A surveyof the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and DataEngineering, 2005, 17(6): 734-749.
    [47] Deshpande M, Karypis G. Item-Based Top-N Recommendation Algorithms . ACM Trans onInformation Systems, 2004, 22(1): 143-177.
    [48]曾春,刑春晓,周立柱.基于内容过滤的个性化搜索算法[J] .软件学报, 2003, 14(5):999-1004.
    [49] WANG J, VRIES A P, REINDERS M J . Unified relevance models for rating perdiction incollaborative filtering[J] . ACM Transaction on Information Systems, 2008, 26(3):1-40.
    [50] CONNOR M O, HERLOCKER J. Clustering items for collaborativefiltering[A] .Proceedings of the ACM SIGIR Workshop on Recommender Systems[C]. UCBerkeley, 1999. 121-128.
    [51] DENG A L, ZUO Z Y, ZHU Y Y. Collaborative filtering recommendation algorithm based onitem clustering[J]. Mini-Micro Systems, 2004, 25(9):1665-1670.
    [52] Yu K, Schwaighofer A, Tresp V, et al .Probabilistic memory-based collaborative filtering[J].IEEE Transaction on Knowledge and Data Engineering, 2004, 16(1): 56-69.
    [53]张格伟,廖文和,刘长毅,俞烽.知识的记忆-遗忘模型及其在知识管理中的应用[J] .南京航空航天大学学报, 2008, 40(2): 265-270.
    [54] Buchegger S, Boudec J Y L .A Robust Reputation System for Mobile Ad-hocNetworks[C]//Proc of the 2nd Workshop on the Economics of Peer-to-Peer Systems .Lausanne,Switzerland: [s .n .], 2003.
    [55] Jigar P, Luke T W T. A Probabilistic Trust Model for Handling Inaccurate ReputationSource[C]//Proc of the 3rd International Conference on Trust Management. Paris, France:[s .n .],2005:193-209.
    [56]北京师范大学公共课《心理学》编写组.心理学[M] .北京:北京师范大学出版社, 1985.
    [57] Ebbinghaus H. Memory:a contribution to experimental psychology[M] .New York:ColumbiaUniversity, 1913.
    [58]邢春晓,高凤荣,战思南等.适应用户兴趣变化的协同过滤推荐算法[J].计算机研究与发展, 2007, 44 (2).
    [59]黄光球,石昌文,孙周军等.基于记忆原理的Web入侵预警系统[J].系统工程与电子技术,2006, 28 (12).
    [60]张格伟,廖文和,刘长毅等.知识的记忆-遗忘模型及其在知识管理中的应用[J].南京航空航天大学学报, 2008, 40 (2)
    [61]周涛,李华.基于用户情景的协同过滤推荐[J].计算机应用, 2010, 30 (4).
    [62]Herlocker J, Konstan J, Terveen L.Evaluating collaborative filtering recommendersystems.ACM Transactions on Information System, 2004, 22(1): 5-53.
    [63] J.M.Read and M.J.Keeling.Disease evolution on networks:The role of contact structure.TheRoyal Society Proceedings.B,Biological Sciences,270(1516):699–708,2003.
    [64] C.Tantipathananandh,T.Berger-Wolf,and D.Kempe.A framework for communityidentification in dynamic social networks.In KDD’07:Proceedings of the 13th ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining,pages717–726,2007.
    [65] B.S. Everitt, S. Landau, and M.Leese. Cluster Analysis. E. Arnold, 2001.
    [66] T.N. Bui and C. Jones. Finding good approximate vertex and edge partition is NP-hard.Information Processing Letters, 42(3):328-383,1975.
    [67] T.N. Bui and B.R. Moon. Genetic algorithm and graph partitioning. IEEE Transactions onComputers, 45(7): 841-855, 1966.
    [68] G.N.Demir,A.S.Uyar,and S.G. Multiobjective evolutionary clustering of web user sessions:acase study in web page recommendation. Soft Computing-A Fusion of Foundations,Methodologies and Applications, 2009.
    [69] J.Handl and J.Knowles. sMultiobjective clustering around medoids. In The 2005 IEEECongress on Evolutionary Computation, volume 1,pages 632–639,2005.
    [70] J.Handl and J.Knowles.Multiobjective clustering around medoids. In The 2005 IEEECongress on Evolutionary Computation,volume 1,pages 632–639,2005.
    [71] J.Handl and J.Knowles.An evolutionary approach to multiobjective clustering. IEEETransactions on Evolutionary Computation, 11(1):56–76, 2007.
    [72] T.Y. Berger-Wolf and J. Saia. A framework for analysis of dynamic social networks. InKDD’06: Proceedings of the 12thACM SIGKDD International Conference on KnowledgeDiscovery and Data Mining, pages 523-528, 2006.
    [73] C.-K. Cheng and Y.-C. Wei. An improved two-way partitioning algorithm with stableperformance [VLSI]. IEEE Transactions on Computer-Aided Design of Integrated Circuits andSystems, 10(12):1502-1511, 1991.
    [74] J.Shi and J.Malik.Normalized cuts and image segmentation.In CVPR’97:Proceedings of the1997 Conference on Computer Vision and Pattern Recognition,page 731,1997.
    [75] C.H.Q.Ding,X.He,H.Zha,M.Gu,and H.D. Simon.A min-max cut algorithm for graphpartitioning and data clustering.In ICDM’01: Proceedings of the 2001 IEEE InternationalConference on Data Mining,pages 107–114,2001.
    [76] M.Lipczak and E.Milios.Agglomerative genetic algorithm for clustering in social networks.InGECCO’09:Proceedings of the 11th Annual Conference on Genetic and EvolutionaryComputation,pages 1243–1250,2009.
    [77] G.N.Demir,A.S.Uyar,and S.G. Graph-based sequence clustering through multiobjectiveevolutionary algorithms for web recommender systems.In GECCO’07:Proceedings of the 9thAnnual Conference on Genetic and Evolutionary Computation,pages 1943–1950,2007.
    [78] G.N.Demir,A.S.Uyar,and S.G. Multiobjective evolutionary clustering of web user sessions:acase study in web page recommendation. Soft Computing-A Fusion of Foundations,Methodologies and Applications, 2009.
    [79] P.Rousseeuw.Silhouettes:a graphical aid to the interpretation and validation of clusteranalysis.Journal of Computational and Applied Mathematics,20(1):53–65,1987.
    [80]张景成,藏光明.基于指标的多目标进化算法研究[J].计算机工程,2009,35(23):187-189.
    [81]李密青,郑金华,罗彪.一种基于最小生成树的多目标进化算法[J].计算机研究与发展,2009,46(5):803-813.
    [82]汪文彬,钟声.基于改进拥挤距离的多目标进化算法[J].计算机工程,2009,35(9):211-213.
    [83] Schaffer JD. Multiple objective optimization with vector evaluated genetic algorithms 1995
    [84] Fonseca CM and Fleming PJ. Proceedings of the fifth international conference on geneticalgorithms. San mateo,Califonia,1993:416-423.
    [85] Deb K, Pratap A. Agrawal S. A fast and elitist multi objective genetic algorithm: NSGA-Ⅱ[J].IEEE Transactions On Evolutionary Computation, 2002, 6(2):182-197.
    [86] Zitzler E, Laumanns M, Thiele L. SPEA2: Improving the Strength Pareto EvolutionaryAlgorithm, TIK-Report[R]. Zurich: Swiss Federal Institute of Technology(ETH) Zurich, 2001.
    [87]郭一楠,王辉.文化算法研究综述[J].计算机工程与应用,2009,45(9):41-46.
    [88] Corne D W, Knowles J D, Oates M J. The pareto envelope-based selection algorithm formultiobjective optimization[C]// Schoenauer M, Deb K, Rudolph G, et al. Proceedings of theParallel Problem Solving from Nature VI Conference.Springer,2000:839-848.
    [89] CORNE D W,KNOWLES J D,OATES M J.The pareto envelope-based selectionalgorithm for multi-objective optimization[C]/ /Proc of the 6th International Conference onParallel Problem Solving from Nature. Berlin: Springer, 2000: 839-848.
    [90]杨善学,王宇平.基于Pareto最优和限制精英的多目标进化算法[J].计算机工程与应用,2007,43( 2) : 108- 110.
    [91] Yi Hong, Sam Kwong, Hui Xiong and Qingsheng Ren. Genetic-guided semi-supervisedclustering algorithm with instance-level constraints. In GECCO’08: Procedings of the 10thannualconference on Genetic and evolutionary computation, pages 1381-1388, New York, NY, USA,2008. ACM.
    [92] Y.Lee,R.McKay,X.Nguyen,and D.Kim. The effect of vesicular selection in dynamicenvironments. In Proceedings of the 11th Annual conference on Genetic and evolutionarycomputation,pages 1867–1868.ACM,2009.
    [93] C.M.Fonseca and P.J.Fleming.Genetic algorithms for multiobjective optimization:Formulation, discussion and generalization.In Proceedings of the Fifth International Conferenceon Genetic Algorithms,pages 416–423,1993.
    [94] Zachary W.W. An information flow model for conflict and fission in small goups[J],J.Anthropol.Res, 1977, 33, 452-473.
    [95] Hofmann T. Latent semantic models for collaborative filtering[J]. ACM Trans. InformationSystems, 2004, 22:89-115.
    [96] Sarwar B, Karypis G, Konstan J, et al. Item-based collaborative filtering recommendationalgorithm[C] // Proc of the l0th International World Wide Web Conference. New York : ACMPress, 2001:285-295.
    [97] Leung C W, Chan S C, Chung F. A collaborative filtering framework based on fuzzyassociation rules and multiple-level similarity [J]. Knowledge Information Systems. 2006, 9(4):492-511.
    [98]刘继.基于网络社团分析的协作推荐算法[D].大连理工大学2011.
    [99]苗金凤.协同进化遗传算法在多目标优化中的应用研究[D].山东师范大学2011.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700