Top-K structural diversity search in large networks
详细信息    查看全文
  • 作者:Xin Huang ; Hong Cheng ; Rong-Hua Li ; Lu Qin ; Jeffrey Xu Yu
  • 关键词:Structural diversity ; Disjoint ; set forest ; $$\hbox {A}^*$$ A ?search ; Dynamic graph
  • 刊名:The VLDB Journal
  • 出版年:2015
  • 出版时间:June 2015
  • 年:2015
  • 卷:24
  • 期:3
  • 页码:319-343
  • 全文大小:2,069 KB
  • 参考文献:1.Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: WSDM, pp. 5-4 (2009)
    2.Angel, A., Koudas, N.: Efficient diversity-aware search. In: SIGMOD, pp. 781-92 (2011)
    3.Backstrom, L., Huttenlocher, D.P., Kleinberg, J.M., Lan, X.: Group formation in large social networks: membership, growth, and evolution. In: KDD, pp. 44-4 (2006)
    4.Batagelj, V., Zaversnik, M.: An o (m) algorithm for cores decomposition of networks (2003). arXiv preprint cs/-310049
    5.Chang, K., Hwang, S.: Minimal probing: supporting expensive predicates for top-k queries. In: SIGMOD, pp. 346-57 (2002)
    6.Cheng, J., Ke, Y., Chu, S., ?zsu, M.T.: Efficient core decomposition in massive networks. In: ICDE, pp. 51-2 (2011)
    7.Chiba, N., Nishizeki, T.: Arboricity and subgraph listing algorithms. SIAM J. Comput. 14(1), 210-23 (1985)View Article MATH MathSciNet
    8.Chu, S., Cheng, J.: Triangle listing in massive networks and its applications. In: KDD, pp. 672-80 (2011)
    9.Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms. MIT Press, Cambridge (2009)MATH
    10.Dodds, P.S., Watts, D.J.: Universal behavior in a generalized model of contagion. Phys. Rev. Lett. 92, 218701 (2004)View Article
    11.Dorogovtsev, S.N., Mendes, J.F.F., Samukhin, A.N.: Structure of growing networks with preferential linking. Phys. Rev. Lett. 85(21), 4633 (2000)View Article
    12.Fagin, R.: Combining fuzzy information from multiple systems. J. Comput. Syst. Sci. 58(1), 83-9 (1999)View Article MATH MathSciNet
    13.Fagin, R., Lotem, A., Naor, M.: Optimal aggregation algorithms for middleware. In: PODS, pp. 102-13 (2001)
    14.Huang, X., Cheng, H., Li, R.-H., Qin, L., Yu, J.X.: Top-k structural diversity search in large networks. PVLDB 6(13), 1618-629 (2013)
    15.Ilyas, I., Beskales, G., Soliman, M.: A survey of top-k query processing techniques in relational database systems. ACM Comput. Surv. (CSUR) 40(4), 11 (2008)View Article
    16.Itai, A., Rodeh, M.: Finding a minimum circuit in a graph. SIAM J. Comput. 7(4), 413-23 (1978)View Article MATH MathSciNet
    17.Kwak, H., Lee, C., Park, H., Moon, S.B.: What is twitter, a social network or a news media? In: WWW, pp. 591-00 (2010)
    18.Latapy, M.: Main-memory triangle computations for very large (sparse (power-law)) graphs. Theor. Comput. Sci. 407(1-), 458-73 (2008)View Article MATH MathSciNet
    19.Li, R.-H., Yu, J.X.: Scalable diversified ranking on large graphs. In: ICDM, pp. 1152-157 (2011)
    20.Luo, Y., Lin, X., Wang, W., Zhou, X.: Spark: top-k keyword query in relational databases. In: SIGMOD, pp. 115-26 (2007)
    21.Qin, L., Yu, J.X., Chang, L.: Diversifying top-k results. PVLDB 5(11), 1124-135 (2012)
    22.Romero, D.M., Meeder, B., Kleinberg, J.M.: Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: WWW, pp. 695-04 (2011)
    23.Schank, T.: Algorithmic aspects of triangle-based network analysis. Ph.D. Dissertation, University Karlsruhe (2007)
    24.Schank, T., Wagner, D.: Finding, counting and listing all triangles in large graphs, an experimental study. In: WEA, pp. 606-09 (2005)
    25.Ugander, J., Backstrom, L., Marlow, C., Kleinberg, J.: Structural diversity in social contagion. Proc. Natl. Acad. Sci. 109(16), 5962-966 (2012)View Article
    26.Watts, D.J., Dodds, P.S.: Influentials, networks, and public opinion formation. J. Consum. Res. 34, 441-58 (2007)View Article
    27.Xiao, C., Wang, W., Lin, X., Shang, H.: Top-k set similarity joins. In: ICDE, pp. 916-27 (2009)
    28.Zhang, Y., Callan, J., Minka, T.: Novelty and redundancy detection in adaptive filtering. In: SIGIR, pp. 81-8 (2002)
    29.Zhu, X., Guo, J., Cheng, X., Du, P., Shen, H.: A unified framework for recommending diverse and relevant queries. In: WWW, pp. 37-6 (2011)
  • 作者单位:Xin Huang (1)
    Hong Cheng (1)
    Rong-Hua Li (2)
    Lu Qin (3)
    Jeffrey Xu Yu (1)

    1. Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, New Territories, Hong Kong
    2. Guangdong Province Key Laboratory of Popular High Performance Computers, Shenzhen University, Shenzhen, China
    3. Quantum Computation and Intelligent Systems, Department of Engineering and Information Technology, University of Technology, Sydney, Australia
  • 刊物类别:Computer Science
  • 刊物主题:Database Management
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:0949-877X
文摘
Social contagion depicts a process of information (e.g., fads, opinions, news) diffusion in the online social networks. A recent study reports that in a social contagion process, the probability of contagion is tightly controlled by the number of connected components in an individual’s neighborhood. Such a number is termed structural diversity of an individual, and it is shown to be a key predictor in the social contagion process. Based on this, a fundamental issue in a social network is to find top-\(k\) users with the highest structural diversities. In this paper, we, for the first time, study the top-\(k\) structural diversity search problem in a large network. Specifically, we study two types of structural diversity measures, namely, component-based structural diversity measure and core-based structural diversity measure. For component-based structural diversity, we develop an effective upper bound of structural diversity for pruning the search space. The upper bound can be incrementally refined in the search process. Based on such upper bound, we propose an efficient framework for top-\(k\) structural diversity search. To further speed up the structural diversity evaluation in the search process, several carefully devised search strategies are proposed. We also design efficient techniques to handle frequent updates in dynamic networks and maintain the top-\(k\) results. We further show how the techniques proposed in component-based structural diversity measure can be extended to handle the core-based structural diversity measure. Extensive experimental studies are conducted in real-world large networks and synthetic graphs, and the results demonstrate the efficiency and effectiveness of the proposed methods.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700