Using machine learning techniques for rising star prediction in co-author network
详细信息    查看全文
  • 作者:Ali Daud ; Muhammad Ahmad ; M. S. I. Malik ; Dunren Che
  • 关键词:Group leader ; Classification ; Prediction ; Rising star ; MEMM ; CART
  • 刊名:Scientometrics
  • 出版年:2015
  • 出版时间:February 2015
  • 年:2015
  • 卷:102
  • 期:2
  • 页码:1687-1711
  • 全文大小:1,506 KB
  • 参考文献:1. Bermejo, P., Gamez, J. A., & Puerta, J. M. (2014). Speeding up incremental wrapper feature subset selection with Naive Bayes classifier. / Knowledge-Based Systems, / 55, 140-47. CrossRef
    2. Chen, J., Huang, H., Tian, S., & Qu, Y. (2009). Feature selection for text classification with Na?ve Bayes. / Expert Systems with Applications, / 36(3), 5432-435. CrossRef
    3. Chrysos, G., Dagritzikos, P., Papaefstathiou, I., & Dollas, A. (2013). HC-CART: A parallel system implementation of data mining classification and regression tree (CART) algorithm on a multi-FPGA system. / ACM Transactions on Architecture and Code Optimization, / 9(4), 47. CrossRef
    4. Constantinou, A. C., Fenton, N. E., & Neil, M. (2012). pi-football: A Bayesian network model for forecasting Association Football match outcomes. / Knowledge-Based Systems, / 36, 322-39. CrossRef
    5. Cui, X., Afify, M., Gao, Y., & Zhou, B. (2013). Stereo hidden Markov modeling for noise robust speech recognition. / Computer Speech & Language, / 27(2), 407-19. CrossRef
    6. Cuxac, P., Lamirel, J.-C., & Bonvallot, V. (2013). Efficient supervised and semi-supervised approaches for affiliations disambiguation. / Scientometrics, / 97(1), 47-8. CrossRef
    7. Daud, A., Abbasi, R., & Muhammad, F. (2013). Finding rising stars in social networks. / Database Systems for Advanced Applications (LNCS), / 7825, 13-4.
    8. Daud, A., Li, J., Zhou, L., & Muhammad, F. (2010). Temporal expert finding through generalized time topic modeling. / Knowledge-Based Systems (KBS), / 23(6), 615-25. CrossRef
    9. Fakhari, A., & Moghadam, A. M. E. (2013). Combination of classification and regression in decision tree for multi-labeling image annotation and retrieval. / Applied Soft Computing, / 13(2), 1292-302. CrossRef
    10. Farid, D. M., Zhang, L., Rahman, C. F., Hossain, M. A., & Strachan, R. (2014). Hybrid decision tree and na?ve Bayes classifiers for multi-class classification tasks. / Expert Systems with Applications, / 41(4) Part 2, 1937-946.
    11. Gu, F., Zhang, H., & Zhu, D. (2013). Blind separation of non-stationary sources using continuous density hidden Markov models. / Digital Signal Processing, / 23(5), 1549-564. CrossRef
    12. Guns, R., & Rousseau, R. (2014). Recommending research collaborations using link prediction and random forest classifiers. / Scientometrics,. doi:10.1007/s11192-013-1228-9 .
    13. Huang, S., Yang, B., Yan, S., & Rousseau, R. (2013). Institution name disambiguation for research assessment. / Scientometrics,. doi:10.1007/s11192-013-1214-2 .
    14. Kao, L. J., Chiu, C. C., & Chiu, F. Y. (2013). A Bayesian latent variable model with classification and regression tree approach for behavior and credit scoring. / Knowledge-Based Systems, / 36, 245-52. CrossRef
    15. Li, Z., Fang, H., & Xia, L. (2014). Increasing mapping based hidden Markov model for dynamic process monitoring and diagnosis. / Expert Systems with Applications, / 41(2), 744-51. CrossRef
    16. Li, X. K., Foo, C. S., Tew, K. L., & Ng, S. K. (2009).Searching for rising stars in bibliography networks. In / Proceedings of the 14th international conference on database systems for advanced applications (pp. 288-92).
    17. Loh, W. J. (2011). Classification and regression trees. / Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, / 1(1), 14-3.
    18. López-Cruz, P. L., Larra?aga, P., DeFelipe, J., & Bielza, C. (2014). Bayesian network modeling of the consensus between experts: An application to neuron classification. /
  • 刊物主题:Information Storage and Retrieval; Library Science; Interdisciplinary Studies;
  • 出版者:Springer Netherlands
  • ISSN:1588-2861
文摘
Online bibliographic databases are powerful resources for research in data mining and social network analysis especially co-author networks. Predicting future rising stars is to find brilliant scholars/researchers in co-author networks. In this paper, we propose a solution for rising star prediction by applying machine learning techniques. For classification task, discriminative and generative modeling techniques are considered and two algorithms are chosen for each category. The author, co-authorship and venue based information are incorporated, resulting in eleven features with their mathematical formulations. Extensive experiments are performed to analyze the impact of individual feature, category wise and their combination w.r.t classification accuracy. Then, two ranking lists for top 30 scholars are presented from predicted rising stars. In addition, this concept is demonstrated for prediction of rising stars in database domain. Data from DBLP and Arnetminer databases (1996-000 for wide disciplines) are used for algorithms-experimental analysis.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700