CluCF: a clustering CF algorithm to address data sparsity problem
详细信息    查看全文
  • 作者:Chengyuan Yu ; Linpeng Huang
  • 关键词:Web service ; QoS prediction ; Time ; aware ; Collaborative filtering algorithm
  • 刊名:Service Oriented Computing and Applications
  • 出版年:2017
  • 出版时间:March 2017
  • 年:2017
  • 卷:11
  • 期:1
  • 页码:33-45
  • 全文大小:
  • 刊物类别:Computer Science
  • 刊物主题:Computer Systems Organization and Communication Networks; Software Engineering/Programming and Operating Systems; e-Commerce/e-business; Computer Appl. in Administrative Data Processing; Management of
  • 出版者:Springer London
  • ISSN:1863-2394
  • 卷排序:11
文摘
In QoS-based Web service recommendation, predicting Quality of Service (QoS) for users will greatly aid service selection and discovery. Collaborative filtering (CF) is an effective method for Web service selection and recommendation. Data sparsity is an important challenges for CF algorithms. Although model-based algorithms can address the data sparsity problem, those models are often time-consuming to build and update. Thus, these CF algorithms aren’t fit for highly dynamic and large-scale environments, such as Web service recommendation systems. In order to overcome this drawback, this paper proposes a novel approach CluCF, which employs user clusters and service clusters to address the data sparsity problem and classifies the new user (the new service) by location factor to lower the time complexity of updating clusters. Additionally, in order to improve the prediction accuracy, CluCF employs time factor. Time-aware user-service matrix Mu;s(tk, d) is introduced, and the time-aware similarity measurement and time-aware QoS prediction are employed in this paper. Since the QoS performance of Web services is highly related to invocation time due to some time-varying factors (e.g., service status, network condition), time-aware similarity measurement and time-aware QoS prediction are more trustworthy than traditional similarity measurement and QoS prediction, respectively. Since similarity measurement and QoS prediction are two key steps of neighborhood-based CF, time-aware CF will be more accurate than traditional CF. Moreover, our approach systematically combines user-based and item-based methods and employs influence weights to balance these two predicted values, automatically. To validate our algorithm, this paper conducts a series of large-scale experiments based on a real-world Web service QoS dataset. Experimental results show that our approach is capable of alleviating the data sparsity problem.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700