Software rejuvenation in cluster computing systems with dependency between nodes
详细信息    查看全文
  • 作者:Menghui Yang (1)
    Geyong Min (2)
    Weikang Yang (3)
    Zituo Li (3)
  • 关键词:Software aging ; Software rejuvenation ; Cluster computing systems ; Stochastic reward net ; 68N01
  • 刊名:Computing
  • 出版年:2014
  • 出版时间:June 2014
  • 年:2014
  • 卷:96
  • 期:6
  • 页码:503-526
  • 全文大小:
  • 参考文献:1. Parnas D (1994) Software aging. In: Proceedings of the 16th international conference on software engineering, pp 279鈥?87
    2. Huang Y, Kintala C, Kolettis N, Fulton N (1995) Software rejuvenation: analysis, module and applications. In: Proceedings of 25th symposium on fault tolerant, computing, pp 381鈥?90
    3. Grottke M, Li L, Vaidyanathan K, Trivedi K (2006) Analysis of software aging in a web server. IEEE Trans Reliab 55(3):411鈥?20 CrossRef
    4. Matias R, Freitas P, (2006) An emperimental study on software aging and rejuevenation in web servers. In: Proceedings of 30th annual international conference on computer software and applications, vol 1, pp 189鈥?96
    5. Grottke M, Nikora A, Trivedi K (2010) An empirical investigation of fault types in space mission system software. In: Proceedings of IEEE conference on dependable systems and networks, pp 447鈥?56
    6. Moorsel A, Wolter K (2006) Analysis of restart mechanisms in software systems. IEEE Trans Softw Eng 32(8):547鈥?58 CrossRef
    7. Alonso J, Torres J, Berral J, Gavalda R (2010) Adaptive on-line software aging prediction based on machine learning. In: Proceedings of international conference on dependable systems and networks, pp 507鈥?16
    8. Dugan J, Trivedi K (1989) Coverage modeling for dependability analysis of fault-tolerant systems. IEEE Trans Comput 38(6):775鈥?87 CrossRef
    9. Gokhale S, Trivedi K (1998) Dependency characterization in path-based approaches to architecture-based software reliability prediction. In: Proceedings of international conference on application-specific software engineering technology, pp 86鈥?9
    10. Popstojanova K, Trivdei K (2000) Failure correlation in software reliability models. IEEE Trans Reliab 49(1):37鈥?8 CrossRef
    11. Fan X, Xu G, Ying R, Zhang H, Jiang L (2003) Performance analysis of software rejuvenation on dispatcher鈥搘orker based cluster system. In: Proceedings of the 4th international conference on parallel and distributed computing, applications and technologies, pp 562鈥?66
    12. Vaidyanathan K, Haarper R, Hunter S, Trivedi K (2001) Analysis and implementation of software rejuvenation in cluster systems. In: Proceedings of joint international conference on measurement and modeling of computer systems, ACM SIGMETRICS, pp 62鈥?1
    13. Bobbio A, Sereno A, Anglano C (2001) Fine grained software degradation models for optimal rejuvenation policies. J Perform Eval 46:45鈥?2 CrossRef
    14. Dohi T, Popstojanova K, Trivedi K (2000) Statistical nonparametric algorithms to estimate the optimal software rejuvenation schedule. In: Proceedings of Pacific rim international symposium dependendable computing, pp 77鈥?4
    15. Grag S, Puliafito A, Telek M, Trivedi K (1998) Analysis of preventive maintenance in transactions based software systems. IEEE Trans Comput 47(1):96鈥?07 CrossRef
    16. Bao Y, Sun X, Trivedi K (2005) A workload-based analysis of software aging and rejuvenation. IEEE Trans Reliab 55(3):541鈥?48 CrossRef
    17. Koutras V, Platis A, Gravvanis G (2009) Optimal server resource reservation policies for priority classes of users under cyclic non-homogeneous markov modeling. Eur J Oper Res 198(2):545鈥?56 CrossRef
    18. Garg S, Moorsel A, Vaidyanathan K, Trivedi K (1998) A methodology for detection and estimation of software aging. In: Proceedings of 9th international symposium on software, reliability engineering, pp 282鈥?92
    19. Vaidyanathan K, Trivedi S (1999) A measurement-based model for estimation of resource exhaustion in operation systems. In: Proceedings of 10th international symposium on software, reliability engineering, pp 84鈥?3
    20. Vaidyanathan K, Trivedi S (2005) A comprehensive model for software rejuvenation. IEEE Trans Dependable Secur Comput 2(2):124鈥?37 CrossRef
    21. Cassidy K, Gross K, Malekpout A (2002) Advanced pattern recognition for detection of complex software aging in online transaxtion processing servers. In: Proceedings of dependable systems and networks, pp 478鈥?82
    22. Gross K, Bhardwaj V, Bickford R (2002) Proactive detective of software aging mechanisms in performance critical computers. In: Proceedings of 27th IEEE annual symposium on software enginerring, pp 17鈥?3
    23. Silva L, Alonso J, Torres J (2009) Using virtualization to improve software rejuvenation. IEEE Trans Comput 58(11):1525鈥?538 CrossRef
    24. Matias R, Barbetta P, Trivedi K, Freitas P (2010) Accelerated degradation tests applied to software aging experiments. IEEE Trans Reliab 59(1):102鈥?14 CrossRef
    25. Avritzer A, Weyuker E (1997) Monitoring smoothly degrading systems for increased dependability. Empir Softw Eng J 2(1):59鈥?7 CrossRef
    26. Liu Y, Trivedi K, Ma Y, Han J, Levendel H (2002) Modeling and analysis of software rejuvenation in cable modem termination systems. In: Proceedings of 13th international symposium on software, reliability engineering, pp 159鈥?70
    27. Tai A, Chau S, Alkalaj L, Hecht H (1997) On-board preventive maintenance: analysis of effectiveness and optimal duty period. In: Proceedings of 3rd international workshop on object oriented real-time dependable systems, pp 40鈥?7
    28. Kourai K, Chiba S (2011) Fast software rejuvenation of virtual machine monitors. IEEE Trans Dependable Secur Comput 8(6):839鈥?51 CrossRef
    29. Wang D, Xie W, Trivedi K (2007) Performance analysis of clustered systems with rejuvenation under varying workload. J Perform Eval 64:247鈥?65 CrossRef
    30. Xie W, Shi Y, Xu G, Mao Y (2002) Smart Platform鈥攁 software infrastructure for smart space. In: Proceedings of 4th IEEE conference on multimodal, interfaces, pp 429鈥?35
  • 作者单位:Menghui Yang (1)
    Geyong Min (2)
    Weikang Yang (3)
    Zituo Li (3)

    1. Electronic Records Management Research Center, School of Information Resource Management, Renmin University of China, Beijing, 100872, China
    2. Department of Computing, University of Bradford, Bradford, BD7 1DP, UK
    3. Research Institute of Information Technology, Tsinghua University, Beijing, 100080, China
  • ISSN:1436-5057
文摘
Software rejuvenation is a preventive and proactive fault management technique that is particularly useful for counteracting the phenomenon of software aging, aimed at cleaning up the system internal state to prevent the occurrence of future failure. The increasing interest in combing software rejuvenation with cluster systems has given rise to a prolific research activity in recent years. However, so far there have been few reports on the dependency between nodes in cluster systems when software rejuvenation is applied. This paper investigates the software rejuvenation policy for cluster computing systems with dependency between nodes, and reconstructs an stochastic reward net model of the software rejuvenation in such cluster systems. Simulation experiments and results reveal that the software rejuvenation strategy can decrease the failure rate and increase the availability of the cluster system. It also shows that the dependency between nodes affects software rejuvenation policy. Based on the theoretic analysis of the software rejuvenation model, a prototype is implemented on the Smart Platform cluster computing system. Performance measurement is carried out on this prototype, and experimental results reveal that software rejuvenation can effectively prevent systems from entering into disabled states, and thereby improving the ability of software fault-tolerance and the availability of cluster computing systems.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700