Computer Engineering and Applications ›› 2022, Vol. 58 ›› Issue (23): 94-103.DOI: 10.3778/j.issn.1002-8331.2203-0239

• Big Data and Cloud Computing • Previous Articles     Next Articles

Path Discrimination Method for Service Fault Propagation in Cloud Computing Environment

SI Jianzhe, JIANG Ying, LI Rongchen, CHEN Weiwei   

  1. 1.Yunnan Key Lab of Computer Technology Application, Kunming 650500, China
    2.Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
  • Online:2022-12-01 Published:2022-12-01

云计算环境下服务故障传播路径判别方法

姒鉴哲,姜瑛,李荣宸,陈威伟   

  1. 1.云南省计算机技术应用重点实验室,昆明 650500
    2.昆明理工大学 信息工程与自动化学院,昆明 650500

Abstract: Since cloud computing services can be characterized by dynamic, complexity and diversity, it makes the interaction between different services more complicated. As service failures happen frequently in cloud computing environment and faults are constantly spreading, resulting in system failure. Most of the existing fault propagation analysis methods all have some problems, such as over reliance on the historical data, only measuring single fault propagation factor, and failing to model the dynamic change system structure. In order to solve these problems, a method of service path discrimination method for service fault propagation in cloud computing environment is proposed. Firstly, the service interaction diagram is dynamically established. Secondly, the structure of service interaction diagram is optimized and service relationship diagram is set up. Thirdly, the possibility of service failure is calculated by considering service operating and environmental conditions, and the fault service is determined. Finally, the influencing factors of service fault propagation are comprehensively analyzed, and the service fault propagation probability is calculated, and the path discrimination of service fault propagation is carried out. Experimental results show that this method can accurately determine the fault service and effectively distinguish the service fault propagation path.

Key words: cloud computing environment, service interaction diagram, service relationship diagram, service failure, fault propagation path

摘要: 云计算环境下服务的运行具有动态性、复杂性和多样性的特征,使得众多服务间的动态交互关系日益复杂,随着云计算环境下服务故障发生,故障在服务间不断传播,致使系统不能正常运行。当前,部分故障传播影响分析方法存在过多关注历史数据、衡量故障传播因素单一、无法适用动态变更的系统结构等问题。为解决上述问题,提出了一种云计算环境下服务故障传播路径判别方法。动态建立服务交互图;优化服务交互图结构,建立服务关系图;通过综合考虑服务运行、环境状况计算服务故障可能性,确定发生故障服务;分析服务故障传播的影响因素,计算服务故障传播概率,进行服务故障传播路径判别。实验结果表明,该方法能准确确定发生故障服务、有效判别服务故障传播路径。

关键词: 云计算环境, 服务交互图, 服务关系图, 服务故障, 故障传播路径