• Title/Summary/Keyword: Fault-Tolerance Service

Search Result 58, Processing Time 0.034 seconds

Fault tolerant architecture for Module-based personal robot (모듈기반 퍼스널 로봇을 위한 Fault Tolerant 구조)

  • Baek, Bum-Hyun;Jee, Dong-Jun;Park, Chan-Jung;Han, Ki-Sung;Lee, Sun-Young;Kim, Kyeong-Tae;Hong, Sung-Won;Park, Hong-Seong
    • Proceedings of the KIEE Conference
    • /
    • 2006.04a
    • /
    • pp.297-299
    • /
    • 2006
  • Many distributed applications is developed in various environment such as operating system, software platform. So, they exhibit different types of system behavior, status, during the course of their operation. Each such behavior may have different functional and non-functional requirements. Therefore, many distributed application need to fault-tolerance solution. Personal robot provide various service or application. Because personal robot has many application or service, it need to fault-tolerance architecture. A flexible architecture is required to provide dependability. In this paper, it is suggested a fault-tolerant architecture for module-based personal robot with module fault-tolerance, service fault-tolerance.

  • PDF

The Implementation of Fault Tolerance Service for QoS in Grid Computing (그리드 컴퓨팅에서 서비스 품질을 위한 결함 포용 서비스의 구현)

  • Lee, Hwa- Min
    • The Journal of Korean Association of Computer Education
    • /
    • v.11 no.3
    • /
    • pp.81-89
    • /
    • 2008
  • The failure occurrence of resources in the grid computing is higher than in a tradition parallel computing. Since the failure of resources affects job execution fatally, fault tolerance service is essential in computational grids. And grid services are often expected to meet some minimum levels of quality of service (QoS) for desirable operation. However Globus toolkit does not provide fault tolerance service that supports fault detection service and management service and satisfies QoS requirement. Thus this paper proposes fault tolerance service to satisfy QoS requirement in computational grids. In order to provide fault tolerance service and satisfy QoS requirements, we expand the definition of failure, such as process failure, processor failure, and network failure. And we propose resource scheduling service, fault detection service and fault management service and show implement and experiment results.

  • PDF

OPRoS based Fault Tolerance Support for Reliability of Service Robots (서비스로봇의 신뢰성 향상을 위한 OPRoS 기반 Fault-tolerance 기법)

  • Ahn, Hee-June;Lee, Dong-Su;Ahn, Sang-Chul
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.16 no.6
    • /
    • pp.601-607
    • /
    • 2010
  • For commercial success of emerging service robots, the fault tolerant technology for system reliability and human safety is crucial. Traditionally fault tolerance methods have been implemented in application level. However, from our studies on the common design patterns in fault tolerance, we argue that a framework-based approach provides many benefits in providing reliability for system development. To demonstrate the benefits, we build a framework-based fault tolerant engine for OPRoS (Open Platform for Robotic Services) standards. The fault manager in framework provides a set of fault tolerant measures of detection, isolation, and recovery. The system integrators choose the appropriate fault handling tools by declaring XML configuration descriptors, considering the constraints of components and operating environment. By building a fault tolerant navigation application from the non-faulttolerant components, we demonstrate the usability and benefits of the proposed framework-based approach.

Service Deployment Strategy for Customer Experience and Cost Optimization under Hybrid Network Computing Environment

  • Ning Wang;Huiqing Wang;Xiaoting Wang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.11
    • /
    • pp.3030-3049
    • /
    • 2023
  • With the development and wide application of hybrid network computing modes like cloud computing, edge computing and fog computing, the customer service requests and the collaborative optimization of various computing resources face huge challenges. Considering the characteristics of network environment resources, the optimized deployment of service resources is a feasible solution. So, in this paper, the optimal goals for deploying service resources are customer experience and service cost. The focus is on the system impact of deploying services on load, fault tolerance, service cost, and quality of service (QoS). Therefore, the alternate node filtering algorithm (ANF) and the adjustment factor of cost matrix are proposed in this paper to enhance the system service performance without changing the minimum total service cost, and corresponding theoretical proof has been provided. In addition, for improving the fault tolerance of system, the alternate node preference factor and algorithm (ANP) are presented, which can effectively reduce the probability of data copy loss, based on which an improved cost-efficient replica deployment strategy named ICERD is given. Finally, by simulating the random occurrence of cloud node failures in the experiments and comparing the ICERD strategy with representative strategies, it has been validated that the ICERD strategy proposed in this paper not only effectively reduces customer access latency, meets customers' QoS requests, and improves system service quality, but also maintains the load balancing of the entire system, reduces service cost, enhances system fault tolerance, which further confirm the effectiveness and reliability of the ICERD strategy.

State-Monitoring Component-based Fault-tolerance Techniques for OPRoS Framework (상태감시컴포넌트를 사용한 OPRoS 프레임워크의 고장감내 기법)

  • Ahn, Hee-June;Ahn, Sang-Chul
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.16 no.8
    • /
    • pp.780-785
    • /
    • 2010
  • The OPRoS (Open Platform for Robotic Services) framework is proposed as an application runtime environment for service robot systems. For the successful deployment of the OPRoS framework, fault tolerance support is crucial on top of its basic functionalities of lifecycle, thread and connection management. In the previous work [1] on OPRoS fault tolerance supports, we presented a framework-based fault tolerance architecture. In this paper, we extend the architecture with component-based fault tolerance techniques, which can provide more simplicity and efficiency than the pure framework-based approach. This argument is especially true for fault detection, since most faults and failure can be defined when the system cannot meet the requirement of the application functions. Specifically, the paper applies two widely-used fault detection techniques to the OPRoS framework: 'bridge component' and 'process model' component techniques for fault detection. The application details and performance of the proposed techniques are demonstrated by the same application scenario in [1]. The combination of component-based techniques with the framework-based architecture would improve the reliability of robot systems using the OPRoS framework.

Service-Dependability-Case based Self-Adaptation in Service-Oriented Environment (서비스 지향 컴퓨팅 환경에서 서비스 안정성 케이스 기반 자가 적응 방법)

  • Jung, Changhee;Lee, Seok-Won
    • Journal of KIISE
    • /
    • v.42 no.11
    • /
    • pp.1339-1348
    • /
    • 2015
  • In a distributed system environment based on a service-oriented architecture, separate systems collaborate to achieve the goals of the entire system by using services provided other systems. A service quality violation from using one service can cause runtime system failure in the environment. The existing self-adaptation methods follow fault tolerance mechanism that responds to a failure after a service quality violation. In other words, these methods are limited to responsive action. Therefore, a service-dependability-case based self-adaptation mechanism is necessary to preserve the dependability of the self-adaptive system. This paper demonstrates that the service-dependability-case based self-adaptation mechanism is better than QoS(quality of service)-based self-adaptation with fault tolerance to preserve the dependability of the self-adaptive system. Additionally, this paper suggests a method to present and analyze service dependability by using GSN(Goal Structuring Notation) which is the existing modeling method for the presentation of assurance cases, an action mechanism adapted using an analysis result of service-dependability-cases, a methods of leveraging the service-dependability-case based self-adaptation mechanism by following the service's life cycle, and the framework architecture including the major components and the interactions between the components in the control loop of the self-adaptation process.

Design and Implementation of Adaptive Fault-Tolerant Management System over Grid (그리드 환경의 적응형 오류 극복 관리 시스템 설계 및 구현)

  • Kim, Eun-Kyung;Kim, Jeu-Young;Kim, Yoon-Hee
    • The KIPS Transactions:PartA
    • /
    • v.15A no.3
    • /
    • pp.151-154
    • /
    • 2008
  • A middleware in grid computing environment is required to support seamless on-demand services over diverse resource situations in order to meet various user requirements [1]. Since grid computing applications need situation-aware middleware services in this environment. In this paper, we propose a semantic middleware architecture to support dynamic software component reconfiguration based fault and service ontology to provide fault-tolerance in a grid computing environment. Our middleware includes autonomic management to detect faults, analyze causes of them, and plan semantically meaningful strategies to recover from the failure using pre-defined fault and service ontology trees. We implemented a referenced prototype, Web-service based Application Execution Environment(Wapee), as a proof-of-concept, and showed the efficiency in runtime recovery.

A Research to Enhance the Fault Tolerance of the CORBA Based Traffic Information Systems (CORBA 기반 교통정보시스템의 Fault Tolerance 향상을 위한 연구)

  • Seh, Woon-Suk;Ryu, Kwang-Taek;Lee, Eun-Seok
    • The KIPS Transactions:PartD
    • /
    • v.10D no.6
    • /
    • pp.991-998
    • /
    • 2003
  • There are many methods to enhance the fault tolerance of the CORBA based real time systems by viewpoints. Among them, this paper provides a method to enable seamless services where the systems based on the CORBA have object's faults originated processing real time information. Namely, this paper observes a method to deal efficiently with object's faults happening in 3 tier architecture environments. It is possible to replicate objects as a way to enhance the fault tolerance considering object's faults. Along with it, this paper shows a method to enhance the fault tolerance ultimately and then keep the service continuity by prividing a way to allow to continue to run the systems until the FT-CORBA based one's faults are recovered.

Fault Tolerance System running on Distributed Multimedia (분산 멀티미디어에서의 결함 허용 시스템)

  • Hong, Sung-Ryong;Ko, Eung-Nam
    • Journal of Digital Contents Society
    • /
    • v.16 no.1
    • /
    • pp.123-126
    • /
    • 2015
  • This paper described fault tolerance system running on distributed multimedia. We implemented the error manager service so that the users participated in distribute multimedia collaborative work may refer synchronized error objects as the same view to others. distributed multimedia environment are based on IP-USN(Internet Protocol - Ubiquitous Sensor Network) and M2M(Machine to machine). This is a system that is suitable for detecting, sharing and recovering software error in distribute multimedia CSCW(Computer Supportes Cooperated Work) environment. With error synchronization system, a group cooperating users can synchronize error applications.

Web Services System Supporting Fault-Tolerance based on the Quality (품질 기반 장애 극복을 지원하는 웹 서비스 시스템)

  • Lee, Yong-Pyo;Shin, Jae-Dong;Han, Sang-Yong
    • The KIPS Transactions:PartD
    • /
    • v.12D no.6 s.102
    • /
    • pp.875-880
    • /
    • 2005
  • Recently web services ale being used to provide environments of distributed computing. Web services provide reusable software component. So, one web service can be used by many users, and one user can use different web services. For reliable use of web services, in these cases, it is important to be fault-tolerance. Existing fault-tolerant methods in web services need a kind of client modification and cannot consider extensible factors like quality. This study suggests the system architecture and description language for the system which can improve some of these problems.