• Title/Summary/Keyword: fault tolerant computing

Search Result 68, Processing Time 0.026 seconds

Web Services System Supporting Fault-Tolerance based on the Quality (품질 기반 장애 극복을 지원하는 웹 서비스 시스템)

  • Lee, Yong-Pyo;Shin, Jae-Dong;Han, Sang-Yong
    • The KIPS Transactions:PartD
    • /
    • v.12D no.6 s.102
    • /
    • pp.875-880
    • /
    • 2005
  • Recently web services ale being used to provide environments of distributed computing. Web services provide reusable software component. So, one web service can be used by many users, and one user can use different web services. For reliable use of web services, in these cases, it is important to be fault-tolerance. Existing fault-tolerant methods in web services need a kind of client modification and cannot consider extensible factors like quality. This study suggests the system architecture and description language for the system which can improve some of these problems.

Operation and performability analysis of modular cells (모듈러 셀의 운영과 수행성 해석)

  • Heo, Gyeon;Jang, Seok-Ho;Jung, Hyun-Ho;Lee, Sang-Moon;Woo, Gwang-Bang;Kim, Hak-Bae
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1997.10a
    • /
    • pp.1263-1266
    • /
    • 1997
  • In a fault-tolerant modern manufacturing systms characterized by the configuration, in which automated redundant machines prone to unexpected failures are interconnected with other complex subsystems such as AGV's, robots, computer control systems to produce complete parts, faulures together with repairs and reconfigurations should be considered as the three basic events to be modeled for computing the performance of manufacturing systems. In this papre, transient analysis is applied to modular cell manufacturing systems form a performability viewpoint whose modeling adantage is that various performanc e measures can be evaluated compositely in the context of application. The hypothertical modular cells are modeled firstly with hybrid decomposition method and availability measures as special cases of performability are computed and comments on performabililty modeling analysis are mentioned.

  • PDF

On the Hardness of Leader Election in Asynchronous Distributed Systems with Crash Failures

  • Park Sung-Hoon;Kim Yoon
    • International Journal of Contents
    • /
    • v.1 no.1
    • /
    • pp.21-28
    • /
    • 2005
  • This paper is about the hardness of Leader Election problem in asynchronous distributed systems in which processes can crash but links are reliable. Recently, the hardness of a problem encountered in the systems is defined with respect to the difficulty to solve it despite failures: a problem is easy if it can be solved in presence of failures, otherwise it is hard [9]. It is shown in [9] that problems are classified as three classes: F (fault-tolerant), NF (Not fault-tolerant) and NFC (NF-completeness). Among those, the class NFC is the hardest problem to solve. It is also shown in [9] that the construction of Perfect Failure Detector (problem P) belongs to NFC. In this paper, we show that Leader Election is also one of NFC problems by using a general reduction protocol that reduces the Leader Election Problem to P. We use a formulation of the Leader Election problem as a prototype to show that it belongs to NFC.

  • PDF

A Recovery Mechanism for Server Failure in Database Systems based on Mobile computing Environments (이동 컴퓨팅 환경에 기반을 둔 데이터베이스 시스템에서 서버의 고장 회복 기법)

  • Jo, Jeong-Ran;Hwang, Bu-Hyeon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.1
    • /
    • pp.1-10
    • /
    • 1999
  • A mobile computing environment is one that support user's mobility through the wireless communication technology. Users access the database and get results what they want by running mobile transactions. To run the mobile transaction correctly and to maintain the consistency I database, we need a concurrency control method to schedule transactions, a caching method to manage the cache, and a recovery method to construct a fault tolerant system. A mobile computing system is based on the existing distributed system, but we can't use recovery methods of the existing distributed system directly because of the user's mobility and the features of wireless media. So this paper presents a recovery mechanism to construct a fault tolerant mobile computing systems. Especially. we develop and analyze a recovery algorithm for server failure among types of failure which can arise in mobile computing environments.

  • PDF

IMMORTAL : Fault Tolerant Distributed Middleware System based on Remote Method Invocation (IMMORTAL : 원격 메쏘드 호출에 기반한 결함허용 분산 미들웨어 시스템)

  • Hyun, Mu-Yong;Kim, Shik;Kim, Myung-Jun;Yamakita, Jiro
    • Journal of KIISE:Information Networking
    • /
    • v.29 no.5
    • /
    • pp.562-572
    • /
    • 2002
  • Distributed object technologies have become popular in developing distributed systems. Although such middleware platforms as DSOM, DCOM, CORBA and Java RMI ease the development of distributed applications, they do not directly improve the reliability and the availability of these applications. Because the task of developing fault-tolerance techniques for distributed object paradigms is often complicated and error-prone, there is a great need for a development toolkit that enhances the reliability and the availability of distributed objects. In this paper, we propose a fault-tolerant distributed middleware system based on RMI, called IMMORTAL. We use a log-based rollback-recovery mechanism for supporting reliable distributed computing. Through a series of experiments, we observe that benchmark applications on the IMMORTAL tolerate hardware and software failures and evaluate its performance and scalability.

Design and Implementation of Fault-Tolerant IVR for Multi-Channel Service (다중 채널 서비스를 위한 결함허용 IVR 설계 및 구현)

  • Han, Yun-Ki;Koo, Yong-Wan
    • Journal of Internet Computing and Services
    • /
    • v.9 no.3
    • /
    • pp.103-117
    • /
    • 2008
  • General customer uses internet, SMS(Short Message Service), ATM(Automated Teller Machine), DM(Direct Mail), and telephony services, etc, for service of big businesses and small and medium enterprises, securities, financial institutions, bank. In particular, the case of Fire insurance & Securities, financial institutions, banks must meet Real-Time constrain control through QoS(Quality of Service) ensure. In this paper, we design and implement a system model about Fault-Tolerance IVR(Interactive Voice Response) of Multi Channel Service based on CRM(Customer Relationship Management) environment for the first inbound position of customer. The proposed model can be widely used in Large-scale Customer Response service.

  • PDF

A Byzantine Fault-tolerant Clock Synchronization Scheme in Wireless Sensor Networks (무선 센서 네트워크에서 비잔틴 오류를 허용하는 클럭 동기화 기법)

  • Lim, Hyung-Geun;Nam, Young-Jin;Baek, Jang-Woon;Ko, Seok-Young;Seo, Dae-Wha
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.14 no.5
    • /
    • pp.487-491
    • /
    • 2008
  • This paper proposes the Byzantine fault tolerant clock synchronization scheme for wireless sensor networks to cope with the clock synchronization disturbance attack of malicious nodes. In the proposed scheme, a node which is requiring clock synchronization receives 3m+1 clock synchronization messages not only from its parent nodes but also from its sibling nodes in order to tolerate malicious attacks even if up to m malicious nodes exist among them. The results show that the proposed scheme is 7 times more resilient to the clock synchronization disturbance attack of malicious nodes than existing schemes in terms of synchronization accuracy.

An Implementation of Fault-Tolerant Message Passing Interface on Parallel Computers (병렬 컴퓨터에서의 결함 허용 메시지 전달 인터페이스 구현)

  • Song, Dae-Ki;Lee, Cheol-Hoon
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.6 no.3
    • /
    • pp.319-328
    • /
    • 2000
  • The Message-Passing Interface(MPI) is a standard interface for parallel programming environment, based on that application programs run on the processors of a parallel computer. Processor nodes execute processes consisting the program by passing messages to one another. During executing, however, if a fault occurs on a processor node or a process, this will result an inconsistent state, and consequently, the whole program will have to be stopped. To solve this problem, in this paper, we propose a fault-tolerant message passing interface(FT-MPI) by adding a fault manager module to MPI. The proposed FT-MPI does not need any hardware support, and each application program based on MPI can run on the FT-MPI without any modification. The proposed fault tolerance scheme uses the so-called hot-spare process duplication method, and verified by simulations that application programs run despite of any fault with less than 5% overhead on execution time.

  • PDF

Design and Evaluation of a Fault-Tolerant Distributed Location Management Method in Mobile Environments (이동 환경에서 결함 포용 분산 위치 관리 방법의 설계 및 평가)

  • Bae, Ihn-Han;Oh, Sun-Jin
    • Journal of Internet Computing and Services
    • /
    • v.1 no.1
    • /
    • pp.35-46
    • /
    • 2000
  • One of the main chalenges in personal communication services (PCS) is to locate many mobile terminals that may move frequently from place to place. Such a system operation is called location management. Many network signaling traffic, and database queries are required to achieve such a task. Several strategies have been proposed to improve the efficiency of location management. These strategies use location register databases to store the current location on mobile terminals, and are vulnerable to failure of the location registers. In this paper, we propose a fault-tolerant pointer forwarding with distributed home location register (FT-RFDHLR) to tolerate the failure of location registers. The performance of the proposed method is evaluated by an analytical model, and is compared with thew pointer forwarding with the single home location register (PFSHLR), the pointer forwarding with distributed home location register (PFDHLR), Biaz's bypass forwarding strategy (BFS) and two-path forwarding strategy (TPFS).

  • PDF

A Dynamic Checkpoint Scheduling Scheme for Fault Tolerant Distributed Computing Systems (결함 내성 분산 시스템에서의 동적 검사점 스케쥴링 기법)

  • Park, Tae-Soon
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.29 no.2
    • /
    • pp.75-86
    • /
    • 2002
  • The selection of the optimal checkpointing interval has been a very critical issue to implement a checkpointing recovery scheme for the fault tolerant distributed system. This paper presents a new scheme that allows a process to select the proper checkpointing interval dynamically. A process in the system evaluates the cost of checkpointing and possible rollback for each checkpointing interval and selects the proper time interval for the next checkpointing Unlike the other scheme, the overhead incurred by both of the checkpointing and rollback activities are considered for the cost evaluation and current communication pattern is reflected in the selection of the checkpointing interval. Moreover, the proposed scheme requires no extra message communication for the checkpointing interval selection and can easily be incorporated into the existing checkpointing coordination schemes.