• Title/Summary/Keyword: Fault-tolerance

Search Result 571, Processing Time 0.023 seconds

Monitoring of Real Time System using Conformance Monitor (Conformance Monitor를 이용한 실시간 시스템의 모니터링)

  • Sim, Jae-Hwan;Kim, Jin-Hyun;Choi, Jin-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2005.11a
    • /
    • pp.1445-1448
    • /
    • 2005
  • 실시간 시스템은 높은 수준의 안정성을 요구하는 시스템이다. 실시간 시스템에서 오류는 잘못된 응답 뿐만 아니라 시간적으로 늦은 응답도 오류를 발생 시킬 수 있다. 따라서 실시간 시스템의 안정성을 보장해 주기 위해서 실시간 태스크의 시간 제약에 대한 모니터링 및 오류발생 시에 오류를 처리해 줄 수 있는 기법이 필요하다. 본 논문은 Timed Conformance Monitor를 통해서 실시간 태스크가 시간의 제약을 만족하는지를 분석하고 또한 분석 결과에 따라 오류를 처리할 수 있는 Fault Handler를 추가하여 실시간 시스템에 대한 Fault Tolerance를 보장해 줄 수 있는 기법을 제시한다.

  • PDF

Simulation and Evaluation of Redistribution Algorithms In Fault-Tolerant Distributed System (결함허용 분산시스템의 재분배 알고리즘의 시뮬레이션과 평가)

  • 최병갑;이천희
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.8
    • /
    • pp.1-10
    • /
    • 1994
  • In this paper load redistribution algorithm to allow fault-tolerance by redistributing the workload of n failure nodes to the remaining good nodes in distributed systems are investigated. To evaluate the efficiency of the algorithms a simulation model of algorithms is developed using SLAM II simulation language. The job arrival rate service rate failure and repair rate of nodes and communication delay time due to load migraion are used as parameters. The result of the simulation shows that the job arrival rate failure and repair rate of nodes do not affected on the relative efficiency of algorithms. If the communication delay time is greater than average job processing time algorithm B is better. Otherwise algorithm C is superior to the others.

  • PDF

Bi-active Load Balancer for enhancing of scalability and fault-tolerance of Cluster System (확장성과 고장 감내를 위한 효율적인 부하 분산기)

  • Kim, Young-Hwan;Youn, Hee-Yong;Choo, Hyun-Seung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.04a
    • /
    • pp.381-384
    • /
    • 2002
  • This paper describes the motivation, design and performance of bi-active Load balancer in Linux Virtual Server. The goal of bi-active Load balancer is to provide a framework to build highly scalable, fault-tolerant services using a large cluster of commodity servers. The TCP/IP stack of Linux Kernel is extended to support three IP load balancing techniques, which can make parallel services of different kinds of server clusters to appear as a service on a single IP address. Scalability is achieved by transparently adding or removing a node in the cluster. and high availability is provided by detecting node or daemon failures and reconfiguring the system appropriately. Extensive simulation reveals that the proposed approach improves the reply rate about 20% compared to earlier design.

  • PDF

JOB Scheduling for process Control in Hierarchical Computer Network (계층구조 Computer Network에서 공정제어를 위한 JOB Scheduling)

  • Park, Yil
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.5 no.1
    • /
    • pp.83-87
    • /
    • 1980
  • The distributive processing job in a hierarchical computer network, which supervises and controls the complex relations between the variables periodically for raising the fault folerance, can be defined its periodicity and its execution time. All the job may be composed of the subsets in relation of Tree structure. For a processor job set this paper finds out a job scheduling algorithm that has the less loose time between period than that of FCFS.

  • PDF

A Study on the Software Reliability Improvement in Railway Control System (열차제어시스템 소프트웨어의 신뢰성 향상에 대한 고찰)

  • Kim, Jong-Ki;Choi, Kyoo-Hyung;Lee, Jong-Woo
    • Proceedings of the KIEE Conference
    • /
    • 2001.10a
    • /
    • pp.275-277
    • /
    • 2001
  • 열차제어시스템의 소프트웨어는 높은 신뢰성과 안전성이 요구된다. 이를 위해 철도에서는 고장회피(Fault Avoidance) 기법과 고장허용(Fault Tolerance)기법을 사용하고 있다. 그러나 최근 신호설비의 소프트웨어 신뢰성을 좀더 향상시키기 위해 일본, 프랑스, 스웨덴 등에서는 안전에 치명적인 시스템에 사용되고 있는 정형기법(Forma) Method)을 포함하여 많은 연구를 수행중이다. 본고에서는 국외 관련분야를 분석하고, 열차제어시스템 소프트웨어의 신뢰성 향상에 관하여 기술하고자 한다.

  • PDF

Implementation of Distributed Fault-Tolerant Middleware for Dual Channel Ethernet based Virtual Server (이중 채널 이더넷 기반 가상서버를 위한 분산 고장 감내 미들웨어의 구현)

  • 함명호;김진용;최보곤;신현식
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.04a
    • /
    • pp.112-114
    • /
    • 2003
  • 긴박한 임무 상황(mission critical)의 시스템 뿐만 아니라, 웹 서버등의 고가용성, 고신뢰성을 위한 가상 서버의 구축도 관심이 되고 있다. 본 연구에서는 가상서버의 고가용성 , 고신뢰성을 보다 향상시키기 위해 가상서버 클러스터의 각 구성 노드들을 이중의 네트웍 채널로 중복 시켜 네트웍 고장에 대한 신뢰성을 향상시켰다. 관리자 노드와 작업노드 풀로 구성되는 시스템의 각 노드와 이중 채널로 구성된 네트웍에 대한 결함검출과 결함복구를 위한 분산 결함 허용 미들웨어를 구현하였고, 적응형 고장 감내 (Adaptive Fault Tolerance) 기법을 사용하여 다양한 임무 상황에서의 자원 효율성을 향상시켰다.

  • PDF

Design and Performance Evaluation of a Fault-Tolerant Input-Buffered ATM Switch based on Multistage Interconnection Networks (다단계 상호연결 네트워크에 기반한 입력버퍼형 오류허용 ATM 스위치의 설계 및 성능 평가)

  • Sin, Won-Cheol;Son, Yu-Ik
    • The KIPS Transactions:PartC
    • /
    • v.8C no.3
    • /
    • pp.319-326
    • /
    • 2001
  • 본 논문에서는 다단계상호연결 네트워크에 기반한 입력버퍼 구조의 ATM 스위치에 관해서 언급한다. 제안된 방법은 HOL 블록킹으로 인해 균일 트래픽(uniform traffic) 하에서 최대 약 58.6%의 처리율을 넘지 못하는 문제를 해결 할 수 있는 방법을 제시하며, 또한 오류허용 기능을 확장시키기 위하여 베이스라인 네트워크에서 버디 연결 매핑 및 제한연결 매핑 특성을 이용한 다중경로를 제공할 수 있는 버퍼 기법에 관하여 언급한다. 시뮬레이션에 의한 성능 평가 결과, 기존 방식과 비교하여 좋은 처리율과 셀 손실율을 보였으며, 더욱이 오류 스위치의 증가에도 불구하고 처리율의 수준은 적정한 셀 지연 범위 내에서 유지될 수 있음을 보여주고 있다.

  • PDF

A Design and Implementation of Fault Tolerance Agent on Distributed Multimedia Environment (분산 멀티미디어 환경에서 결함 허용 에이전트의 설계 및 구현)

  • Go, Eung-Nam;Hwang, Dae-Jun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.10
    • /
    • pp.2618-2629
    • /
    • 1999
  • In this paper, we describe the design and implementation of the FDRA(Fault Detection Recovery based on Agent) running on distributed multimedia environment. DOORAE is a good example for distributed multimedia and multimedia distance education system among students and teachers during lecture. It has primitive service agents. Service functions are implemented with objected oriented concept. FDRA is a multi-agent system. It has been environment, intelligent agents interact with each other, either collaboratively or non-collaboratively, to achieve their goals. The main idea is to detect an error by using polling method. This system detects an error by polling periodically the process with relation to session. And, it is to classify the type of error s automatically by using learning rules. The merit of this system is to use the same method to recovery it as it creates a session. FDRA is a system that is able to detect an error, to classify an error type, and to recover automatically a software error based on distributed multimedia environment.

  • PDF

Ad hoc Software Rejuvenation for Survivability

  • Khin Mi Mi Aung;Park, Jong-Sou
    • Proceedings of the Korea Institutes of Information Security and Cryptology Conference
    • /
    • 2003.12a
    • /
    • pp.141-145
    • /
    • 2003
  • We propose the model of Software Rejuvenation methodology, which is applicable for survivability. Software rejuvenation is a proactive fault management technique and being used in fault tolerant systems as a cost effective technique for dealing with software faults. Survivability focuses on delivery of essential services and preservation of essential assets, even systems are penetrated and compromised. Thus, our objective is to detect the intrusions in a real time and survive in face of such attacks. As we deterrent against an attack in a system level, the Intrusion tolerance could be maximized at the target environment. We address the optimal time to execute ad hoc software rejuvenation and we compute it by using the semi Markov process. This is one way that could be really frustrated and deterred the attacks, as the attacker can't make their progress. This Software Rejuvenation method can be very effective under the assumption of unknown attacks. In this paper, we compute the optimum time to perform an ad hoc Software Rejuvenation through intrusions.

  • PDF

Reaching Byzantine Agreement underlying VANET

  • Wang, Shu-Ching;Lin, Ya-Jung;Yan, Kuo-Qin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.7
    • /
    • pp.3351-3368
    • /
    • 2019
  • The Internet of Things (IoT) enables machines and devices in a global network to connect and provide applications. The Vehicular Ad-hoc NETwork (VANET) allows vehicles in the network to communicate with each other as an application of the IoT. The safety and comfort of passengers can be improved through VANET related applications. In order to be able to provide related applications, there must be a reliable VANET topology. As a result of the Byzantine agreement (BA), fault tolerance can be solved in VANET. In order to improve the reliability of the system, even if some components in the system are damaged, a protocol is needed to assist the system to perform normally. Therefore, the BA problem in VANET with multiple impairments is revisited in this research. The proposed protocol allows all normal processing elements (PEs) to reach agreement using the least amount of information exchange. Moreover, the proposed protocol can tolerate the largest number of damaged PEs in VANET.