• 제목/요약/키워드: Fault Tolerance System

검색결과 335건 처리시간 0.027초

우주 방사능 누적에 의한 크리티컬 레이스가 존재하는 비동기 카운터를 위한 고장 탐지 및 극복 (Fault Diagnosis and Tolerance for Asynchronous Counters with Critical Races Caused by Total Ionizing Dose in Space)

  • 곽성우;양정민
    • 한국지능시스템학회논문지
    • /
    • 제22권1호
    • /
    • pp.49-55
    • /
    • 2012
  • 전역 클럭 없이 외부 입력에 따라서 값이 변하는 비동기 카운터는 우주용 메모리 등 현대 디지털 시스템에서 널리 사용된다. 본 논문에서는 우주 방사능 누적에 기인하는 크리티컬 레이스 고장이 존재하는 비동기 카운터를 위한 고장 극복 기법을 제안한다. 크리티컬 레이스는 비동기 디지털 회로 설계 과정에서 발생하는 대표적인 고장으로서 회로의 다음 안정 상태가 고정되지 않고 여러 값 중 하나로 나오는 비결정적인 특성을 보인다. 이번 연구에서는 비동기 순차 머신에 대한 교정 제어 기법을 이용하여 크리티컬 레이스를 극복할 수 있는 상태 피드백 제어기의 설계 과정을 제시한다. 또한 비동기 카운터 교정 제어 시스템을 VHDL 코드로 구현하고 실험을 통하여 제안된 제어 시스템이 크리티컬 레이스 고장을 극복하는 과정을 예시한다.

총이온화선량에 의한 고장이 존재하는 비동기 순차 회로의 교정 제어 (Corrective Control of Asynchronous Sequential Circuits with Faults from Total Ionizing Dose Effects in Space)

  • 양정민;곽성우
    • 제어로봇시스템학회논문지
    • /
    • 제17권11호
    • /
    • pp.1125-1131
    • /
    • 2011
  • This paper presents a control theoretic approach to realizing fault tolerance in asynchronous sequential circuits. The considered asynchronous circuit is assumed to work in space environment and is subject to faults caused by total ionizing dose (TID) effects. In our setting, TID effects cause permanent changes in state transition characteristics of the asynchronous circuit. Under a certain condition of reachability redundancy, it is possible to design a corrective controller so that the closed-loop system can maintain the normal behavior despite occurrences of TID faults. As a case study, the proposed control scheme is applied to an asynchronous arbiter implemented in FPGA.

실시간 제어 시스템의 결함 허용성을 위한 적응형 체크포인팅 기법 (An Adaptive Checkpointing Scheme for Fault Tolerance of Real-Time Control Systems)

  • 류상문
    • 제어로봇시스템학회논문지
    • /
    • 제15권6호
    • /
    • pp.598-603
    • /
    • 2009
  • The checkpointing scheme is a well-known technique to cope with transient faults in digital systems. This paper proposes an adaptive checkpointing scheme for the reliability improvement of real-time control systems. The proposed adaptive checkpointing scheme is based on the previous work about the reliability problem of an equidistant checkpointing scheme. For the derivation of the adaptive scheme, some conditions are introduced which are to be satisfied for the reliability improvement by exploiting an equidistant checkpointing scheme. Numerical data show the proposed adaptive scheme outperforms the equidistant scheme from a reliability point of view.

분산 처리 시스템의 결함 허용을 위한 투명성 향상 기법 (Advanced Method to Improve the Transparency for Fault-Tolerance in Distributed System)

  • 김분희
    • 한국콘텐츠학회:학술대회논문집
    • /
    • 한국콘텐츠학회 2006년도 추계 종합학술대회 논문집
    • /
    • pp.609-611
    • /
    • 2006
  • 분산 시스템에서 어떠한 시스템 요소에 결함이나 오류가 발생하더라도 시스템이 정상적으로 동작하게 하는 결함 허용 기법은 해당 분산시스템의 효용성을 높이는데 기여한다. 본 연구는 분산 시스템의 결함 허용 기법 중 시간 제약성 측면에서 강한 여분기반 결함허용 기법을 수용한다. 이 기법의 구성 요소인 어플리케이션 서버는 그 상태가 결정적(deterministic)이냐 비결정적(nondeterministic)이냐에 따라 그 처리 기법을 달리하고 있다. 그 중 SAR(Semi-Active Replication)이 자원 활용도 측면에서 그 효율성 증명된바 있다. 본 논문에서는 SAR의 단점인 응답시간 지연문제와 클라이언트 측면에서의 결함 허용(fault-tolerance) 문제를 해결하기 위한 기반 구조를 제안한다.

  • PDF

리눅스 기반 ARINC 653 헬스 모니터 (Linux-based ARINC 653 Health Monitor)

  • 윤영일;조현우;김형신
    • 대한임베디드공학회논문지
    • /
    • 제9권3호
    • /
    • pp.183-191
    • /
    • 2014
  • The software running on avionic system is required to be highly reliable and productive. The air transport industry has developed ARINC Specification 653(ARINC653) as a standardized software requirement of avionics computers. The document specifies the interface boundary between avionics application software and the core executive software. Dependability in ARINC 653 is provided by spatial and temporal partitioning whilst fault-tolerance is provided by health monitoring mechanism. Legacy real-time operating systems are used to support ARINC653 health monitor on integrated modular avionics(IMA). However, legacy real-time operating systems are costly and difficult to modify the kernel. In this paper, we suggest a Linux-based ARINC653 health monitor. Functionalities to support ARINC653 health monitor are implemented as a Linux kernel module and its performance is evaluated.

Double-Input DC-DC Converter for Applications with Wide-Input-Voltage-Ranges

  • Hu, Renjun;Zeng, Jun;Liu, Junfeng;Yang, Jinming
    • Journal of Power Electronics
    • /
    • 제18권6호
    • /
    • pp.1619-1626
    • /
    • 2018
  • The output power of most facilities for renewable energy generation is unstable due to external environmental conditions. In distributed power systems with two or more sources, a stable output can be achieved with the complementary power supply among the different input sources. In this paper, a double-input DC-DC converter with a wide-input-voltage-range is proposed for renewable energy generation. This converter has the following advantages: the circuit is simple, and the input voltage range is wide and the fault tolerance is excellent. The operation modes and the steady-state analysis are examined. Finally, experimental results are illustrated to verify the correctness of the analysis and the feasibility of the proposed converter.

분산 시스템의 적응형 내결합성 및 QoS 미들웨어 지원 (An Adaptive Fault Tolerant and QoS-Enabled Middleware Support in Distributed Systems)

  • 조바니 카가라반;김석수
    • 한국산학기술학회:학술대회논문집
    • /
    • 한국산학기술학회 2009년도 추계학술발표논문집
    • /
    • pp.461-465
    • /
    • 2009
  • Normally, a distributed computing environment is flexible in controlling complex embedded systems but their software components are becoming complex as these systems are equipped with several platforms and attached to various electronic devices, sensors, and actuators. These systems requires inter-object communication mechanisms to provide fault tolerant and QoS-enabled middleware service support in a distributed system. Generally, a middleware performs analysis of the parameters to ensure the availability and reliability of data dissemination. This paper focuses in particular to designing an application middleware for the specific scenario to improve the high availability and fault tolerance of data thus improving the QoS (Quality of Service) of a distributed system. The performance of an adaptive and highly reliable middleware can be significant based on the selection of vital parameters of the system.

  • PDF

분산 시스템의 결함시 재분배 알고리즘의 선정기준을 위한 특성 분석 (Analysis of Criteria for Selecting Load Redistribution Algorithm for Fault-Tolerant Distributed System)

  • 최병갑
    • 한국시뮬레이션학회논문지
    • /
    • 제3권1호
    • /
    • pp.89-98
    • /
    • 1994
  • In this paper, a criteria for selecting an appropriate load redistribution algorithm is devised so that a fault-tolerance distributed system can operte at its optimal efficience. To present the guideline for selecting redistributing algorithms, simulation models of fault-tolerant system including redistribution algorithms are developed using SLAM II. The job arrival rate, service rate, failure and repair rate of nodes, and communication delay time due to load migration are used as parameters of simulation. The result of simulation shows that the job arrival rate and the failure rate of nodes are not deciding factors in affecting the relative efficiency of algorithms. Algorithm B shows relatively a consistent performance under various environments, although its performance is between those of other algorithms. If the communication delay time is longer than average job processing time, the performance of algorithm B is better than others. If the repair rate is relatively small or communication delay time is longer than service time, algorithm A leads to good performance. But in opposite environments, algorithm C is superior to other algorithms.

  • PDF

결함내성을 가진 도메인네임 서버의 구축 및 연동시험 (Foundation Techniques and Cooperation Test of Fault-tolerant Domain Name Servers for Internet Name Resolution)

  • 최재원
    • 한국정보통신학회논문지
    • /
    • 제15권1호
    • /
    • pp.41-50
    • /
    • 2011
  • DNS(Domain Name System)는 인터넷상의 호스트의 도메인주소를 IP주소로 변환하거나 IP주소를 도메인주소로 변환하는 이름해결 메카니즘을 총칭한다. 본 논문에서는 1차 DNS 서버가 오류로 인해 정지하더라도 2차 DNS 서버가 대신하여 서비스를 지속할 수 있도록 하는 결함내성을 갖는 DNS 시스템 구축에 관해 연구하였다.

분산객체 기반 경량화 결함허용 기술의 성능 비교 (The Performance Comparison of Low-Overhead Fault Tolerant Services based on Distributed Object)

  • 김식;현무용
    • 정보학연구
    • /
    • 제9권4호
    • /
    • pp.25-34
    • /
    • 2006
  • As most application programs are more sophisticated and are adopted the distributed object technology, the object based distributed design became widespread since it supports portability and reusability. The approaches for fault-tolerant distributed computing are categorized into the active replica mechanism for mission-critical application programs and the passive replica mechanism for non mission-critical ones, when fault-tolerant facilities are added on. Our paper introduces the pros and drawbacks of several approaches for the add-on low-overhead fault-tolerant services by the surveys and shows the results of experiments for bench-mark models in order to demonstrate their performance.

  • PDF