• 제목/요약/키워드: Fault-Tolerant Computer

검색결과 193건 처리시간 0.032초

Dynamic Redundancy-based Fault-Recovery Scheme for Reliable CGRA-based Multi-Core Architecture

  • Kim, Yoonjin;Sohn, Seungyeon
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • 제15권6호
    • /
    • pp.615-628
    • /
    • 2015
  • CGRA (Coarse-Grained Reconfigurable Architecture) based multi-core architecture can be considered as a suitable solution for the fault-tolerant computing. However, there have been a few research projects based on fault-tolerant CGRA without exploiting the strengths of CGRA as well as their works are limited to single CGRA. Therefore, in this paper, we propose two approaches to enable exploiting the inherent redundancy and reconfigurability of the multi-CGRA for fault-recovery. One is a resilient inter-CGRA fabric that is ring-based sharing fabric (RSF) with minimal interconnection overhead. Another is a novel intra/inter-CGRA reconfiguration technique on RSF for maximizing utilization of the resources when faults occur. Experimental results show that the proposed approaches achieve up to 94% faulty recoverability with reducing area/delay/power by up to 15%/28.6%/31% when compared with completely connected fabric (CCF).

임무지향 컴퓨터를 위한 메시지패싱 고장감내 기법 (A Fault-Tolerant Scheme Based on Message Passing for Mission-Critical Computers)

  • 김태현;배정일;신진범;조길석
    • 한국군사과학기술학회지
    • /
    • 제18권6호
    • /
    • pp.762-770
    • /
    • 2015
  • Fault tolerance is a crucial design for a mission-critical computer such as engagement control computer that has to maintain its operation for long mission time. In recent years, software fault-tolerant design is becoming important in terms of cost-effectiveness and high-efficiency. In this paper, we propose MPCMCC which is a model-based software component to implement fault tolerance in mission-critical computers. MPCMCC is a fault tolerance design that synchronizes shared data between two computers by using the one-way message-passing scheme which is easy to use and more stable than the shared memory scheme. In addition, MPCMCC can be easily reused for future work by employing the model based development methodology. We verified the functions of the software component and analyzed its performance in the simulation environment by using two mission-critical computers. The results show that MPCMCC is a suitable software component for fault tolerance in mission-critical computers.

확장 ASR 기법을 이용한 임무지향 컴퓨터의 설계 및 신뢰도 분석 (The Design and Reliability Analysis of A Mission-Critical Computer Using Extended Active Sparing Redundancy)

  • 신진범;김상하
    • 정보처리학회논문지A
    • /
    • 제16A권4호
    • /
    • pp.235-244
    • /
    • 2009
  • 대공방어용 임무지향 컴퓨터는 장시간의 대공방어 임무를 성공적으로 완수하기 위하여 고장감내 기능이 필수적으로 요구되며, 고장감내 기법으로는 지상 장비인 임무지향 컴퓨터에 적합한 저비용의 기법이 요구된다. 현재 기본형의 임무지향 컴퓨터에는 저비용의 ASR 고장 감내 기법이 적용되어 있으나, ASR 기법은 TMR 기법을 적용한 컴퓨터에 비해 낮은 임무 신뢰도를 제공한다. 그러므로 본 논문에서는 TMR 기법을 적용한 컴퓨터 보다 적은 수량의 프로세서 보드를 사용하여 우수한 신뢰도를 제공하는 확장형 ASR(EASR) 고장 감내 기법을 제안하였으며, 제안된 EASR 기법 을 적용한 임무지향 컴퓨터의 고장 감내 성능을 입증하였다. EASR 기법의 임무지향 컴퓨터는 고장 감내 성능과 저비용성 측면에서 대공방어 시스템의 컴퓨터에 적합하였다.

Real-time Network Middleware Supporting Fault-Tolerant Operations for Personal Robot System

  • Choo, Seong-Ho;Park, Hong-Seong
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.756-760
    • /
    • 2005
  • By development of a robot technology, personal robot is being developed very actively. Various infra-technologies are accumulated in hardware and software how by internal a lot of research and development efforts, and it is circumstance that actual commodity is announced. But, personal robot is applied to be acting near human, and takes charge of safety and connected directly a lot of works of home security, gas-leakage, fire-warning facilities, or/and etc. So personal robot must do safe and stable action even if any unexpected accidents are happened, important functions are always operated. In this paper, we are wished to show design structures for supporting fault-tolerant operation from our real-time robot middleware viewpoint. Personal robot, in being developed, was designed by module structure to do to interconnect and to interoperate among each module that is mutually implemented by each research facilities or company. Also, each modules can use appreciate network system that is fit for handling and communicating its data. To guarantee this, we have being developed a real-time network middleware, for especially personal robot. Recent our working is to add and to adjust some functions like connection management, distributed routing mechanism, remote object management, and making platform independent robot application execution environment with self-moving of robot application, for fault-tolerant personal robot.

  • PDF

The Design of Fault Tolerant Dual System and Real Time Fault Detection for Countdown Time Generating System

  • Kim, Jeong-Seok;Han, Yoo-Soo
    • 한국컴퓨터정보학회논문지
    • /
    • 제21권10호
    • /
    • pp.125-133
    • /
    • 2016
  • In this paper, we propose a real-time fault monitoring and dual system design of the countdown time-generating system, which is the main component of the mission control system. The countdown time-generating system produces a countdown signal that is distributed to mission control system devices. The stability of the countdown signal is essential for the main launch-related devices because they perform reserved functions based on the countdown time information received from the countdown time-generating system. Therefore, a reliable and fault-tolerant design is required for the countdown time-generating system. To ensure system reliability, component devices should be redundant and faults should be monitored in real time to manage the device changeover from Active mode to Standby mode upon fault detection. In addition, designing different methods for mode changeover based on fault classification is necessary for appropriate changeover. This study presents a real-time fault monitoring and changeover system, which is based on the dual system design of countdown time-generating devices, as well as experiment on real-time fault monitoring and changeover based on fault inputs.

Fault Tolerant Control of DC-Link Voltage Sensor for Three-Phase AC/DC/AC PWM Converters

  • Kim, Soo-Cheol;Nguyen, Thanh Hai;Lee, Dong-Choon;Lee, Kyo-Beum;Kim, Jang-Mok
    • Journal of Power Electronics
    • /
    • 제14권4호
    • /
    • pp.695-703
    • /
    • 2014
  • In this paper, a fault detection scheme for DC-link voltage sensor and its fault tolerant control strategy for three-phase AC/DC/AC PWM converters are proposed, where the Luenberger observer is applied to estimate the DC-link voltage. The Luenberger observer is based on a converter model, which is derived from the voltage equations of a grid-side converter and the power balance on a DC link. A fault of the voltage sensor is detected by comparing the measured value of the DC-link voltage with the estimated one. When a sensor fault is detected, a fault tolerant control strategy is performed, where the estimated DC-link voltage is used for the feedback control. The estimation error from the observer is about 1.5 V, which is sufficiently accurate for feedback control. In addition, it is shown that the observer performance is robust to parameter variations of the converter. The validity of the proposed method has been verified by simulation and experimental results.

결함 포용 정적 Shuffle-Exchange 네트워크 (Fault Tolerant Static Shuffle-Exchange Network)

  • Choi Hong In
    • 한국정보과학회논문지:시스템및이론
    • /
    • 제30권3_4호
    • /
    • pp.160-167
    • /
    • 2003
  • 정적 shuffle-exchange 네트워크는 여러 응용 알고리듬에 적용되고 현재 많이 사용되는 다중 단계 네트워크에 비해 적은 하드웨어를 사용하는 등 많은 장점이 있으나 아직까지 어떤 병렬처리 컴퓨터에도 채택된 없었다. 그 이유 중에 하나는 결함 내성 기능이 없었기 때문이다. 본 논문에서는 다중 결함 포용 정적 shuffle-exchange network를 소개한다. 본 논문에서 제시되는 결함 포용 정적shuffle-exchange 네트워크는 k 결함을 제어하기 위해서 최소 2k의 추가 처리 요소들과 각 처리 요소들은 최대 4k의 추가 shuffle 링크를 필요로 한다. k 결함 내성을 가진 정적 shuffle-exchange 네트워크를 m개의 동일한 모듈로 분리하여 네트워크의 신뢰성을 증가시키는 것을 보였다.

고장 극복 (Fault Tolerant) Ethernet 구현 방안의 비교 분석 (A Comparison of Fault Tolerant Ethernet Implementation Approaches)

  • 김세목;고윤민;최한석;민정현;;이동호;이종명
    • 한국정보전자통신기술학회논문지
    • /
    • 제1권2호
    • /
    • pp.13-20
    • /
    • 2008
  • 최근 대부분의 첨단 임무 (mission critical) 시스템이 컴퓨터 노드 연결에 의한 Ethernet 네트워크 기반으로 구성되고 있어, 이에 따른 시스템 신뢰성 측면이 더욱 중요한 요소로 대두되고 있다. 본 논문은 첨단 임무 시스템의 신뢰도를 높이기 위하여 일반화되고 있는 고장 극복(Fault Tolerant) Ethernet의 개념을 이중화 관점에서 정립하고, 이를 위한 핵심 사항인 노드 이중화 구현 방식을 비교하여 제시한다. 고장 극복개념을 위한 기본 구성인 Ethernet 이중화를 독립형 이중화와 통합형 이중화로 구분하여 분석하였다. 또한 노드 이중화 구현 방식은 기존의 소프트웨어 접근 방식과 하드웨어 접근 방식은 물론, 최근 본 연구팀의 일부가 제안한 복합 (hybrid) 방식도 함께 비교 대상으로 하였다. 실시간이 요구되는 첨단 임무 시스템의 경우는 하드웨어 접근 방식 또는 복합 방식이 적합하고, Commercial-Off-The-Shelf (COTS) 사용이 요구되는 시스템의 경우는 소프트웨어 접근 방식이나 복합 방식의 선택으로 귀결된다.

  • PDF

발전용 보일러 시스템의 이상허용 및 과도상태의 유연한 제어에 관한 연구 (A Study on the Fault-Tolerant and Bumpless Switching Control for Boiler Systems in the Power Plant)

  • 권오규;이영삼
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1998년도 추계학술대회 논문집 학회본부 C
    • /
    • pp.1037-1040
    • /
    • 1998
  • In this research a fault-tolerant and bumpless switching control is proposed for boiler systems used in the power plants. Firstly, three operating points are selected to control the nonlinear boiler through the full operational range, and the $H_{\infty}$ loop shaping controller and the model-based predictive controller(MBPC) are designed. To prevent the windup and bump problems which are caused by the actuator saturation and the controller switching, an anti-windup and bumpless transfer technique is adopted to the $H_{\infty}$ loop shaping controller. Also the constrained gain-scheduling technique is applied to MBPC to achieve the same objective. Secondly, the fault-tolerant control technique is proposed to continue the control action without stopping the boiler operation even in case of some faults. Through various simulation studies, the performances of the proposed control techniques are demonstrated.

  • PDF

발전용 보일러 시스템의 이상허용 및 과도상태의 유연한 제어에 관한 연구 (A Study on the Fault-Tolerant and Bumpless Switching Control for Boiler Systems in the Power Plant)

  • 권오규;이영삼
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1998년도 추계학술대회 논문집 학회본부 B
    • /
    • pp.689-692
    • /
    • 1998
  • In this research a fault-tolerant and bumpless switching control is proposed for boiler systems used in the power plants. Firstly, three operating points are selected to control the nonlinear boiler through the full operational range, and the $H_{\infty}$ loop shaping controller and the model-based predictive controller(MBPC) are designed. To prevent the windup and bump problems which are caused by the actuator saturation and the controller switching, an anti-windup and bump less transfer technique is adopted to the $H_{\infty}$ loop shaping controller. Also the constrained gain-scheduling technique is applied to MBPC to achieve the same objective. Secondly, the fault-tolerant control technique is proposed to continue the control action without stopping the boiler operation even in case of some faults. Through various simulation studies, the performances of the proposed control techniques are demonstrated.

  • PDF