• Title/Summary/Keyword: Fault tolerance

Search Result 571, Processing Time 0.028 seconds

A New Artificial Immune System Based on the Principle of Antibody Diversity And Antigen Presenting Cell (Antibody Diversity 원리와 Antigen Presenting Cell을 구현한 새로운 인공 면역 시스템)

  • 이상형;김은태;박민용
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.41 no.4
    • /
    • pp.51-58
    • /
    • 2004
  • This paper proposes a new artificial immune approach to on-line hardware test which is the most indispensable technique for fault tolerant hardware. A novel algorithm of generating tolerance conditions is suggested based on the principle of the antibody diversity. Tolerance conditions in artificial immune system correspond to the antibody in biological immune system. In addition, antigen presenting cell (APC) is realized by Quine-McCluskey method in this algorithm and tolerance conditions are generated through GA (Genetic Algorithm). The suggested method is applied to the on-line monitoring of a typical FSM (a decade counter) and its effectiveness is demonstrated by the computer simulation.

Multi-Agent System for Fault Tolerance in Wireless Sensor Networks

  • Lee, HwaMin;Min, Se Dong;Choi, Min-Hyung;Lee, DaeWon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.3
    • /
    • pp.1321-1332
    • /
    • 2016
  • Wireless sensor networks (WSN) are self-organized networks that typically consist of thousands of low-cost, low-powered sensor nodes. The reliability and availability of WSNs can be affected by faults, including those from radio interference, battery exhaustion, hardware and software failures, communication link errors, malicious attacks, and so on. Thus, we propose a novel multi-agent fault tolerant system for wireless sensor networks. Since a major requirement of WSNs is to reduce energy consumption, we use multi-agent and mobile agent configurations to manage WSNs that provide energy-efficient services. Mobile agent architecture have inherent advantages in that they provide energy awareness, scalability, reliability, and extensibility. Our multi-agent system consists of a resource manager, a fault tolerance manager and a load balancing manager, and we also propose fault-tolerant protocols that use multi-agent and mobile agent setups.

Fault Tolerance Operation and Characteristics Analysis of Asymmetric Six-phase Permanent Magnet Synchronous Motor According to Switch Open (비대칭 6상 영구자석 동기 전동기의 스위치 개방에 따른 특성 분석 및 고장허용운전)

  • Jun, So-Young;Hwang, Seon-Hwan;Park, Jong-Won
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.25 no.6_2
    • /
    • pp.1003-1008
    • /
    • 2022
  • This paper proposes a method related to fault tolerance operation and characteristic analysis of asymmetric 6-phase permanent magnet synchronous motor. In general, motor drive systems with multi-phase structures can be continuously operated despite a reduction of power and speed by using a phase changeover or control techniques according to the failures. As a result, it is widely used in industrial fields such as aviation and defense, which require high efficiency and high reliability. In this paper, the second order ripple of the electrical fundamental freuqnecy occurs in the dq-axis currents of the synchronous coordinate system through mathematical analysis according to the switch open of the dual 3-phase inverter. Therefore, the fault tolerant operation method is presented by applying the fault detection method with a constant cycle for continuous operations. The effectiveness of the proposed fault tolerance operation method is verified through the several experiments.

Garbage Collection Protocol of Fault Tolerance Information in Multi-agent Environments (멀티에이전트 환경에서 결함 포용 정보의 쓰레기 처리 기법)

  • 이대원;정광식;이화민;신상철;이영준;유헌창;이원규
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.31 no.3_4
    • /
    • pp.204-212
    • /
    • 2004
  • Existing distributed systems have higher probability of failures occurrence than stand-alone system, so many fault tolerant techniques have been developed. Because of insufficient storage resulting from the increased fault tolerance information stored, the performance of system has been degraded. To avoid performance degradation, it needs delete useless fault tolerance information. In this paper, we propose a garbage collection algorithm for fault tolerance information. And we define and design the garbage collection agent for garbage collection of fault tolerance information, the information agent for management of fault tolerant data, and the facilitator agent for communication between agents. Also, we propose the garbage collection algorithm using the garbage collection agent. For rollback recovery, we use independent checkpointing protocol and sender based pessimistic message logging protocol. In our proposed garbage collection algorithm, the garbage collection, information, and facilitator agent is created with process, and the information agent constructs domain knowledge with its checkpoints and non-determistic events. And the garbage collection agent decides garbage collection time, and it deletes useless fault tolerance information in cooperation with the information and facilitator agent. For propriety of proposed garbage collection technique using agents, we compare domain knowledge of system that performs garbage collection after rollback recovery and domain knowledge of system that doesn't perform garbage collection.

Fault-Free Process for IT System with TRM(Technical Reference Model) based Fault Check Point and Event Rule Engine (기술분류체계 기반의 장애 점검포인트와 이벤트 룰엔진을 적용한 무장애체계 구현)

  • Hyun, Byeong-Tag;Kim, Tae-Woo;Um, Chang-Sup;Seo, Jong-Hyen
    • Information Systems Review
    • /
    • v.12 no.3
    • /
    • pp.1-17
    • /
    • 2010
  • IT Systems based on Global Single Instance (GSI) can manage a corporation's internal information, resources and assets effectively and raise business efficiency through consolidation of their business process and productivity. But, It has also dangerous factor that IT system fault failure can cause a state of paralysis of a business itself, followed by huge loss of money. Many of studies have been conducted about fault-tolerance based on using redundant component. The concept of fault tolerance is rather simple but, designing and adopting fault-tolerance system is not easy due to uncertainty of a type and frequency of faults. So, Operational fault management that working after developed IT system is important more and more along with technical fault management. This study proposes the fault management process that including a pre-estimation method using TRM (Technical Reference Model) check point and event rule engine. And also proposes a effect of fault-free process through built fault management system to representative company of Hi-tech industry. After adopting fault-free process, a number of failure decreased by 46%, a failure time decreased by 56% and the Opportunity loss costs decreased by 77%.

A Fault-Tolerant Duplex Microcontroller Architecture (결함내성형 이중 마이크로콘트롤러 구조)

  • Kim, Byung-Jin;Baek, Seung-Soo;Lee, In-Hwan;Lim, Dong-Jin
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.51 no.4
    • /
    • pp.144-151
    • /
    • 2002
  • This paper presents a fault-tolerant duplex architecture to build a high-reliability microcontroller using commercial VLSI processors. The architecture supports fail-silence under all single-failure situations and facilitates recovery from transient failures. The paper implements the duplex architecture using two Motorola MC68360 processors and evaluates its fault tolerance in a real application environment.

HM System Design for Fault Tolerance on the IMA System (IMA 시스템에서의 Fault Tolerance를 위한 HM 시스템 설계)

  • Ko, Young-Kwan;Lee, Seung-Hoon;Park, Se-Young;Ban, Chang-Bong;Kang, Dai-Il;Jeong, Jae-Yeop;Lee, Cheol-Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.8
    • /
    • pp.77-86
    • /
    • 2012
  • Avionics system are composed of multiple electronic device that performs important missions the number of electronic devices for the performance of aircraft has been gradually increasing. As a result, cost of development, maintenance and operating have increased. To solve this problem, technology paradigm of avionics has been shifting from federated avionics systems that manage to each system independently to IMA(Integrated Modular Avionics) systems. Unlike federated systems, fault tolerance becomes an essential technology in IMA systems. Because each aircraft features integrated in the an IMA system, a fault can jeopardize the entire system. In this paper, we define faults which can occur on the ARINC 653 based IMA system first, and design the ARINC 653 compliant HM(health Monitoring) system for the system can continue to operate be normal when occur a fault.

Reliability Analysis for Train Control System by Software Fault Tolerance Techniques (소프트웨어 결함허용 기법에 의한 열차제어시스템 신뢰도 분석)

  • Suh, Seog-Chul;Lee, Jong-Woo
    • Journal of the Korean Society for Railway
    • /
    • v.12 no.6
    • /
    • pp.1043-1048
    • /
    • 2009
  • PES (Programmable Electronic System) is used by software development for the train control system. PES has been widely used in real world and consists of hardware, firmware and application software. The PES are easily apply to many applications because its implementation has high flexibility. Many safety critical functions are realized through software in safety critical system. Normally, it is difficult to detect failures for PES system because the PES is too sophisticated to identify sources of the failure. So, the reliability analysis is needed by using software fault tolerance techniques. Currently, there are the recovery block, distributed recovery block, N-version programming, N self-checking programming in fault tolerance techniques. In this paper, the models of recovery block and N-version programming in software fault tolerance techniques are suggested by using the Markov model. Also, the reliability in the train control system is analyzed through changing time. The fault occupancy rates of the program, adjustment test and voter are stationary. So, the relation between time and reliability is presented by using Matlab program. In the result of reliability, the reliability of recovery block is more high than N-version programming in case of the same number of substitution block.

A Fault Tolerance Mechanism with Dynamic Detection Period in Multiple Gigabit Server NICs (다중 Gigabit Server NICs에서 동적 검출 주기를 적용한 결함 허용 메커니즘)

  • 이진영;이시진
    • Journal of Internet Computing and Services
    • /
    • v.3 no.5
    • /
    • pp.31-39
    • /
    • 2002
  • A rapid growth of internet and sudden increase of multimedia data demands for high-speed transfer media and if optimizec usage from the interface system. To achieve this level of network bandwidth, multiple NICs for support of high-speed network bandwidth have been developed and studied. Furthermore, the use of multiple NICs can provide high-speed LAN environment without large network environment modification, supports backward compatibility of current system and reduce overhead. However. if system failure is caused by SPOF(Single Point of Failure) fault of large-capacity multiple NICs, incredible loss will be met because it services large capacity of multimedia data, Therefore, to prevent loss coming from faults, we describe 'Fault tolerance of multiple NICs', which use the fault prevention mechanism. Considering inefficiency of availability and serviceability that is occurred with existing TMR, Primary-Standby approach and Watchdog time mechanism, we propose and design the efficient fault tolerance mechanism, which minimize down time as changing of detection period dynamically. Consequently, the fault tolerance mechanism proposed for reducing overhead time when the fault is occurred, should minimize system downtime overall.

  • PDF

Enhanced Cube Network for the High Reliability (고 신뢰성 큐브 네트웍)

  • Mun Youngsong
    • Journal of Internet Computing and Services
    • /
    • v.4 no.6
    • /
    • pp.25-31
    • /
    • 2003
  • Multistage Interconnection networks (MIN) for the high performance computing and communications must be efficient and reliable. While a number of fault tolerance schemes have been developed, some of them are not efficient enough with respect to all evaluation measures or overheads of others are too significant. In this paper we develop a new efficient fault tolerant MIN which displays high reliability and fault tolerance capability using a simple structure. Structure and reliabilities of Enhanced Cube Network are evaluated and compared with previous designs to show the effectiveness of new design.

  • PDF