• Title/Summary/Keyword: Software fault tolerance

Search Result 89, Processing Time 0.026 seconds

A Study on Software Based Fault-Tolerance Techniques for Flight Control Computer (비행조종컴퓨터 소프트웨어 기반 고장허용 설계 기법 연구)

  • Yoon, Hyung-Sik;Kim, Yeon-Gyun
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.44 no.3
    • /
    • pp.256-265
    • /
    • 2016
  • Software based fault tolerance techniques are designed to allow a system to tolerate software faults in the system. Fault tolerance techniques are divided into two groups : software based fault tolerance techniques and hardware based fault tolerance techniques. We need a proper design method according to characteristics of the system. In this paper, the concepts of software based fault tolerance techniques for Dual Flight Control Computer are described. For software based fault tolerance design, we classified software failure, designed a way for failure detection and the way of recovery. Eventually the effectiveness of software based fault tolerance techniques was verified through the Software Test Environment(STE).

Fault Location Identification Using Software Fault Tolerance Technique (소프트웨어 Fault Tolerance를 이용한 고장점 표정)

  • Kim Wonha;Jang Yong-Won;Han Seung-Soo
    • The Transactions of the Korean Institute of Electrical Engineers A
    • /
    • v.54 no.2
    • /
    • pp.73-78
    • /
    • 2005
  • The management of technological systems will become increasingly complex. Safe and reliable software operation is a significant requirement for many types of system. So, with software fault tolerance, we want to prevent failures by tolerating faults whose occurrences are known when errors are detected. This paper presents a fault location algorithm for single-phase-to-ground faults on the teed circuit of a parallel transmission line using software fault tolerance technique. To find the fault location of transmission line, we have to solve the 3rd order transmission line equation. A significant improvement in the identification of the fault location was accomplished using the N-Version Programming (NVP) design paradigm. The delivered new algorithm has been tested with the simulation data obtained from the versatile EMTP simulator.

Fault Location Algorithm using Software Fault Tolerance (Software Fault Tolerance를 이용한 송전선로의 고장점 표정 알고리즘)

  • Jang, Yong-Won;Han, Seung-Su;Kim, Won-Ha
    • Proceedings of the KIEE Conference
    • /
    • 2003.11c
    • /
    • pp.875-877
    • /
    • 2003
  • This paper use fault location algorithm for single-phase-to-ground faults on the teed circuit of a parallel transmission line that use only local end voltage and current information. When Newton-Raphson iteration method is used, the Initial value may cause error or cause not suitable result. Suggested new calculation model uses NVP methodology, which is one of the fault tolerance technology to solve this problem. EMTP simulation result has shown effectiveness of the algorithm under various conditions.

  • PDF

A Fault-Tolerant Scheme Based on Message Passing for Mission-Critical Computers (임무지향 컴퓨터를 위한 메시지패싱 고장감내 기법)

  • Kim, Taehyon;Bae, Jungil;Shin, Jinbeom;Cho, Kilseok
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.18 no.6
    • /
    • pp.762-770
    • /
    • 2015
  • Fault tolerance is a crucial design for a mission-critical computer such as engagement control computer that has to maintain its operation for long mission time. In recent years, software fault-tolerant design is becoming important in terms of cost-effectiveness and high-efficiency. In this paper, we propose MPCMCC which is a model-based software component to implement fault tolerance in mission-critical computers. MPCMCC is a fault tolerance design that synchronizes shared data between two computers by using the one-way message-passing scheme which is easy to use and more stable than the shared memory scheme. In addition, MPCMCC can be easily reused for future work by employing the model based development methodology. We verified the functions of the software component and analyzed its performance in the simulation environment by using two mission-critical computers. The results show that MPCMCC is a suitable software component for fault tolerance in mission-critical computers.

Optimal Software Release Using Time and Cost Benefits via Fuzzy Multi-Criteria and Fault Tolerance

  • Srivastava, Praveen Ranjan
    • Journal of Information Processing Systems
    • /
    • v.8 no.1
    • /
    • pp.21-54
    • /
    • 2012
  • As we know every software development process is pretty large and consists of different modules. This raises the idea of prioritizing different software modules so that important modules can be tested by preference. In the software testing process, it is not possible to test each and every module regressively, which is due to time and cost constraints. To deal with these constraints, this paper proposes an approach that is based on the fuzzy multi-criteria approach for prioritizing several software modules and calculates optimal time and cost for software testing by using fuzzy logic and the fault tolerance approach.

Reliability Analysis for Train Control System by Software Fault Tolerance Techniques (소프트웨어 결함허용 기법에 의한 열차제어시스템 신뢰도 분석)

  • Suh, Seog-Chul;Lee, Jong-Woo
    • Journal of the Korean Society for Railway
    • /
    • v.12 no.6
    • /
    • pp.1043-1048
    • /
    • 2009
  • PES (Programmable Electronic System) is used by software development for the train control system. PES has been widely used in real world and consists of hardware, firmware and application software. The PES are easily apply to many applications because its implementation has high flexibility. Many safety critical functions are realized through software in safety critical system. Normally, it is difficult to detect failures for PES system because the PES is too sophisticated to identify sources of the failure. So, the reliability analysis is needed by using software fault tolerance techniques. Currently, there are the recovery block, distributed recovery block, N-version programming, N self-checking programming in fault tolerance techniques. In this paper, the models of recovery block and N-version programming in software fault tolerance techniques are suggested by using the Markov model. Also, the reliability in the train control system is analyzed through changing time. The fault occupancy rates of the program, adjustment test and voter are stationary. So, the relation between time and reliability is presented by using Matlab program. In the result of reliability, the reliability of recovery block is more high than N-version programming in case of the same number of substitution block.

OPRoS based Fault Tolerance Support for Reliability of Service Robots (서비스로봇의 신뢰성 향상을 위한 OPRoS 기반 Fault-tolerance 기법)

  • Ahn, Hee-June;Lee, Dong-Su;Ahn, Sang-Chul
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.16 no.6
    • /
    • pp.601-607
    • /
    • 2010
  • For commercial success of emerging service robots, the fault tolerant technology for system reliability and human safety is crucial. Traditionally fault tolerance methods have been implemented in application level. However, from our studies on the common design patterns in fault tolerance, we argue that a framework-based approach provides many benefits in providing reliability for system development. To demonstrate the benefits, we build a framework-based fault tolerant engine for OPRoS (Open Platform for Robotic Services) standards. The fault manager in framework provides a set of fault tolerant measures of detection, isolation, and recovery. The system integrators choose the appropriate fault handling tools by declaring XML configuration descriptors, considering the constraints of components and operating environment. By building a fault tolerant navigation application from the non-faulttolerant components, we demonstrate the usability and benefits of the proposed framework-based approach.

Fault tolerant architecture for Module-based personal robot (모듈기반 퍼스널 로봇을 위한 Fault Tolerant 구조)

  • Baek, Bum-Hyun;Jee, Dong-Jun;Park, Chan-Jung;Han, Ki-Sung;Lee, Sun-Young;Kim, Kyeong-Tae;Hong, Sung-Won;Park, Hong-Seong
    • Proceedings of the KIEE Conference
    • /
    • 2006.04a
    • /
    • pp.297-299
    • /
    • 2006
  • Many distributed applications is developed in various environment such as operating system, software platform. So, they exhibit different types of system behavior, status, during the course of their operation. Each such behavior may have different functional and non-functional requirements. Therefore, many distributed application need to fault-tolerance solution. Personal robot provide various service or application. Because personal robot has many application or service, it need to fault-tolerance architecture. A flexible architecture is required to provide dependability. In this paper, it is suggested a fault-tolerant architecture for module-based personal robot with module fault-tolerance, service fault-tolerance.

  • PDF

System Reliability (시스템 信賴性)

  • 김동주
    • The Magazine of the IEIE
    • /
    • v.5 no.1
    • /
    • pp.31-37
    • /
    • 1978
  • The value of a system is highly dependent upon its reliability, Reliability means not merely correctness but means fault tolerance of the system. This paper emphasizes software fault tolerance in design stage especially in case of computer controlled system. The general method of fault tolerance design especially including dual computer system and its advantage and disadvantage was introduced. Finally for example of fault tolerance design we would like to present our GTK-500 EPABX.

  • PDF

Multi-Agent System for Fault Tolerance in Wireless Sensor Networks

  • Lee, HwaMin;Min, Se Dong;Choi, Min-Hyung;Lee, DaeWon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.3
    • /
    • pp.1321-1332
    • /
    • 2016
  • Wireless sensor networks (WSN) are self-organized networks that typically consist of thousands of low-cost, low-powered sensor nodes. The reliability and availability of WSNs can be affected by faults, including those from radio interference, battery exhaustion, hardware and software failures, communication link errors, malicious attacks, and so on. Thus, we propose a novel multi-agent fault tolerant system for wireless sensor networks. Since a major requirement of WSNs is to reduce energy consumption, we use multi-agent and mobile agent configurations to manage WSNs that provide energy-efficient services. Mobile agent architecture have inherent advantages in that they provide energy awareness, scalability, reliability, and extensibility. Our multi-agent system consists of a resource manager, a fault tolerance manager and a load balancing manager, and we also propose fault-tolerant protocols that use multi-agent and mobile agent setups.