• Title/Summary/Keyword: Fault Tolerance System

Search Result 335, Processing Time 0.026 seconds

Implementation and Performance Analysis of a Fault-tolerant Mini-MAP System (결함 허용 Mini-MAP 시스템의 구현 및 성능해석)

  • 문홍주;박홍성;권욱현
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.3
    • /
    • pp.1-10
    • /
    • 1995
  • In this paper, a fault-tolerant Mini-MAP system with high reliability is proposed. For fault-tolerance, the LLC sublayer, MAC sublayer, and physical layer of the Mini-MAP system are dualized. The detection of faults, the replacement of the failed network, and the management of the network are three major functions required for the dualization, and they are performed by ESM(Error Supervisory Machine), EMM(Error Management Machine), and NMM(Network Management Machine) of the proposed fault-tolerant Mini-MAP system, respectively. The ring maintenance function of the MAC sublayer is used for the detection of the faults. In the proposed fault-tolerant Mini-MAP system, the data are received from both of the dualized networks and transmitted to the selected one of the two. We analyze the reliability and the MTTF(Mean Time To Failure) of the proposed fault-tolerant Mini-MAP system and show that it has better performance compared to a general Mini-MAP system.

  • PDF

Design of Scheduler Considering Real-Time Characteristic and Fault-Tolerant in Embedded System (임베디드 시스템에서 실시간성과 결함허용을 보장하는 스케줄러 설계)

  • Jeon, Tae-Gun;Kim, Chang-Soo
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.1
    • /
    • pp.76-84
    • /
    • 2011
  • Embedded Systems need to ensure real-time of the task response time depending on the applied fields of it. And task could be faulty due to various reasons in real time systems. Therefore in this paper, we design a task scheduler that guarantees deadlines of periodic tasks and considers a fault tolerance of defective task in embedded system with a single processor. In order to provide real-time, we classify tasks with periodic/aperiodic tasks and applies RMS(Rate Monotonic Scheduling) method to schedule periodic tasks and can guarantees execution of aperiodic tasks by managing surplus times obtained after analyzing the execution time of periodic tasks. In order to provide fault tolerance, we manage backup times and reexecute a fault task to restore it's conditions.

A Study on System's Reliability Evaluation Using DFT Algorithm (동적 결함 트리 (Dynamic Fault Tree) 알고리즘을 이용한 시스템의 신뢰도 평가에 관한 연구)

  • 김진수;양성현;이기서
    • Proceedings of the KSR Conference
    • /
    • 1998.11a
    • /
    • pp.280-287
    • /
    • 1998
  • In this paper, Dynamic Fault Tree algorithm(DFT algorithm) is presented. This new algorithm provides a concise representation of dynamic fault tolerance system structure with redundancy, dynamic redundancy management and complex fault & error recovery techniques. And it allows the modeler to define a dynamic fault tree model with the relative advantages of both fault tree and Markov models that captures the system structure and dynamic behavior. This algorithm applies to TMR and Dual-Duplex systems with the dynamic behavior and show that this algorithm captured the dynamic behavior in these systems with fault & error recovery technique, sequence-dependent failures and the use dynamic spare. The DFT algorithm for solving the problems of the systems is more effective than the Markov and Fault tree analysis model.

  • PDF

Design of a Fault-tolerant Embedded Controllerfor Rail-way Signaling Systems

  • Cho, Yong-Gee;Lim, Jae-Sik
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2002.10a
    • /
    • pp.68.4-68
    • /
    • 2002
  • $\textbullet$ This report presents an implementation a set of reusable software components which use of fault-tolerance embedded controller for railway signalling systems. These components can be used in real-time applications without application reprogramming. $\textbullet$ This library runs under VxWorks operating system and is oriented on real-time embedded systems. The library includes fault detection, fault containment, checkpointing and recovery components. $\textbullet$ The library enables to support high-speed response to fault occurrence in application software. Garbage collector together with VxWorks Watchdog provides both dead tasks detection and useless resources removing to avoid an overflow. Control flow...

  • PDF

Realtime Monitoring System using AJAX + XML (AJAX+XML 기반의 모니터링 시스템)

  • Choi, Yun Jeong;Park, Seung Soo
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.5 no.4
    • /
    • pp.39-49
    • /
    • 2009
  • Nowadays, according to rapid development of computing environments, information processing and analysis system are very interesting research area. As a viewpoint of data preparation-processing-analysis in knowledge technology, the goal of automated information system is to satisfy high reliability and confidence and to minimize of human-administrator intervention. In addition, we expect the system which can deal with problem and abnormal error effectively as a fault detection and fault tolerance. In this paper, we design a monitoring system as follows. A productive monitoring information from various systems has unstructured forms and characteristics and crawls informative data by conditions and gathering rules. For representing of monitering information which requested by administrator, running-status can be able to check dynamically and systematic like connection/closed status in real-time. Our proposed system can easily correct and processing for monitoring information from various type of server and support to make objective judgement and analysis of administrator under operative target of information system. We implement semi-realtime monitering system using AJAX technology for dynamic browsing of web information and information processing using XML and XPATH. We apply our system to SMS server for checking running status and the system shows that has high utility and reliability.

A New Immunotronic Approach to Hardware Fault Detection Using Symbiotic Evolution (공생 진화를 이용한 Immunotronic 접근 방식의 하드웨어 오류 검출)

  • Lee, Sang-Hyung;Kim, Eun-Tai;Lee, Hee-Jin;Park, Mignon
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.42 no.5
    • /
    • pp.59-68
    • /
    • 2005
  • A novel immunotronic approach to fault detection in hardware based on symbiotic evolution is proposed in this paper. In the immunotronic system, the generation of tolerance conditions corresponds to the generation of antibodies in the biological immune system. In this paper, the principle of antibody diversity, one of the most important concepts in the biological immune system, is employed and it is realized through symbiotic evolution. Symbiotic evolution imitates the generation of antibodies in the biological immune system morethan the traditional GA does. It is demonstrated that the suggested method outperforms the previous immunotronic methods with less running time. The suggested method is applied to fault detection in a decade counter (typical example of finite state machines) and MCNC finite state machines and its effectiveness is demonstrated by the computer simulation.

A Fault Management Architecture Using Backup FA in Hierarchical Local Registration Mobile IP (계층적 지역 등록 Mobile IP에서 백업 FA를 이용한 장애 관리 구조)

  • 임기운;홍충선;이대영
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.11C
    • /
    • pp.1-9
    • /
    • 2001
  • The Mobile IP protocol allows IP hosts to move between different networks without changing their IP addresses. The Mobile IP systems supporting the local registration were introduced to reduce the number of times when a home registration with the remotely located Home Agent is needed. The local registration Mobile IP protocol enhanced the performance by processing the MN\`s registration requests at a local agent. However. the local registration approach may consider other aspects of the Mobile IP systems as the FA fault tolerance. In this paper, we will briefly review previous protocols to support the FA fault tolerance in hierarchical local registration Mobile If system and will propose a fault tolerance protocol with backup FA in hierarchical local registration mobile IP t() enhance the efficiency of such systems against Foreign Agent failures.

  • PDF

A Performance Estimation Simulation of Grouping Method for Performance Elevation under Mobile Database Environment (모바일 데이터베이스 환경하에서의 성능 향상을 위한 군집화 기법의 성능 평가 시뮬레이션)

  • 신성욱;정동원;백두권
    • Journal of the Korea Society for Simulation
    • /
    • v.12 no.2
    • /
    • pp.55-62
    • /
    • 2003
  • The explosive Increase of wireless networks and the advancement of mobile devices lead to the expansion of mobile environment. In accordance with the development of mobile environment, the need to use mobile database is increased sharply, and also it accompanies the related problems. The current mobile database system is based on the centralized method from which a synchronized server manages multiple mobile database management system to synchronize. From this mobile system architecture, several kinds of problems can be detected such as the management of synchronization issues between mobile databases and the transaction management issues. Furthermore, the current mobile database management system does not consider any solution on the fault tolerance. To solve those problems, this paper proposes the mobile agent-based mobile database management system. The proposed system provide high confidence and efficiency by enhancing the network efficiency and fault tolerance through the mobile grouping.

  • PDF

Reliability Analysis and Fault Tolerance Strategy of TMR Real-time Control Systems (TMR 실시간 제어 시스템의 내고장성 기법 및 신뢰도 해석)

  • Kwak, Seong-Woo;You, Kwan-Ho
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.10 no.8
    • /
    • pp.748-754
    • /
    • 2004
  • In this paper, we propose the Triple Modular Redundancy (TMR) control system equipped with a checkpoint strategy. In this system, faults in a single processor are masked and faults in two or more processors are detected at each checkpoint time. When faults are detected, the rollback recovery is activated to recover from faults. The conventional TMR control system cannot overcome faults in two or more processors. The proposed system can effectively cope with correlated and independent faults in two or more processors. We develop a reliability model for this TMR control system under correlated and independent transient faults, and derive the reliability equation. Then we investigate the number of checkpoints that maximizes the reliability.

Error Recovery Technique for Improving Reliability of Embedded Systems

  • Son, Sunghoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.6
    • /
    • pp.1-8
    • /
    • 2017
  • In this paper, we propose a fault tolerance technique which enables embedded systems to run without interruption while its operating system and tasks fail. In order to improve reliability, the proposed scheme makes an embedded system run as a virtual machine on virtual machine monitor. It also prepares a contingency virtual machine at which periodical backups of the embedded system are saved. When an error occurs in the main virtual machine, the corresponding standby virtual machine takes a role of the main virtual machine and continues its operation. Especially such backups and switches of virtual machines are performed with minor performance degradation by manipulating page table entries in virtual machine monitor. By conducting performance evaluation studies, we show that the proposed scheme makes embedded system robust against errors while it does not degrade the performance of the system significantly.