• Title/Summary/Keyword: Fault Management

Search Result 671, Processing Time 0.041 seconds

Design of a Fault Tolerant System Employing Fault Detection Bus (고장 검출 버스를 이용한 고장 감내 시스템 설계)

  • 정우석;송광석;이광선;신진욱;박동선
    • Proceedings of the IEEK Conference
    • /
    • 1999.06a
    • /
    • pp.168-171
    • /
    • 1999
  • A fault-tolerant system should have a high availability and high reliability to maintain a given system stable against sudden faults in the system. In this paper, we propose a new types of fault tolerant system based on a fault detection bus. The fault detection bus is designed and implemented to detect any errors by comparing event-output signals from two processor modules. It employs the hot standby sparing fault detection method〔1〕 to provide continuity of services even if a system fault occurs. The prototype fault tolerant system is currently being implemented on a management system with two processor modules.

  • PDF

Bootstrap-Based Fault Identification Method (붓스트랩을 활용한 이상원인변수의 탐지 기법)

  • Kang, Ji-Hoon;Kim, Seoung-Bum
    • Journal of Korean Society for Quality Management
    • /
    • v.39 no.2
    • /
    • pp.234-243
    • /
    • 2011
  • Multivariate control charts are widely used to monitor the performance of a multivariate process over time to maintain control of the process. Although existing multivariate control charts provide control limits to monitor the process and detect any extraordinary events, it is a challenge to identify the causes of an out-of-control alarm when the number of process variables is large. Several fault identification methods have been developed to address this issue. However, these methods require a normality assumption of the process data. In the present study, we propose a bootstrapped-based $T^2$ decomposition technique that does not require any distributional assumption. A simulation study was conducted to examine the properties of the proposed fault identification method under various scenarios and compare it with the existing parametric $T^2$ decomposition method. The simulation results showed that the proposed method produced better results than the existing one, especially in nonnormal situations.

Fault Prediction of a Telecommunications Network using Association Rules Mining based on Voice of the Customer (VOC 기반 연관규칙 마이닝을 이용한 통신선로설비의 장애 예측)

  • Na, Gijoo;Han, Insup;Cho, Namwook
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.11 no.4
    • /
    • pp.13-24
    • /
    • 2015
  • Customer complaints handling helps organizations to retain existing customers and attract new customers, as well. As Voice of the Customer (VOC) is one of the main sources of customer complaints, many organizations utilize VOC to enhance customer satisfaction. Effective management of VOC has been proved as one of the best ways to maintain organization's brand image and reputation. In spite of its importance, little has been reported on the utilization of VOC to detect faults in a telecommunication industry. In this paper, association rule mining based on VOC is used to identify root fault causes of a telecommunications network. To do that, VOC of a Communication Service Provider has been collected first. Then, association rule mining has also been conducted with various support and confidence levels. As a result, root fault causes of the telecommunications network can be identified. It is expected that this study can be used as a basis for decisions about customer satisfaction management such as preventive maintenances or reduction of the customer maintenance cost.

Rule-based network fault self-recovery system (규칙 기반의 네트워크 장애 자기 복구 시스템)

  • Lee, Jae-Wook;Ahn, Seong-Jin;Chung, Jin-Wook
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.10 no.1
    • /
    • pp.83-93
    • /
    • 2006
  • This paper introduces rule-based reasoning (RBR) based self-recovery system for network fault in ubiquitous computing. This system is fault management system for fault recovery of rule-based for self-recovery in ubiquitous computing environment. We proposed rules of network fault recovery applied the system as a distinguished reason of network fault. And, in this paper, the network fault self-recovery system proved the rules that applied each situatpion through the simulation.

  • PDF

A Study on System's Reliability Evaluation Using DFT Algorithm (동적 결함 트리 (Dynamic Fault Tree) 알고리즘을 이용한 시스템의 신뢰도 평가에 관한 연구)

  • 김진수;양성현;이기서
    • Proceedings of the KSR Conference
    • /
    • 1998.11a
    • /
    • pp.280-287
    • /
    • 1998
  • In this paper, Dynamic Fault Tree algorithm(DFT algorithm) is presented. This new algorithm provides a concise representation of dynamic fault tolerance system structure with redundancy, dynamic redundancy management and complex fault & error recovery techniques. And it allows the modeler to define a dynamic fault tree model with the relative advantages of both fault tree and Markov models that captures the system structure and dynamic behavior. This algorithm applies to TMR and Dual-Duplex systems with the dynamic behavior and show that this algorithm captured the dynamic behavior in these systems with fault & error recovery technique, sequence-dependent failures and the use dynamic spare. The DFT algorithm for solving the problems of the systems is more effective than the Markov and Fault tree analysis model.

  • PDF

One-class Classification based Fault Classification for Semiconductor Process Cyclic Signal (단일 클래스 분류기법을 이용한 반도체 공정 주기 신호의 이상분류)

  • Cho, Min-Young;Baek, Jun-Geol
    • IE interfaces
    • /
    • v.25 no.2
    • /
    • pp.170-177
    • /
    • 2012
  • Process control is essential to operate the semiconductor process efficiently. This paper consider fault classification of semiconductor based cyclic signal for process control. In general, process signal usually take the different pattern depending on some different cause of fault. If faults can be classified by cause of faults, it could improve the process control through a definite and rapid diagnosis. One of the most important thing is a finding definite diagnosis in fault classification, even-though it is classified several times. This paper proposes the method that one-class classifier classify fault causes as each classes. Hotelling T2 chart, kNNDD(k-Nearest Neighbor Data Description), Distance based Novelty Detection are used to perform the one-class classifier. PCA(Principal Component Analysis) is also used to reduce the data dimension because the length of process signal is too long generally. In experiment, it generates the data based real signal patterns from semiconductor process. The objective of this experiment is to compare between the proposed method and SVM(Support Vector Machine). Most of the experiments' results show that proposed method using Distance based Novelty Detection has a good performance in classification and diagnosis problems.

The Implementation of Fault Tolerance Service for QoS in Grid Computing (그리드 컴퓨팅에서 서비스 품질을 위한 결함 포용 서비스의 구현)

  • Lee, Hwa- Min
    • The Journal of Korean Association of Computer Education
    • /
    • v.11 no.3
    • /
    • pp.81-89
    • /
    • 2008
  • The failure occurrence of resources in the grid computing is higher than in a tradition parallel computing. Since the failure of resources affects job execution fatally, fault tolerance service is essential in computational grids. And grid services are often expected to meet some minimum levels of quality of service (QoS) for desirable operation. However Globus toolkit does not provide fault tolerance service that supports fault detection service and management service and satisfies QoS requirement. Thus this paper proposes fault tolerance service to satisfy QoS requirement in computational grids. In order to provide fault tolerance service and satisfy QoS requirements, we expand the definition of failure, such as process failure, processor failure, and network failure. And we propose resource scheduling service, fault detection service and fault management service and show implement and experiment results.

  • PDF

Distributed System Architecture Modeling of a Performance Monitoring and Reporting Tool (분산 시스템의 성능 모니터링과 레포팅 툴의 아키텍처 모델링)

  • Kim, Ki;Choi, Eun-Mi
    • Journal of the Korea Society for Simulation
    • /
    • v.12 no.3
    • /
    • pp.69-81
    • /
    • 2003
  • To manage a cluster of distributed server systems, a number of management aspects should be considered in terms of configuration management, fault management, performance management, and user management. System performance monitoring and reporting take an important role for performance and fault management. In this paper, we present distributed system architecture modeling of a performance monitoring and reporting tool. Modeling architecture of four subsystems are introduced: node agent, data collection, performance management & report, and DB schema. The performance-related information collected from distributed servers are categorized into performance counters, event data for system status changes, service quality, and system configuration data. In order to analyze those performance information, we use a number of ways to evaluate data corelation. By using some results from a real site of a company and from simulation of artificial workload, we show the example of performance collection and analysis. Since our report tool detects system fault or node component failure and analyzes performances through resource usage and service quality, we are able to provide information for server load balancing, in short term view, and the cause of system faults and decision for system scale-out and scale-up, in long term view.

  • PDF

A Study on Improvement of Restoration Ability by Fault Simulation Training (모의사고 훈련을 통한 급전원의 고장복구 능력향상에 관한 연구)

  • Kim, T.W.;Lee, B.S.;Lee, W.S.
    • Proceedings of the KIEE Conference
    • /
    • 2011.07a
    • /
    • pp.149-151
    • /
    • 2011
  • This paper describe that restoration ability raising method of power system operator when Power System fault happen. This paper introduce score management method about simulation training when supposed fault happened and essencial fault have to be trained for power system operator.

  • PDF

Motion Sensor Fault Detection and Failsafe Logic for Vehic1e Stability Control Systems (VSCs)

  • Yi, Kyongsu;Min, Kyongchan
    • Journal of Mechanical Science and Technology
    • /
    • v.18 no.11
    • /
    • pp.1961-1968
    • /
    • 2004
  • The design of a reliable and failsafe control system requires that sensor failures be detected and identified within acceptable time limit so that system malfunction can be prevented. This paper presents a model-based approach to sensor fault detection with applications to vehicle stability control systems. The effectiveness of the proposed method is illustrated through test data-based evaluation. Vehicle test data-based evaluation results show that the proposed fault management scheme can be used for the design of a failsafe VSCs.