• Title/Summary/Keyword: 결함허용정보

Search Result 165, Processing Time 0.031 seconds

A Study on Analysis of NVP Reliability Using Genetic Algorithms (GA를 이용한 NVP 신뢰도 분석에 관한 연구)

  • Sin, Gyeong-Ae;Han, Pan-Am
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.2
    • /
    • pp.326-334
    • /
    • 1999
  • There are the fault tolerance technology and the fault avoidance technology to analyze and evaluate the performance of computer system. To improve the relibility of software The N-Version Programming (NVP) technology is known to be the most objective and quantitive. However, when discrete probability distribution is used as estimation model, the values of it's component reliability should be same. In this paper, to resolve this problem, we adapted the genetic algorithms to NVP technology and implement the optimized simulate. and the results were analyzed and estimated. Through this study, we could optimize the reliability of each component and estimate the optimum count in the system reliability.

  • PDF

An Availability Model for Active/Active Cluster Systems (Active/Active 클러스터 시스템의 가용도 모델)

  • Park, Kie-Jin;Kim, Sung-Soo
    • The KIPS Transactions:PartC
    • /
    • v.8C no.2
    • /
    • pp.173-181
    • /
    • 2001
  • 하드웨어 기술의 발전으로 인해 컴퓨터 하드웨어의 결함 발생률은 상수 값이거나 점차 작아지는 경향이 있다. 반면에 하드웨어에 탑재된 소프트웨어의 복잡성 및 크기는 이전에는 상상할 수 없을 정도로 방대해져가고 있기 때문에, 소프트웨어의 결함 발생으로 인한 컴퓨터 시스템의 장애 발생 가능성은 점차 더 높아지고 있다. 본 논문에서는 Active/Active 클러스터 시스템의 가용도 개선을 위해서 소프트웨어적인 결함 발생을 미연에 방지할 수 있는 능동적 결함허용 기법인 소프트웨어 재활(rejuvenation) 방법에 대하여 연구하였다. 소프트웨어 재활 과정 및 여분서버로 작업전이(switchover) 과정을 semi-Markov 프로세스로 모델링 한 후, 수학적 분석을 통해 구한 Active/Active 클러스터 시스템의 bud형 상태 확률을 이용하여, 다양한 운영 조건하의 가용도 및 손실비용을 계산하였으며, 이를 통하여 소프트웨어 재활을 통한 Active/Active 클러스터 시스템의 가용도 개선 가능성을 확인하였다.

  • PDF

Fault-Tolerant Algorithm using Multi-Connectivity of Communication Networks (통신망의 다중연결성을 이용한 결함허용 알고리즘)

  • Moon, Yun-Ho;Kim, Byung-Ki
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.27 no.1
    • /
    • pp.53-60
    • /
    • 2000
  • The purpose of this paper is to propose new recovery algorithm for case of a system element raises communication obstacle due to faults in networks, Also we are simulate the algorithm using adjacency matrix. We recover one faulty node per each excution of proposed algorithm so that we can be reconstruct the faulty system gradually to communicatable network. For that, this paper propose a new recovery algorithm named MATRECO which connect the recovery process is simulated by use of adjacency matrix.

  • PDF

Fault-Free Process for IT System with TRM(Technical Reference Model) based Fault Check Point and Event Rule Engine (기술분류체계 기반의 장애 점검포인트와 이벤트 룰엔진을 적용한 무장애체계 구현)

  • Hyun, Byeong-Tag;Kim, Tae-Woo;Um, Chang-Sup;Seo, Jong-Hyen
    • Information Systems Review
    • /
    • v.12 no.3
    • /
    • pp.1-17
    • /
    • 2010
  • IT Systems based on Global Single Instance (GSI) can manage a corporation's internal information, resources and assets effectively and raise business efficiency through consolidation of their business process and productivity. But, It has also dangerous factor that IT system fault failure can cause a state of paralysis of a business itself, followed by huge loss of money. Many of studies have been conducted about fault-tolerance based on using redundant component. The concept of fault tolerance is rather simple but, designing and adopting fault-tolerance system is not easy due to uncertainty of a type and frequency of faults. So, Operational fault management that working after developed IT system is important more and more along with technical fault management. This study proposes the fault management process that including a pre-estimation method using TRM (Technical Reference Model) check point and event rule engine. And also proposes a effect of fault-free process through built fault management system to representative company of Hi-tech industry. After adopting fault-free process, a number of failure decreased by 46%, a failure time decreased by 56% and the Opportunity loss costs decreased by 77%.

Availability Analysis of Multiplex Systems using Software Rejuvenation Method (소프트웨어 재활 기법을 적용한 다중계 시스템의 가용도 분석)

  • Park, Kie-Jin;Kim, Sung-Soo;Kim, Jai-Hoon
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.27 no.8
    • /
    • pp.730-740
    • /
    • 2000
  • The software rejuvenation method for highly available multiplex systems uses a pro-active fault-tolerant approach to handle system failures. The software rejuvenation prevents failures from occurring, while the previous methods recover from failures after happening. Especially, since the software aging proceeds fast in the software used for the multimedia mobile computing due to the loss of communications or data, the preventive method from failures using software rejuvenation can be used for the multimedia mobile computing. In this paper, according to the operational parameters such as rejuvenation period, rejuvenation time, failure rate and repair rate of the servers, number of running servers, duration of running time, and type of running modes, we calculate steady-state probabilities, downtime, availability, and cost of the multiplex systems using software rejuvenation method. We validate the closed-form solutions of the mathematical model by experiments based on various operational parameters and find that the software rejuvenation method can be adopted as preventive fault-tolerant technique. The failure rate and unstable rate of the servers are essential factors for the decision making of the rejuvenation policies.

  • PDF

Analysis of Available Performance Satisfying Waiting Time Deadline for (n, k)-way Systems (대기시간 데드라인 조건을 고려한(n, k)-way 시스템의 가용 성능 분석)

  • 박기진;김성수
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.9
    • /
    • pp.445-453
    • /
    • 2003
  • As cluster systems used for high performance computing consist of large number of running servers, one has to solve the low availability problems occurred by the high chance of the server failures. To handle the problems, it is necessary to have the precise definition of available performance of cluster systems that represents availability and performability of the systems simultaneously. Previous research results that mention availability issues lack for concerning system performance such as waiting time and response time in their availability definition. In this paper, we propose a new availability metric for (n, k)-way cluster systems which compose of n primary servers and k backup servers. With the metric, the change of system performance according to arrival rates is captured and the waiting time of a request can be kept below to a certain level. Using various system operating parameters, we calculate availability and downtime of cluster systems along with waiting tine deadline.

Implementation of Distributed Fault-Tolerant Middleware for Dual Channel Ethernet based Virtual Server (이중 채널 이더넷 기반 가상서버를 위한 분산 고장 감내 미들웨어의 구현)

  • 함명호;김진용;최보곤;신현식
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.04a
    • /
    • pp.112-114
    • /
    • 2003
  • 긴박한 임무 상황(mission critical)의 시스템 뿐만 아니라, 웹 서버등의 고가용성, 고신뢰성을 위한 가상 서버의 구축도 관심이 되고 있다. 본 연구에서는 가상서버의 고가용성 , 고신뢰성을 보다 향상시키기 위해 가상서버 클러스터의 각 구성 노드들을 이중의 네트웍 채널로 중복 시켜 네트웍 고장에 대한 신뢰성을 향상시켰다. 관리자 노드와 작업노드 풀로 구성되는 시스템의 각 노드와 이중 채널로 구성된 네트웍에 대한 결함검출과 결함복구를 위한 분산 결함 허용 미들웨어를 구현하였고, 적응형 고장 감내 (Adaptive Fault Tolerance) 기법을 사용하여 다양한 임무 상황에서의 자원 효율성을 향상시켰다.

  • PDF

TFT-LCD Defect Detection Using Double-Self Quotient Image (이중 SQI를 이용한 TFT-LCD 결함 검출)

  • Park, Woon-Ik;Lee, Kyu-Bong;Kim, Se-Yoon;Park, Kil-Houm
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.14 no.6
    • /
    • pp.604-608
    • /
    • 2008
  • The TFT-LCD image allows non-uniform illumination variation and that is one of main difficulties of finding defect region. The SQI (self quotient image) has the HPF (high pass filter) shape and is used to reduce low frequency-lightness component. In this paper, we proposed the TFT-LCD defect-enhancement algorithm using characteristics of the SQI, that is the SQI has low-frequency flattening effect and maintains local variation. The proposed method has superior flattening effect and defect-enhancement effect compared with previous the TFT-LCD image preprocessing.

The design for controllabel self-checking checker (제어 가능한 자체검사 특성 검사기 설계)

  • 양성현;이기서
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.23 no.5
    • /
    • pp.1149-1159
    • /
    • 1998
  • This paper presents the Controllable Self-Checking(CSC) Checker at which can be used the Fault-Tolerant System with the redundancy. According to the critical level of output(of system), especially, it can be instructed the time if it has to check the output or not. We adop the deterministic test, performed on-line, to detect the faults with a minimal test set. The results show the Parity 2-rail checker(P-TRC) which is designed much simpler than the checker has the higher fault coverage than the existent checker.

  • PDF

Design and Implementation of job Migration on a Grid Computing Environment (그리드 컴퓨팅 환경에서의 작업 마이그레이션의 설계 및 구현)

  • Kim Young-Gyun;Cho Kum Won;Song Young-Duk;Go Soon-Heum;Na Jeong-su;Oh Gi1-Ho
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07a
    • /
    • pp.577-579
    • /
    • 2005
  • 본 논문에서는 Cactus와 Globus를 사용하는 그리드 컴퓨팅 환경에서 작업 마이그레이션(Job Migration)에 대해 연구 하였다. 그리드 컴퓨팅은 고속의 네트워크로 연결된 다중의 사이트에 분산되어 있는 연산 자원들을 활용하는 것으로서, 연산 자원들의 효율적인 이용이 중요하다. 연산자원의 효율적인 이용의 한 방법으로서 작업 마이그레이션은 이동 에이전트, 부하 균등화, 결함 허용 등을 위해 사용될 수 있다. 본 논문에는 한 사이트에서 실행중인 연산 작업이 중단된 경우, 유휴한 다른 사이트의 연산자원으로 이동한 후 체크포인팅 파일을 이용하여 중단된 지점부터 복구하여 연산을 계속 수행하도록 하는 연구를 수행하였다. K*Grid 환경에서 연산시간을 효과적으로 단축함을 실험으로 확인하였다. 보다 동적인 그리드 컴퓨팅에서 결함허용, 연산자원의 효율적인 이용 방법으로 사용될 수 있다.

  • PDF