• Title/Summary/Keyword: Fault-tolerance

Search Result 570, Processing Time 0.033 seconds

Reliability Analysis of Interleaved Memory with a Scrubbing Technique (인터리빙 구조를 갖는 메모리의 스크러빙 기법 적용에 따른 신뢰도 해석)

  • Ryu, Sang-Moon
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.20 no.4
    • /
    • pp.443-448
    • /
    • 2014
  • Soft errors in memory devices that caused by radiation are the main threat from a reliability point of view. This threat can be commonly overcome with the combination of SEC (Single-Error Correction) codes and scrubbing technique. The interleaving architecture can give memory devices the ability of tolerating these soft errors, especially against multiple-bit soft errors. And the interleaving distance plays a key role in building the tolerance against multiple-bit soft errors. This paper proposes a reliability model of an interleaved memory device which suffers from multiple-bit soft errors and are protected by a combination of SEC code and scrubbing. The proposed model shows how the interleaving distance works to improve the reliability and can be used to make a decision in determining optimal scrubbing technique to meet the demands in reliability.

Design Properties of Distributed Real-Time Systems (실시간 분산처리 시스템 개발을 위한 설계 방안 연구)

  • Park, Dong-Won;Ahn, Sung-Og
    • The Journal of Engineering Research
    • /
    • v.2 no.1
    • /
    • pp.73-82
    • /
    • 1997
  • In this paper, we examine design trends in the development of distributed real-time system. Many real-time systems developed over the last ten years share common characteristics including use of communications, imprecise computations, the object-oriented paradigm, multiprocessor node hardware, dynamic time-driven scheduling and the inclusion of fault tolerance mechanisms. By examining the major design decisions made in previous systems, the best attributes of these systems can be coalesced together to form the "ideal" real-time system. This paper examines such a system and the advantages and disadvantages of the design decisions involved.

  • PDF

A study on data sharing system based on threshold quorum consensus for fairness in permissioned blockchain (허가된 블록체인에서의 공정성을 보장하는 임계값 쿼럼 합의 기반의 데이터 공유 시스템에 관한 연구)

  • Ra, Gyeongjin;Lee, Imyeong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.334-336
    • /
    • 2021
  • 허가형 블록체인 기반 데이터 공유 시스템은 분산 환경에서 신뢰 수준을 구축하고 일관된 메시지를 기록 및 공유함으로써 서비스의 상호 운용성을 가능하게 한다. 그러나 허가형 블록체인은 종종 탈중앙화, 보안 및 상호 운용성과 충돌한다. 이는 중앙 집중식 시스템으로 돌아가거나 데이터의 독점 및 남용 및 오용으로 이어질 수 있다. 따라서 CAP (Consistency, Availability, Partition tolerance)에 이론 검증에 따라 메시지 공유, 비잔틴 내결함성 및 메시지 일관성을 고려하고 적용해야 한다. 기존의 PBFT(Practical Byzantine Fault Tolerance) 합의 알고리즘는 노드의 증가시, 장애내성을 갖기위해 계산되어야 할 합의 처리시간이 증가하며, DPOS(Delegated Proof of Stake) 알고리즘은 보상, 리더 선출의 공정성 문제 등에 따라 허가형 블록체인에서의 적합한 방식이 연구되고 있다. 본 논문에서는 서비스의 상호 운용성과 과제에 대해 논의하고 허가된 블록체인의 합의 개선을 통한 데이터 공유 시스템을 제안한다.

Implementation and Performance Measuring of Erasure Coding of Distributed File System (분산 파일시스템의 소거 코딩 구현 및 성능 비교)

  • Kim, Cheiyol;Kim, Youngchul;Kim, Dongoh;Kim, Hongyeon;Kim, Youngkyun;Seo, Daewha
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.11
    • /
    • pp.1515-1527
    • /
    • 2016
  • With the growth of big data, machine learning, and cloud computing, the importance of storage that can store large amounts of unstructured data is growing recently. So the commodity hardware based distributed file systems such as MAHA-FS, GlusterFS, and Ceph file system have received a lot of attention because of their scale-out and low-cost property. For the data fault tolerance, most of these file systems uses replication in the beginning. But as storage size is growing to tens or hundreds of petabytes, the low space efficiency of the replication has been considered as a problem. This paper applied erasure coding data fault tolerance policy to MAHA-FS for high space efficiency and introduces VDelta technique to solve data consistency problem. In this paper, we compares the performance of two file systems, MAHA-FS and GlusterFS. They have different IO processing architecture, the former is server centric and the latter is client centric architecture. We found the erasure coding performance of MAHA-FS is better than GlusterFS.

Adaptive Consensus Bound PBFT Algorithm Design for Eliminating Interface Factors of Blockchain Consensus (블록체인 합의 방해요인 제거를 위한 Adaptive Consensus Bound PBFT 알고리즘 설계)

  • Kim, Hyoungdae;Yun, Jusik;Goh, Yunyeong;Chung, Jong-Moon
    • Journal of Internet Computing and Services
    • /
    • v.21 no.1
    • /
    • pp.17-31
    • /
    • 2020
  • With the rapid development of block chain technology, attempts have been made to put the block chain technology into practical use in various fields such as finance and logistics, and also in the public sector where data integrity is very important. Defense Operations In addition, strengthening security and ensuring complete integrity of the command communication network is crucial for operational operation under the network-centered operational environment (NCOE). For this purpose, it is necessary to construct a command communication network applying the block chain network. However, the block chain technology up to now can not solve the security issues such as the 51% attack. In particular, the Practical Byzantine fault tolerance (PBFT) algorithm which is now widely used in blockchain, does not have a penalty factor for nodes that behave maliciously, and there is a problem of failure to make a consensus even if malicious nodes are more than 33% of all nodes. In this paper, we propose a Adaptive Consensus Bound PBFT (ACB-PBFT) algorithm that incorporates a penalty mechanism for anomalous behavior by combining the Trust model to improve the security of the PBFT, which is the main agreement algorithm of the blockchain.

Doughnut: An improved P2P Pastry Overlay Network with Efficient Locality and Caching (Doughnut: 효율적인 지역성 및 캐슁을 사용하는 향상된 P2P Pastry 오버레이 네트워크)

  • Kim, Myung-Won;Kwak, Hu-Keun;Chung, Kyu-Sik
    • The KIPS Transactions:PartC
    • /
    • v.16C no.2
    • /
    • pp.245-256
    • /
    • 2009
  • Pastry overlay network is one of structured P2Ps using DHT(Distributed Hash Table). To reduce the number of messages among nodes, Rosary and LAR have been proposed by exploiting spatial locality and caching, respectively, in the Pastry. Rosary consists of Inter-Pastry and Intra-Pastry. A root node is assigned as a representative in each Intra-Pastry and it has the responsibility of Inter-Pastry and Intra-Pastry routing. Therefore, Rosary has several disadvantages; 1) low fault tolerance in case of root node failure 2) routing hop count increases because of the use of root nodes compared to the existing structured P2Ps, and 3) the communication load is concentrated in some specific areas. LAR has inefficient problems in that caching is not distributed among nodes in Intra-Pastry and caching is used by only nodes in the Intra-Pastry. In this paper, we propose an improved Pastry called Doughnut to overcome the above problems of Rosary and LAR. By dividing nodes with the local characteristics, the Doughnut consists of Inter-Pastry and Intra-Pastry, and all nodes have the responsibility of Inter-Pastry and Intra-Pastry routing. This results in that all nodes perform the role of the existing root node. This solves the problems of the reducing of fault-tolerance, the increasing of routing hop count, and the not-distributed communication load. Also Doughnut can use cache effectively because it guarantees the even cache distribution in local(Intra-Pastry) and the cache contents in local can be used in the other local. The proposed algorithm is implemented using simulator and the experimental results show the effectiveness of the proposed method compared to the existing method.

Fault Tolerant of IED in IEC61850 based Substation Automation System (IEC 61850기반 변전소 자동화 시스템에서 IED의 결함 극복 방법)

  • Lee, Dong-Wook;Jin, Yong-Woo;Lee, Duck-Su;Lim, Seong-Il;Lee, Seung-Jae
    • Proceedings of the KIEE Conference
    • /
    • 2005.07a
    • /
    • pp.152-154
    • /
    • 2005
  • This paper proposes new methods which are used for trouble detecting and tolerance of IED in IEC61850 based Substation Automation System. There are three point devices for construction of this system: Trouble Detection IED, Backup IED and Engineering Unit. Trouble Detection IED tests trouble of Bay IED and sends the result to Engineering Unit. And then Engineering Unit is able to control the system. When fault occurs in Bay IED, Backup IED will be substitute the troubled Bay IED. If this proposed scheme can be used in this system, the reliability of the Power System would be greatly improved.

  • PDF

An implementation of reliable data sharing multi-stack system in virtualized environment (신뢰성 있는 멀티스택 기반의 가상화된 데이터 동시공유 시스템의 구현)

  • Han, Kyujong;Jeon, Dongwoon;Kim, Doohyun
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.11 no.5
    • /
    • pp.259-265
    • /
    • 2016
  • In this paper, we present an architecture for the fault isolation by applying virtualization-based multi-stack technologies. We propose the simultaneous sharing and switching mechanism using virtualied serial communications. Each guest OS has its own virtual serial device. The distribution module provides communications between the guest OS's through the virtual serial devices and simultaneously detect the liveness of the guest OS. The suggested mechanism has been implemented in VirtualBox and shows satisfactory performance in transmission speed and data sharing capability with virtual RS232.

A Study on the Reliability Management Program during Spacecraft Development (인공위성 개발과정에서의 시스템 신뢰성 관리 프로그램)

  • Yoo, Seung-Woo;Jin, Young-Kwon;Lee, Jong-Hee
    • Journal of Applied Reliability
    • /
    • v.8 no.3
    • /
    • pp.145-154
    • /
    • 2008
  • 인공위성은 지상 시스템과는 달리 극한 발사환경 및 우주환경에 노출되고, 운용 중 수리가 불가능하며, 임무 실패에 따른 위험비용이 매우 크기 때문에 고신뢰성, 고비용의 부품, 장비 및 시스템을 채택하여 개념설계단계에서부터 신뢰성을 관리한다. 신뢰성 관리의 주요 목적은 설계변경 및 개발 사이클의 반복을 최소화하고, 개발비용을 절감하며, 시스템의 신뢰성을 향상시키기 위한 것으로서, 대부분의 우주시스템 개발 프로젝트에서는 제품보증(Product Assurance) 활동의 일환으로 수행된다. 제품보증 업무는 크게 품질보증, 신뢰성 관리, EEE 부품관리, 재료 및 공정(Material & Process) 관리, 오염관리, 소프트웨어 품질보증 등으로 구분되지만, 포괄적인 의미의 신뢰성 관리 범주에 이를 모두 포함시키기도 한다. 인공 위성의 개발과정에서 기술성능지표 중 하나로서 중점 관리되고 있는 시스템 신뢰성 관리의 세부기법, 프로세스, 개선 필요사항 등에 대하여 살펴보고자 한다.

  • PDF

A Study on Design and Reliability Assessment for Embedded Hot-Standby Sparing FT System Using Self-Checking Logic (자기검사회로를 이용한 대기이중계구조 결함허용제어기의 설계 및 신뢰도평가에 관한 연구)

  • Lee, Jae-Ho;Lee, Kang-Mi;Kim, Young-Kyu;Shin, Duc-Ko
    • Journal of the Korean Society for Railway
    • /
    • v.9 no.6 s.37
    • /
    • pp.725-731
    • /
    • 2006
  • Hot Standby sparing system detecting faults by using software, and being tolerant any faults by using Hardware Redundancy is difficult to perform quantitative reliability prediction and to detect real time faults. Therefore, this paper designs Hot Standby sparing system using hardware basis self checking logic in order to overcome this problem. It also performs failure mode analysis of Hot Standby sparing system with designed self checking logic by using FMEA (Failure Mode Effect Analysis), and identifies reliability assessment of the controller designed by quantifying the numbers of failure development by using FTA (Fault Tree Analysis)