• 제목/요약/키워드: Recovery probability

검색결과 126건 처리시간 0.021초

A Multi-objective Optimization Approach to Workflow Scheduling in Clouds Considering Fault Recovery

  • Xu, Heyang;Yang, Bo;Qi, Weiwei;Ahene, Emmanuel
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권3호
    • /
    • pp.976-995
    • /
    • 2016
  • Workflow scheduling is one of the challenging problems in cloud computing, especially when service reliability is considered. To improve cloud service reliability, fault tolerance techniques such as fault recovery can be employed. Practically, fault recovery has impact on the performance of workflow scheduling. Such impact deserves detailed research. Only few research works on workflow scheduling consider fault recovery and its impact. In this paper, we investigate the problem of workflow scheduling in clouds, considering the probability that cloud resources may fail during execution. We formulate this problem as a multi-objective optimization model. The first optimization objective is to minimize the overall completion time and the second one is to minimize the overall execution cost. Based on the proposed optimization model, we develop a heuristic-based algorithm called Min-min based time and cost tradeoff (MTCT). We perform extensive simulations with four different real world scientific workflows to verify the validity of the proposed model and evaluate the performance of our algorithm. The results show that, as expected, fault recovery has significant impact on the two performance criteria, and the proposed MTCT algorithm is useful for real life workflow scheduling when both of the two optimization objectives are considered.

RM 스케줄링된 실시간 태스크에서의 최적 체크 포인터 구간 선정 (Determination of Optimal Checkpoint Interval for RM Scheduled Real-time Tasks)

  • 곽성우;정용주
    • 전기학회논문지
    • /
    • 제56권6호
    • /
    • pp.1122-1129
    • /
    • 2007
  • For a system with multiple real-time tasks of different deadlines, it is very difficult to find the optimal checkpoint interval because of the complexity in considering the scheduling of tasks. In this paper, we determine the optimal checkpoint interval for multiple real-time tasks that are scheduled by RM(Rate Monotonic) algorithm. Faults are assumed to occur with Poisson distribution. Checkpoints are inserted in the execution of task with equal distance in the same task, but different distances in other tasks. When faults occur, rollback to the latest checkpoint and re-execute task after the checkpoint. We derive the equation of maximum slack time for each task, and determine the number of re-executable checkpoint intervals for fault recovery. The equation to check the schedulibility of tasks is also derived. Based on these equations, we find the probability of all tasks executed within their deadlines successfully. Checkpoint intervals which make the probability maximum is the optimal.

확률모형을 이용한 정보보호 투자 포트폴리오 분석 (Probabilistic Modeling for Evaluation of Information Security Investment Portfolios)

  • 양원석;김태성;박현민
    • 한국경영과학회지
    • /
    • 제34권3호
    • /
    • pp.155-163
    • /
    • 2009
  • We develop a probability model to evaluate information security investment portfolios. We assume that organizations install portfolios of information security countermeasures to mitigate the damage such as loss of the transaction being processed, damage of hardware and data, etc. A queueing model and Its expected value analysis are used to derive the lost cost of transactions being processed, the replacement cost of hardwares, and the recovery cost of data. The net present value for each portfolio is derived and organizations can select the optimal information security investment portfolio by comparing portfolios.

Nonuniform Encoding and Hybrid Decoding Schemes for Equal Error Protection of Rateless Codes

  • Lim, Hyung Taek;Joo, Eon Kyeong
    • ETRI Journal
    • /
    • 제34권5호
    • /
    • pp.719-726
    • /
    • 2012
  • Messages are generally selected with the same probability in the encoding scheme of rateless codes for equal error protection. In addition, a belief propagation (BP) decoding scheme is generally used because of the low computational complexity. However, the probability of recovering a new message by BP decoding is reduced if both the recovered and unrecovered messages are selected uniformly. Thus, more codeword symbols than expected are required for the perfect recovery of message symbols. Therefore, a new encoding scheme with a nonuniform selection of messages is proposed in this paper. In addition, a BP-Gaussian elimination hybrid decoding scheme that complements the drawback of the BP decoding scheme is proposed. The performances of the proposed schemes are analyzed and compared with those of the conventional schemes.

A Quantitative Model of System-Man Interaction Based on Discrete Function Theory

  • Kim, Man-Cheol;Seong, Poong-Hyun
    • Nuclear Engineering and Technology
    • /
    • 제36권5호
    • /
    • pp.430-449
    • /
    • 2004
  • A quantitative model for a control system that integrates human operators, systems, and their interactions is developed based on discrete functions. After identifying the major entities and the key factors that are important to each entity in the control system, a quantitative analysis to estimate the recovery failure probability from an abnormal state is performed. A numerical analysis based on assumed values of related variables shows that this model produces reasonable results. The concept of 'relative sensitivity' is introduced to identify the major factors affecting the reliability of the control system. The analysis shows that the hardware factor and the design factor of the instrumentation system have the highest relative sensitivities in this model. T도 probability of human operators performing incorrect actions, along with factors related to human operators, are also found to have high relative sensitivities. This model is applied to an analysis of the TMI-2 nuclear power plant accident and systematically explains how the accident took place.

시스템 오류 발생률 분석 (An Analysis of System Error Rate)

  • 성순용
    • 한국정보통신학회논문지
    • /
    • 제13권3호
    • /
    • pp.475-481
    • /
    • 2009
  • 교착상태의 발생 주기 및 확률은 교착상태를 다루는 알고리즘 설계 시 많은 영향을 미친다. 그러나 프로세스나 자원의 성격, 자원 요구나 반환 연산 방식, 프로세스 개수 등의 성질이 교착상태 발생에 어떻게 영향을 미치는지 분석하는 게 쉽지 않아 이 분야에 대한 연구가 매우 부족하다. 이 논문은 자원 할당 상태를 (a,b)t로 표현하는 상태 모델을 이용하여 상태의 개수를 획기적으로 감소시켰다. 또한 시스템 분석에 있어서 자원의 오류 발생 비율과 복구 비율이 미치는 영향도 함께 포함할 수 있도록 설계하였다. 그 결과 교착상태의 평균 발생 주기, 요구연산이 보류되거나 교착상태를 유발할 확률, 사이클의 길이가 2인 교착상태가 발생할 확률 등과 같은 각종 수식을 구하였다.

A Heuristic Buffer Management and Retransmission Control Scheme for Tree-Based Reliable Multicast

  • Baek, Jin-Suk;Paris, Jehan-Francois
    • ETRI Journal
    • /
    • 제27권1호
    • /
    • pp.1-12
    • /
    • 2005
  • We propose a heuristic buffer management scheme that uses both positive and negative acknowledgments to provide scalability and reliability. Under our scheme, most receiver nodes only send negative acknowledgments to their repair nodes to request packet retransmissions while some representative nodes also send positive acknowledgments to indicate which packets can be discarded from the repair node's buffer. Our scheme provides scalability because it significantly reduces the number of feedbacks sent by the receiver nodes. In addition, it provides fast recovery of transmission errors since the packets requested from the receiver nodes are almost always available in their buffers. Our scheme also reduces the number of additional retransmissions from the original sender node or upstream repair nodes. These features satisfy the original goal of treebased protocols since most packet retransmissions are performed within a local group.

  • PDF

지진 및 기초의 세굴을 고려한 교량시스템의 동적거동분석 (Dynamic Behavior Analysis of Bridges under the Combined Effect of Earthquake and Scour)

  • 김상효;최성욱;이상우;김호상
    • 한국지진공학회:학술대회논문집
    • /
    • 한국지진공학회 2002년도 춘계 학술발표회 논문집
    • /
    • pp.187-194
    • /
    • 2002
  • Bridge dynamic behaviors and the failure of the foundation are examined in this study under seismic excitations including the local scour effect. The simplified mechanical model, which can consider the effect of various influence elements, is proposed to simulate the bridge motions. The scour depths around the foundations are estimated by the CSU equation recommended by the HEC-18 and the local scour effect upon global bridge motions is then considered by applying various foundation stiffness based upon the reduced embedded depths. From the simulation results, it is found that seismic responses of a bridge with the same scour depth for both foundations increase due to the local scour effect. The bridge scour is found to be significant under weak and moderate seismic intensity. The recovery durations of the foundation stiffness after local scour are found to be critical in the estimation of the probability of foundation failure under earthquakes. Therefore, the safety of the whole bridge system should be conducted with the consideration of the scour effect upon the foundations and the recovery duration of stiffness should be determined rationally.

  • PDF

원자력발전소의 노심냉각회복 조치에 대한 운전원 조치시간 평가 (An Evaluation of Operator's Action Time for Core Cooling Recovery Operation in Nuclear Power Plant)

  • 배연경
    • 한국안전학회지
    • /
    • 제27권5호
    • /
    • pp.229-234
    • /
    • 2012
  • Operator's action time is evaluated from MAAP4 analysis used in conventional probabilistic safety assessment(PSA) of a nuclear power plant. MAAP4 code which was developed for severe accident analysis is too conservative to perform a realistic PSA. A best-estimate code such as RELAP5/MOD3, MARS has been used to reduce the conservatism of thermal hydraulic analysis. In this study, operator's action time of core cooling recovery operation is evaluated by using the MARS code, which its Fussell-Vessely(F-V) value was evaluated as highly important in a small break loss of coolant(SBLOCA) event and loss of component cooling water(LOCCW) event in previous PSA. The main conclusions were elicited : (1) MARS analysis provides larger time window for operator's action time than MAAP4 analysis and gives the more realistic time window in PSA (2) Sufficient operator's action time can reduce human error probability and core damage frequency in PSA.

에러 회복 기능을 포함하는 Ethernet 전송 프로토콜에 관한 연구 (A Study on the Transmission Protocol Including Error Recovery Strategy for Ethernet.)

  • 박성래;신우철;이상배;박민용
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1988년도 전기.전자공학 학술대회 논문집
    • /
    • pp.261-264
    • /
    • 1988
  • In this paper, a transmission protocol including error recovery strategy on the data link layer for Ethernet using CSMA/CD media accessing method was proposed. So when considering the actual transmission error probability on the channel, it's performance was analyzed through a simulation. Performming the simmulation, the required parameters was taken as those given by Ethernet controller interface board.

  • PDF