• 제목/요약/키워드: fault-proneness

검색결과 12건 처리시간 0.024초

소프트웨어 신뢰성 예측을 위한 객체지향 척도 분석 (Analysis of Object-Oriented Metrics to Predict Software Reliability)

  • 이양규
    • 한국신뢰성학회지:신뢰성응용연구
    • /
    • 제16권1호
    • /
    • pp.48-55
    • /
    • 2016
  • Purpose: The purpose of this study is to identify the object-oriented metrics which have strong impact on the reliability and fault-proneness of software products. The reliability and fault-proneness of software product is closely related to the design properties of class diagrams such as coupling between objects and depth of inheritance tree. Methods: This study has empirically validated the object-oriented metrics to determine which metrics are the best to predict fault-proneness. We have tested the metrics using logistic regressions and artificial neural networks. The results are then compared and validated by ROC curves. Results: The artificial neural network models show better results in sensitivity, specificity and correctness than logistic regression models. Among object-oriented metrics, several metrics can estimate the fault-proneness better. The metrics are CBO (coupling between objects), DIT (depth of inheritance), LCOM (lack of cohesive methods), RFC (response for class). In addition to the object-oriented metrics, LOC (lines of code) metric has also proven to be a good factor for determining fault-proneness of software products. Conclusion: In order to develop fault-free and reliable software products on time and within budget, assuring quality of initial phases of software development processes is crucial. Since object-oriented metrics can be measured in the early phases, it is important to make sure the key metrics of software design as good as possible.

Empirical assessment of design patterns' fault-proneness at different granularity levels

  • Mohammed, Mawal A.;Elish, Mahmoud O.
    • Advances in Computational Design
    • /
    • 제2권4호
    • /
    • pp.293-311
    • /
    • 2017
  • There are several claimed benefits for the impact of design patterns (DPs) on software quality. However, the association between design patterns and fault-proneness has been a controversial issue. In this work, we evaluate the fault-proneness of design patterns at four levels: the design level, category level, pattern level, and role level. We used five subject systems in our empirical study. As a result, we found that, at the design level, the classes that participate in the design patterns are less fault-prone than the non-participant classes. At the category level, we found that the classes that participate in the behavioral and structural categories are less fault-prone than the non-participant classes. In addition, we found that the classes that participate in the structural design patterns are less fault-prone than the classes that participate in the other categories. At the pattern level, we found that only five patterns show significant associations with fault-proneness: builder, factory method, adapter, composite, and decorator. All of these patterns except for builder show that the classes that participate in each one of them are less fault-prone than the non-participant classes in that pattern. The classes that participate in the builder design pattern were more fault-prone than the non-participant classes and the classes that participate in several patterns: the adapter, the composite, and the decorator design patterns. At the role level, the most significant differences were between the classes that participate in some roles and the non-participant classes. Only three pairs of design pattern roles show significant differences. These roles are concrete-product vs. concrete-creator, adapter vs. adaptee, and adapter vs. client. The results recommend the use of design patterns because they are less fault-prone in general except for the builder design pattern, which should be applied with care and addressed with more test cases.

대형 소프트웨어 시스템의 결함경향성 예측을 위한 혼성 메트릭 모델 (Hybrid metrics model to predict fault-proneness of large software systems)

  • 홍의석
    • 컴퓨터교육학회논문지
    • /
    • 제8권5호
    • /
    • pp.129-137
    • /
    • 2005
  • 설계 명세를 이용하여 결함경향성이 많은 부분을 예측하는 위험도 예측 모델은 대형 통신 시스템 같이 결과 산물이 매우 큰 시스템의 개발비용을 낮추는데 중요한 역할을 하고 있다. 복잡도 메트릭에 기반한 많은 위험도 예측 모델들이 제안되었지만 그들 대부분은 모델 훈련을 위한 훈련 데이터 집합을 필요로 하고, 설계 개체들을 위험 그룹과 비위험 그룹으로 나누는 기능만 지닌 분류 모델들이었다. 본 논문에서는 두가지 형태의 검증된 혼성 메트릭들을 사용하는 새로운 예측 모델 HMM을 제안한다. HMM의 장점은 설계 개체의 위험도를 정량화함으로써 모델 훈련을 위한 훈련 데이터 집합이 필요 없다는 것과 개체 간에 위험도 비교가 가능하다는 것이다. HMM의 유용성을 보이기 위해 여러 내부 특성들과 예측 정확도 비교를 통해 잘 알려진 예측 모델인 역전파 신경망 모델(BPM)과 HMM을 비교하였다.

  • PDF

대표적인 클러스터링 알고리즘을 사용한 비감독형 결함 예측 모델 (Unsupervised Learning Model for Fault Prediction Using Representative Clustering Algorithms)

  • 홍의석;박미경
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제3권2호
    • /
    • pp.57-64
    • /
    • 2014
  • 입력 모듈의 결함경향성을 결정하는 결함 예측 모델 연구들은 대부분 훈련 데이터 집합을 사용하는 감독형 모델에 관련된 것들이었다. 하지만 과거 데이터 집합이 없거나 데이터 집합이 있더라도 현재 프로젝트와 성격이 다른 경우는 비감독형 모델이 필요하며, 이들에 관한 연구들은 모델 구축의 어려움 때문에 극소수 존재한다. 본 논문에서는 기존 비감독형 모델 연구들에서 사용하지 않은 대표적인 클러스터링 알고리즘인 EM, DBSCAN을 사용한 비감독형 모델들을 제작하여, 기존 연구들에서 사용한 K-means 모델과 성능을 비교하였다. 그 결과 오류율 면에서 EM이 K-means보다 약간 나은 성능을 보였으며, DBSCAN은 두 모델에 떨어지는 성능을 보였다.

Software Fault Prediction at Design Phase

  • Singh, Pradeep;Verma, Shrish;Vyas, O.P.
    • Journal of Electrical Engineering and Technology
    • /
    • 제9권5호
    • /
    • pp.1739-1745
    • /
    • 2014
  • Prediction of fault-prone modules continues to attract researcher's interest due to its significant impact on software development cost. The most important goal of such techniques is to correctly identify the modules where faults are most likely to present in early phases of software development lifecycle. Various software metrics related to modules level fault data have been successfully used for prediction of fault-prone modules. Goal of this research is to predict the faulty modules at design phase using design metrics of modules and faults related to modules. We have analyzed the effect of pre-processing and different machine learning schemes on eleven projects from NASA Metrics Data Program which offers design metrics and its related faults. Using seven machine learning and four preprocessing techniques we confirmed that models built from design metrics are surprisingly good at fault proneness prediction. The result shows that we should choose Naïve Bayes or Voting feature intervals with discretization for different data sets as they outperformed out of 28 schemes. Naive Bayes and Voting feature intervals has performed AUC > 0.7 on average of eleven projects. Our proposed framework is effective and can predict an acceptable level of fault at design phases.

Fault Prediction Using Statistical and Machine Learning Methods for Improving Software Quality

  • Malhotra, Ruchika;Jain, Ankita
    • Journal of Information Processing Systems
    • /
    • 제8권2호
    • /
    • pp.241-262
    • /
    • 2012
  • An understanding of quality attributes is relevant for the software organization to deliver high software reliability. An empirical assessment of metrics to predict the quality attributes is essential in order to gain insight about the quality of software in the early phases of software development and to ensure corrective actions. In this paper, we predict a model to estimate fault proneness using Object Oriented CK metrics and QMOOD metrics. We apply one statistical method and six machine learning methods to predict the models. The proposed models are validated using dataset collected from Open Source software. The results are analyzed using Area Under the Curve (AUC) obtained from Receiver Operating Characteristics (ROC) analysis. The results show that the model predicted using the random forest and bagging methods outperformed all the other models. Hence, based on these results it is reasonable to claim that quality models have a significant relevance with Object Oriented metrics and that machine learning methods have a comparable performance with statistical methods.

Identifying SDC-Causing Instructions Based on Random Forests Algorithm

  • Liu, LiPing;Ci, LinLin;Liu, Wei;Yang, Hui
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권3호
    • /
    • pp.1566-1582
    • /
    • 2019
  • Silent Data Corruptions (SDCs) is a serious reliability issue in many domains of computer system. The identification and protection of the program instructions that cause SDCs is one of the research hotspots in computer reliability field at present. A lot of solutions have already been proposed to solve this problem. However, many of them are hard to be applied widely due to time-consuming and expensive costs. This paper proposes an intelligent approach named SDCPredictor to identify the instructions that cause SDCs. SDCPredictor identifies SDC-causing Instructions depending on analyzing the static and dynamic features of instructions rather than fault injections. The experimental results demonstrate that SDCPredictor is highly accurate in predicting the SDCs proneness. It can achieve higher fault coverage than previous similar techniques in a moderate time cost.

결함 심각도에 기반한 소프트웨어 품질 예측 (Software Quality Prediction based on Defect Severity)

  • 홍의석
    • 한국컴퓨터정보학회논문지
    • /
    • 제20권5호
    • /
    • pp.73-81
    • /
    • 2015
  • 소프트웨어 결함 예측 연구들의 대부분은 입력 개체의 결함 유무를 예측하는 이진 분류 모델들에 관한 것들이다. 하지만 모든 결함들이 같은 심각도를 갖지는 않으므로 예측 모델이 입력 개체의 결함경향성을 몇 개의 심각도 범주로 분류할 수 있다면 훨씬 유용하게 사용될 수 있다. 본 논문에서는 전통적인 복잡도와 크기 메트릭들을 입력으로 하는 심각도 기반 결함 예측 모델을 제안하였다. 학습 알고리즘은 많이 사용되는 네 개의 기계학습 기법들을 사용하였으며, 모델 구조는 삼진 분류 모델로 하였다. 모델 성능 평가를 위해 실험 데이터는 두 개의 NASA 공개 데이터 집합을 사용하였고, 평가 측정치는 Accuracy를 이용하였다. 평가 실험 결과는 역전파 신경망 모델이 두 데이터 집합에 대해 각각 81%와 88% 정도의 Accuracy 값으로 가장 좋은 성능을 보였다.

Analyzing Machine Learning Techniques for Fault Prediction Using Web Applications

  • Malhotra, Ruchika;Sharma, Anjali
    • Journal of Information Processing Systems
    • /
    • 제14권3호
    • /
    • pp.751-770
    • /
    • 2018
  • Web applications are indispensable in the software industry and continuously evolve either meeting a newer criteria and/or including new functionalities. However, despite assuring quality via testing, what hinders a straightforward development is the presence of defects. Several factors contribute to defects and are often minimized at high expense in terms of man-hours. Thus, detection of fault proneness in early phases of software development is important. Therefore, a fault prediction model for identifying fault-prone classes in a web application is highly desired. In this work, we compare 14 machine learning techniques to analyse the relationship between object oriented metrics and fault prediction in web applications. The study is carried out using various releases of Apache Click and Apache Rave datasets. En-route to the predictive analysis, the input basis set for each release is first optimized using filter based correlation feature selection (CFS) method. It is found that the LCOM3, WMC, NPM and DAM metrics are the most significant predictors. The statistical analysis of these metrics also finds good conformity with the CFS evaluation and affirms the role of these metrics in the defect prediction of web applications. The overall predictive ability of different fault prediction models is first ranked using Friedman technique and then statistically compared using Nemenyi post-hoc analysis. The results not only upholds the predictive capability of machine learning models for faulty classes using web applications, but also finds that ensemble algorithms are most appropriate for defect prediction in Apache datasets. Further, we also derive a consensus between the metrics selected by the CFS technique and the statistical analysis of the datasets.

Assessment of geological hazards in landslide risk using the analysis process method

  • Peixi Guo;Seyyed Behnam Beheshti;Maryam Shokravi;Amir Behshad
    • Steel and Composite Structures
    • /
    • 제47권4호
    • /
    • pp.451-454
    • /
    • 2023
  • Landslides are one of the natural disasters that cause a lot of financial and human losses every year It will be all over the world. China, especially. The Mainland China can be divided into 12 zones, including 4 high susceptibility zones, 7 medium susceptibility zones and 1 low susceptibility zone, according to landslide proneness. Climate and physiography are always at risk of landslides. The purpose of this research is to prepare a landslide hazard map using the Hierarchical Analysis Process method. In the GIS environment, it is in a part of China watershed. In order to prepare a landslide hazard map, first with Field studies, a distribution map of landslides in the area and then a map of factors affecting landslides were prepared. In the next stage, the factors are prioritized using expert opinion and hierarchical analysis process and nine factors including height, slope, slope direction, geological units, land use, distance from Waterway, distance from the road, distance from the fault and rainfall map were selected as effective factors. Then Landslide risk zoning in the region was done using the hierarchical analysis process model. The results showed that the three factors of geological units, distance from the road and slope are the most important have had an effect on the occurrence of landslides in the region, while the two factors of fault and rainfall have the least effect The landslide occurred in the region.