• Title/Summary/Keyword: Evaluation metric

Search Result 297, Processing Time 0.029 seconds

Development of Metric Analysis Module for Railway Signaling Software (열차제어시스템 소프트웨어 Metric 분석 자동화도구 개발)

  • Hwang, Jong-Gyu;Jo, Hyun-Jeong;Jeong, Eui-Jeong;Kim, Yong-Gyu
    • Proceedings of the KSR Conference
    • /
    • 2008.11b
    • /
    • pp.1257-1263
    • /
    • 2008
  • Recent advances in embedded system technology have brought more dependence on automating train control. While much efforts have been reported to improve electronic hardware's safety, not so much systematic approaches to evaluate software's safety, especially for the vital software running on board train controllers. In this paper, we have developed a software testing tool to evaluate train control system software safety, expecially "Metric Analysis" module. We have reviewed requirements in the international standards and surveyed available tools in the market. From this, we identified the S/W metric analysis module is required for software evaluation. So we have developed S/W metric analysis module for railway signaling systems.

  • PDF

Risk Evaluation of Failure Cause for FMEA under a Weibull Time Delay Model (와이블 지연시간 모형 하에서의 FMEA를 위한 고장원인의 위험평가)

  • Kwon, Hyuck Moo;Lee, Min Koo;Hong, Sung Hoon
    • Journal of the Korean Society of Safety
    • /
    • v.33 no.3
    • /
    • pp.83-91
    • /
    • 2018
  • This paper suggests a weibull time delay model to evaluate failure risks in FMEA(failure modes and effects analysis). Assuming three types of loss functions for delayed time in failure cause detection, the risk of each failure cause is evaluated as its occurring frequency and expected loss. Since the closed form solution of the risk metric cannot be obtained, a statistical computer software R program is used for numerical calculation. When the occurrence and detection times have a common shape parameter, though, some simple results of mathematical derivation are also available. As an enormous quantity of field data becomes available under recent progress of data acquisition system, the proposed risk metric will provide a more practical and reasonable tool for evaluating the risks of failure causes in FMEA.

Evaluation of Video Quality Based on Objectively Estimated Metric

  • Koumaras Harilaos;Kourtis Anastasios;Martakos Drakoulis
    • Journal of Communications and Networks
    • /
    • v.7 no.3
    • /
    • pp.235-242
    • /
    • 2005
  • Multimedia applications and especially encoded video services, are expected to playa major role in the 3rd generation (3G) and beyond mobile communication systems. Given that future service providers are expected to provide video applications at various price and quality levels, quick and economically affordable methods for preparing/encoding the offering media at various qualities are necessary to be developed. This paper presents a method for objective evaluation of the perceived quality of MPEG­4 video content, based on a quantification of subjective assessments. Showing that subjectively derived perceived quality of service (PQoS) vs. bit rate curves can be successfully approximated by a group of exponential functions, the proposed method exploits a simple objective metric, which is obtained from the mean frame rate vs. bit rate curves of an encoded clip. The validity of this metric is assessed by comparing subjectively derived PQoS results to the corresponding ones, which come from the proposed objective method, showing that the proposed technique provides satisfactory PQoS estimation.

Framework for evaluating code generation ability of large language models

  • Sangyeop Yeo;Yu-Seung Ma;Sang Cheol Kim;Hyungkook Jun;Taeho Kim
    • ETRI Journal
    • /
    • v.46 no.1
    • /
    • pp.106-117
    • /
    • 2024
  • Large language models (LLMs) have revolutionized various applications in natural language processing and exhibited proficiency in generating programming code. We propose a framework for evaluating the code generation ability of LLMs and introduce a new metric, pass-ratio@n, which captures the granularity of accuracy according to the pass rate of test cases. The framework is intended to be fully automatic to handle the repetitive work involved in generating prompts, conducting inferences, and executing the generated codes. A preliminary evaluation focusing on the prompt detail, problem publication date, and difficulty level demonstrates the successful integration of our framework with the LeetCode coding platform and highlights the applicability of the pass-ratio@n metric.

A Study on the Evaluation Method of Aircraft Noise (국내 항공기 소음 평가방법에 관한 실험적 연구)

  • Lee, Tai-Kang;Song, Kook-Gon;Kim, Hang;Jang, Gil-Soo;Kim, Sun-Woo
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2007.11a
    • /
    • pp.421-424
    • /
    • 2007
  • Currently domestic criteria for the aircraft noise is being adapted WECPNL(weighted equivalent continuous perceived noise level), while internationally preferred method is $L_{dn}$ which is based on from $L_{eq}$ and can also evaluate environmental noise. WECPNL used in domestic as an evaluation metric is only for the aircraft noise. It is, therefore, not adequate for the evaluation of residents' injury, moreover, it is very difficult to measure the aircraft noise by WECPNL due to the complicated calculating procedures as long as automatic measuring system is not used. Accordingly, this study aims to propose alternative evaluation metric for the aircraft noise. To achieve this purpose, WECPNL, $L_{eq}$, $L_{dn}$, other metrics and criteria were compared and analyzed.

  • PDF

A Selection Process of COTS Component And Quality Evaluation Techniques (상용컴포넌트 선정 프로세스 및 품질 평가 기법)

  • Oh, Kie-Sung
    • Journal of Information Technology Services
    • /
    • v.2 no.1
    • /
    • pp.123-133
    • /
    • 2003
  • Because of rapid evolution of software technique, numerous software professionals have been concerned with component based development methodologies. However, it is hard to find out a systematic technique for the selection of COTS (Commercial Off The Shelf) component in consumer position. Up to date, the major of component quality evaluation is object-oriented metric based evaluation methodology. But this paper present four step process and evaluation criteria based on MCDM (Multiple Criteria Decision Making) technique for optimal COTS component selection in consumer position. Weconsidered funtionality, efficiency, usability based on ISO/IEC 9126 for quality measurement and executed practical analysis about commercial EJB component in internet. This paper show that the proposed selection technique is applicable to optimal COTS component selection.

Image Quality Assessment Considering both Computing Speed and Robustness to Distortions (계산 속도와 왜곡 강인성을 동시 고려한 이미지 품질 평가)

  • Kim, Suk-Won;Hong, Seongwoo;Jin, Jeong-Chan;Kim, Young-Jin
    • Journal of KIISE
    • /
    • v.44 no.9
    • /
    • pp.992-1004
    • /
    • 2017
  • To assess image quality accurately, an image quality assessment (IQA) metric is required to reflect the human visual system (HVS) properly. In other words, the structure, color, and contrast ratio of the image should be evaluated in consideration of various factors. In addition, as mobile embedded devices such as smartphone become popular, a fast computing speed is important. In this paper, the proposed IQA metric combines color similarity, gradient similarity, and phase similarity synergistically to satisfy the HVS and is designed by using optimized pooling and quantization for fast computation. The proposed IQA metric is compared against existing 13 methods using 4 kinds of evaluation methods. The experimental results show that the proposed IQA metric ranks the first on 3 evaluation methods and the first on the remaining method, next to VSI which is the most remarkable IQA metric. Its computing speed is on average about 20% faster than VSI's. In addition, we find that the proposed IQA metric has a bigger amount of correlation with the HVS than existing IQA metrics.

Scanline Based Metric for Evaluating the Accuracy of Automatic Fracture Survey Methods (자동 균열 조사기법의 정확도 평가를 위한 조사선 기반의 지표 제안)

  • Kim, Jineon;Song, Jae-Joon
    • Tunnel and Underground Space
    • /
    • v.29 no.4
    • /
    • pp.230-242
    • /
    • 2019
  • While various automatic rock fracture survey methods have been researched, the evaluation of the accuracy of these methods raises issues due to the absence of a metric which fully expresses the similarity between automatic and manual fracture maps. Therefore, this paper proposes a geometry similarity metric which is especially designed to determine the overall similarity of fracture maps and to evaluate the accuracy of rock fracture survey methods by a single number. The proposed metric, Scanline Intersection Similarity (SIS), is derived by conducting a large number of scanline surveys upon two fracture maps using Python code. By comparing the frequency of intersections over a large number of scanlines, SIS is able to express the overall similarity between two fracture maps. The proposed metric was compared with Intersection Over Union (IoU) which is a widely used evaluation metric in computer vision. Results showed that IoU is inappropriate for evaluating the geometry similarity of fracture maps because it is overly sensitive to minor geometry differences of thin elongated objects. The proposed metric, on the other hand, reflected macro-geometry differences rather than micro-geometry differences, showing good agreement with human perception. The metric was further applied to evaluate the accuracy of a deep learning-based automatic fracture surveying method which resulted as 0.674 (SIS). However, the proposed metric is currently limited to 2D fracture maps and requires comparison with rock joint parameters such as RQD.

Evaluation of Long-term Stability of Interior Orientation Parameters of a Non-metric Camera (비측량용 카메라 내부표정요소의 장기간 안정성 평가)

  • Jeong, Soo
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.29 no.3
    • /
    • pp.283-291
    • /
    • 2011
  • In case of metric cameras, not only fiducial marks but also various parameters related to camera lens are provided to users for the interior orientation process. The parameters have been acquired through precise camera calibration in laboratory by camera maker. But, in case of non-metric cameras, the interior orientation parameters should be determined in person by users through camera calibration with great number of control points. The interior orientation parameters of metric cameras are practically used for long time. But in case of non-metric cameras, the long-term stability of the interior orientation parameters have not been established. Generally, the interior orientation parameters of non-metric cameras are determined in every photogrammetric work. It's been an obstacle to use the non-metric camera in photogrammetric project because so many control points are required to get the interior orientation parameters. In this study, camera calibrations and photogrammetric observations using a non-metric camera have been implemented 25 times periodically for 6 months and the results have been analyzed. As a result, long-them stability of the interior orientation parameters of a non-metric camera is analyzed.