• Title/Summary/Keyword: Automatic validation

Search Result 180, Processing Time 0.035 seconds

Compositional Feature Selection and Its Effects on Bandgap Prediction by Machine Learning (기계학습을 이용한 밴드갭 예측과 소재의 조성기반 특성인자의 효과)

  • Chunghee Nam
    • Korean Journal of Materials Research
    • /
    • v.33 no.4
    • /
    • pp.164-174
    • /
    • 2023
  • The bandgap characteristics of semiconductor materials are an important factor when utilizing semiconductor materials for various applications. In this study, based on data provided by AFLOW (Automatic-FLOW for Materials Discovery), the bandgap of a semiconductor material was predicted using only the material's compositional features. The compositional features were generated using the python module of 'Pymatgen' and 'Matminer'. Pearson's correlation coefficients (PCC) between the compositional features were calculated and those with a correlation coefficient value larger than 0.95 were removed in order to avoid overfitting. The bandgap prediction performance was compared using the metrics of R2 score and root-mean-squared error. By predicting the bandgap with randomforest and xgboost as representatives of the ensemble algorithm, it was found that xgboost gave better results after cross-validation and hyper-parameter tuning. To investigate the effect of compositional feature selection on the bandgap prediction of the machine learning model, the prediction performance was studied according to the number of features based on feature importance methods. It was found that there were no significant changes in prediction performance beyond the appropriate feature. Furthermore, artificial neural networks were employed to compare the prediction performance by adjusting the number of features guided by the PCC values, resulting in the best R2 score of 0.811. By comparing and analyzing the bandgap distribution and prediction performance according to the material group containing specific elements (F, N, Yb, Eu, Zn, B, Si, Ge, Fe Al), various information for material design was obtained.

Experimental and numerical validation of guided wave based on time-reversal for evaluating grouting defects of multi-interface sleeve

  • Jiahe Liu;Li Tang;Dongsheng Li;Wei Shen
    • Smart Structures and Systems
    • /
    • v.33 no.1
    • /
    • pp.41-53
    • /
    • 2024
  • Grouting sleeves are an essential connecting component of prefabricated components, and the quality of grouting has a significant influence on structural integrity and seismic performance. The embedded grouting sleeve (EGS)'s grouting defects are highly undetectable and random, and no effective monitoring method exists. This paper proposes an ultrasonic guided wave method and provides a set of guidelines for selecting the optimal frequency and suitable period for the EGS. The optimal frequency was determined by considering the group velocity, wave structure, and wave attenuation of the selected mode. Guided waves are prone to multi-modality, modal conversion, energy leakage, and dispersion in the EGS, which is a multi-layer structure. Therefore, a time-reversal (TR)-based multi-mode focusing and dispersion automatic compensation technology is introduced to eliminate the multi-mode phase difference in the EGS. First, the influence of defects on guided waves is analyzed according to the TR coefficient. Second, two major types of damage indicators, namely, the time domain and the wavelet packet energy, are constructed according to the influence method. The constructed wavelet packet energy indicator is more sensitive to the changes of defecting than the conventional time-domain similarity indicator. Both numerical and experimental results show that the proposed method is feasible and beneficial for the detection and quantitative estimation of the grouting defects of the EGS.

Development of an AIDA(Automatic Incident Detection Algorithm) for Uninterrupted Flow Based on the Concept of Short-term Displaced Flow (연속류도로 단기 적체 교통량 개념 기반 돌발상황 자동감지 알고리즘 개발)

  • Lee, Kyu-Soon;Shin, Chi-Hyun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.15 no.2
    • /
    • pp.13-23
    • /
    • 2016
  • Many traffic centers are highly hesitant in employing existing Automatic Incident Detection Algorithms due to high false alarm rate, low detection rate, and enormous effort taken in maintaining algorithm parameters, together with complex algorithm structure and filtering/smoothing process. Concerns grow over the situation particularly in Freeway Incident Management Area This study proposes a new algorithm and introduces a novel concept, the Displaced Flow Index (DiFI) which is similar to a product of relative speed and relative occupancy for every execution period. The algorithm structure is very simple, also easy to understand with minimum parameters, and could use raw data without any additional pre-processing. To evaluate the performance of the DiFI algorithm, validation test on the algorithm has been conducted using detector data taken from Naebu Expressway in Seoul and following transferability tests with Gyeongbu Expressway detector data. Performance test has utilized many indices such as DR, FAR, MTTD (Mean Time To Detect), CR (Classification Rate), CI (Composite Index) and PI (Performance Index). It was found that the DR is up to 100%, the MTTD is a little over 1.0 minutes, and the FAR is as low as 2.99%. This newly designed algorithm seems promising and outperformed SAO and most popular AIDAs such as APID and DELOS, and showed the best performance in every category.

SWAT model calibration/validation using SWAT-CUP I: analysis for uncertainties of objective functions (SWAT-CUP을 이용한 SWAT 모형 검·보정 I: 목적함수에 따른 불확실성 분석)

  • Yu, Jisoo;Noh, Joonwoo;Cho, Younghyun
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.1
    • /
    • pp.45-56
    • /
    • 2020
  • This study aims to quantify the uncertainty that can be induced by the objective function when calibrating SWAT parameters using SWAT-CUP. SWAT model was constructed to estimate runoff in Naesenong-cheon, which is the one of mid-watershed in Nakdong River basin, and then automatic calibration was performed using eight objective functions (R2, bR2, NS, MNS, KGE, PBIAS, RSR, and SSQR). The optimum parameter sets obtained from each objective function showed different ranges, and thus the corresponding hydrologic characteristics of simulated data were also derived differently. This is because each objective function is sensitive to specific hydrologic signatures and evaluates model performance in an unique way. In other words, one objective function might be sensitive to the residual of the extreme value, so that well produce the peak value, whereas ignores the average or low flow residuals. Therefore, the hydrological similarity between the simulated and measured values was evaluated in order to select the optimum objective function. The hydrologic signatures, which include not only the magnitude, but also the ratio of the inclining and declining time in hydrograph, were defined to consider the timing of the flow occurrence, the response of watershed, and the increasing and decreasing trend. The results of evaluation were quantified by scoring method, and hence the optimal objective functions for SWAT parameter calibration were determined as MNS (342.48) and SSQR (346.45) with the highest total scores.

Prediction of Greenhouse Strawberry Production Using Machine Learning Algorithm (머신러닝 알고리즘을 이용한 온실 딸기 생산량 예측)

  • Kim, Na-eun;Han, Hee-sun;Arulmozhi, Elanchezhian;Moon, Byeong-eun;Choi, Yung-Woo;Kim, Hyeon-tae
    • Journal of Bio-Environment Control
    • /
    • v.31 no.1
    • /
    • pp.1-7
    • /
    • 2022
  • Strawberry is a stand-out cultivating fruit in Korea. The optimum production of strawberry is highly dependent on growing environment. Smart farm technology, and automatic monitoring and control system maintain a favorable environment for strawberry growth in greenhouses, as well as play an important role to improve production. Moreover, physiological parameters of strawberry plant and it is surrounding environment may allow to give an idea on production of strawberry. Therefore, this study intends to build a machine learning model to predict strawberry's yield, cultivated in greenhouse. The environmental parameter like as temperature, humidity and CO2 and physiological parameters such as length of leaves, number of flowers and fruits and chlorophyll content of 'Seolhyang' (widely growing strawberry cultivar in Korea) were collected from three strawberry greenhouses located in Sacheon of Gyeongsangnam-do during the period of 2019-2020. A predictive model, Lasso regression was designed and validated through 5-fold cross-validation. The current study found that performance of the Lasso regression model is good to predict the number of flowers and fruits, when the MAPE value are 0.511 and 0.488, respectively during the model validation. Overall, the present study demonstrates that using AI based regression model may be convenient for farms and agricultural companies to predict yield of crops with fewer input attributes.

A Study on the Automatic Detection of Railroad Power Lines Using LiDAR Data and RANSAC Algorithm (LiDAR 데이터와 RANSAC 알고리즘을 이용한 철도 전력선 자동탐지에 관한 연구)

  • Jeon, Wang Gyu;Choi, Byoung Gil
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.31 no.4
    • /
    • pp.331-339
    • /
    • 2013
  • LiDAR has been one of the widely used and important technologies for 3D modeling of ground surface and objects because of its ability to provide dense and accurate range measurement. The objective of this research is to develop a method for automatic detection and modeling of railroad power lines using high density LiDAR data and RANSAC algorithms. For detecting railroad power lines, multi-echoes properties of laser data and shape knowledge of railroad power lines were employed. Cuboid analysis for detecting seed line segments, tracking lines, connecting and labeling are the main processes. For modeling railroad power lines, iterative RANSAC and least square adjustment were carried out to estimate the lines parameters. The validation of the result is very challenging due to the difficulties in determining the actual references on the ground surface. Standard deviations of 8cm and 5cm for x-y and z coordinates, respectively are satisfactory outcomes. In case of completeness, the result of visual inspection shows that all the lines are detected and modeled well as compare with the original point clouds. The overall processes are fully automated and the methods manage any state of railroad wires efficiently.

Sensitivity Analysis of the High-Resolution WISE-WRF Model with the Use of Surface Roughness Length in Seoul Metropolitan Areas (서울지역의 고해상도 WISE-WRF 모델의 지표면 거칠기 길이 개선에 따른 민감도 분석)

  • Jee, Joon-Bum;Jang, Min;Yi, Chaeyeon;Zo, Il-Sung;Kim, Bu-Yo;Park, Moon-Soo;Choi, Young-Jean
    • Atmosphere
    • /
    • v.26 no.1
    • /
    • pp.111-126
    • /
    • 2016
  • In the numerical weather model, surface properties can be defined by various parameters such as terrain height, landuse, surface albedo, soil moisture, surface emissivity, roughness length and so on. And these parameters need to be improved in the Seoul metropolitan area that established high-rise and complex buildings by urbanization at a recent time. The surface roughness length map is developed from digital elevation model (DEM) and it is implemented to the high-resolution numerical weather (WISE-WRF) model. Simulated results from WISE-WRF model are analyzed the relationship between meteorological variables to changes in the surface roughness length. Friction speed and wind speed are improved with various surface roughness in urban, these variables affected to temperature and relative humidity and hence the surface roughness length will affect to the precipitation and Planetary Boundary Layer (PBL) height. When surface variables by the WISE-WRF model are validated with Automatic Weather System (AWS) observations, NEW experiment is able to simulate more accurate than ORG experiment in temperature and wind speed. Especially, wind speed is overestimated over $2.5m\;s^{-1}$ on some AWS stations in Seoul and surrounding area but it improved with positive correlation and Root Mean Square Error (RMSE) below $2.5m\;s^{-1}$ in whole area. There are close relationship between surface roughness length and wind speed, and the change of surface variables lead to the change of location and duration of precipitation. As a result, the accuracy of WISE-WRF model is improved with the new surface roughness length retrieved from DEM, and its surface roughness length is important role in the high-resolution WISE-WRF model. By the way, the result in this study need various validation from retrieved the surface roughness length to numerical weather model simulations with observation data.

Automated Vehicle Research by Recognizing Maneuvering Modes using LSTM Model (LSTM 모델 기반 주행 모드 인식을 통한 자율 주행에 관한 연구)

  • Kim, Eunhui;Oh, Alice
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.16 no.4
    • /
    • pp.153-163
    • /
    • 2017
  • This research is based on the previous research that personally preferred safe distance, rotating angle and speed are differentiated. Thus, we use machine learning model for recognizing maneuvering modes trained per personal or per similar driving pattern groups, and we evaluate automatic driving according to maneuvering modes. By utilizing driving knowledge, we subdivided 8 kinds of longitudinal modes and 4 kinds of lateral modes, and by combining the longitudinal and lateral modes, we build 21 kinds of maneuvering modes. we train the labeled data set per time stamp through RNN, LSTM and Bi-LSTM models by the trips of drivers, which are supervised deep learning models, and evaluate the maneuvering modes of automatic driving for the test data set. The evaluation dataset is aggregated of living trips of 3,000 populations by VTTI in USA for 3 years and we use 1500 trips of 22 people and training, validation and test dataset ratio is 80%, 10% and 10%, respectively. For recognizing longitudinal 8 kinds of maneuvering modes, RNN achieves better accuracy compared to LSTM, Bi-LSTM. However, Bi-LSTM improves the accuracy in recognizing 21 kinds of longitudinal and lateral maneuvering modes in comparison with RNN and LSTM as 1.54% and 0.47%, respectively.

The Analysis and Implication of Link Integrity of Korean Public Institution Website from a Web Usability Perspective (웹 사용성 관점에서 공공기관 웹 사이트의 링크 유효성 분석 및 개선 과제)

  • Moon, Hyun Ju;Kim, Suk Il
    • 재활복지
    • /
    • v.17 no.4
    • /
    • pp.291-309
    • /
    • 2013
  • Web usability is utilized as a standard for constructing and managing website to be useful by people of all abilities and disabilities. Link integrity is one of main criteria for web usability evaluation. Links that are found to be broken or pointing to irrelevant information lead gradual degradation of link integrity over time. This study has conducted investigation for link integrity of Korean public institution website. Among 49 website that are accessible by an automatic link checker software, 91.8% of the website has more than one broken links. Also average number of broken links is 50.5 that is 0.33% of the number of links. 29 websites consist of internal broken links. And 22 website do not provide any information on the missing page. Broken links enlower website confidence by reducing web usability as well as they prohibit web search engine not to provide information on the website. Therefore, web developers and managers need to be aware of importance of link integrity and prohibit web usability degradation by performing link validation as a web management task or periodically.

Validation on the algorithm of estimation of collision risk among ships based on AIS data of actual ships' collision accident (선박충돌사고 AIS 데이터 기반 선박 충돌위험도 추정 알고리즘 검증에 관한 연구)

  • Son, Nam-Sun;Kim, Sun-Young
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2010.10a
    • /
    • pp.180-181
    • /
    • 2010
  • An estimation algorithm of collision risk among multiple ships has been developed in order to reduce human error and prevent collision accidents. The algorithm is designed to calculate the collision risk among ships based on Fuzzy theory by using AIS data as traffic information. In this paper, to validate the algorithm, the AIS data of actual collision accident, which occurred between a product carrier and a cargo carrier in Busan harbor in 2009 are collected. The replay simulation is carried out on the actual AIS data and the collision risk is calculated in real time. In this paper, the features of the estimation algorithm of collision risk and the results of replay simulation based on AIS data of actual collision accident are discussed.

  • PDF