• Title/Summary/Keyword: total prediction error

Search Result 235, Processing Time 0.025 seconds

Preliminary Construction Cost Prediction Model Based on Module for Modernized Hanok (초기 기획단계의 신한옥 공사비 예측 모델 - 모듈(칸) 기반의 목공사 개략 물량 산출 중심으로 -)

  • Kang, Seunghee;Jung, Youngsoo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.21 no.3
    • /
    • pp.48-56
    • /
    • 2020
  • Prediction of construction cost in the planning stage that provides basic information for feasibility study, budgeting, and planning is an important factor for successful project execution. In this study, a prediction model was developed for the purpose of improving the accuracy of estimating the construction cost of Hanok in the planning stage. The cost of this model is estimated by two methods. First, the cost of wood work, which accounts for the largest portion of the total construction cost, is estimated by calculating the approximate quantity under various conditions (structure type, roof type, plane type, etc.). Second, the cost of the rest work sections except the wood work is estimated by using the unit cost model. The predictive model was verified by two case projects, and the error rate of total construction cost was -4%(case 1) and -6%(case 2). These results showed an error rate in the range that can be applied to practice in the planning stage.

Spectral Weighted-Sum-of-Gray-Gases Modeling of Narrow Band for Prediction of Radiative Heat Transfer Induced from Liquid Engine Plume (액체 엔진 플룸 복사 열전달 예측을 위한 파장별 회체가스 중합법의 좁은밴드 적용)

  • Ko, Ju-Yong;kim, In-Sun
    • Aerospace Engineering and Technology
    • /
    • v.8 no.1
    • /
    • pp.17-25
    • /
    • 2009
  • The precise calculation of gas absorption coefficient in the radiative transfer equation is very important to the prediction of radiative heat transfer induced from liquid engine plume in view of base insulation design. For this purpose, the WNB model for gas absorption coefficient is described with the selection of important parameters and then the calculated results are compared with those of SNB model for validation. Total emissivity, narrow band averaged intensity and total intensity are calculated and compared to the results of SNB model. As results, the total emissivity and the total intensity are well matched within 3.1% and roughly 5 % error, respectively. Moreover, the gas modeling database is constructed with estimation of the combustion gas composition of $CO_2$ and $H_2O$ for liquid engine plume.

  • PDF

Effect of input variable characteristics on the performance of an ensemble machine learning model for algal bloom prediction (앙상블 머신러닝 모형을 이용한 하천 녹조발생 예측모형의 입력변수 특성에 따른 성능 영향)

  • Kang, Byeong-Koo;Park, Jungsu
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.35 no.6
    • /
    • pp.417-424
    • /
    • 2021
  • Algal bloom is an ongoing issue in the management of freshwater systems for drinking water supply, and the chlorophyll-a concentration is commonly used to represent the status of algal bloom. Thus, the prediction of chlorophyll-a concentration is essential for the proper management of water quality. However, the chlorophyll-a concentration is affected by various water quality and environmental factors, so the prediction of its concentration is not an easy task. In recent years, many advanced machine learning algorithms have increasingly been used for the development of surrogate models to prediction the chlorophyll-a concentration in freshwater systems such as rivers or reservoirs. This study used a light gradient boosting machine(LightGBM), a gradient boosting decision tree algorithm, to develop an ensemble machine learning model to predict chlorophyll-a concentration. The field water quality data observed at Daecheong Lake, obtained from the real-time water information system in Korea, were used for the development of the model. The data include temperature, pH, electric conductivity, dissolved oxygen, total organic carbon, total nitrogen, total phosphorus, and chlorophyll-a. First, a LightGBM model was developed to predict the chlorophyll-a concentration by using the other seven items as independent input variables. Second, the time-lagged values of all the input variables were added as input variables to understand the effect of time lag of input variables on model performance. The time lag (i) ranges from 1 to 50 days. The model performance was evaluated using three indices, root mean squared error-observation standard deviation ration (RSR), Nash-Sutcliffe coefficient of efficiency (NSE) and mean absolute error (MAE). The model showed the best performance by adding a dataset with a one-day time lag (i=1) where RSR, NSE, and MAE were 0.359, 0.871 and 1.510, respectively. The improvement of model performance was observed when a dataset with a time lag up of about 15 days (i=15) was added.

Evaluation of UM-LDAPS Prediction Model for Solar Irradiance by using Ground Observation at Fine Temporal Resolution (고해상도 일사량 관측 자료를 이용한 UM-LDAPS 예보 모형 성능평가)

  • Kim, Chang Ki;Kim, Hyun-Goo;Kang, Yong-Heack;Kim, Jin-Young
    • Journal of the Korean Solar Energy Society
    • /
    • v.40 no.5
    • /
    • pp.13-22
    • /
    • 2020
  • Day ahead forecast is necessary for the electricity market to stabilize the electricity penetration. Numerical weather prediction is usually employed to produce the solar irradiance as well as electric power forecast for longer than 12 hours forecast horizon. Korea Meteorological Administration operates the UM-LDAPS model to produce the 36 hours forecast of hourly total irradiance 4 times a day. This study interpolates the hourly total irradiance into 15 minute instantaneous irradiance and then compare them with observed solar irradiance at four ground stations at 1 minute resolution. Numerical weather prediction model employed here was produced at 00 UTC or 18 UTC from January to December, 2018. To compare the statistical model for the forecast horizon less than 3 hours, smart persistent model is used as a reference model. Relative root mean square error of 15 minute instantaneous irradiance are averaged over all ground stations as being 18.4% and 19.6% initialized at 18 and 00 UTC, respectively. Numerical weather prediction is better than smart persistent model at 1 hour after simulation began.

Estimating the unconfined compression strength of low plastic clayey soils using gene-expression programming

  • Muhammad Naqeeb Nawaz;Song-Hun Chong;Muhammad Muneeb Nawaz;Safeer Haider;Waqas Hassan;Jin-Seop Kim
    • Geomechanics and Engineering
    • /
    • v.33 no.1
    • /
    • pp.1-9
    • /
    • 2023
  • The unconfined compression strength (UCS) of soils is commonly used either before or during the construction of geo-structures. In the pre-design stage, UCS as a mechanical property is obtained through a laboratory test that requires cumbersome procedures and high costs from in-situ sampling and sample preparation. As an alternative way, the empirical model established from limited testing cases is used to economically estimate the UCS. However, many parameters affecting the 1D soil compression response hinder employing the traditional statistical analysis. In this study, gene expression programming (GEP) is adopted to develop a prediction model of UCS with common affecting soil properties. A total of 79 undisturbed soil samples are collected, of which 54 samples are utilized for the generation of a predictive model and 25 samples are used to validate the proposed model. Experimental studies are conducted to measure the unconfined compression strength and basic soil index properties. A performance assessment of the prediction model is carried out using statistical checks including the correlation coefficient (R), the root mean square error (RMSE), the mean absolute error (MAE), the relatively squared error (RSE), and external criteria checks. The prediction model has achieved excellent accuracy with values of R, RMSE, MAE, and RSE of 0.98, 10.01, 7.94, and 0.03, respectively for the training data and 0.92, 19.82, 14.56, and 0.15, respectively for the testing data. From the sensitivity analysis and parametric study, the liquid limit and fine content are found to be the most sensitive parameters whereas the sand content is the least critical parameter.

A Basic Study to Predict Solar Insolation using Meteorological Observation Data in Korea (국내 기상 측정결과를 이용한 일사량 예측 방법 기초 연구)

  • Hwangbo, Seong;Kim, Hayang;Kim, Jeongbae
    • Journal of Institute of Convergence Technology
    • /
    • v.4 no.2
    • /
    • pp.27-33
    • /
    • 2014
  • To well design the solar energy system using solar energy, the correlation to calculate solar irradiation is basically needed. So, this study was performed to reveal the relationships between the solar irradiation and four meteorological observation data(dry bulb temperature, relative humidity, sunshine duration, and cloud cover) which are different from previous other researches. And then, we finally proposed the first order non-linear correlation from the measured solar irradiation using four meteorological observation data with MINITAB. To show the deviation of the solar irradiation between measured and calculated, this study compared using the daily total solar irradiance and the maximum peak value. From those results, the calculation error was estimated about maximum 25.4% for the daily total solar irradiance. The error of the solar irradiation between measured and calculated was made from the curve fitting error. So, solar irradiation prediction correlation with higher accuracy can be obtained using 2nd or higher order terms with four meteorological observation data.

Artificial Neural Network Prediction of Normalized Polarity Parameter for Various Solvents with Diverse Chemical Structures

  • Habibi-Yangjeh, Aziz
    • Bulletin of the Korean Chemical Society
    • /
    • v.28 no.9
    • /
    • pp.1472-1476
    • /
    • 2007
  • Artificial neural networks (ANNs) are successfully developed for the modeling and prediction of normalized polarity parameter (ETN) of 216 various solvents with diverse chemical structures using a quantitative-structure property relationship. ANN with architecture 5-9-1 is generated using five molecular descriptors appearing in the multi-parameter linear regression (MLR) model. The most positive charge of a hydrogen atom (q+), total charge in molecule (qt), molecular volume of solvent (Vm), dipole moment (μ) and polarizability term (πI) are input descriptors and its output is ETN. It is found that properly selected and trained neural network with 192 solvents could fairly represent the dependence of normalized polarity parameter on molecular descriptors. For evaluation of the predictive power of the generated ANN, an optimized network is applied for prediction of the ETN values of 24 solvents in the prediction set, which are not used in the optimization procedure. Correlation coefficient (R) and root mean square error (RMSE) of 0.903 and 0.0887 for prediction set by MLR model should be compared with the values of 0.985 and 0.0375 by ANN model. These improvements are due to the fact that the ETN of solvents shows non-linear correlations with the molecular descriptors.

Development of ensemble machine learning model considering the characteristics of input variables and the interpretation of model performance using explainable artificial intelligence (수질자료의 특성을 고려한 앙상블 머신러닝 모형 구축 및 설명가능한 인공지능을 이용한 모형결과 해석에 대한 연구)

  • Park, Jungsu
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.36 no.4
    • /
    • pp.239-248
    • /
    • 2022
  • The prediction of algal bloom is an important field of study in algal bloom management, and chlorophyll-a concentration(Chl-a) is commonly used to represent the status of algal bloom. In, recent years advanced machine learning algorithms are increasingly used for the prediction of algal bloom. In this study, XGBoost(XGB), an ensemble machine learning algorithm, was used to develop a model to predict Chl-a in a reservoir. The daily observation of water quality data and climate data was used for the training and testing of the model. In the first step of the study, the input variables were clustered into two groups(low and high value groups) based on the observed value of water temperature(TEMP), total organic carbon concentration(TOC), total nitrogen concentration(TN) and total phosphorus concentration(TP). For each of the four water quality items, two XGB models were developed using only the data in each clustered group(Model 1). The results were compared to the prediction of an XGB model developed by using the entire data before clustering(Model 2). The model performance was evaluated using three indices including root mean squared error-observation standard deviation ratio(RSR). The model performance was improved using Model 1 for TEMP, TN, TP as the RSR of each model was 0.503, 0.477 and 0.493, respectively, while the RSR of Model 2 was 0.521. On the other hand, Model 2 shows better performance than Model 1 for TOC, where the RSR was 0.532. Explainable artificial intelligence(XAI) is an ongoing field of research in machine learning study. Shapley value analysis, a novel XAI algorithm, was also used for the quantitative interpretation of the XGB model performance developed in this study.

Development of a soil total carbon prediction model using a multiple regression analysis method

  • Jun-Hyuk, Yoo;Jwa-Kyoung, Sung;Deogratius, Luyima;Taek-Keun, Oh;Jaesung, Cho
    • Korean Journal of Agricultural Science
    • /
    • v.48 no.4
    • /
    • pp.891-897
    • /
    • 2021
  • There is a need for a technology that can quickly and accurately analyze soil carbon contents. Existing soil carbon analysis methods are cumbersome in terms of professional manpower requirements, time, and cost. It is against this background that the present study leverages the soil physical properties of color and water content levels to develop a model capable of predicting the carbon content of soil sample. To predict the total carbon content of soil, the RGB values, water content of the soil, and lux levels were analyzed and used as statistical data. However, when R, G, and B with high correlations were all included in a multiple regression analysis as independent variables, a high level of multicollinearity was noted and G was thus excluded from the model. The estimates showed that the estimation coefficients for all independent variables were statistically significant at a significance level of 1%. The elastic values of R and B for the soil carbon content, which are of major interest in this study, were -2.90 and 1.47, respectively, showing that a 1% increase in the R value was correlated with a 2.90% decrease in the carbon content, whereas a 1% increase in the B value tallied with a 1.47% increase in the carbon content. Coefficient of determination (R2), root mean square error (RMSE), and mean absolute percentage error (MAPE) methods were used for regression verification, and calibration samples showed higher accuracy than the validation samples in terms of R2 and MAPE.

Predicting Human Errors in Landing Situations of Aircraft by Using SHERPA (SHERPA기법을 이용한 항공기 착륙상황에서 발생 가능한 인적오류 예측)

  • Choi, Jae-Rim;Han, Hyeok Jae;Ham, Dong-Han
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.29 no.2
    • /
    • pp.14-24
    • /
    • 2021
  • This study aims to examine probable human errors when landing an airplane by the use of SHERPA(systematic human error reduction and prediction approach) and propose methods for preventing the predictive human errors. It has been reported that human errors are concerned with a lot of accidents or incidents of an airplane. It is significant to predict presumable human errors, particularly in the operation mode of human-automation interaction, and attempt to reduce the likelihood of predicted human error. By referring to task procedures and interviewing domain experts, we analyzed airplane landing task by using HTA(hierarchical task analysis) method. In total, 6 sub-tasks and 19 operations were identified from the task analysis. SHERPA method was used for predicting probable human error types for each task. As a result, we identified 31 human errors and predicted their occurrence probability and criticality. Based on them, we suggested a set of methods for minimizing the probability of the predicted human errors. From this study, it can be said that SHERPA can be effectively used for predicting probable human error types in the context of human-automation interaction needed for navigating an airplane.