• Title/Summary/Keyword: random coefficient models

Search Result 98, Processing Time 0.026 seconds

Comparison Analysis of Machine Learning for Concrete Crack Depths Prediction Using Thermal Image and Environmental Parameters (열화상 이미지와 환경변수를 이용한 콘크리트 균열 깊이 예측 머신 러닝 분석)

  • Kim, Jihyung;Jang, Arum;Park, Min Jae;Ju, Young K.
    • Journal of Korean Association for Spatial Structures
    • /
    • v.21 no.2
    • /
    • pp.99-110
    • /
    • 2021
  • This study presents the estimation of crack depth by analyzing temperatures extracted from thermal images and environmental parameters such as air temperature, air humidity, illumination. The statistics of all acquired features and the correlation coefficient among thermal images and environmental parameters are presented. The concrete crack depths were predicted by four different machine learning models: Multi-Layer Perceptron (MLP), Random Forest (RF), Gradient Boosting (GB), and AdaBoost (AB). The machine learning algorithms are validated by the coefficient of determination, accuracy, and Mean Absolute Percentage Error (MAPE). The AB model had a great performance among the four models due to the non-linearity of features and weak learner aggregation with weights on misclassified data. The maximum depth 11 of the base estimator in the AB model is efficient with high performance with 97.6% of accuracy and 0.07% of MAPE. Feature importances, permutation importance, and partial dependence are analyzed in the AB model. The results show that the marginal effect of air humidity, crack depth, and crack temperature in order is higher than that of the others.

Prediction of Residual Resistance Coefficient of Low-Speed Full Ships Using Hull Form Variables and Machine Learning Approaches (선형변수 기계학습 기법을 활용한 저속비대선의 잉여저항계수 추정)

  • Kim, Yoo-Chul;Yang, Kyung-Kyu;Kim, Myung-Soo;Lee, Young-Yeon;Kim, Kwang-Soo
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.57 no.6
    • /
    • pp.312-321
    • /
    • 2020
  • In this study, machine learning techniques were applied to predict the residual resistance coefficient (Cr) of low-speed full ships. The used machine learning methods are Ridge regression, support vector regression, random forest, neural network and their ensemble model. 19 hull form variables were used as input variables for machine learning methods. The hull form variables and Cr data obtained from 139 hull forms of KRISO database were used in analysis. 80 % of the total data were used as training models and the rest as validation. Some non-linear models showed the overfitted results and the ensemble model showed better results than others.

Simultaneous Determination of Reference Free-Stream Temperature and Convective Heat Transfer Coefficients (자유흐름 온도와 대류열전달 계수를 동시에 측정할 수 있는 실험 방법에 대한 연구)

  • Jeong, Gi-Ho;Song, Ki-Bum;Kim, Kui-Soon
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.26 no.12
    • /
    • pp.1707-1714
    • /
    • 2002
  • This paper deals with the development of a new method that can obtain heat transfer coefficient and reference free stream temperature simultaneously, The method is based on transient heat transfer experiments using two narrow-band TLCs. The method is validated through error analysis in terms of the random uncertainties in the measured temperatures. It is found that the errors could be reduced more than 2 times less. The general method described in this paper is applicable to many heat transfer models with unknown free stream temperature.

Predicting Oxynitrification layer using AI-based Varying Coefficient Regression model (AI 기반의 Varying Coefficient Regression 모델을 이용한 산질화층 예측)

  • Hye Jung Park;Joo Yong Shim;Kyong Jun An;Chang Ha Hwang;Je Hyun Han
    • Journal of the Korean Society for Heat Treatment
    • /
    • v.36 no.6
    • /
    • pp.374-381
    • /
    • 2023
  • This study develops and evaluates a deep learning model for predicting oxide and nitride layers based on plasma process data. We introduce a novel deep learning-based Varying Coefficient Regressor (VCR) by adapting the VCR, which previously relied on an existing unique function. This model is employed to forecast the oxide and nitride layers within the plasma. Through comparative experiments, the proposed VCR-based model exhibits superior performance compared to Long Short-Term Memory, Random Forest, and other methods, showcasing its excellence in predicting time series data. This study indicates the potential for advancing prediction models through deep learning in the domain of plasma processing and highlights its application prospects in industrial settings.

Posterior Inference in Single-Index Models

  • Park, Chun-Gun;Yang, Wan-Yeon;Kim, Yeong-Hwa
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.1
    • /
    • pp.161-168
    • /
    • 2004
  • A single-index model is useful in fields which employ multidimensional regression models. Many methods have been developed in parametric and nonparametric approaches. In this paper, posterior inference is considered and a wavelet series is thought of as a function approximated to a true function in the single-index model. The posterior inference needs a prior distribution for each parameter estimated. A prior distribution of each coefficient of the wavelet series is proposed as a hierarchical distribution. A direction $\beta$ is assumed with a unit vector and affects estimate of the true function. Because of the constraint of the direction, a transformation, a spherical polar coordinate $\theta$, of the direction is required. Since the posterior distribution of the direction is unknown, we apply a Metropolis-Hastings algorithm to generate random samples of the direction. Through a Monte Carlo simulation we investigate estimates of the true function and the direction.

Effects of Overdispersion on Testing for Serial Dependence in the Time Series of Counts Data

  • Kim, Hee-Young;Park, You-Sung
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.6
    • /
    • pp.829-843
    • /
    • 2010
  • To test for the serial dependence in time series of counts data, Jung and Tremayne (2003) evaluated the size and power of several tests under the class of INARMA models based on binomial thinning operations for Poisson marginal distributions. The overdispersion phenomenon(i.e., a variance greater than the expectation) is common in the real world. Overdispersed count data can be modeled by using alternative thinning operations such as random coefficient thinning, iterated thinning, and quasi-binomial thinning. Such thinning operations can lead to time series models of counts with negative binomial or generalized Poisson marginal distributions. This paper examines whether the test statistics used by Jung and Tremayne (2003) on serial dependence in time series of counts data are affected by overdispersion.

Predictive Model of Optimal Continuous Positive Airway Pressure for Obstructive Sleep Apnea Patients with Obesity by Using Machine Learning (비만 폐쇄수면무호흡 환자에서 기계학습을 통한 적정양압 예측모형)

  • Kim, Seung Soo;Yang, Kwang Ik
    • Journal of Sleep Medicine
    • /
    • v.15 no.2
    • /
    • pp.48-54
    • /
    • 2018
  • Objectives: The aim of this study was to develop a predicting model for the optimal continuous positive airway pressure (CPAP) for obstructive sleep apnea (OSA) patient with obesity by using a machine learning. Methods: We retrospectively investigated the medical records of 162 OSA patients who had obesity [body mass index (BMI) ≥ 25] and undertaken successful CPAP titration study. We divided the data to a training set (90%) and a test set (10%), randomly. We made a random forest model and a least absolute shrinkage and selection operator (lasso) regression model to predict the optimal pressure by using the training set, and then applied our models and previous reported equations to the test set. To compare the fitness of each models, we used a correlation coefficient (CC) and a mean absolute error (MAE). Results: The random forest model showed the best performance {CC 0.78 [95% confidence interval (CI) 0.43-0.93], MAE 1.20}. The lasso regression model also showed the improved result [CC 0.78 (95% CI 0.42-0.93), MAE 1.26] compared to the Hoffstein equation [CC 0.68 (95% CI 0.23-0.89), MAE 1.34] and the Choi's equation [CC 0.72 (95% CI 0.30-0.90), MAE 1.40]. Conclusions: Our random forest model and lasso model ($26.213+0.084{\times}BMI+0.004{\times}$apnea-hypopnea index+$0.004{\times}oxygen$ desaturation index-$0.215{\times}mean$ oxygen saturation) showed the improved performance compared to the previous reported equations. The further study for other subgroup or phenotype of OSA is required.

Short-term Construction Investment Forecasting Model in Korea (건설투자(建設投資)의 단기예측모형(短期豫測模型) 비교(比較))

  • Kim, Kwan-young;Lee, Chang-soo
    • KDI Journal of Economic Policy
    • /
    • v.14 no.1
    • /
    • pp.121-145
    • /
    • 1992
  • This paper examines characteristics of time series data related to the construction investment(stationarity and time series components such as secular trend, cyclical fluctuation, seasonal variation, and random change) and surveys predictibility, fitness, and explicability of independent variables of various models to build a short-term construction investment forecasting model suitable for current economic circumstances. Unit root test, autocorrelation coefficient and spectral density function analysis show that related time series data do not have unit roots, fluctuate cyclically, and are largely explicated by lagged variables. Moreover it is very important for the short-term construction investment forecasting to grasp time lag relation between construction investment series and leading indicators such as building construction permits and value of construction orders received. In chapter 3, we explicate 7 forecasting models; Univariate time series model (ARIMA and multiplicative linear trend model), multivariate time series model using leading indicators (1st order autoregressive model, vector autoregressive model and error correction model) and multivariate time series model using National Accounts data (simple reduced form model disconnected from simultaneous macroeconomic model and VAR model). These models are examined by 4 statistical tools that are average absolute error, root mean square error, adjusted coefficient of determination, and Durbin-Watson statistic. This analysis proves two facts. First, multivariate models are more suitable than univariate models in the point that forecasting error of multivariate models tend to decrease in contrast to the case of latter. Second, VAR model is superior than any other multivariate models; average absolute prediction error and root mean square error of VAR model are quitely low and adjusted coefficient of determination is higher. This conclusion is reasonable when we consider current construction investment has sustained overheating growth more than secular trend.

  • PDF

A study of glass and carbon fibers in FRAC utilizing machine learning approach

  • Ankita Upadhya;M. S. Thakur;Nitisha Sharma;Fadi H. Almohammed;Parveen Sihag
    • Advances in materials Research
    • /
    • v.13 no.1
    • /
    • pp.63-86
    • /
    • 2024
  • Asphalt concrete (AC), is a mixture of bitumen and aggregates, which is very sensitive in the design of flexible pavement. In this study, the Marshall stability of the glass and carbon fiber bituminous concrete was predicted by using Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF), and M5P Tree machine learning algorithms. To predict the Marshall stability, nine inputs parameters i.e., Bitumen, Glass and Carbon fibers mixed in 100:0, 75:25, 50:50, 25:75, 0:100 percentage (designated as 100GF:0CF, 75GF:25CF, 50GF:50 CF, 25GF:75CF, 0GF:100CF), Bitumen grade (VG), Fiber length (FL), and Fiber diameter (FD) were utilized from the experimental and literary data. Seven statistical indices i.e., coefficient of correlation (CC), mean absolute error (MAE), root mean squared error (RMSE), relative absolute error (RAE), root relative squared error (RRSE), Scattering index (SI), and BIAS were applied to assess the effectiveness of the developed models. According to the performance evaluation results, Artificial neural network (ANN) was outperforming among other models with CC values as 0.9147 and 0.8648, MAE values as 1.3757 and 1.978, RMSE values as 1.843 and 2.6951, RAE values as 39.88 and 49.31, RRSE values as 40.62 and 50.50, SI values as 0.1379 and 0.2027 and BIAS value as -0.1 290 and -0.2357 in training and testing stage respectively. The Taylor diagram (testing stage) also confirmed that the ANN-based model outperforms the other models. Results of sensitivity analysis showed that the fiber length is the most influential in all nine input parameters whereas the fiber combination of 25GF:75CF was the most effective among all the fiber mixes in Marshall stability.

A Development of Traffic Accident Estimation Model by Random Parameter Negative Binomial Model: Focus on Multilane Rural Highway (확률모수를 이용한 교통사고예측모형 개발: 지방부 다차로 도로를 중심으로)

  • Lim, Joon Beom;Lee, Soo Beom;Kim, Joon-Ki;Kim, Jeong Hyun
    • Journal of Korean Society of Transportation
    • /
    • v.32 no.6
    • /
    • pp.662-674
    • /
    • 2014
  • In this study, accident frequency prediction models were constructed by collecting variables such as geometric structures, safety facilities, traffic volume and weather conditions, land use, highway design-satisfaction criteria along 780km (4,372 sections) of 4 lane-highways over 8 areas. As for models, a fixed parameter model and a random parameter model were employed. In the random parameter model, some influences were reversed as the range was expressed based on specific probability in the case of no fixed coefficients. In the fixed parameter model, the influences of independent variables on accident frequency were interpreted by using one coefficient, but in the random parameter model, more various interpretations were took place. In particular, curve radius, securement of shoulder lane, vertical grade design criteria satisfaction showed both positive and negative influence, according to specific probability. This means that there could be a reverse effect depending on the behavioral characteristics of drivers and the characteristics of highway sections. Rather, they influence the increase of accident frequency through the all sections.