• Title/Summary/Keyword: coefficient of determination (R-square)

Search Result 172, Processing Time 0.041 seconds

Note on Use of $R^2$ for No-intercept Model

  • Do, Jong-Doo;Kim, Tae-Yoon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.661-668
    • /
    • 2006
  • There have been some controversies on the use of the coefficient of determination for linear no-intercept model. One definition of the coefficient of determination, $R^2={\sum}\;{\widehat{y^2}}\;/\;{\sum}\;y^2$, is being widely accepted only for linear no-intercept models though Kvalseth (1985) demonstrated some possible pitfalls in using such $R^2$. Main objective of this note is to report that $R^2$ is not a desirable measure of fit for the no-intercept linear model. In fact it is found that mean square error(MSE) could replace $R^2$ efficiently in most cases where selection of no-intercept model is at issue.

  • PDF

Comparison of models for estimating surplus productions and methods for estimating their parameters (잉여생산량을 추정하는 모델과 파라미터 추정방법의 비교)

  • Kwon, Youjung;Zhang, Chang Ik;Pyo, Hee Dong;Seo, Young Il
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.49 no.1
    • /
    • pp.18-28
    • /
    • 2013
  • It was compared the estimated parameters by the surplus production from three different models, i.e., three types (Schaefer, Gulland, and Schnute) of the traditional surplus production models, a stock production model incorporating covariates (ASPIC) model and a maximum entropy (ME) model. We also evaluated the performance of models in the estimation of their parameters. The maximum sustainable yield (MSY) of small yellow croaker (Pseudosciaena polyactis) in Korean waters ranged from 35,061 metric tons (mt) by Gulland model to 44,844mt by ME model, and fishing effort at MSY ($f_{MSY}$) ranged from 262,188hauls by Schnute model to 355,200hauls by ME model. The lowest root mean square error (RMSE) for small yellow croaker was obtained from the Gulland surplus production model, while the highest RMSE was from Schnute model. However, the highest coefficient of determination ($R^2$) was from the ME model, but the ASPIC model yielded the lowest coefficient. On the other hand, the MSY of Kapenta (Limnothrissa miodon) ranged from 16,880 mt by ASPIC model to 25,373mt by ME model, and $f_{MSY}$, from 94,580hauls by ASPIC model to 225,490hauls by Schnute model. In this case, both the lowest root mean square error (RMSE) and the highest coefficient of determination ($R^2$) were obtained from the ME model, which showed relatively better fits of data to the model, indicating that the ME model is statistically more stable and robust than other models. Moreover, the ME model could provide additional ecologically useful parameters such as, biomass at MSY ($B_{MSY}$), carrying capacity of the population (K), catchability coefficient (q) and the intrinsic rate of population growth (r).

Nondestructive Prediction of Fatty Acid Composition in Sesame Seeds by Near Infrared Reflectance Spectroscopy

  • Kim, Kwan-Su;Park, Si-Hyung;Choung, Myoung-Gun;Kim, Sun-Lim
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.51 no.spc1
    • /
    • pp.304-309
    • /
    • 2006
  • Near infrared reflectance spectroscopy (NIRS) was used to develop a rapid and nondestructive method for the determination of fatty acid composition in sesame (Sesamum indicum L.) seed oil. A total of ninety-three samples of intact seeds were scanned in the reflectance mode of a scanning monochromator, and reference values for fatty acid composition were measured by gas-liquid chromatography. Calibration equations were developed using modified partial least square regression with internal cross validation (n=63). The equations obtained had low standard errors of cross-validation and moderate $R^2$ (coefficient of determination in calibration). Prediction of an external validation set (n=30) showed significant correlation between reference values and NIRS estimated values based on the SEP (standard error of prediction), $r^2$ (coefficient of determination in prediction) and the ratio of standard deviation (SD) of reference data to SEP. The models developed in this study had relatively higher values (more than 2.0) of SD/SEP(C) for oleic and linoleic acid, having good correlation between reference and NIRS estimate. The results indicated that NIRS, a nondestructive screening method could be used to rapidly determine fatty acid composition in sesame seeds in the breeding programs for high quality sesame oil.

The analysis of oat chemical properties using visible-near infrared spectroscopy

  • Jang, Hyeon Jun;Choi, Chang Hyun;Choi, Tae Hyun;Kim, Jong Hun;Kwon, Gi Hyeon;Oh, Seung Il;Kim, Hoon;Kim, Yong Joo
    • Korean Journal of Agricultural Science
    • /
    • v.43 no.5
    • /
    • pp.715-722
    • /
    • 2016
  • Rapid determination of food quality is important in food distribution. In this study, the chemical properties of oats were analyzed using visible-near infrared (VIS-NIR) spectroscopy. The objective of this study was to develop and validate a predictive model of oat quality by VIS-NIR spectroscopy. A total of 200 oat samples were collected from domestic and import markets. Reflectance spectra, moisture, protein, fat, Fe, and K of oat samples were measured. Reflectance spectra were measured in the wavelength range of 400 - 2,500 nm at 2 nm intervals. The reflectance spectrum of an oat sample was measured after sample cell and reflectance plate spectrum measurement. Preprocessing methods such as normalization and $1^{st}$ and $2^{nd}$ derivations were used to minimize the spectroscopic noise. The partial-least-square (PLS) models were developed to predict chemical properties of oats using a commercial software package, Unscrambler. The PLS models showed the possibility to predict moisture, protein, and fat content of oat samples. The coefficient of determination ($R^2$) of moisture, protein, and fat was greater than 0.89. However, it was hard to predict Fe and K concentrations due to their low concentrations in the oat samples. The coefficient of determinations of Fe and K were 0.57 and 0.77, respectively. In future studies, the stability and practicability of these models should be improved by using a high accuracy spectrophotometer and by performing calibrations with a wider range of oat chemicals.

Development of Prediction Model of Chloride Diffusion Coefficient using Machine Learning (기계학습을 이용한 염화물 확산계수 예측모델 개발)

  • Kim, Hyun-Su
    • Journal of Korean Association for Spatial Structures
    • /
    • v.23 no.3
    • /
    • pp.87-94
    • /
    • 2023
  • Chloride is one of the most common threats to reinforced concrete (RC) durability. Alkaline environment of concrete makes a passive layer on the surface of reinforcement bars that prevents the bar from corrosion. However, when the chloride concentration amount at the reinforcement bar reaches a certain level, deterioration of the passive protection layer occurs, causing corrosion and ultimately reducing the structure's safety and durability. Therefore, understanding the chloride diffusion and its prediction are important to evaluate the safety and durability of RC structure. In this study, the chloride diffusion coefficient is predicted by machine learning techniques. Various machine learning techniques such as multiple linear regression, decision tree, random forest, support vector machine, artificial neural networks, extreme gradient boosting annd k-nearest neighbor were used and accuracy of there models were compared. In order to evaluate the accuracy, root mean square error (RMSE), mean square error (MSE), mean absolute error (MAE) and coefficient of determination (R2) were used as prediction performance indices. The k-fold cross-validation procedure was used to estimate the performance of machine learning models when making predictions on data not used during training. Grid search was applied to hyperparameter optimization. It has been shown from numerical simulation that ensemble learning methods such as random forest and extreme gradient boosting successfully predicted the chloride diffusion coefficient and artificial neural networks also provided accurate result.

Fuzzy logic approach for estimating bond behavior of lightweight concrete

  • Arslan, Mehmet E.;Durmus, Ahmet
    • Computers and Concrete
    • /
    • v.14 no.3
    • /
    • pp.233-245
    • /
    • 2014
  • In this paper, a rule based Mamdani type fuzzy logic model for prediction of slippage at maximum tensile strength and slippage at rupture of structural lightweight concretes were discussed. In the model steel rebar diameters and development lengths were used as inputs. The FL model and experimental results, the coefficient of determination R2, the Root Mean Square Error were used as evaluation criteria for comparison. It was concluded that FL was practical method for predicting slippage at maximum tensile strength and slippage at rupture of structural lightweight concretes.

Development of On-line Sorting System for Detection of Infected Seed Potatoes Using Visible Near-Infrared Transmittance Spectral Technique (가시광 및 근적외선 투과분광법을 이용한 감염 씨감자 온라인 선별시스템 개발)

  • Kim, Dae Yong;Mo, Changyeun;Kang, Jun-Soon;Cho, Byoung-Kwan
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.35 no.1
    • /
    • pp.1-11
    • /
    • 2015
  • In this study, an online seed potato sorting system using a visible and near infrared (40 1100 nm) transmittance spectral technique and statistical model was evaluated for the nondestructive determination of infected and sound seed potatoes. Seed potatoes that had been artificially infected with Pectobacterium atrosepticum, which is known to cause a soil borne disease infection, were prepared for the experiments. After acquiring transmittance spectra from sound and infected seed potatoes, a determination algorithm for detecting infected seed potatoes was developed using the partial least square discriminant analysis method. The coefficient of determination($R^2_p$) of the prediction model was 0.943, and the classification accuracy was above 99% (n = 80) for discriminating diseased seed potatoes from sound ones. This online sorting system has good potential for developing a technique to detect agricultural products that are infected and contaminated by pathogens.

A Comparative Study on the Infinite NHPP Software Reliability Model Following Chi-Square Distribution with Lifetime Distribution Dependent on Degrees of Freedom (수명분포가 자유도에 의존한 카이제곱분포를 따르는 무한고장 NHPP 소프트웨어 신뢰성 모형에 관한 비교연구)

  • Kim, Hee-Cheul;Kim, Jae-Wook
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.10 no.5
    • /
    • pp.372-379
    • /
    • 2017
  • Software reliability factor during the software development process is elementary. Case of the infinite failure NHPP for identifying software failure, the occurrence rates per fault (hazard function) have the characteristic point that is constant, increases and decreases. In this paper, we propose a reliability model using the chi - square distribution which depends on the degree of freedom that represents the application efficiency of software reliability. Algorithm to estimate the parameters used to the maximum likelihood estimator and bisection method, a model selection based on the mean square error (MSE) and coefficient of determination($R^2$), for the sake of the efficient model, were employed. For the reliability model using the proposed degree of freedom of the chi - square distribution, the failure analysis using the actual failure interval data was applied. Fault data analysis is compared with the intensity function using the degree of freedom of the chi - square distribution. For the insurance about the reliability of a data, the Laplace trend test was employed. In this study, the chi-square distribution model depends on the degree of freedom, is also efficient about reliability because have the coefficient of determination is 90% or more, in the ground of the basic model, can used as a applied model. From this paper, the software development designer must be applied life distribution by the applied basic knowledge of the software to confirm failure modes which may be applied.

Prediction of ham weight with the autofom in Korea (오토폼을 이용한 돼지 뒷다리 중량예측 연구)

  • Bae, Jin-Gyu;Lee, Young-Kyu;Park, Beom-Young;Lym, Hyo-Seon;Jung, Bong-Su
    • Korean Journal of Veterinary Service
    • /
    • v.39 no.1
    • /
    • pp.7-12
    • /
    • 2016
  • The Autofom is a equipment for predicting the amount of pig carcasses meat using the 16 ultrasonic sensors to measure in real time and it was established in Dodram LPC in Gyeonggi Province of Korea for the first time. This study was carried out to validate the reliability of Autofom statistically and to establish guideline for developing a analytic formula through comparing the measurement between Autofom and dissection. The ham parts of sixty-six pig carcasses were measured with Autofom and by two experimental performers. The weight means and standard deviations of ham parts including bone by measurements with Autofom and dissection were $10.69{\pm}0.81kg$ and $10.77{\pm}0.94kg$, respectively a strong positive correlation (P<0.01) was identified, with a coefficient of determination ($R^2$) of 0.82. The weight means and standard deviations of lean ham parts by measurements with Autofom and dissection were $7.41{\pm}0.58kg$ and $7.42{\pm}0.89kg$, respectively a strong positive correlation (P<0.01) was identified, with a coefficient of determination ($R^2$) of 0.72. The root mean square errors of two groups were 0.40 and 0.50, respectively.

A Study on the Safe Blasting Design by Statistical Analysis of Ground Vibration for Vibration Controlled Blasting in Urban Area (II) (도심지 미진동 제어발파에서 진동분석을 통한 안전 발파설계에 관한 연구(II) - 진동측정 자료의 통계적 분석을 위주로 -)

  • 김영환;안명석;박종남;강대우;이창우
    • Explosives and Blasting
    • /
    • v.18 no.2
    • /
    • pp.7-13
    • /
    • 2000
  • Abstract The characteristics of bed rock in the study area was classified by means of the crack coefficient estimated from the seismic velocities of in-situ and intact rocks. Various statistical methods were investigated in order to minimize the possible errors in estimating the predictive equation of blasting vibration and to enhance the determination coefficient $R^2$, for more reliable estimation. The determination coefficient showed the highest in the analysis for those groups using weighting function with the number of samples. The analysis for the weighting function employed with standard coefficient and variance also enhanced the determination coefficients significantly compared to the others, but the reliability was slightly lower than results obtained former method. Therefore the most reliable predictive equation of blasting vibration was found to be obtained from a regression analysis of the mean vibration level using the weighting of same distance groups within 15m with the same explosive charge weight per delay. The coefficients, K and n 317.4 and -1.66, respectively, when using the square root scaling, and 209.9 and -1.66, respectively, when using the cube root scaling. The analysis also showed that the square root scaling may be used in the distance less than 31m form the blast source, and the cube root scaling in the distance more than 31m for safe design.

  • PDF