• Title/Summary/Keyword: non-linear regression

Search Result 621, Processing Time 0.028 seconds

Validity for Use of Non-HDL Cholesterol Rather than LDL Cholesterol

  • Kwon, Se-Young;Na, Young-Ak
    • Korean Journal of Clinical Laboratory Science
    • /
    • v.45 no.2
    • /
    • pp.54-59
    • /
    • 2013
  • NonHDL cholesterol values have been suggested as a risk marker for cardiovascular disease. NonHDL cholesterol values were calculated, using a very simple measurement [nonHDL cholesterol=serum total cholesterol-HDL cholesterol]. This formula is very useful as a screening tool for identifying dyslipoproteinemias, risk assessment, and assessing the results of hypolipidemic therapy. The data from the 2009 Korean National Health and Nutrition Examination Survey were used. Analysis was done for 1,992 subjects with lipid panels (Cholesterol, HDL, LDLdirect and Triglycerides) results. We studied the relationship between nonHDL cholesterol and LDL cholesterol. As a result, nonHDL cholesterol values were plotted against the LDL direct and calculated values. The linear regression equation for nonHDL cholesterol and direct LDL cholesterol was $nonHDLchol=23.60+1.03{\times}LDLdirect$ (p<0.0001, $r^2=0.80$) in all subjects. The subjects were classified into triglyceride values. When triglycerides are below 400 mg/dL, the linear fit to LDL direct is found to be $[nonHDLchol=17.34+1.07{\times}LDLdirect]$ (p<0.0001, $r^2=0.88$) and to the Friedewald LDL calculation is $[nonHDLchol=23.10+1.02{\times}LDLcalc]$ (p<0.0001, $r^2=0.82$). For triglycerides above 400 mg/dL, the linear fit equation is $[nonHDLchol=87.57+0.92{\times}LDLdirect]$ (p<0.0001, $r^2=0.50$) and to the LDL calculated, it is $[nonHDLchol=142.70+0.50{\times}LDLcalc]$ (p<0.0001, $r^2=0.32$). This study provides examples of the utility of nonHDL cholesterol concentrations in clinical medicine.

  • PDF

Predicting the Soluble Solids of Apples by Near Infrared Spectroscopy (I) - Multiple Linear Regression Models - (근적외선을 이용한 사과의 당도예측 (I) - 다중회귀모델 -)

  • ;W. R. Hruschka;J. A. Abbott;;B. S. Park
    • Journal of Biosystems Engineering
    • /
    • v.23 no.6
    • /
    • pp.561-570
    • /
    • 1998
  • The MLR(Multiple Linear Regression) models to estimate soluble solids content non-destructively were presented to make a selection of optimal photosensor utilized to measure the soluble solids content of apples. Visible and NIR absorbance in the 400 to 2498 nanometer(nm) wavelength region, soluble solids content(sugar content), hardness, and weight were measured for 400 apples(gala). Spectrophotometer with fiber optic probe was utilized for spectrum measurement and digital refractometer was used for soluble solids content. Correlation between absorbance spectrum and soluble solids content was analyzed to pick out the optimal wavelengths and to develop corresponding prediction model by means of MLR. For the coefficient of determination($R^2$) to be over 0.92, the MLR models out of the original absorbance were built based on 7 wavelengths of 992, 904, 1096, 1032, 880, 824, 1048nm, and the ones of the second derivative absorbance based on 5 wavelengths of 784, 1056, 992, 808, 872nm. The best model of the second derivative absorbance spectrum had $R^2$=0.91, bias= -0.02bx, SEP=0.28bx for unknown samples.

  • PDF

Prediction of Solvent Effects on Rate Constant of [2+2] Cycloaddition Reaction of Diethyl Azodicarboxylate with Ethyl Vinyl Ether Using Artificial Neural Networks

  • Habibi-Yangjeh, Aziz;Nooshyar, Mahdi
    • Bulletin of the Korean Chemical Society
    • /
    • v.26 no.1
    • /
    • pp.139-145
    • /
    • 2005
  • Artificial neural networks (ANNs), for a first time, were successfully developed for the modeling and prediction of solvent effects on rate constant of [2+2] cycloaddition reaction of diethyl azodicarboxylate with ethyl vinyl ether in various solvents with diverse chemical structures using quantitative structure-activity relationship. The most positive charge of hydrogen atom (q$^+$), dipole moment ($\mu$), the Hildebrand solubility parameter (${\delta}_H^2$) and total charges in molecule (q$_t$) are inputs and output of ANN is log k$_2$ . For evaluation of the predictive power of the generated ANN, the optimized network with 68 various solvents as training set was used to predict log k$_2$ of the reaction in 16 solvents in the prediction set. The results obtained using ANN was compared with the experimental values as well as with those obtained using multi-parameter linear regression (MLR) model and showed superiority of the ANN model over the regression model. Mean square error (MSE) of 0.0806 for the prediction set by MLR model should be compared with the value of 0.0275 for ANN model. These improvements are due to the fact that the reaction rate constant shows non-linear correlations with the descriptors.

Estimation of Soil Organic Carbon Stock in South Korea

  • Thi, Tuyet-May Do;Le, Xuan-Hien;Van, Linh Nguyen;Yeon, Minho;Lee, Giha
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.159-159
    • /
    • 2022
  • Soil represents a substantial component within the global carbon cycle and small changes in the SOC stock may result in large changes of atmospheric CO2 particularly over tens to hundreds of years. In this study, we aim to (i) evaluate the SOC stock in the topsoil 0 - 15 cm from soil physical and chemical characteristics and (ii) find the correlation of SOC and soil organic matter (SOM) for national-scale in South Korea. First of all, based on the characteristics of the soil to calculate the soil hydraulic properties, SOC stock is the SOC mass per unit area for a given depth. It depends on bulk density (BD-g/cm3), SOC content (%), the depth of topsoil (cm), and gravel content (%). Due to insufficient data on BD observation, we establish a correlation between BD and SOC content, sand content, clay content parameter. Next, we present linear and non-linear regression models of BD and the interrelationship between SOC and SOM using a linear regression model and determine the conversion factor for them, comparing with Van Bemmelen 1890's factor value for the country scale. The results obtained, helps managers come up with suitable solutions to conserve land resources.

  • PDF

Estimating Moisture Content of Cucumber Seedling Using Hyperspectral Imagery

  • Kang, Jeong-Gyun;Ryu, Chan-Seok;Kim, Seong-Heon;Kang, Ye-Seong;Sarkar, Tapash Kumar;Kang, Dong-Hyeon;Kim, Dong Eok;Ku, Yang-Gyu
    • Journal of Biosystems Engineering
    • /
    • v.41 no.3
    • /
    • pp.273-280
    • /
    • 2016
  • Purpose: This experiment was conducted to detect water stress in terms of the moisture content of cucumber seedlings under water stress condition using a hyperspectral image acquisition system, linear regression analysis, and partial least square regression (PLSR) to achieve a non-destructive measurement procedure. Methods: Changes in the reflectance spectrum of cucumber seedlings under water stress were measured using hyperspectral imaging techniques. A model for estimating moisture content of cucumber seedlings was constructed through a linear regression analysis that used the moisture content of cucumber seedlings and a normalized difference vegetation index (NDVI). A model using PLSR that used the moisture content of cucumber seedlings and reflectance spectrum was also created. Results: In the early stages of water stress, cucumber seedlings recovered completely when sub-irrigation was applied. However, the seedlings suffering from initial wilting did not recover when more than 42 h passed without irrigation. The reflectance spectrum of seedlings under water stress decreased gradually, but increased when irrigation was provided, except for the seedlings that had permanently wilted. From the results of the linear regression analysis using the NDVI, the model excluding wilted seedlings with less than 20% (n=97) moisture content showed a precision ($R^2$ and $R^2_{\alpha}$) of 0.573 and 0.568, respectively, and accuracy (RE) of 4.138% and 4.138%, which was higher than that for models including all seedlings (n=100). For PLS regression analysis using the reflectance spectrum, both models were found to have strong precision ($R^2$) with a rating of 0.822, but accuracy (RMSE and RE) was higher in the model excluding wilted seedlings as 5.544% and 13.65% respectively. Conclusions: The estimation model of the moisture content of cucumber seedlings showed better results in the PLSR analysis using reflectance spectrum than the linear regression analysis using NDVI.

An Application of Support Vector Machines to Personal Credit Scoring: Focusing on Financial Institutions in China (Support Vector Machines을 이용한 개인신용평가 : 중국 금융기관을 중심으로)

  • Ding, Xuan-Ze;Lee, Young-Chan
    • Journal of Industrial Convergence
    • /
    • v.16 no.4
    • /
    • pp.33-46
    • /
    • 2018
  • Personal credit scoring is an effective tool for banks to properly guide decision profitably on granting loans. Recently, many classification algorithms and models are used in personal credit scoring. Personal credit scoring technology is usually divided into statistical method and non-statistical method. Statistical method includes linear regression, discriminate analysis, logistic regression, and decision tree, etc. Non-statistical method includes linear programming, neural network, genetic algorithm and support vector machine, etc. But for the development of the credit scoring model, there is no consistent conclusion to be drawn regarding which method is the best. In this paper, we will compare the performance of the most common scoring techniques such as logistic regression, neural network, and support vector machines using personal credit data of the financial institution in China. Specifically, we build three models respectively, classify the customers and compare analysis results. According to the results, support vector machine has better performance than logistic regression and neural networks.

Modeling of Flow-Accelerated Corrosion using Machine Learning: Comparison between Random Forest and Non-linear Regression (기계학습을 이용한 유동가속부식 모델링: 랜덤 포레스트와 비선형 회귀분석과의 비교)

  • Lee, Gyeong-Geun;Lee, Eun Hee;Kim, Sung-Woo;Kim, Kyung-Mo;Kim, Dong-Jin
    • Corrosion Science and Technology
    • /
    • v.18 no.2
    • /
    • pp.61-71
    • /
    • 2019
  • Flow-Accelerated Corrosion (FAC) is a phenomenon in which a protective coating on a metal surface is dissolved by a flow of fluid in a metal pipe, leading to continuous wall-thinning. Recently, many countries have developed computer codes to manage FAC in power plants, and the FAC prediction model in these computer codes plays an important role in predictive performance. Herein, the FAC prediction model was developed by applying a machine learning method and the conventional nonlinear regression method. The random forest, a widely used machine learning technique in predictive modeling led to easy calculation of FAC tendency for five input variables: flow rate, temperature, pH, Cr content, and dissolved oxygen concentration. However, the model showed significant errors in some input conditions, and it was difficult to obtain proper regression results without using additional data points. In contrast, nonlinear regression analysis predicted robust estimation even with relatively insufficient data by assuming an empirical equation and the model showed better predictive power when the interaction between DO and pH was considered. The comparative analysis of this study is believed to provide important insights for developing a more sophisticated FAC prediction model.

Software Cost Estimation Model Based on Use Case Points by using Regression Model (회귀분석을 이용한 UCP 기반 소프트웨어 개발 노력 추정 모델)

  • Park, Ju-Seok;Yang, Hea-Sool
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.8
    • /
    • pp.147-157
    • /
    • 2009
  • Recently, there has been continued research on UCP from the development effort estimation method to a software development project applying object oriented development methodology. Current research proposes a linear model estimating the developmenteffort by multiplying a constant to AUCP which applies technical and environmental factors. However, the fact that a non-linear regression model is more appropriate as the software size increases, the development period increases exponentially. In addition, in the UCP calculation process the occurrence of FP errors due to the application of TCF and EF, it is unrealistic to estimate the size with AUCP. This paper presents the issue of current research based on UCP without considering problems of the research, for example, TCF and EF and expresses the models (linear, logarithmic, polynomial, power and exponential type) estimating the development effort directly from UUCP. Consequently, the exponential model within non-linear models exhibit more accurate results than the current linear model. Therefore, after calculating the UUCP of the developing software system, using the proposed model to estimate the development effort, it is possible to estimate the direct cost required in development.

Development of a Observational Settlement Analysis Method Using Outliers (이상치를 이용한 관측적 침하예측기법의 개발)

  • 우철웅;장병욱
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.45 no.5
    • /
    • pp.140-150
    • /
    • 2003
  • Observational methods such as the Asaoka's method and the hyperbolic method are widely applied on the settlement analysis using observed settlement. The most unreliable aspects in those methods is arose from the subjective discretion of initial non-linearity on linear regression. The initial non-linearity is inevitable due to the settlement behaviour itself. Therefore an objective method is essential to achieve more reliable results on settlement analysis. It was found that the initial non-linear data are statistical outliers. New automation algorithms of the hyperbolic and the Asaoka's method were developed based on outlier detection method. The methods are a successive detection of outliers and a searching method of suitable hyperbolic range for the Asaoka's and the hyperbolic method respectively. Applicability of the algorithms was verified through case studies.

Multi-variate Fuzzy Polynomial Regression using Shape Preserving Operations

  • Hong, Dug-Hun;Do, Hae-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.1
    • /
    • pp.131-141
    • /
    • 2003
  • In this paper, we prove that multi-variate fuzzy polynomials are universal approximators for multi-variate fuzzy functions which are the extension principle of continuous real-valued function under $T_W-based$ fuzzy arithmetic operations for a distance measure that Buckley et al.(1999) used. We also consider a class of fuzzy polynomial regression model. A mixed non-linear programming approach is used to derive the satisfying solution.

  • PDF