• Title/Summary/Keyword: Regression coefficient

Search Result 3,571, Processing Time 0.037 seconds

Robust varying coefficient model using L1 regularization

  • Hwang, Changha;Bae, Jongsik;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.1059-1066
    • /
    • 2016
  • In this paper we propose a robust version of varying coefficient models, which is based on the regularized regression with L1 regularization. We use the iteratively reweighted least squares procedure to solve L1 regularized objective function of varying coefficient model in locally weighted regression form. It provides the efficient computation of coefficient function estimates and the variable selection for given value of smoothing variable. We present the generalized cross validation function and Akaike information type criterion for the model selection. Applications of the proposed model are illustrated through the artificial examples and the real example of predicting the effect of the input variables and the smoothing variable on the output.

Analysis of Longitudinal Dispersion Coefficient : Part II. Development of New Dispersion Coefficient Equation (종확산계수에 관한 연구 : II. 새로운 종확산계수 추정식 개발)

  • 서일원;정태성
    • Water for future
    • /
    • v.28 no.4
    • /
    • pp.195-204
    • /
    • 1995
  • New dispersion coefficient equation which can be used to estimate dispersion coefficient by using only hydraulic data easily obtained in natural streams has been developed. Dimensional analysis was performed to select physically meaningful parameters, One-Step Huber method, which is one of the nonlinear multi-regression method, was applied to derive a regression equation of dispersion coefficient. 59 measured hydraulic data which were collected in 26 streams in the United States and were analyzed in the Part I of this study, were used in developing new dispersion coefficient equation. Among 59 measured data sets, 35 data sets were used in deriving regression equation, and 24 data sets are used for verification. The new dispersion coefficient equation, which has been developed in this study was proven to be superior in explaining dispersion characteristics of natural streams more precisely compared to existing dispersion coefficient equations.

  • PDF

Kernel Regression with Correlation Coefficient Weighted Distance (상관계수 가중법을 이용한 커널회귀 방법)

  • Shin, Ho-Cheol;Park, Moon-Ghu;Lee, Jae-Yong;You, Skin
    • Proceedings of the KIEE Conference
    • /
    • 2006.10c
    • /
    • pp.588-590
    • /
    • 2006
  • Recently, many on-line approaches to instrument channel surveillance (drift monitoring and fault detection) have been reported worldwide. On-line monitoring (OLM) method evaluates instrument channel performance by assessing its consistency with other plant indications through parametric or non-parametric models. The heart of an OLM system is the model giving an estimate of the true process parameter value against individual measurements. This model gives process parameter estimate calculated as a function of other plant measurements which can be used to identify small sensor drifts that would require the sensor to be manually calibrated or replaced. This paper describes an improvement of auto-associative kernel regression by introducing a correlation coefficient weighting on kernel distances. The prediction performance of the developed method is compared with conventional auto-associative kernel regression.

  • PDF

Feature selection in the semivarying coefficient LS-SVR

  • Hwang, Changha;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.2
    • /
    • pp.461-471
    • /
    • 2017
  • In this paper we propose a feature selection method identifying important features in the semivarying coefficient model. One important issue in semivarying coefficient model is how to estimate the parametric and nonparametric components. Another issue is how to identify important features in the varying and the constant effects. We propose a feature selection method able to address this issue using generalized cross validation functions of the varying coefficient least squares support vector regression (LS-SVR) and the linear LS-SVR. Numerical studies indicate that the proposed method is quite effective in identifying important features in the varying and the constant effects in the semivarying coefficient model.

A Study on Impacts of Selection Attribute of Jeju Local Folklore Food on Customers' Behaviors -Focusing on Customer Satisfaction, Re-visit, and Word of Mouth of Jeju Tourists- (제주 향토음식 선택속성이 고객행동에 미치는 영향 -제주방문 관광객의 고객만족, 재방문, 구전을 중심으로-)

  • Yang, Tai-Seok;Oh, Myung-Cheol
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.38 no.5
    • /
    • pp.636-643
    • /
    • 2009
  • This research was to find out what impacts do selection attributes of Jeju local folklore food by Jeju tourists provide on their behaviors. Multiple regression analysis was carried out using statistics package of SPSS+/WIN 12.0 to find out impacts of selection attribute factors of Jeju local folklore food on customers' satisfaction, re-visit, and intention by word of mouth. As the results, for factors with statistically meaningful impacts at the level of meaningfulness (p<0.05); level of satisfaction showed regression coefficient of 0.476 and t value of 5.198 in essential factors; auxiliary factors showed regression coefficient of 0.232 and t value of 2.808; and sensual (five senses) factors showed regression coefficient of 0.165 and t-value of 2.013. Also, for re-visit, essential factors showed impacts with regression coefficient of 0.413 and t-value of 3.540; factors of menu composition showed regression coefficient of 0.228 and t-value of 3.118; and auxiliary factors showed regression coefficient of 0.218 and t-value of 2.643. In positive word of mouth factors, auxiliary factors showed impacts with regression coefficient of 0.273 and t-value of 2.555; sensual (five senses) factors showed regression coefficient of 0.264 and t-value of 2.238; essential factor showed regression coefficient of 0.237 and t-value of 2.230 and factors of menu composition showed regression coefficient of 0.161 and t-value of 2.167. Therefore, in customer behaviors (customer satisfaction, re-visit, and positive word of mouth) regarding Jeju local folklore food by tourists who visited Jeju, local folklore and cultures did not impact on customer behaviors; also, it can suggested this thesis is meaningful as a study proving that the best marketing is focus on essential substances of food as indicated in existing researches.

Proposal for the Estimation Model of Coefficient of Permeability of Soil Layer using Linear Regression Analysis (단순회귀분석에 의한 토층의 투수계수산정모델 제안)

  • Lee, Moon-Se;Ryu, Je-Cheon;Lim, Heui-Dae;Park, Joo-Whan;Kim, Kyeong-Su
    • The Journal of Engineering Geology
    • /
    • v.18 no.1
    • /
    • pp.27-36
    • /
    • 2008
  • To derive easily the coefficient of permeability from several other soil properties, the estimation model of coefficient of permeability was proposed using linear regression analysis. The coefficient of permeability is one of the major factors to evaluate the soil characteristics. The study area is located in Kangwon-do Pyeongchang-gun Jinbu-Myeon. Soil samples of 45 spots were taken from the study area and various soil tests were carried out in laboratory. After selecting the soil factor influenced by the coefficient of permeability through the correlation analysis, the estimation model of coefficient of permeability was developed using the linear regression analysis between the selected soil factor and the coefficient of permeability from permeability test. Also, the estimation model of coefficient of permeability was compared with the results from permeability test and empirical equation, and the suitability of proposed model was proved. As the result of correlation analysis between various soil factors and the coefficient of permeability using SPSS(statistical package for the social sciences), the largest influence factor of coefficient of permeability were the effective grain size, porosity and dry unit weight. The coefficient of permeability calculated from the proposed model was similar to that resulted from permeability test. Therefore, the proposed model can be used in case of estimating the coefficient of permeability at the same soil condition like study area.

Regression Analysis on Physical Status of Korean Middle and High School Boys (중.고등학생(中.高等學生)의 체격(體格)에 관(關)한 회귀분석(回歸分析))

  • Song, Dal-Hyo
    • Journal of Preventive Medicine and Public Health
    • /
    • v.7 no.2
    • /
    • pp.299-304
    • /
    • 1974
  • The physical status (standing height, body weight, chest girth, sitting height, length of leg, length of thigh, thigh girth, length of crus, length of arm, brachial length, antebrachial girth and skinfold thickness) of 360 healthy middle and high school boys aged between 12 and 17 years in Taegu area was measured and evaluated by means of dispersion. For regression equation and coefficient ofidetermination of each status against standing height were computed. The growth progress of physical status had a tendency to be exponential and, generally, between 13 and 14 years of age the fastest progress was observed. The regression coefficient of body weight against standing height (0.90) was largest and that of skinfold thickness against standing height (0.09) was smallest. In general, the dimension of the regression coefficient was accordant with the dimension of respective physical status. Except in length of thigh and skinfold thickness, coefficient of determination of each physical status against standing height was almost 1 and the regression line could express the relation between standing height and each physical status very satisfactorily. But the regression curve was more desirable for the elucidation of the relation between standing height and skinfold thickness.

  • PDF

Improvement and Validation of an Overlay Design Equation in Seoul (서울형 포장설계식 개선 및 검증)

  • Kim, Won Jae;Park, Chang Kyu;Son, Tran Thai;Phuc, Le Van;Lee, Hyun Jong
    • International Journal of Highway Engineering
    • /
    • v.19 no.5
    • /
    • pp.49-58
    • /
    • 2017
  • PURPOSES : The objective of this study is to develop a simple regression model in designing the asphalt concrete (AC) overlay thickness using the Mechanistic-empirical pavement design guide (MEPDG) program. METHODS : To establish the AC overlay design equation, multiple regression analyses were performed based on the synthetic database for AC thickness design, which was generated using the MEPDG program. The climate in Seoul city, a modified Hirsh model for determining dynamic modulus of asphalt material, and a new damaged master curve approach were used in this study. Meanwhile, the proposed rutting model developed in Seoul city was then used to calibrate the rutting model in the MEPDG program. The AC overlay design equation is a function of the total AC thickness, the ratio of AC overlay thickness and existing AC thickness, the ratio of existing AC modulus and AC overlay modulus, the subgrade condition, and the annual average daily truck traffic (AADTT). RESULTS : The regression model was verified by comparing the predicted AC thickness, the AADTT from the model and the MEPDG. The regression model shows a correlation coefficient of 0.98 in determining the AC thickness and 0.97 in determining AADTT. In addition, the data in Seoul city was used to validate the regression model. The result shows that correlation coefficient between the predicted and measured AADTT is 0.64. This indicates that the current model is more accuracy than the previous study which showed a correlation coefficient of 0.427. CONCLUSIONS:The high correlation coefficient values indicate that the regression equations can predict the AC thickness accurately.

Prediction of random-regression coefficient for daily milk yield after 305 days in milk by using the regression-coefficient estimates from the first 305 days

  • Yamazaki, Takeshi;Takeda, Hisato;Hagiya, Koichi;Yamaguchi, Satoshi;Sasaki, Osamu
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.10
    • /
    • pp.1542-1549
    • /
    • 2018
  • Objective: Because lactation periods in dairy cows lengthen with increasing total milk production, it is important to predict individual productivities after 305 days in milk (DIM) to determine the optimal lactation period. We therefore examined whether the random regression (RR) coefficient from 306 to 450 DIM (M2) can be predicted from those during the first 305 DIM (M1) by using a RR model. Methods: We analyzed test-day milk records from 85,690 Holstein cows in their first lactations and 131,727 cows in their later (second to fifth) lactations. Data in M1 and M2 were analyzed separately by using different single-trait RR animal models. We then performed a multiple regression analysis of the RR coefficients of M2 on those of M1 during the first and later lactations. Results: The first-order Legendre polynomials were practical covariates of RR for the milk yields of M2. All RR coefficients for the additive genetic (AG) effect and the intercept for the permanent environmental (PE) effect of M2 had moderate to strong correlations with the intercept for the AG effect of M1. The coefficients of determination for multiple regression of the combined intercepts for the AG and PE effects of M2 on the coefficients for the AG effect of M1 were moderate to high. The daily milk yields of M2 predicted by using the RR coefficients for the AG effect of M1 were highly correlated with those obtained by using the coefficients of M2. Conclusion: Milk production after 305 DIM can be predicted by using the RR coefficient estimates of the AG effect during the first 305 DIM.