• Title/Summary/Keyword: Regression

Search Result 35,324, Processing Time 0.053 seconds

Evaluating Variable Selection Techniques for Multivariate Linear Regression (다중선형회귀모형에서의 변수선택기법 평가)

  • Ryu, Nahyeon;Kim, Hyungseok;Kang, Pilsung
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.42 no.5
    • /
    • pp.314-326
    • /
    • 2016
  • The purpose of variable selection techniques is to select a subset of relevant variables for a particular learning algorithm in order to improve the accuracy of prediction model and improve the efficiency of the model. We conduct an empirical analysis to evaluate and compare seven well-known variable selection techniques for multiple linear regression model, which is one of the most commonly used regression model in practice. The variable selection techniques we apply are forward selection, backward elimination, stepwise selection, genetic algorithm (GA), ridge regression, lasso (Least Absolute Shrinkage and Selection Operator) and elastic net. Based on the experiment with 49 regression data sets, it is found that GA resulted in the lowest error rates while lasso most significantly reduces the number of variables. In terms of computational efficiency, forward/backward elimination and lasso requires less time than the other techniques.

ON THEIL'S METHOD IN FUZZY LINEAR REGRESSION MODELS

  • Choi, Seung Hoe;Jung, Hye-Young;Lee, Woo-Joo;Yoon, Jin Hee
    • Communications of the Korean Mathematical Society
    • /
    • v.31 no.1
    • /
    • pp.185-198
    • /
    • 2016
  • Regression analysis is an analyzing method of regression model to explain the statistical relationship between explanatory variable and response variables. This paper propose a fuzzy regression analysis applying Theils method which is not sensitive to outliers. This method use medians of rate of increment based on randomly chosen pairs of each components of ${\alpha}$-level sets of fuzzy data in order to estimate the coefficients of fuzzy regression model. An example and two simulation results are given to show fuzzy Theils estimator is more robust than the fuzzy least squares estimator.

The Study on Solid Fuel Regression Rate of Swirl Hybrid Rocket (선회류 하이브리드 로켓의 고체 연료 후퇴율에 관한 연구)

  • Park JongWon;Park JooHyuk;Lee ChoongWon;Yoon MyungWon
    • Proceedings of the Korean Society of Propulsion Engineers Conference
    • /
    • v.y2005m4
    • /
    • pp.53-56
    • /
    • 2005
  • Hybrid rocket had many advantage with compared to solid and liquid rockets. In this study, swirl flow hybrid motor was designed and manufactured. And the methods of regression rate improvement wire considered. Thrust was calculated with pressure of the combustion chamber and the regression rate was measured in low flow rate of oxidizer. Several problems and solutions of operating hybrid rocket was presented.

  • PDF

Geographically weighted kernel logistic regression for small area proportion estimation

  • Shim, Jooyong;Hwang, Changha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.2
    • /
    • pp.531-538
    • /
    • 2016
  • In this paper we deal with the small area estimation for the case that the response variables take binary values. The mixed effects models have been extensively studied for the small area estimation, which treats the spatial effects as random effects. However, when the spatial information of each area is given specifically as coordinates it is popular to use the geographically weighted logistic regression to incorporate the spatial information by assuming that the regression parameters vary spatially across areas. In this paper, relaxing the linearity assumption and propose a geographically weighted kernel logistic regression for estimating small area proportions by using basic principle of kernel machine. Numerical studies have been carried out to compare the performance of proposed method with other methods in estimating small area proportion.

Censored varying coefficient regression model using Buckley-James method

  • Shim, Jooyong;Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1167-1177
    • /
    • 2017
  • The censored regression using the pseudo-response variable proposed by Buckley and James has been one of the most well-known models. Recently, the varying coefficient regression model has received a great deal of attention as an important tool for modeling. In this paper we propose a censored varying coefficient regression model using Buckley-James method to consider situations where the regression coefficients of the model are not constant but change as the smoothing variables change. By using the formulation of least squares support vector machine (LS-SVM), the coefficient estimators of the proposed model can be easily obtained from simple linear equations. Furthermore, a generalized cross validation function can be easily derived. In this paper, we evaluated the proposed method and demonstrated the adequacy through simulate data sets and real data sets.

Application of Logit Model in Qualitative Dependent Variables (로짓모형을 이용한 질적 종속변수의 분석)

  • Lee, Kil-Soon;Yu, Wann
    • Journal of Families and Better Life
    • /
    • v.10 no.1 s.19
    • /
    • pp.131-138
    • /
    • 1992
  • Regression analysis has become a standard statistical tool in the behavioral science. Because of its widespread popularity. regression has been often misused. Such is the case when the dependent variable is a qualitative measure rather than a continuous, interval measure. Regression estimates with a qualitative dependent variable does not meet the assumptions underlying regression. It can lead to serious errors in the standard statistical inference. Logit model is recommended as alternatives to the regression model for qualitative dependent variables. Researchers can employ this model to measure the relationship between independent variables and qualitative dependent variables without assuming that logit model was derived from probabilistic choice theory. Coefficients in logit model are typically estimated by the method of Maximum Likelihood Estimation in contrast to ordinary regression model which estimated by the method of Least Squares Estimation. Goodness of fit in logit model is based on the likelihood ratio statistics and the t-statistics is used for testing the null hypothesis.

  • PDF

Tumour Regression via Integrative Regulation of Neurological, Inflammatory, and Hypoxic Tumour Microenvironment

  • Lee, Chang Hoon;Cho, Jungsook;Lee, Kyeong
    • Biomolecules & Therapeutics
    • /
    • v.28 no.2
    • /
    • pp.119-130
    • /
    • 2020
  • Changing trends in anticancer research have altered the treatment paradigm to the extent that it is difficult to investigate any anticancer drugs without mentioning immunotherapy. Thus, we are finally contemplating tumour regression using magic bullets known as immunotherapy drugs. This review explores the possible options and pitfalls in tumour regression by first elucidating the features of cancer and the importance of tumour microenvironments. Next, we evaluated the trends of anticancer therapeutics regulating tumour microenvironment. Finally, we introduced the concept of tumour regression and various targets of tumour microenvironment, which can be used in combination with current immunotherapy for tumour regression. In particular, we emphasize the importance of regulating the neurological manifestations of tumour microenvironment (N) in addition to inflammation (I) and hypoxia (H) in cancer.

Adaptive Regression by Mixing for Fixed Design

  • Oh, Jong-Chul;Lu, Yun;Yang, Yuhong
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.3
    • /
    • pp.713-727
    • /
    • 2005
  • Among different regression approaches, nonparametric procedures perform well under different conditions. In practice it is very hard to identify which is the best procedure for the data at hand, thus model combination is of practical importance. In this paper, we focus on one dimensional regression with fixed design. Polynomial regression, local regression, and smoothing spline are considered. The data are split into two parts, one part is used for estimation and the other part is used for prediction. Prediction performances are used to assign weights to different regression procedures. Simulation results show that the combined estimator performs better or similarly compared with the estimator chosen by cross validation. The combined estimator generates a similar risk to the best candidate procedure for the data.

Fused sliced inverse regression in survival analysis

  • Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.5
    • /
    • pp.533-541
    • /
    • 2017
  • Sufficient dimension reduction (SDR) replaces original p-dimensional predictors to a lower-dimensional linearly transformed predictor. The sliced inverse regression (SIR) has the longest and most popular history of SDR methodologies. The critical weakness of SIR is its known sensitive to the numbers of slices. Recently, a fused sliced inverse regression is developed to overcome this deficit, which combines SIR kernel matrices constructed from various choices of the number of slices. In this paper, the fused sliced inverse regression and SIR are compared to show that the former has a practical advantage in survival regression over the latter. Numerical studies confirm this and real data example is presented.

Comparison of Nonparametric Function Estimation Methods for Discontinuous Regression Functions

  • Park, Dong-Ryeon
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.6
    • /
    • pp.1245-1253
    • /
    • 2010
  • There are two main approaches for estimating the discontinuous regression function nonparametrically. One is the direct approach, the other is the indirect approach. The major goal of the two approaches are different. The direct approach focuses on the overall good estimation of the regression function itself, whereas the indirect approach focuses on the good estimation of jump locations. Apparently, the two approaches are quite different in nature. Gijbels et al. (2007) argue that the comparison of two approaches does not make much sense and that it is even difficult to choose an appropriate criterion for comparisons. However, it is obvious that the indirect approach also has the regression curve estimate as the subsidiary result. Therefore it is necessary to verify the appropriateness of the indirect approach as the estimator of the discontinuous regression function itself. Park (2009a) compared the performance of two approaches through a simulation study. In this paper, we consider a more general case and draw some useful conclusions.