• Title/Summary/Keyword: Regression model.

Search Result 9,515, Processing Time 0.036 seconds

Analysis of health-related quality of life using Beta regression (베타회귀분석 방법을 이용한 건강 관련 삶의 질 자료 분석)

  • Jang, Eun Jin
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.3
    • /
    • pp.547-557
    • /
    • 2017
  • The health-related quality of life data are commonly skewed and bounded with spike at the perfect health status, and the variance tended to be heteroscedastic. In this study, we have developed a prediction model for EQ-5D using linear regression model, beta regression model, and extended beta regression model with mean and precision submodel, and also compared the predictive accuracy. The extended beta regression model allows to model skewness and differences in dispersion related to covariates. Although the extended beta regression model has higher prediction accuracy than the linear regression model, the overlapped confidence intervals suggested that the extended beta regression model was superior to the linear regression model. However, the expended beta regression model could explain the heteroscedasticity and predict within the bounded range. Therefore, the expended beta regression model are appropriate for fitting the health-related quality of life data such as EQ-5D.

Exact Confidence Intervals on the Regression Coeffcients in Multiple Regression Model with Nested Error Structure

  • Park, Dong-Joon
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.2
    • /
    • pp.541-548
    • /
    • 1997
  • In regression model with nested error structure interval estimations on regression coefficients in different stages are proposed. Ordinary least square estimators and generalized least square estimators of the regression coefficients in this model are derived for between and within group model. The confidence intervals are dervied by using independent idstributional properties between regression coefficient estimators and quadratic froms obtained from the model.

  • PDF

Application of discrete Weibull regression model with multiple imputation

  • Yoo, Hanna
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.3
    • /
    • pp.325-336
    • /
    • 2019
  • In this article we extend the discrete Weibull regression model in the presence of missing data. Discrete Weibull regression models can be adapted to various type of dispersion data however, it is not widely used. Recently Yoo (Journal of the Korean Data and Information Science Society, 30, 11-22, 2019) adapted the discrete Weibull regression model using single imputation. We extend their studies by using multiple imputation also with several various settings and compare the results. The purpose of this study is to address the merit of using multiple imputation in the presence of missing data in discrete count data. We analyzed the seventh Korean National Health and Nutrition Examination Survey (KNHANES VII), from 2016 to assess the factors influencing the variable, 1 month hospital stay, and we compared the results using discrete Weibull regression model with those of Poisson, negative Binomial and zero-inflated Poisson regression models, which are widely used in count data analyses. The results showed that the discrete Weibull regression model using multiple imputation provided the best fit. We also performed simulation studies to show the accuracy of the discrete Weibull regression using multiple imputation given both under- and over-dispersed distribution, as well as varying missing rates and sample size. Sensitivity analysis showed the influence of mis-specification and the robustness of the discrete Weibull model. Using imputation with discrete Weibull regression to analyze discrete data will increase explanatory power and is widely applicable to various types of dispersion data with a unified model.

Combining Regression Model and Time Series Model to a Set of Autocorrelated Data

  • Jee, Man-Won
    • Journal of the military operations research society of Korea
    • /
    • v.8 no.1
    • /
    • pp.71-76
    • /
    • 1982
  • A procedure is established for combining a regression model and a time series model to fit to a set of autocorrelated data. This procedure is based on an iterative method to compute regression parameter estimates and time series parameter estimates simultaneously. The time series model which is discussed is basically AR(p) model, since MA(q) model or ARMA(p,q) model can be inverted to AR({$\infty$) model which can be approximated by AR(p) model. The procedure discussed in this articled is applied in general to any combination of regression model and time series model.

  • PDF

CENSORED FUZZY REGRESSION MODEL

  • Choi, Seung-Hoe;Kim, Kyung-Joong
    • Journal of the Korean Mathematical Society
    • /
    • v.43 no.3
    • /
    • pp.623-634
    • /
    • 2006
  • Various methods have been studied to construct a fuzzy regression model in order to present a fuzzy relation between a dependent variable and an independent variable. However, in the fuzzy regression analysis the value of the center point of estimated fuzzy output may be either greater than the value of the right endpoint or smaller than the value of the left endpoint. In the case, we cannot predict the fuzzy output properly. This paper presents sufficient conditions to construct the fuzzy regression model using several methods investigated by some authors and then introduces the censored fuzzy regression model using the censored samples to manipulate the problem of crossing of the center and the end points of the estimated fuzzy number. Examples show that the censored fuzzy regression model is an extension of the fuzzy regression model and also it improves the problem of crossing.

MapReduce-based Localized Linear Regression for Electricity Price Forecasting (전기 가격 예측을 위한 맵리듀스 기반의 로컬 단위 선형회귀 모델)

  • Han, Jinju;Lee, Ingyu;On, Byung-Won
    • The Transactions of the Korean Institute of Electrical Engineers P
    • /
    • v.67 no.4
    • /
    • pp.183-190
    • /
    • 2018
  • Predicting accurate electricity prices is an important task in the electricity trading market. To address the electricity price forecasting problem, various approaches have been proposed so far and it is known that linear regression-based approaches are the best. However, the use of such linear regression-based methods is limited due to low accuracy and performance. In traditional linear regression methods, it is not practical to find a nonlinear regression model that explains the training data well. If the training data is complex (i.e., small-sized individual data and large-sized features), it is difficult to find the polynomial function with n terms as the model that fits to the training data. On the other hand, as a linear regression model approximating a nonlinear regression model is used, the accuracy of the model drops considerably because it does not accurately reflect the characteristics of the training data. To cope with this problem, we propose a new electricity price forecasting method that divides the entire dataset to multiple split datasets and find the best linear regression models, each of which is the optimal model in each dataset. Meanwhile, to improve the performance of the proposed method, we modify the proposed localized linear regression method in the map and reduce way that is a framework for parallel processing data stored in a Hadoop distributed file system. Our experimental results show that the proposed model outperforms the existing linear regression model. Specifically, the accuracy of the proposed method is improved by 45% and the performance is faster 5 times than the existing linear regression-based model.

Graphical regression and model assessment in logistic model (로지스틱모형에서 그래픽을 이용한 회귀와 모형평가)

  • Kahng, Myung-Wook;Kim, Bu-Yong;Hong, Ju-Hee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.1
    • /
    • pp.21-32
    • /
    • 2010
  • Graphical regression is a paradigm for obtaining regression information using plots without model assumptions. The general goal of this approach is to find lowdimensional sufficient summary plots without loss of important information. Model assessments using residual plots are less likely to be successful in models that are not linear. As an alternative approach, marginal model plots provide a general graphical method for assessing the model. We apply the methods of graphical regression and model assessment using marginal model plots to the logistic regression model.

Estimation of Pollutant Load Using Genetic-algorithm and Regression Model (유전자 알고리즘과 회귀식을 이용한 오염부하량의 예측)

  • Park, Youn Shik
    • Korean Journal of Environmental Agriculture
    • /
    • v.33 no.1
    • /
    • pp.37-43
    • /
    • 2014
  • BACKGROUND: Water quality data are collected less frequently than flow data because of the cost to collect and analyze, while water quality data corresponding to flow data are required to compute pollutant loads or to calibrate other hydrology models. Regression models are applicable to interpolate water quality data corresponding to flow data. METHODS AND RESULTS: A regression model was suggested which is capable to consider flow and time variance, and the regression model coefficients were calibrated using various measured water quality data with genetic-algorithm. Both LOADEST and the regression using genetic-algorithm were evaluated by 19 water quality data sets through calibration and validation. The regression model using genetic-algorithm displayed the similar model behaviors to LOADEST. The load estimates by both LOADEST and the regression model using genetic-algorithm indicated that use of a large proportion of water quality data does not necessarily lead to the load estimates with smaller error to measured load. CONCLUSION: Regression models need to be calibrated and validated before they are used to interpolate pollutant loads, as separating water quality data into two data sets for calibration and validation.

Application of Logit Model in Qualitative Dependent Variables (로짓모형을 이용한 질적 종속변수의 분석)

  • Lee, Kil-Soon;Yu, Wann
    • Journal of Families and Better Life
    • /
    • v.10 no.1 s.19
    • /
    • pp.131-138
    • /
    • 1992
  • Regression analysis has become a standard statistical tool in the behavioral science. Because of its widespread popularity. regression has been often misused. Such is the case when the dependent variable is a qualitative measure rather than a continuous, interval measure. Regression estimates with a qualitative dependent variable does not meet the assumptions underlying regression. It can lead to serious errors in the standard statistical inference. Logit model is recommended as alternatives to the regression model for qualitative dependent variables. Researchers can employ this model to measure the relationship between independent variables and qualitative dependent variables without assuming that logit model was derived from probabilistic choice theory. Coefficients in logit model are typically estimated by the method of Maximum Likelihood Estimation in contrast to ordinary regression model which estimated by the method of Least Squares Estimation. Goodness of fit in logit model is based on the likelihood ratio statistics and the t-statistics is used for testing the null hypothesis.

  • PDF