• Title/Summary/Keyword: 로버스트회귀분석

Search Result 23, Processing Time 0.028 seconds

Comparison of GEE Estimation Methods for Repeated Binary Data with Time-Varying Covariates on Different Missing Mechanisms (시간-종속적 공변량이 포함된 이분형 반복측정자료의 GEE를 이용한 분석에서 결측 체계에 따른 회귀계수 추정방법 비교)

  • Park, Boram;Jung, Inkyung
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.5
    • /
    • pp.697-712
    • /
    • 2013
  • When analyzing repeated binary data, the generalized estimating equations(GEE) approach produces consistent estimates for regression parameters even if an incorrect working correlation matrix is used. However, time-varying covariates experience larger changes in coefficients than time-invariant covariates across various working correlation structures for finite samples. In addition, the GEE approach may give biased estimates under missing at random(MAR). Weighted estimating equations and multiple imputation methods have been proposed to reduce biases in parameter estimates under MAR. This article studies if the two methods produce robust estimates across various working correlation structures for longitudinal binary data with time-varying covariates under different missing mechanisms. Through simulation, we observe that time-varying covariates have greater differences in parameter estimates across different working correlation structures than time-invariant covariates. The multiple imputation method produces more robust estimates under any working correlation structure and smaller biases compared to the other two methods.

A Study of the Roust Degradation Model by Analyzing the Filament Lamp Degradation Data (헤드램프용 필라멘트 램프 가속열화데이터 분석을 통한 로버스트 열화모형 연구)

  • Sung, Ki-Woo
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.20 no.6
    • /
    • pp.132-139
    • /
    • 2012
  • It is generally needed to test durability and lifetime when we develop parts in new technology. In this paper, the accelerated degradation analysis methods are developed to test them. This study is presented robust model estimation method that is less affected by outlier in regresstion model estimation. In addition, the lifetime can be predicted by Degradation-stress relationship in stress level.

Fuzzy Theil regression Model (Theil방법을 이용한 퍼지회귀모형)

  • Yoon, Jin Hee;Lee, Woo-Joo;Choi, Seung-Hoe
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.23 no.4
    • /
    • pp.366-370
    • /
    • 2013
  • Regression Analysis is an analyzing method of regression model to explain the statistical relationship between explanatory variable and response variables. This paper introduce Theil's method to find a fuzzy regression model which explain the relationship between explanatory variable and response variables. Theil's method is a robust method which is not sensive to outliers. Theil's method use medians of rate of increment based on randomly chosen pairs of each components of ${\alpha}$-level sets of fuzzy data in order to estimate the coefficients of fuzzy regression model. We propose an example to show Theil's estimator is robust than the Least squares estimator.

Characterization of low frequency between Droughts and Meteorological factor in Korea (우리나라 가뭄특성과 기상인자간의 저빈도 특성 분석)

  • So, Byung-Jin;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2012.05a
    • /
    • pp.418-418
    • /
    • 2012
  • 현재 전 세계적으로 온실가스 농도 증가로 호우나 가뭄, 대설 등 지역에 따라 서로 상반되는 변화를 가져올 수 있다고 경고되고 있으며, 우리나라에서도 남해안지역과 경기북부지역에서 호우빈도가 증가하는 반면, 충정도 내륙지역과 경상북도에서는 호우빈도가 감소하고 5일 누적 강수량 또한 감소하여, 해당지역에서 가뭄이 발생할 경우 심화될 가능성이 높아진다고 보고된 바 있다. 기후변화 시나리오에 분석결과에서도 우리나라의 경우 평균적으로 강우일수는 작아지며, 강우강도는 커지는 결과들이 도출되었다. 이러한 결과들은 가뭄의 발생가능성이 높아지고 있음을 보여주고 있다. 본 연구에서는 우리나라에서 발생된 가뭄의 특성을 분석하고 가뭄의 특성과 기상인자간의 관계를 Quantile regression 분석을 통해 살펴보고자 한다. 가뭄의 특성과 기상인자(엘니뇨, 강수량 등)의 관계에 있어서 기상인자들의 평균을 이용하는 일반적인 회귀분석은 전체 데이터의 영향에 따른 가뭄특성인자와의 관계를 보여준다. 하지만 강수량과 가뭄과의 관계에서와 같이 강수량의 극값보다는 적은 강수량 혹은 무강우일수가 가뭄과 밀접한 관련을 보여준다. 이러한 점에서 이상치들에 영향을 배재할 수 있는 Quantile regression을 사용하여 Quantile에 따른 기상인자와 가뭄특성과의 관계를 규명하고 평가해 보고자 한다. 본 연구에서 적용한 Quantile Regression 기법은 회귀계수의 추정에 있어서 회귀인자의 신뢰성을 아래와 같은 Quantile-회귀계수 그래프를 통해 분석할 수 있으며, 로버스트 통계량의 특징인 분산이 적은 안정적인 추정량을 확보할 수 있는 장점을 갖는다. 아래식은 Quantile regression의 회귀계수 추정식을 나타낸다. $$arg\;in\;{n\\\;p(y_i-f(x_i,\;z_i,\;{\cdots}))\\ =1}$$ 여기서, $y_i$는 가뭄특성값을 $x_i$, $z_i$, $\cdots$는 기상인자를 나타낸다. $$p(y-q)={{\beta}(y-q)\;y{\geq_-}q \\ (1-{\beta})(q-y)\;y<q}$$ ${\beta}$는 quantile을 나타내며 0< ${\beta}$ <1범위를 갖는다.

  • PDF

Asymptotic Test for Dimensionality in Sliced Inverse Regression (분할 역회귀모형에서 차원결정을 위한 점근검정법)

  • Park, Chang-Sun;Kwak, Jae-Guen
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.381-393
    • /
    • 2005
  • As a promising technique for dimension reduction in regression analysis, Sliced Inverse Regression (SIR) and an associated chi-square test for dimensionality were introduced by Li (1991). However, Li's test needs assumption of Normality for predictors and found to be heavily dependent on the number of slices. We will provide a unified asymptotic test for determining the dimensionality of the SIR model which is based on the probabilistic principal component analysis and free of normality assumption on predictors. Illustrative results with simulated and real examples will also be provided.

Outlier detection for multivariate long memory processes (다변량 장기 종속 시계열에서의 이상점 탐지)

  • Kim, Kyunghee;Yu, Seungyeon;Baek, Changryong
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.3
    • /
    • pp.395-406
    • /
    • 2022
  • This paper studies the outlier detection method for multivariate long memory time series. The existing outlier detection methods are based on a short memory VARMA model, so they are not suitable for multivariate long memory time series. It is because higher order of autoregressive model is necessary to account for long memory, however, it can also induce estimation instability as the number of parameter increases. To resolve this issue, we propose outlier detection methods based on the VHAR structure. We also adapt the robust estimation method to estimate VHAR coefficients more efficiently. Our simulation results show that our proposed method performs well in detecting outliers in multivariate long memory time series. Empirical analysis with stock index shows RVHAR model finds additional outliers that existing model does not detect.

Unified Approach to Coefficient of Determination $R^2$ Using Likelihood Distancd (우도거리에 의한 결정계수 $R^2$에의한 통합적 접근)

  • 허명회;이종한;정진환
    • The Korean Journal of Applied Statistics
    • /
    • v.4 no.2
    • /
    • pp.117-127
    • /
    • 1991
  • Coefficient of determination $R^2$ is most frequently used descriptive measure in practical use of linear regression analysis. But there have been controversies on defining this measure in the cases of linear regression without the intercept, weighted linear regression and robust linear regression. Several authors such as Kvalseth(1985) and Willet and Singer(1988) proposed many variations of $R^2$ to meet the situations. However, theire measures are not satisfactory due to the lack of a universal principle. In this study, we propose a unfied approach to defining the coefficient of determination $R^2$ using the concept of likelihood distance. This new measure is in good accordance with typical $R^2$ in linear regression and, moreover, can be applied to nonlinear regression models and generalized linear models such as logit and log-linear models.

  • PDF

A Confirmation of Identified Multiple Outliers and Leverage Points in Linear Model (다중 선형 모형에서 식별된 다중 이상점과 다중 지렛점의 재확인 방법에 대한 연구)

  • 유종영;안기수
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.2
    • /
    • pp.269-279
    • /
    • 2002
  • We considered the problem for confirmation of multiple outliers and leverage points. Identification of multiple outliers and leverage points is difficult because of the masking effect and swamping effect. Rousseeuw and van Zomeren(1990) identified multiple outliers and leverage points by using the Least Median of Squares and Minimum Value of Ellipsoids which are high-breakdown robust estimators. But their methods tend to declare too many observations as extremes. Atkinson(1987) suggested a method for confirming of outliers and Fung(1993) pointed out Atkinson method's limitation and proposed another method by using the add-back model. But we analyzed that Fung's method is affected by adjacent effect. In this thesis, we proposed one procedure for confirmation of outliers and leverage points and compared three example with Fung's method.

Robust ridge regression for nonlinear mixed effects models with applications to quantitative high throughput screening assay data (비선형 혼합효과모형에서의 로버스트 능형회귀 방법과 정량적 고속 대량 스크리닝 자료에의 응용)

  • Yoo, Jiseon;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.1
    • /
    • pp.123-137
    • /
    • 2018
  • A nonlinear mixed effects model is mainly used to analyze repeated measurement data in various fields. A nonlinear mixed effects model consists of two stages: the first-stage individual-level model considers intra-individual variation and the second-stage population model considers inter-individual variation. The individual-level model, which is the first stage of the nonlinear mixed effects model, estimates the parameters of the nonlinear regression model. It is the same as the general nonlinear regression model, and usually estimates parameters using the least squares estimation method. However, the least squares estimation method may have a problem that the estimated value of the parameters and standard errors become extremely large if the assumed nonlinear function is not explicitly revealed by the data. In this paper, a new estimation method is proposed to solve this problem by introducing the ridge regression method recently proposed in the nonlinear regression model into the first-stage individual-level model of the nonlinear mixed effects model. The performance of the proposed estimator is compared with the performance with the standard estimator through a simulation study. The proposed methodology is also illustrated using quantitative high throughput screening data obtained from the US National Toxicology Program.

Preliminary test estimation method accounting for error variance structure in nonlinear regression models (비선형 회귀모형에서 오차의 분산에 따른 예비검정 추정방법)

  • Yu, Hyewon;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.4
    • /
    • pp.595-611
    • /
    • 2016
  • We use nonlinear regression models (such as the Hill Model) when we analyze data in toxicology and/or pharmacology. In nonlinear regression models an estimator of parameters and estimation of measurement about uncertainty of the estimator are influenced by the variance structure of the error. Thus, estimation methods should be different depending on whether the data are homoscedastic or heteroscedastic. However, we do not know the variance structure of the error until we actually analyze the data. Therefore, developing estimation methods robust to the variance structure of the error is an important problem. In this paper we propose a method to estimate parameters in nonlinear regression models based on a preliminary test. We define an estimator which uses either the ordinary least square estimation method or the iterative weighted least square estimation method according to the results of a simple preliminary test for the equality of the error variance. The performance of the proposed estimator is compared to those of existing estimators by simulation studies. We also compare estimation methods using real data obtained from the National Toxicology program of the United States.