• 제목/요약/키워드: generalized estimating equations

검색결과 63건 처리시간 0.019초

A Flexible Modeling Approach for Current Status Survival Data via Pseudo-Observations

  • Han, Seungbong;Andrei, Adin-Cristian;Tsui, Kam-Wah
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.947-958
    • /
    • 2012
  • When modeling event times in biomedical studies, the outcome might be incompletely observed. In this paper, we assume that the outcome is recorded as current status failure time data. Despite well-developed literature the routine practical use of many current status data modeling methods remains infrequent due to the lack of specialized statistical software, the difficulty to assess model goodness-of-fit, as well as the possible loss of information caused by covariate grouping or discretization. We propose a model based on pseudo-observations that is convenient to implement and that allows for flexibility in the choice of the outcome. Parameter estimates are obtained based on generalized estimating equations. Examples from studies in bile duct hyperplasia and breast cancer in conjunction with simulated data illustrate the practical advantages of this model.

시간-종속적 공변량이 포함된 이분형 반복측정자료의 GEE를 이용한 분석에서 결측 체계에 따른 회귀계수 추정방법 비교 (Comparison of GEE Estimation Methods for Repeated Binary Data with Time-Varying Covariates on Different Missing Mechanisms)

  • 박보람;정인경
    • 응용통계연구
    • /
    • 제26권5호
    • /
    • pp.697-712
    • /
    • 2013
  • 다시점 자료 연구에서 일반화추정방정식은 가상관행렬을 잘못 가정하더라도 모수의 일치추정량을 도출하므로 많이 이용된다. 하지만, 결측 체계가 완전임의결측이 아닌 경우에는 편의추정량을 제공하고, 시간-종속적 공변량이 포함된 경우에는 가상관행렬에 따라 회귀계수 추정값이 다르게 도출될 수 있는 문제점이 있다. 결측 체계가 임의결측인 경우에 발생하는 문제를 해결하기 위해 가중 방법과 다중대체 방법을 사용하는 것이 제안되었다. 본 논문에서는 시간-종속적 공변량이 포함된 이분형 반복측정자료를 GEE를 이용하여 분석할 때 다양한 결측 체계에서 일반화추정방정식 방법, 가중 방법, 다중대체 방법의 회귀계수 추정에 대한 로버스트성과 정확성을 모의실험을 통하여 비교해 보았다. 세 가지 방법 모두에서 시간-종속적 공변량의 회귀계수가 시간-독립적 공변량의 회귀계수에 비해 가상관행렬에 따라 추정값의 차이가 크게 나타났다. 다른 두 방법에 비해 다중대체 방법이 가상관행렬의 형태에 대해 더 로버스트하고 편의도 작은 추정치를 도출하였다.

An Analysis of Panel Count Data from Multiple random processes

  • 박유성;김희영
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2002년도 추계 학술발표회 논문집
    • /
    • pp.265-272
    • /
    • 2002
  • An Integer-valued autoregressive integrated (INARI) model is introduced to eliminate stochastic trend and seasonality from time series of count data. This INARI extends the previous integer-valued ARMA model. We show that it is stationary and ergodic to establish asymptotic normality for conditional least squares estimator. Optimal estimating equations are used to reflect categorical and serial correlations arising from panel count data and variations arising from three random processes for obtaining observation into estimation. Under regularity conditions for martingale sequence, we show asymptotic normality for estimators from the estimating equations. Using cancer mortality data provided by the U.S. National Center for Health Statistics (NCHS), we apply our results to estimate the probability of cells classified by 4 causes of death and 6 age groups and to forecast death count of each cell. We also investigate impact of three random processes on estimation.

  • PDF

A marginal logit mixed-effects model for repeated binary response data

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권2호
    • /
    • pp.413-420
    • /
    • 2008
  • This paper suggests a marginal logit mixed-effects for analyzing repeated binary response data. Since binary repeated measures are obtained over time from each subject, observations will have a certain covariance structure among them. As a plausible covariance structure, 1st order auto-regressive correlation structure is assumed for analyzing data. Generalized estimating equations(GEE) method is used for estimating fixed effects in the model.

  • PDF

Quasi-Likelihood Approach for Linear Models with Censored Data

  • Ha, Il-Do;Cho, Geon-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제9권2호
    • /
    • pp.219-225
    • /
    • 1998
  • The parameters in linear models with censored normal responses are usually estimated by the iterative maximum likelihood and least square methods. However, the iterative least square method is simple but hardly has theoretical justification, and the iterative maximum likelihood estimating equations are complicatedly derived. In this paper, we justify these methods via Wedderburn (1974)'s quasi-likelihood approach. This provides an explicit justification for the iterative least square method and also directly the iterative maximum likelihood method for estimating the regression coefficients.

  • PDF

민수부문의 에너지원별 수요예측모형 (An Energy Demand Forecasting Model for the Residential and Commercial Sector)

  • 유병우
    • 한국경영과학회지
    • /
    • 제8권2호
    • /
    • pp.45-56
    • /
    • 1983
  • This paper presents a generalized fuel choice model in which restrictive constraints on cross-price coefficients as Baughman-Joskow-FEA Logit Model need not be imposed, but all demand elasticities are uniquely determined. The model is applied to estimating aggregate energy demand and fuel choices for the residential and commercial sector. The structural equations are estimated by a generalized least squares procedure using national-level EPB, KDI, BK, KRIS, MOER data for 1965 and 1980, and other related reports. The econometric results support the argument that “third-price” and “fourth-price” coefficients should not be constrained in estimating relative market share models. Furthermore, by using this fuel choice model, it has forecasted energy demands by fuel sources in, the residential and commercial sector until 1991. The results are turned out good estimates to compare with existing demands forecasted from other institutes.

  • PDF

Regression Analysis of Longitudinal Data Based on M-estimates

  • Jung, Sin-Ho;Terry M. Therneau
    • Journal of the Korean Statistical Society
    • /
    • 제29권2호
    • /
    • pp.201-217
    • /
    • 2000
  • The method of generalized estimating equations (GEE) has become very popular for the analysis of longitudinal data. We extend this work to the use of M-estimators; the resultant regression estimates are robust to heavy tailed errors and to outliers. The proposed method does not require correct specification of the dependence structure between observation, and allows for heterogeneity of the error. However, an estimate of the dependence structure may be incorporated, and if it is correct this guarantees a higher efficiency for the regression estimators. A goodness-of-fit test for checking the adequacy of the assumed M-estimation regression model is also provided. Simulation studies are conducted to show the finite-sample performance of the new methods. The proposed methods are applied to a real-life data set.

  • PDF

A GEE approach for the semiparametric accelerated lifetime model with multivariate interval-censored data

  • Maru Kim;Sangbum Choi
    • Communications for Statistical Applications and Methods
    • /
    • 제30권4호
    • /
    • pp.389-402
    • /
    • 2023
  • Multivariate or clustered failure time data often occur in many medical, epidemiological, and socio-economic studies when survival data are collected from several research centers. If the data are periodically observed as in a longitudinal study, survival times are often subject to various types of interval-censoring, creating multivariate interval-censored data. Then, the event times of interest may be correlated among individuals who come from the same cluster. In this article, we propose a unified linear regression method for analyzing multivariate interval-censored data. We consider a semiparametric multivariate accelerated failure time model as a statistical analysis tool and develop a generalized Buckley-James method to make inferences by imputing interval-censored observations with their conditional mean values. Since the study population consists of several heterogeneous clusters, where the subjects in the same cluster may be related, we propose a generalized estimating equations approach to accommodate potential dependence in clusters. Our simulation results confirm that the proposed estimator is robust to misspecification of working covariance matrix and statistical efficiency can increase when the working covariance structure is close to the truth. The proposed method is applied to the dataset from a diabetic retinopathy study.

AR 프로세스를 이용한 도산예측모형 (Bankruptcy Prediction Model with AR process)

  • 이군희;지용희
    • 한국경영과학회지
    • /
    • 제26권1호
    • /
    • pp.109-116
    • /
    • 2001
  • The detection of corporate failures is a subject that has been particularly amenable to cross-sectional financial ratio analysis. In most of firms, however, the financial data are available over past years. Because of this, a model utilizing these longitudinal data could provide useful information on the prediction of bankruptcy. To correctly reflect the longitudinal and firm-specific data, the generalized linear model with assuming the first order AR(autoregressive) process is proposed. The method is motivated by the clinical research that several characteristics are measured repeatedly from individual over the time. The model is compared with several other predictive models to evaluate the performance. By using the financial data from manufacturing corporations in the Korea Stock Exchange (KSE) list, we will discuss some experiences learned from the procedure of sampling scheme, variable transformation, imputation, variable selection, and model evaluation. Finally, implications of the model with repeated measurement and future direction of research will be discussed.

  • PDF

복합표본조사 데이터 분석을 위한 회귀모형 접근법의 비교: 소규모사업체조사 데이터 분석을 중심으로 (Comparison of Regression Model Approaches fined to Complex Survey Data)

  • 이기재
    • 한국조사연구학회지:조사연구
    • /
    • 제2권1호
    • /
    • pp.73-86
    • /
    • 2001
  • 본 논문은 복합표본조사 분석에서 회귀모형 접근법으로 사용되는 모형 기반 접근법, 설계 기반 접근법과 일반화 추정 방정식 접근법을 설명하고, 이들을 실증적으로 비교한 것이다. 또한 설계 기반 접근법과 일반화 추정 방정식 접근법에 대해서 설계효과와 가중치 효과 분석을 통해서 표본 설계가 모수 추정에 미치는 영향을 살펴보았다.

  • PDF