• Title/Summary/Keyword: semiparametric regression model

Search Result 27, Processing Time 0.017 seconds

Semiparametric kernel logistic regression with longitudinal data

  • Shim, Joo-Yong;Seok, Kyung-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.2
    • /
    • pp.385-392
    • /
    • 2012
  • Logistic regression is a well known binary classification method in the field of statistical learning. Mixed-effect regression models are widely used for the analysis of correlated data such as those found in longitudinal studies. We consider kernel extensions with semiparametric fixed effects and parametric random effects for the logistic regression. The estimation is performed through the penalized likelihood method based on kernel trick, and our focus is on the efficient computation and the effective hyperparameter selection. For the selection of optimal hyperparameters, cross-validation techniques are employed. Numerical results are then presented to indicate the performance of the proposed procedure.

Residual-based copula parameter estimation (잔차를 이용한 코플라 모수 추정)

  • Na, Okyoung;Kwon, Sunghoon
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.1
    • /
    • pp.267-277
    • /
    • 2016
  • This paper considers we consider the estimation of copula parameters based on residuals in stochastic regression models. We prove that a semiparametric estimator using residual empirical distributions is consistent under some conditions and apply the results to the copula-ARMA model. We provide simulation results for illustration.

A study on robust regression estimators in heteroscedastic error models

  • Son, Nayeong;Kim, Mijeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1191-1204
    • /
    • 2017
  • Weighted least squares (WLS) estimation is often easily used for the data with heteroscedastic errors because it is intuitive and computationally inexpensive. However, WLS estimator is less robust to a few outliers and sometimes it may be inefficient. In order to overcome robustness problems, Box-Cox transformation, Huber's M estimation, bisquare estimation, and Yohai's MM estimation have been proposed. Also, more efficient estimations than WLS have been suggested such as Bayesian methods (Cepeda and Achcar, 2009) and semiparametric methods (Kim and Ma, 2012) in heteroscedastic error models. Recently, Çelik (2015) proposed the weight methods applicable to the heteroscedasticity patterns including butterfly-distributed residuals and megaphone-shaped residuals. In this paper, we review heteroscedastic regression estimators related to robust or efficient estimation and describe their properties. Also, we analyze cost data of U.S. Electricity Producers in 1955 using the methods discussed in the paper.

Bayesian ordinal probit semiparametric regression models: KNHANES 2016 data analysis of the relationship between smoking behavior and coffee intake (베이지안 순서형 프로빗 준모수 회귀 모형 : 국민건강영양조사 2016 자료를 통한 흡연양태와 커피섭취 간의 관계 분석)

  • Lee, Dasom;Lee, Eunji;Jo, Seogil;Choi, Taeryeon
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.25-46
    • /
    • 2020
  • This paper presents ordinal probit semiparametric regression models using Bayesian Spectral Analysis Regression (BSAR) method. Ordinal probit regression is a way of modeling ordinal responses - usually more than two categories - by connecting the probability of falling into each category explained by a combination of available covariates using a probit (an inverse function of normal cumulative distribution function) link. The Bayesian probit model facilitates posterior sampling by bringing a latent variable following normal distribution, therefore, the responses are categorized by the cut-off points according to values of latent variables. In this paper, we extend the latent variable approach to a semiparametric model for the Bayesian ordinal probit regression with nonparametric functions using a spectral representation of Gaussian processes based BSAR method. The latent variable is decomposed into a parametric component and a nonparametric component with or without a shape constraint for modeling ordinal responses and predicting outcomes more flexibly. We illustrate the proposed methods with simulation studies in comparison with existing methods and real data analysis applied to a Korean National Health and Nutrition Examination Survey (KNHANES) 2016 for investigating nonparametric relationship between smoking behavior and coffee intake.

A semiparametric method to measure predictive accuracy of covariates for doubly censored survival outcomes

  • Han, Seungbong;Lee, JungBok
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.4
    • /
    • pp.343-353
    • /
    • 2016
  • In doubly-censored data, an originating event time and a terminating event time are interval-censored. In certain analyses of such data, a researcher might be interested in the elapsed time between the originating and terminating events as well as regression modeling with risk factors. Therefore, in this study, we introduce a model evaluation method to measure the predictive ability of a model based on negative predictive values. We use a semiparametric estimate of the predictive accuracy to provide a simple and flexible method for model evaluation of doubly-censored survival outcomes. Additionally, we used simulation studies and tested data from a prostate cancer trial to illustrate the practical advantages of our approach. We believe that this method could be widely used to build prediction models or nomograms.

Survival Function Estimation for the Proportional Hazards Regression Model

  • Cha, Young Joon
    • Journal of Korean Society for Quality Management
    • /
    • v.18 no.1
    • /
    • pp.9-20
    • /
    • 1990
  • The purpose of this paper is to propose the modified semiparametric estimators for survival function in the Cox's regression model with randomly censored data based on Tsiatis and Breslow estimators, and present their asymptotic variances estimates. The proposed estimators are compared to Tsiatis, Breslow, and Kaplan-Meier estimators through a small-sample Monte Carlo study. The simulation results show that the proposed estimators are preferred for small sample sizes.

  • PDF

Negative Binomial Varying Coefficient Partially Linear Models

  • Kim, Young-Ju
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.809-817
    • /
    • 2012
  • We propose a semiparametric inference for a generalized varying coefficient partially linear model(VCPLM) for negative binomial data. The VCPLM is useful to model real data in that varying coefficients are a special type of interaction between explanatory variables and partially linear models fit both parametric and nonparametric terms. The negative binomial distribution often arise in modelling count data which usually are overdispersed. The varying coefficient function estimators and regression parameters in generalized VCPLM are obtained by formulating a penalized likelihood through smoothing splines for negative binomial data when the shape parameter is known. The performance of the proposed method is then evaluated by simulations.

SOME PROPERTIES OF SIMEX ESTIMATOR IN PARTIALLY LINEAR MEASUREMENT ERROR MODEL

  • Meeseon Jeong;Kim, Choongrak
    • Journal of the Korean Statistical Society
    • /
    • v.32 no.1
    • /
    • pp.85-92
    • /
    • 2003
  • We consider the partially linear model E(Y) : X$^{t}$ $\beta$+η(Z) when the X's are measured with additive error. The semiparametric likelihood estimation ignoring the measurement error gives inconsistent estimator for both $\beta$ and η(.). In this paper we suggest the SIMEX estimator for f to correct the bias induced by measurement error, and explore its properties. We show that the rational linear extrapolant is proper in extrapolation step in the sense that the SIMEX method under this extrapolant gives consistent estimator It is also shown that the SIMEX estimator is asymptotically equivalent to the semiparametric version of the usual parametric correction for attenuation suggested by Liang et al. (1999) A simulation study is given to compare two variance estimating methods for SIMEX estimator.

Shrinkage Small Area Estimation Using a Semiparametric Mixed Model (준모수혼합모형을 이용한 축소소지역추정)

  • Jeong, Seok-Oh;Choo, Manho;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.4
    • /
    • pp.605-617
    • /
    • 2014
  • Small area estimation is a statistical inference method to overcome large variance due to a small sample size allocated in a small area. A shrinkage estimator obtained by minimizing relative error(RE) instead of MSE has been suggested. The estimator takes advantage of good interpretation when the data range is large. A semiparametric estimator is also studied for small area estimation. In this study, we suggest a semiparametric shrinkage small area estimator and compare small area estimators using labor statistics.

A GEE approach for the semiparametric accelerated lifetime model with multivariate interval-censored data

  • Maru Kim;Sangbum Choi
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.4
    • /
    • pp.389-402
    • /
    • 2023
  • Multivariate or clustered failure time data often occur in many medical, epidemiological, and socio-economic studies when survival data are collected from several research centers. If the data are periodically observed as in a longitudinal study, survival times are often subject to various types of interval-censoring, creating multivariate interval-censored data. Then, the event times of interest may be correlated among individuals who come from the same cluster. In this article, we propose a unified linear regression method for analyzing multivariate interval-censored data. We consider a semiparametric multivariate accelerated failure time model as a statistical analysis tool and develop a generalized Buckley-James method to make inferences by imputing interval-censored observations with their conditional mean values. Since the study population consists of several heterogeneous clusters, where the subjects in the same cluster may be related, we propose a generalized estimating equations approach to accommodate potential dependence in clusters. Our simulation results confirm that the proposed estimator is robust to misspecification of working covariance matrix and statistical efficiency can increase when the working covariance structure is close to the truth. The proposed method is applied to the dataset from a diabetic retinopathy study.