• Title/Summary/Keyword: Censored Data

Search Result 405, Processing Time 0.022 seconds

Estimation of hazard function and hazard change-point for the rectal cancer data (직장암 데이터에 대한 위험률 함수 추정 및 위험률 변화점 추정)

  • Lee, Sieun;Shim, Byoung Yong;Kim, Jaehee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.6
    • /
    • pp.1225-1238
    • /
    • 2015
  • In this research, we fit various survival models and conduct tests and estimation for the hazard change-point with the rectal cancer data. By the log-rank tests, at significance level ${\alpha}=0.10$, survival functions are significantly different according to the uniporter of glucose (GLUT1), clinical stage (cstage) and pathologic stage (ypstage). From the Cox proportional hazard model, the most significant covariates are GLUT1 and ypstage. Assuming that the rectal cancer data follows the exponential distribution, we estimate one hazard change-point using Matthews and Farewell (1982), Henderson (1990) and Loader (1991) methods.

Parametric survival model based on the Lévy distribution

  • Valencia-Orozco, Andrea;Tovar-Cuevas, Jose R.
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.5
    • /
    • pp.445-461
    • /
    • 2019
  • It is possible that data are not always fitted with sufficient precision by the existing distributions; therefore this article presents a methodology that enables the use of families of asymmetric distributions as alternative probabilistic models for survival analysis, with censorship on the right, different from those usually studied (the Exponential, Gamma, Weibull, and Lognormal distributions). We use a more flexible parametric model in terms of density behavior, assuming that data can be fit by a distribution of stable distribution families considered unconventional in the analyses of survival data that are appropriate when extreme values occur, with small probabilities that should not be ignored. In the methodology, the determination of the analytical expression of the risk function h(t) of the $L{\acute{e}}vy$ distribution is included, as it is not usually reported in the literature. A simulation was conducted to evaluate the performance of the candidate distribution when modeling survival times, including the estimation of parameters via the maximum likelihood method, survival function ${\hat{S}}$(t) and Kaplan-Meier estimator. The obtained estimates did not exhibit significant changes for different sample sizes and censorship fractions in the sample. To illustrate the usefulness of the proposed methodology, an application with real data, regarding the survival times of patients with colon cancer, was considered.

Estimation methods and interpretation of competing risk regression models (경쟁 위험 회귀 모형의 이해와 추정 방법)

  • Kim, Mijeong
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1231-1246
    • /
    • 2016
  • Cause-specific hazard model (Prentice et al., 1978) and subdistribution hazard model (Fine and Gray, 1999) are mostly used for the right censored survival data with competing risks. Some other models for survival data with competing risks have been subsequently introduced; however, those models have not been popularly used because the models cannot provide reliable statistical estimation methods or those are overly difficult to compute. We introduce simple and reliable competing risk regression models which have been recently proposed as well as compare their methodologies. We show how to use SAS and R for the data with competing risks. In addition, we analyze survival data with two competing risks using five different models.

A Study on the Survival Probability and Survival Factors of Small and Medium-sized Enterprises Using Technology Rating Data (기술평가 자료를 이용한 중소기업의 생존율 추정 및 생존요인 분석)

  • Lee, Young-Chan
    • Knowledge Management Research
    • /
    • v.11 no.2
    • /
    • pp.95-109
    • /
    • 2010
  • The objectives of this study are to identify the survival function (hazard function) of small and medium enterprises by using technology rating data for the companies guaranteed by Korea Technology Finance Corporation (KOTEC), and to figure out the factors that affects their survival. To serve the purposes, this study uses Kaplan-Meier Analysis as a non-parametric method and Cox proportional hazards model as a semi-parametric one. The 17,396 guaranteed companies that assessed from July 1st in 2005 to December 31st in 2009 are selected as samples (16,504 censored data and 829 accident data). The survival time is computed with random censoring (Type III) from July in 2005 as a starting point. The results of the analysis show that Kaplan-Meier Analysis and Cox proportional hazards model are able to readily estimate survival and hazard function and to perform comparative study among group variables such as industry and technology rating level. In particular, Cox proportional hazards model is recognized that it is useful to understand which technology rating items are meaningful to company's survival and how much they affect it. It is considered that these results will provide valuable knowledge for practitioners to find and manage the significant items for survival of the guaranteed companies through future technology rating.

  • PDF

Empirical Bayesian Prediction Analysis on Accelerated Lifetime Data (가속수명자료를 이용한 경험적 베이즈 예측분석)

  • Cho, Geon-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.8 no.1
    • /
    • pp.21-30
    • /
    • 1997
  • In accelerated life tests, the failure time of an item is observed under a high stress level, and based on the time the performances of items are investigated at the normal stress level. In this paper, when the mean of the prior of a failure rate is known in the exponential lifetime distribution with censored accelerated failure time data, we utilize the empirical Bayesian method by using the moment estimators in order to estimate the parameters of the prior distribution and obtain the empirical Bayesian predictive density and predictive intervals for a future observation under the normal stress level.

  • PDF

Smoothing Kaplan-Meier estimate using monotone support vector regression (단조 서포트벡터기계를 이용한 카플란-마이어 생존함수의 평활)

  • Hwang, Changha;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.6
    • /
    • pp.1045-1054
    • /
    • 2012
  • Support vector machine is known to be the very useful statistical method in classification and nonlinear function estimation. In this paper we propose a monotone support vector regression (SVR) for the estimation of monotonically decreasing function. The proposed monotone SVR is applied to smooth the Kaplan-Meier estimate of survival function. Experimental results are then presented which indicate the performance of the proposed monotone SVR using survival functions obtained by exponential distribution.

A Study of the Small Sample Warranty Data Analysis Using the Bayesian Approach (베이지안 기법을 이용한 소표본 보증데이터 분석 방법 연구)

  • Kim, Jong-Gurl;Sung, Ki-Woo;Song, Jung-Moo
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2013.04a
    • /
    • pp.517-531
    • /
    • 2013
  • 보증 데이터를 통해 제품의 수명 및 형상모수를 추정할 때 최우추정법과 같은 전통적인 통계 분석방법(Classical Statistical Method)을 많이 사용하였다. 그러나 전통적인 통계 분석방법을 통해 수명과 형상모수의 추정 시 표본의 크기가 작거나 불완전한 경우 추정량의 신뢰성이 떨어진다는 단점이 있고 또 누적된 경험과 과거자료를 충분히 이용하지 못하는 단점도 있다. 이러한 문제점을 해결하기 위해 모수의 사전분포를 가정하는 베이지안(Bayesian) 기법의 적용이 필요하다. 하지만 보증 데이터분석에 있어서 베이지안 기법을 이용한 연구는 아직 미흡한 실정이다. 본 연구에서는 수명분포가 와이블 분포를 갖는 보증데이터를 활용하여 모수 추정의 효율성을 비교 분석하고자 한다. 이를 위해 와이블 분포의 모수가 대수정규분포를 따르는 사전분포를 갖는 베이지안 기법과 전통적 통계기법인 생명표법(Actuarial method)을 활용하여 추정량을 도출하고 비교 분석하였다. 이를 통해 충분한 관측 데이터를 확보할 수 없는 경우에 베이지안 기법을 이용한 보증 데이터 분석방법의 성능을 확인하고자 한다.

  • PDF

The determinants of the youth employment rate using panel tobit model (패널 토빗모형을 이용한 청년채용비율 결정요인 분석)

  • Park, Sungik;Ryu, Jangsoo;Kim, Jonghan;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.4
    • /
    • pp.853-862
    • /
    • 2017
  • In this study, we analyse the determinants of the youth employment rate of public agencies and local public enterprises. On the other hand the youth employment rate contains information of the youth employment rate and the size of the youth employment. We use pooled tobit model and panel tobit model since dependent variable is a censored form observed only in a certain area. The results of the analysis are summarized as follows. First, the panel tobit model is more statistically significant as compared to the combined tobit model. Second, the youth employment rate is more statistically significantly higher in 2014 and 2015 than in 2011. Third, the youth employment rate in public enterprises is more statistically significantly higher than that in local public agencies. Finally, the higher the average wage is, the lower the youth employment ratio is.

Intrinsic Bayes Factors for Exponential Model Comparison with Censored Data

  • Kim, Dal-Ho;Kang, Sang-Gil;Kim, Seong W.
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.1
    • /
    • pp.123-135
    • /
    • 2000
  • This paper addresses the Bayesian hypotheses testing for the comparison of exponential population under type II censoring. In Bayesian testing problem, conventional Bayes factors can not typically accommodate the use of noninformative priors which are improper and are defined only up to arbitrary constants. To overcome such problem, we use the recently proposed hypotheses testing criterion called the intrinsic Bayes factor. We derive the arithmetic, expected and median intrinsic Bayes factors for our problem. The Monte Carlo simulation is used for calculating intrinsic Bayes factors which are compared with P-values of the classical test.

  • PDF

The Step Stress Life Testing for the Parallel System with Censored Data (절단된 자료가 있는 병렬형 시스템의 단계적 충격수명검사)

  • Park, Hee-Chang;Lee, Suk-Hoon
    • Journal of Korean Society for Quality Management
    • /
    • v.23 no.1
    • /
    • pp.15-28
    • /
    • 1995
  • We consider a step-stress life testing which is devised for a two-component parallel system with considerably long life time. To describe such a system, we use an exponential distribution as the survival function. The lift distribution is assumed between the log mean life time and the stress with the cumulative exposure model. The criterion for optimality is to minimize the sum of the variances of the maximum likelihood estimators of the mean life times of each part under the normal stress.

  • PDF