• Title/Summary/Keyword: statistical probability models

Search Result 213, Processing Time 0.022 seconds

Does Breast Cancer Drive the Building of Survival Probability Models among States? An Assessment of Goodness of Fit for Patient Data from SEER Registries

  • Khan, Hafiz;Saxena, Anshul;Perisetti, Abhilash;Rafiq, Aamrin;Gabbidon, Kemesha;Mende, Sarah;Lyuksyutova, Maria;Quesada, Kandi;Blakely, Summre;Torres, Tiffany;Afesse, Mahlet
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.12
    • /
    • pp.5287-5294
    • /
    • 2016
  • Background: Breast cancer is a worldwide public health concern and is the most prevalent type of cancer in women in the United States. This study concerned the best fit of statistical probability models on the basis of survival times for nine state cancer registries: California, Connecticut, Georgia, Hawaii, Iowa, Michigan, New Mexico, Utah, and Washington. Materials and Methods: A probability random sampling method was applied to select and extract records of 2,000 breast cancer patients from the Surveillance Epidemiology and End Results (SEER) database for each of the nine state cancer registries used in this study. EasyFit software was utilized to identify the best probability models by using goodness of fit tests, and to estimate parameters for various statistical probability distributions that fit survival data. Results: Statistical analysis for the summary of statistics is reported for each of the states for the years 1973 to 2012. Kolmogorov-Smirnov, Anderson-Darling, and Chi-squared goodness of fit test values were used for survival data, the highest values of goodness of fit statistics being considered indicative of the best fit survival model for each state. Conclusions: It was found that California, Connecticut, Georgia, Iowa, New Mexico, and Washington followed the Burr probability distribution, while the Dagum probability distribution gave the best fit for Michigan and Utah, and Hawaii followed the Gamma probability distribution. These findings highlight differences between states through selected sociodemographic variables and also demonstrate probability modeling differences in breast cancer survival times. The results of this study can be used to guide healthcare providers and researchers for further investigations into social and environmental factors in order to reduce the occurrence of and mortality due to breast cancer.

Comparison of Three Binomial-related Models in the Estimation of Correlations

  • Moon, Myung-Sang
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.585-594
    • /
    • 2003
  • It has been generally recognized that conventional binomial or Poisson model provides poor fits to the actual correlated binary data due to the extra-binomial variation. A number of generalized statistical models have been proposed to account for this additional variation. Among them, beta-binomial, correlated-binomial, and modified-binomial models are binomial-related models which are frequently used in modeling the sum of n correlated binary data. In many situations, it is reasonable to assume that n correlated binary data are exchangeable, which is a special case of correlated binary data. The sum of n exchangeable correlated binary data is modeled relatively well when the above three binomial-related models are applied. But the estimation results of correlation coefficient turn to be quite different. Hence, it is important to identify which model provides better estimates of model parameters(success probability, correlation coefficient). For this purpose, a small-scale simulation study is performed to compare the behavior of above three models.

Development of Integrated Model for Accelerated Life Test Using Linkage Parameter (연계모수를 이용한 가속수명시험 통합모형의 개발)

  • Choi, Sung-Woon
    • Journal of the Korea Safety Management & Science
    • /
    • v.9 no.5
    • /
    • pp.43-48
    • /
    • 2007
  • This paper is to present linkage parameter to integrate statistical models and physical models for accelerated life test. Statistical models represent the relationship of probability distribution and life. Physical models show the relationship of life and stress. Moreover, this study proposes the four steps for construction of integrated models for accelerated life test using linkage parameter. Finally, this paper develops new integrated models such as extreme value distribution-general Eyring, linearly increasing failure rate function-general Eyring, etc., and estimates various reliability measures.

Test for Distribution Change of Dependent Errors (종속 오차에 대한 분포 변화 검정법)

  • Na, Seong-Ryong
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.4
    • /
    • pp.587-594
    • /
    • 2009
  • In this paper the change point problem of the error terms in linear regression models is considered. Since fixed or stochastic independent variables and weakly dependent errors are assumed, usual multiple regression models and time series models including ARMA are covered. We use the estimates of probability density function based on residuals in order to test the distribution change of the unobserved errors. Under some mild conditions, the test using the residuals is proved to have the same limiting distribution as the test based on true errors.

Statistical Applications for the Prediction of White Hispanic Breast Cancer Survival

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Ross, Elizabeth;Shrestha, Alice
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.14
    • /
    • pp.5571-5575
    • /
    • 2014
  • Background: The ability to predict the survival time of breast cancer patients is important because of the potential high morbidity and mortality associated with the disease. To develop a predictive inference for determining the survival of breast cancer patients, we applied a novel Bayesian method. In this paper, we propose the development of a databased statistical probability model and application of the Bayesian method to predict future survival times for White Hispanic female breast cancer patients, diagnosed in the US during 1973-2009. Materials and Methods: A stratified random sample of White Hispanic female patient survival data was selected from the Surveillance Epidemiology and End Results (SEER) database to derive statistical probability models. Four were considered to identify the best-fit model. We used three standard model-building criteria, which included Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC) to measure the goodness of fit. Furthermore, the Bayesian method was used to derive future survival inferences for survival times. Results: The highest number of White Hispanic female breast cancer patients in this sample was from New Mexico and the lowest from Hawaii. The mean (SD) age at diagnosis (years) was 58.2 (14.2). The mean (SD) of survival time (months) for White Hispanic females was 72.7 (32.2). We found that the exponentiated Weibull model best fit the survival times compared to other widely known statistical probability models. The predictive inference for future survival times is presented using the Bayesian method. Conclusions: The findings are significant for treatment planning and health-care cost allocation. They should also contribute to further research on breast cancer survival issues.

Prediction of Extreme Sloshing Pressure Using Different Statistical Models

  • Cetin, Ekin Ceyda;Lee, Jeoungkyu;Kim, Sangyeob;Kim, Yonghwan
    • Journal of Advanced Research in Ocean Engineering
    • /
    • v.4 no.4
    • /
    • pp.185-194
    • /
    • 2018
  • In this study, the extreme sloshing pressure was predicted using various statistical models: three-parameter Weibull distribution, generalized Pareto distribution, generalized extreme value distribution, and three-parameter log-logistic distribution. The estimation of sloshing impact pressure is important in design of liquid cargo tank in severe sea state. In order to get the extreme values of local impact pressures, a lot of model tests have been carried out and statistical analysis has been performed. Three-parameter Weibull distribution and generalized Pareto distribution are widely used as the statistical analysis method in sloshing phenomenon, but generalized extreme value distribution and three-parameter log-logistic distribution are added in this study. Additionally, statistical distributions are fitted to peak pressure data using three different parameter estimation methods. The data were obtained from a three-dimensional sloshing model text conducted at Seoul National University. The loading conditions were 20%, 50%, and 95% of tank height, and the analysis was performed based on the measured impact pressure on four significant panels with large sloshing impacts. These fittings were compared by observing probability of exceedance diagrams and probability plot correlation coefficient test for goodness-of-fit.

A New Form of Nondestructive Strength-Estimating Statistical Models Accounting for Uncertainty of Model and Aging Effect of Concrete

  • Hong, Kee-Jeung;Kim, Jee-Sang
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.29 no.3
    • /
    • pp.230-234
    • /
    • 2009
  • As concrete ages, the surrounding environment is expected to have growing influences on the concrete. As all the impacts of the environment cannot be considered in the strength-estimating model of a nondestructive concrete test, the increase in concrete age leads to growing uncertainty in the strength-estimating model. Therefore, the variation of the model error increases. It is necessary to include those impacts in the probability model of concrete strength attained from the nondestructive tests so as to build a more accurate reliability model for structural performance evaluation. This paper reviews and categorizes the existing strength-estimating statistical models of nondestructive concrete test, and suggests a new form of the strength-estimating statistical models to properly reflect the model uncertainty due to aging of the concrete. This new form of the statistical models will lay foundation for more accurate structural performance evaluation.

On the Conditional Tolerance Probability in Time Series Models

  • Lee, Sang-Yeol
    • Journal of the Korean Statistical Society
    • /
    • v.26 no.3
    • /
    • pp.407-416
    • /
    • 1997
  • Suppose that { $X_{i}$ } is a stationary AR(1) process and { $Y_{j}$ } is an ARX process with { $X_{i}$ } as exogeneous variables. Let $Y_{j}$ $^{*}$ be the stochastic process which is the sum of $Y_{j}$ and a nonstochastic trend. In this paper we consider the problem of estimating the conditional probability that $Y_{{n+1}}$$^{*}$ is bigger than $X_{{n+1}}$, given $X_{1}$, $Y_{1}$$^{*}$,..., $X_{n}$ , $Y_{n}$ $^{*}$. As an estimator for the tolerance probability, an Mann-Whitney statistic based on least squares residuars is suggested. It is shown that the deviations between the estimator and true probability are stochatically bounded with $n^{{-1}$2}/ order. The result may be applied to the stress-strength reliability theory when the stress and strength variables violate the classical iid assumption.umption.n.

  • PDF

On the Moving Average Models with Multivariate geometric Distributions

  • Baek, Jong-ill
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.677-686
    • /
    • 1999
  • In this paper we introduce a class of moving-average(MA) sequences of multivariate random vectors with geometric marginals. The theory of positive dependence is used to show that in various cases the class of MA sequences consists of associated random variables. We utilize positive dependence properties to obtain weakly probability inequality of the multivariate processes.

  • PDF

Unbiasedness or Statistical Efficiency: Comparison between One-stage Tobit of MLE and Two-step Tobit of OLS

  • Park, Sun-Young
    • International Journal of Human Ecology
    • /
    • v.4 no.2
    • /
    • pp.77-87
    • /
    • 2003
  • This paper tried to construct statistical and econometric models on the basis of economic theory in order to discuss the issue of statistical efficiency and unbiasedness including the sample selection bias correcting problem. Comparative analytical tool were one stage Tobit of Maximum Likelihood estimation and Heckman's two-step Tobit of Ordinary Least Squares. The results showed that the adequacy of model for the analysis on demand and choice, we believe that there is no big difference in explanatory variables between the first selection model and the second linear probability model. Since the Lambda, the self- selectivity correction factor, in the Type II Tobit is not statistically significant, there is no self-selectivity in the Type II Tobit model, indicating that Type I Tobit model would give us better explanation in the demand for and choice which is less complicated statistical method rather than type II model.