• Title/Summary/Keyword: goodness of fit criteria

Search Result 53, Processing Time 0.023 seconds

Modelling Count Responses with Overdispersion

  • Jeong, Kwang Mo
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.761-770
    • /
    • 2012
  • We frequently encounter outcomes of count that have extra variation. This paper considers several alternative models for overdispersed count responses such as a quasi-Poisson model, zero-inflated Poisson model and a negative binomial model with a special focus on a generalized linear mixed model. We also explain various goodness-of-fit criteria by discussing their appropriateness of applicability and cautions on misuses according to the patterns of response categories. The overdispersion models for counts data have been explained through two examples with different response patterns.

Statistical Applications for the Prediction of White Hispanic Breast Cancer Survival

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Ross, Elizabeth;Shrestha, Alice
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.14
    • /
    • pp.5571-5575
    • /
    • 2014
  • Background: The ability to predict the survival time of breast cancer patients is important because of the potential high morbidity and mortality associated with the disease. To develop a predictive inference for determining the survival of breast cancer patients, we applied a novel Bayesian method. In this paper, we propose the development of a databased statistical probability model and application of the Bayesian method to predict future survival times for White Hispanic female breast cancer patients, diagnosed in the US during 1973-2009. Materials and Methods: A stratified random sample of White Hispanic female patient survival data was selected from the Surveillance Epidemiology and End Results (SEER) database to derive statistical probability models. Four were considered to identify the best-fit model. We used three standard model-building criteria, which included Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC) to measure the goodness of fit. Furthermore, the Bayesian method was used to derive future survival inferences for survival times. Results: The highest number of White Hispanic female breast cancer patients in this sample was from New Mexico and the lowest from Hawaii. The mean (SD) age at diagnosis (years) was 58.2 (14.2). The mean (SD) of survival time (months) for White Hispanic females was 72.7 (32.2). We found that the exponentiated Weibull model best fit the survival times compared to other widely known statistical probability models. The predictive inference for future survival times is presented using the Bayesian method. Conclusions: The findings are significant for treatment planning and health-care cost allocation. They should also contribute to further research on breast cancer survival issues.

Application of Jackknife Method for Determination of Representative Probability Distribution of Annual Maximum Rainfall (연최대강우량의 대표확률분포형 결정을 위한 Jackknife기법의 적용)

  • Lee, Jae-Joon;Lee, Sang-Won;Kwak, Chang-Jae
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.10
    • /
    • pp.857-866
    • /
    • 2009
  • In this study, basic data is consisted annual maximum rainfall at 56 stations that has the rainfall records more than 30years in Korea. The 14 probability distributions which has been widely used in hydrologic frequency analysis are applied to the basic data. The method of moments, method of maximum likelihood and probability weighted moments method are used to estimate the parameters. And 4-tests (chi-square test, Kolmogorov-Smirnov test, Cramer von Mises test, probability plot correlation coefficient (PPCC) test) are used to determine the goodness of fit of probability distributions. This study emphasizes the necessity for considering the variability of the estimate of T-year event in hydrologic frequency analysis and proposes a framework for evaluating probability distribution models. The variability (or estimation error) of T-year event is used as a criterion for model evaluation as well as three goodness of fit criteria (SLSC, MLL, and AIC) in the framework. The Jackknife method plays a important role in estimating the variability. For the annual maxima of rainfall at 56 stations, the Gumble distribution is regarded as the best one among probability distribution models with two or three parameters.

The Structural Relationship between Surfer's Experience, Lovemark, Flow and Customer Behavior Intention (서퍼의 체험과 러브마크, 몰입, 소비행동의도 간의 관계)

  • Ryu, Jean-Seung;Kim, Sangyoo;Kim, Soo-Hyun
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.5
    • /
    • pp.129-136
    • /
    • 2022
  • The purpose of this study was to investigate the structural relationship between surfer's experience, lovemark, flow and customer behavioral intention. The model's goodness-of-fit test was conducted to analyze the mediating effect of experience, lovemark, flow and consumer behavioral intention. The result is as follows. First, As a result of the conformity test of the research model, all the fit indexes met the criteria for goodness of fit to predict the causality of experience, lovemark, flow, and customer behavioral intention. Second, experience factors positively effected the lovemark. Experience factors had a positive effect on the flow. Flow had a positive effect on customer behavioral intention. Lovemarks had a positive effect on customer behavioral intention. Experience factors did not have a positive effect on customer behavioral intention.

The analysis of validity and Development of Evaluation Criteria for Cyber Teachers' Satisfaction Factors (사이버교사의 만족도 요인 평가 준거 개발 및 타당도 분석)

  • Kim, Ja-Mee;Kim, Yong;Kim, Jung-Hoon
    • Journal of The Korean Association of Information Education
    • /
    • v.13 no.1
    • /
    • pp.31-40
    • /
    • 2009
  • In the Cyber Home Learning Service (CHLS) for primary and secondary level students, Cyber Teachers are in charge of providing learning assistance and guidance to students through various activities. Thus, improving the Cyber Teachers' satisfaction level could contribute to increased effectiveness of the CHLS. In this study, we developed the evaluation criteria to assess the satisfaction level of teachers participating in the CHLS, and analyzed factors influencing the satisfaction level. We verified the content validity of the evaluation criteria. We also analyzed the evaluation criteria model by conducting an item characteristic analysis and a goodness of fit test using the SPSS and AMOS. Finally, we developed 20 items from five factors based on the analysis results.

  • PDF

Mediating Effects of Basic Psychological Needs in Parent-Child Relationships between Perceived Parental Attachment and the Life Satisfaction of College Students (대학생이 지각한 부모애착과 삶의 만족의 관계에서 부모-자녀관계 기본심리욕구의 매개효과)

  • YU, Shin-Bok
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.29 no.2
    • /
    • pp.466-478
    • /
    • 2017
  • The purpose of this study was to examine the relationships between perceived parental attachment and the life satisfaction of college students, focusing on the mediating role of psychological basic needs(autonomy, competence and relatedness). The Participants of this study were 208 college students. The result was statistically treated using SPSS 21.0 program, Amos 21.0. Additionally, PROCESS Macro was used to verify the significant mediating effect. Results from structural equation modeling analyses indicated that a research model produced a better fit to the data than a alternative structural model. The final SEM model fit indices of $x^2$(df), CFI, TLI, RMSEA were met the acceptable criteria of model fitness. In other words, among the goodness-of-fit indexes of the final study model, $x^2=261.075$(p<.001), RMSEA is .082, TLI equals .925, CFI equals .940. The results showed the following: First, Parental attachment has a direct effect on autonomy, competence and relatedness. Also competence and relatedness have a direct effect on the life satisfaction. Second, Competence and relatedness showed a mediating effects on Parental attachment and the life satisfaction. The implications of these results were discussed and the further studies were suggested.

Multivariate design estimations under copulas constructions. Stage-1: Parametrical density constructions for defining flood marginals for the Kelantan River basin, Malaysia

  • Latif, Shahid;Mustafa, Firuza
    • Ocean Systems Engineering
    • /
    • v.9 no.3
    • /
    • pp.287-328
    • /
    • 2019
  • Comprehensive understanding of the flood risk assessments via frequency analysis often demands multivariate designs under the different notations of return periods. Flood is a tri-variate random consequence, which often pointing the unreliability of univariate return period and demands for the joint dependency construction by accounting its multiple intercorrelated flood vectors i.e., flood peak, volume & durations. Selecting the most parsimonious probability functions for demonstrating univariate flood marginals distributions is often a mandatory pre-processing desire before the establishment of joint dependency. Especially under copulas methodology, which often allows the practitioner to model univariate marginals separately from their joint constructions. Parametric density approximations often hypothesized that the random samples must follow some specific or predefine probability density functions, which usually defines different estimates especially in the tail of distributions. Concentrations of the upper tail often seem interesting during flood modelling also, no evidence exhibited in favours of any fixed distributions, which often characterized through the trial and error procedure based on goodness-of-fit measures. On another side, model performance evaluations and selections of best-fitted distributions often demand precise investigations via comparing the relative sample reproducing capabilities otherwise, inconsistencies might reveal uncertainty. Also, the strength & weakness of different fitness statistics usually vary and having different extent during demonstrating gaps and dispensary among fitted distributions. In this literature, selections efforts of marginal distributions of flood variables are incorporated by employing an interactive set of parametric functions for event-based (or Block annual maxima) samples over the 50-years continuously-distributed streamflow characteristics for the Kelantan River basin at Gulliemard Bridge, Malaysia. Model fitness criteria are examined based on the degree of agreements between cumulative empirical and theoretical probabilities. Both the analytical as well as graphically visual inspections are undertaken to strengthen much decisive evidence in favour of best-fitted probability density.

Models for Estimating Genetic Parameters of Milk Production Traits Using Random Regression Models in Korean Holstein Cattle

  • Cho, C.I.;Alam, M.;Choi, T.J.;Choy, Y.H.;Choi, J.G.;Lee, S.S.;Cho, K.H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.29 no.5
    • /
    • pp.607-614
    • /
    • 2016
  • The objectives of the study were to estimate genetic parameters for milk production traits of Holstein cattle using random regression models (RRMs), and to compare the goodness of fit of various RRMs with homogeneous and heterogeneous residual variances. A total of 126,980 test-day milk production records of the first parity Holstein cows between 2007 and 2014 from the Dairy Cattle Improvement Center of National Agricultural Cooperative Federation in South Korea were used. These records included milk yield (MILK), fat yield (FAT), protein yield (PROT), and solids-not-fat yield (SNF). The statistical models included random effects of genetic and permanent environments using Legendre polynomials (LP) of the third to fifth order (L3-L5), fixed effects of herd-test day, year-season at calving, and a fixed regression for the test-day record (third to fifth order). The residual variances in the models were either homogeneous (HOM) or heterogeneous (15 classes, HET15; 60 classes, HET60). A total of nine models (3 orders of $polynomials{\times}3$ types of residual variance) including L3-HOM, L3-HET15, L3-HET60, L4-HOM, L4-HET15, L4-HET60, L5-HOM, L5-HET15, and L5-HET60 were compared using Akaike information criteria (AIC) and/or Schwarz Bayesian information criteria (BIC) statistics to identify the model(s) of best fit for their respective traits. The lowest BIC value was observed for the models L5-HET15 (MILK; PROT; SNF) and L4-HET15 (FAT), which fit the best. In general, the BIC values of HET15 models for a particular polynomial order was lower than that of the HET60 model in most cases. This implies that the orders of LP and types of residual variances affect the goodness of models. Also, the heterogeneity of residual variances should be considered for the test-day analysis. The heritability estimates of from the best fitted models ranged from 0.08 to 0.15 for MILK, 0.06 to 0.14 for FAT, 0.08 to 0.12 for PROT, and 0.07 to 0.13 for SNF according to days in milk of first lactation. Genetic variances for studied traits tended to decrease during the earlier stages of lactation, which were followed by increases in the middle and decreases further at the end of lactation. With regards to the fitness of the models and the differential genetic parameters across the lactation stages, we could estimate genetic parameters more accurately from RRMs than from lactation models. Therefore, we suggest using RRMs in place of lactation models to make national dairy cattle genetic evaluations for milk production traits in Korea.

Survival Analysis for White Non-Hispanic Female Breast Cancer Patients

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Stewart, Tiffanie Shauna-Jeanne;Bhatt, Chintan
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.9
    • /
    • pp.4049-4054
    • /
    • 2014
  • Background: Race and ethnicity are significant factors in predicting survival time of breast cancer patients. In this study, we applied advanced statistical methods to predict the survival of White non-Hispanic female breast cancer patients, who were diagnosed between the years 1973 and 2009 in the United States (U.S.). Materials and Methods: Demographic data from the Surveillance Epidemiology and End Results (SEER) database were used for the purpose of this study. Nine states were randomly selected from 12 U.S. cancer registries. A stratified random sampling method was used to select 2,000 female breast cancer patients from these nine states. We compared four types of advanced statistical probability models to identify the best-fit model for the White non-Hispanic female breast cancer survival data. Three model building criterion were used to measure and compare goodness of fit of the models. These include Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC). In addition, we used a novel Bayesian method and the Markov Chain Monte Carlo technique to determine the posterior density function of the parameters. After evaluating the model parameters, we selected the model having the lowest DIC value. Using this Bayesian method, we derived the predictive survival density for future survival time and its related inferences. Results: The analytical sample of White non-Hispanic women included 2,000 breast cancer cases from the SEER database (1973-2009). The majority of cases were married (55.2%), the mean age of diagnosis was 63.61 years (SD = 14.24) and the mean survival time was 84 months (SD = 35.01). After comparing the four statistical models, results suggested that the exponentiated Weibull model (DIC= 19818.220) was a better fit for White non-Hispanic females' breast cancer survival data. This model predicted the survival times (in months) for White non-Hispanic women after implementation of precise estimates of the model parameters. Conclusions: By using modern model building criteria, we determined that the data best fit the exponentiated Weibull model. We incorporated precise estimates of the parameter into the predictive model and evaluated the survival inference for the White non-Hispanic female population. This method of analysis will assist researchers in making scientific and clinical conclusions when assessing survival time of breast cancer patients.

Measurement Instruments for Superior Product Development: A Case Study of Deli Serdang Cassava in Indonesia

  • P, Remus Hasiholan;TARMIZI, Hasan Basri;RAHMANTA, Rahmanta;PURWOKO, Agus
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.5
    • /
    • pp.1139-1145
    • /
    • 2021
  • This study is aimed at developing a measurement instrument for the superior product development program of Deli Serdang cassava, Indonesia. This research population is the target population of Deli Serdang Micro, Small and Medium-sized Enterprises (MSMEs) which produces cassava. The sample was randomly selected and consisted of 300 MSMEs. The study method is research and development with confirmatory factor analysis using Amos software. The data collection technique was a questionnaire. Study results used the maximum likelihood method which showed that the validity and reliability instruments met the ideal loading factor value > 0.5 and a significance value of p (0.000). The model built also meets the fit criteria based on the Goodness of Fit Model Standard. All instruments are presented to build and measure the superior cassava product development program by Deli Serdang MSMEs. This superior product development program comprises (1) economic contribution (with a loading factor value of 0.76) (2) social aspects (with a loading factor value of 0.76) (3) cultural aspects (with a loading factor value of 0.99) and (4) institutional (with a loading factor value of 0.87). This result means that all instruments have proven construct validity.