• Title/Summary/Keyword: Akaike Information Criterion

Search Result 114, Processing Time 0.025 seconds

Examining Impact of Weather Factors on Apple Yield (사과생산량에 영향을 미치는 기상요인 분석)

  • Kim, Mi Ri;Kim, Seung Gyu
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.16 no.4
    • /
    • pp.274-284
    • /
    • 2014
  • Crops and varieties are mostly affected by temperature, the amount of precipitation, and duration of sunshine. This study aims to identify the weather factors that directly influence to apple yield among the series of daily measured weather variables during growing seasons. In order to identify them, 1) a priori natural scientific knowledge with respect to the growth stage of apples and 2) pure statistical approaches to minimize bias due to the subject selection of variables are considered. Each result estimated by the Panel regression using fixed/random effect models is evaluated through suitability (i.e., Akaike information criterion and Bayesian information criterion) and predictability (i.e., mean absolute error, root mean square error, mean absolute percentage). The Panel data of apple yield and weather factors are collected from fifteen major producing areas of apples from 2006 to 2013 in Korea for the case study. The result shows that variable selection using factor analysis, which is one of the statistical approaches applied in the analysis, increases predictability and suitability most. It may imply that all the weather factors are important to predict apple yield if statistical problems, such as multicollinearity and lower degree of freedom due to too many explanatory variables used in the regression, can be controlled effectively. This may be because whole growth stages, such as germination, florescence, fruit setting, fatting, ripening, coloring, and harvesting, are affected by weather.

Nonlinear mixed models for characterization of growth trajectory of New Zealand rabbits raised in tropical climate

  • de Sousa, Vanusa Castro;Biagiotti, Daniel;Sarmento, Jose Lindenberg Rocha;Sena, Luciano Silva;Barroso, Priscila Alves;Barjud, Sued Felipe Lacerda;de Sousa Almeida, Marisa Karen;da Silva Santos, Natanael Pereira
    • Animal Bioscience
    • /
    • v.35 no.5
    • /
    • pp.648-658
    • /
    • 2022
  • Objective: The identification of nonlinear mixed models that describe the growth trajectory of New Zealand rabbits was performed based on weight records and carcass measures obtained using ultrasonography. Methods: Phenotypic records of body weight (BW) and loin eye area (LEA) were collected from 66 animals raised in a didactic-productive module of cuniculture located in the southern Piaui state, Brazil. The following nonlinear models were tested considering fixed parameters: Brody, Gompertz, Logistic, Richards, Meloun 1, modified Michaelis-Menten, Santana, and von Bertalanffy. The coefficient of determination (R2), mean squared error, percentage of convergence of each model (%C), mean absolute deviation of residuals, Akaike information criterion (AIC), and Bayesian information criterion (BIC) were used to determine the best model. The model that best described the growth trajectory for each trait was also used under the context of mixed models, considering two parameters that admit biological interpretation (A and k) with random effects. Results: The von Bertalanffy model was the best fitting model for BW according to the highest value of R2 (0.98) and lowest values of AIC (6,675.30) and BIC (6,691.90). For LEA, the Logistic model was the most appropriate due to the results of R2 (0.52), AIC (783.90), and BIC (798.40) obtained using this model. The absolute growth rates estimated using the von Bertalanffy and Logistic models for BW and LEA were 21.51g/d and 3.16 cm2, respectively. The relative growth rates at the inflection point were 0.028 for BW (von Bertalanffy) and 0.014 for LEA (Logistic). Conclusion: The von Bertalanffy and Logistic models with random effect at the asymptotic weight are recommended for analysis of ponderal and carcass growth trajectories in New Zealand rabbits. The inclusion of random effects in the asymptotic weight and maturity rate improves the quality of fit in comparison to fixed models.

Analysis and probabilistic modeling of wind characteristics of an arch bridge using structural health monitoring data during typhoons

  • Ye, X.W.;Xi, P.S.;Su, Y.H.;Chen, B.
    • Structural Engineering and Mechanics
    • /
    • v.63 no.6
    • /
    • pp.809-824
    • /
    • 2017
  • The accurate evaluation of wind characteristics and wind-induced structural responses during a typhoon is of significant importance for bridge design and safety assessment. This paper presents an expectation maximization (EM) algorithm-based angular-linear approach for probabilistic modeling of field-measured wind characteristics. The proposed method has been applied to model the wind speed and direction data during typhoons recorded by the structural health monitoring (SHM) system instrumented on the arch Jiubao Bridge located in Hangzhou, China. In the summer of 2015, three typhoons, i.e., Typhoon Chan-hom, Typhoon Soudelor and Typhoon Goni, made landfall in the east of China and then struck the Jiubao Bridge. By analyzing the wind monitoring data such as the wind speed and direction measured by three anemometers during typhoons, the wind characteristics during typhoons are derived, including the average wind speed and direction, turbulence intensity, gust factor, turbulence integral scale, and power spectral density (PSD). An EM algorithm-based angular-linear modeling approach is proposed for modeling the joint distribution of the wind speed and direction. For the marginal distribution of the wind speed, the finite mixture of two-parameter Weibull distribution is employed, and the finite mixture of von Mises distribution is used to represent the wind direction. The parameters of each distribution model are estimated by use of the EM algorithm, and the optimal model is determined by the values of $R^2$ statistic and the Akaike's information criterion (AIC). The results indicate that the stochastic properties of the wind field around the bridge site during typhoons are effectively characterized by the proposed EM algorithm-based angular-linear modeling approach. The formulated joint distribution of the wind speed and direction can serve as a solid foundation for the purpose of accurately evaluating the typhoon-induced fatigue damage of long-span bridges.

Simulation Study on Model Selection Based on AIC under Unbalanced Design in Linear Mixed Effect Models (불균형 자료에서 AIC를 이용한 선형혼합모형 선택법의 효율에 대한 모의실험 연구)

  • Lee, Yong-Hee
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.6
    • /
    • pp.1169-1178
    • /
    • 2010
  • This article consider a performance model selection based on AIC under unbalanced deign in linear mixed effect models. Vaida and Balanchard (2005) proposed conditional AIC for model selection in linear mixed effect models when the prediction of random effects is of primary interest. Theoretical properties of cAIC and related criteria have been investigated by Liang et al. (2008) and Greven and Kneib (2010). However, all of the simulation studies were performed under a balanced design. Even though functional form of AIC remain same even under the unbalanced deign, it is worthwhile to investigate performance of AIC based model selection criteria under the unbalanced design. The simulation study in this article shows how unbalancedness affects model selection in linear mixed effect models.

Multiphasic Analysis of Growth Curve of Body Weight in Mice

  • Kurnianto, E.;Shinjo, A.;Suga, D.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.12 no.3
    • /
    • pp.331-335
    • /
    • 1999
  • The present study describes the analysis of the multiphasic growth function (MGF) to body weight in laboratory and wild mice. Three genetic groups of laboratory mice (Mus musculus domesticus) designated $CF_{{\sharp}1}$, C3H/HeNCrj and C57BL/6NCrj, and a genetic group of Yonakuni wild mice (Mus musculus molossinus yonakuni, Yk) were used. Mean body weights of each genetic group-sex subclass from birth to 69 days of age taken at 3-day intervals were analyzed by a monophasic, diphasic and triphasic functions for describing growth patterns. A comparison among the three functions of the MGF was based on the goodness-of-fit criteria: residual standard deviation (RSD), adjusted R-square (Adj $R^2$) and Akaike's information criterion (AIC). Result of this study indicated that body weight averaged heavier for males than for females. Among the four genetic groups within both sexes, $CF_{{\sharp}1}$ showed the highest, subsequent followed by C3H/HeNCrj, C57BL/6NCrj and Yk. Comparison among the three functions revealed that the triphasic function was the best fit to growth data, with the lowest RSD, the highest Adj $R^2$ and the lowest AIC, for the four genetic groups. For the triphasic function, RSD within each genetic group-sex subclass was similar for males and females. Adj $R^2$ was 0.999 for all genetic group-sex subclasses. AIC for laboratory mice males and females ranged from -70.48 to 66.50 and from -92.81 to -68.64, respectively; whereas for Yk wild mice males was -74.29 and females -78.42.

Differences in Prognostic Factors between Early and Late Recurrence Breast Cancers

  • Payandeh, Mehrdad;Sadeghi, Masoud;Sadeghi, Edris
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.15
    • /
    • pp.6575-6579
    • /
    • 2015
  • Background: Breast cancer (BC) is the most frequent malignancy among females and is a leading cause of death of middle-aged women. Herein, we evaluated baseline characteristics for BC patients and also compared these variables across ealry and late recurrence groups. Materials and Methods: Between 1995 to 2014, among female breast cancer patients referred to our oncology clinic, eighty-six were entered into our study. All had distant metastasis. Early recurrence was defined as initial recurrence within 5 years following curative surgery irrespective of site. Likewise, late recurrence was defined as initial recurrence after 5 years. No recurrence was defined for survivors to a complete minimum of 10 years follow-up. Significant prognostic factors associated with early or late recurrence were selected according to the Akaike Information Criterion. Results: The median follow-up was 9 years (range, 1-18 years). During follow-up period, 51 recurrences occurred (distant metastasis), 31 early and 20 late. According to the site of recurrence, there were 51 distant. In this follow-up period, 19 patients died. Compared with the early recurrence group, the no recurrence group had lower lymph node involvement and more p53 positive lesions but the late recurrence group had lower tumor size. In comparison to no recurrence, p53 (odds ratio [OR] 6.94, 95% CI 1.49-32.16) was a significant prognostic factor for early recurrence within 5 years. Conclusions: Tumor size, p53 and LN metastasis are the most important risk factors for distance recurrence especially in early recurrence and also between of them, p53 is significant prognostic factor for early recurrence.

Application of Cox and Parametric Survival Models to Assess Social Determinants of Health Affecting Three-Year Survival of Breast Cancer Patients

  • Mohseny, Maryam;Amanpour, Farzaneh;Mosavi-Jarrahi, Alireza;Jafari, Hossein;Moradi-Joo, Mohammad;Monfared, Esmat Davoudi
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.sup3
    • /
    • pp.311-316
    • /
    • 2016
  • Breast cancer is one of the most common causes of cancer mortality in Iran. Social determinants of health are among the key factors affecting the pathogenesis of diseases. This cross-sectional study aimed to determine the social determinants of breast cancer survival time with parametric and semi-parametric regression models. It was conducted on male and female patients diagnosed with breast cancer presenting to the Cancer Research Center of Shohada-E-Tajrish Hospital from 2006 to 2010. The Cox proportional hazard model and parametric models including the Weibull, log normal and log-logistic models were applied to determine the social determinants of survival time of breast cancer patients. The Akaike information criterion (AIC) was used to assess the best fit. Statistical analysis was performed with STATA (version 11) software. This study was performed on 797 breast cancer patients, aged 25-93 years with a mean age of 54.7 (${\pm}11.9$) years. In both semi-parametric and parametric models, the three-year survival was related to level of education and municipal district of residence (P<0.05). The AIC suggested that log normal distribution was the best fit for the three-year survival time of breast cancer patients. Social determinants of health such as level of education and municipal district of residence affect the survival of breast cancer cases. Future studies must focus on the effect of childhood social class on the survival times of cancers, which have hitherto only been paid limited attention.

Applying Conventional and Saturated Generalized Gamma Distributions in Parametric Survival Analysis of Breast Cancer

  • Yavari, Parvin;Abadi, Alireza;Amanpour, Farzaneh;Bajdik, Chris
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.5
    • /
    • pp.1829-1831
    • /
    • 2012
  • Background: The generalized gamma distribution statistics constitute an extensive family that contains nearly all of the most commonly used distributions including the exponential, Weibull and log normal. A saturated version of the model allows covariates having effects through all the parameters of survival time distribution. Accelerated failure-time models assume that only one parameter of the distribution depends on the covariates. Methods: We fitted both the conventional GG model and the saturated form for each of its members including the Weibull and lognormal distribution; and compared them using likelihood ratios. To compare the selected parameter distribution with log logistic distribution which is a famous distribution in survival analysis that is not included in generalized gamma family, we used the Akaike information criterion (AIC; r=l(b)-2p). All models were fitted using data for 369 women age 50 years or more, diagnosed with stage IV breast cancer in BC during 1990-1999 and followed to 2010. Results: In both conventional and saturated parametric models, the lognormal was the best candidate among the GG family members; also, the lognormal fitted better than log-logistic distribution. By the conventional GG model, the variables "surgery", "radiotherapy", "hormone therapy", "erposneg" and interaction between "hormone therapy" and "erposneg" are significant. In the AFT model, we estimated the relative time for these variables. By the saturated GG model, similar significant variables are selected. Estimating the relative times in different percentiles of extended model illustrate the pattern in which the relative survival time change during the time. Conclusions: The advantage of using the generalized gamma distribution is that it facilitates estimating a model with improved fit over the standard Weibull or lognormal distributions. Alternatively, the generalized F family of distributions might be considered, of which the generalized gamma distribution is a member and also includes the commonly used log-logistic distribution.

Evaluation of goodness of fit of semiparametric and parametric models in analysis of factors associated with length of stay in neonatal intensive care unit

  • Kheiry, Fatemeh;Kargarian-Marvasti, Sadegh;Afrashteh, Sima;Mohammadbeigi, Abolfazl;Daneshi, Nima;Naderi, Salma;Saadat, Seyed Hossein
    • Clinical and Experimental Pediatrics
    • /
    • v.63 no.9
    • /
    • pp.361-367
    • /
    • 2020
  • Background: Length of stay is a significant indicator of care effectiveness and hospital performance. Owing to the limited number of healthcare centers and facilities, it is important to optimize length of stay and associated factors. Purpose: The present study aimed to investigate factors associated with neonatal length of stay in the neonatal intensive care unit (NICU) using parametric and semiparametric models and compare model fitness according to Akaike information criterion (AIC) between 2016 and 2018. Methods: This retrospective cohort study reviewed 600 medical records of infants admitted to the NICU of Bandar Abbas Hospital. Samples were identified using census sampling. Factors associated with NICU length of stay were investigated based on semiparametric Cox model and 4 parametric models including Weibull, exponential, log-logistic, and log-normal to determine the best fitted model. The data analysis was conducted using R software. The significance level was set at 0.05. Results: The study findings suggest that breastfeeding, phototherapy, acute renal failure, presence of mechanical ventilation, and availability of central venous catheter were commonly identified as factors associated with NICU length of stay in all 5 models (P<0.05). Parametric models showed better fitness than the Cox model in this study. Conclusion: Breastfeeding and availability of central venous catheter had protective effects against length of stay, whereas phototherapy, acute renal failure, and mechanical ventilation increased length of stay in NICU. Therefore, the identification of factors associated with NICU length of stay can help establish effective interventions aimed at decreasing the length of stay among infants.

Size selectivity of the gill net for spinyhead sculpin, Dasycottus setiger in the eastern coastal waters of Korea (동해안 자망에 대한 고무꺽정이 (Dasycottus setiger )의 망목 선택성)

  • PARK, Chang-Doo;BAE, Jae-Hyun;CHO, Sam-Kwang;AN, Heui-Chun;KIM, In-Ok
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.52 no.4
    • /
    • pp.281-289
    • /
    • 2016
  • Spinyhead sculpin Dasycottus setiger, a species of cold water fish, is distributed along the eastern coastal waters of Korea. A series of fishing experiments was carried out in the waters near Uljin from June, 2002 to November, 2004, using the experimental monofilament gill nets of different mesh sizes (82.2, 89.4, 104.8, and 120.2 mm) to describe the selectivity of the gill net for the fish. The SELECT (Share Each Length's Catch Total) analysis with maximum likelihood method was applied to fit the different functional models (normal, lognormal, and bi-normal models) for selection curves to the catch data. The bi-normal model with the fixed relative fishing intensity was selected as the best-fit selection curve by AIC (Akaike's Information Criterion) comparison. For the best-fit selection curve, the optimum relative length (the ratio of fish total length to mesh size) with the maximum efficiency and the selection range ($R_{50%,large}-R_{50%,small}$) of 50% retention were obtained as 2.363 and 0.851, respectively. The ratios of body girth to mesh perimeter at 100% retention where the selection curve of each mesh size represented the optimum total length were calculated as the range of 0.86 ~ 0.87.