• 제목/요약/키워드: AIC(Akaike Information Criterion)

검색결과 68건 처리시간 0.028초

Application of single-step genomic evaluation using social genetic effect model for growth in pig

  • Hong, Joon Ki;Kim, Young Sin;Cho, Kyu Ho;Lee, Deuk Hwan;Min, Ye Jin;Cho, Eun Seok
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제32권12호
    • /
    • pp.1836-1843
    • /
    • 2019
  • Objective: Social genetic effects (SGE) are an important genetic component for growth, group productivity, and welfare in pigs. The present study was conducted to evaluate i) the feasibility of the single-step genomic best linear unbiased prediction (ssGBLUP) approach with the inclusion of SGE in the model in pigs, and ii) the changes in the contribution of heritable SGE to the phenotypic variance with different scaling ${\omega}$ constants for genomic relationships. Methods: The dataset included performance tested growth rate records (average daily gain) from 13,166 and 21,762 pigs Landrace (LR) and Yorkshire (YS), respectively. A total of 1,041 (LR) and 964 (YS) pigs were genotyped using the Illumina PorcineSNP60 v2 BeadChip panel. With the BLUPF90 software package, genetic parameters were estimated using a modified animal model for competitive traits. Giving a fixed weight to pedigree relationships (${\tau}:1$), several weights (${\omega}_{xx}$, 0.1 to 1.0; with a 0.1 interval) were scaled with the genomic relationship for best model fit with Akaike information criterion (AIC). Results: The genetic variances and total heritability estimates ($T^2$) were mostly higher with ssGBLUP than in the pedigree-based analysis. The model AIC value increased with any level of ${\omega}$ other than 0.6 and 0.5 in LR and YS, respectively, indicating the worse fit of those models. The theoretical accuracies of direct and social breeding value were increased by decreasing ${\omega}$ in both breeds, indicating the better accuracy of ${\omega}_{0.1}$ models. Therefore, the optimal values of ${\omega}$ to minimize AIC and to increase theoretical accuracy were 0.6 in LR and 0.5 in YS. Conclusion: In conclusion, single-step ssGBLUP model fitting SGE showed significant improvement in accuracy compared with the pedigree-based analysis method; therefore, it could be implemented in a pig population for genomic selection based on SGE, especially in South Korean populations, with appropriate further adjustment of tuning parameters for relationship matrices.

AFT 생존분석 기법을 이용한 고속도로 교통사고 지속시간 예측모형 (A Prediction Model on Freeway Accident Duration using AFT Survival Analysis)

  • 정연식;송상규;최기주
    • 대한교통학회지
    • /
    • 제25권5호
    • /
    • pp.135-148
    • /
    • 2007
  • 교통사고의 특성과 사고에 대한 지속시간 사이의 관계에 대한 이해는 사고의 효과적인 대응과 사고에 의한 혼잡을 감소시키는데 핵심 요소가 된다. 때문에 본 연구의 목적은 AFT metric 모형을 적용한 사고 지속시간을 분석하는 것이다. 비록 로그 로지스틱 및 로그 정규 AFT 모형이 통계적 이론과 기존 연구 사례를 기반으로 선정되었으나, 로그 로지스틱 모형이 보다 우수하게 추정되었다. AFT 모형은 예측 목적으로도 널리 사용되기 때문에, 추정된 모형은 사고 발생시 사고 관련 기본 정보 접수 즉시 고속도에서의 사고 지속시간 예측에 사용될 수 있다. 결과적으로, 예측된 사고 지속시간 정보는 사고를 처리하기 위한 제반 의사 결정에 도움을 줄 뿐 아니라 교통 혼잡의 감소 및 추가 사상자의 감소로 그 효과가 이어질 것으로 판단된다.

Statistical analysis and probabilistic modeling of WIM monitoring data of an instrumented arch bridge

  • Ye, X.W.;Su, Y.H.;Xi, P.S.;Chen, B.;Han, J.P.
    • Smart Structures and Systems
    • /
    • 제17권6호
    • /
    • pp.1087-1105
    • /
    • 2016
  • Traffic load and volume is one of the most important physical quantities for bridge safety evaluation and maintenance strategies formulation. This paper aims to conduct the statistical analysis of traffic volume information and the multimodal modeling of gross vehicle weight (GVW) based on the monitoring data obtained from the weigh-in-motion (WIM) system instrumented on the arch Jiubao Bridge located in Hangzhou, China. A genetic algorithm (GA)-based mixture parameter estimation approach is developed for derivation of the unknown mixture parameters in mixed distribution models. The statistical analysis of one-year WIM data is firstly performed according to the vehicle type, single axle weight, and GVW. The probability density function (PDF) and cumulative distribution function (CDF) of the GVW data of selected vehicle types are then formulated by use of three kinds of finite mixed distributions (normal, lognormal and Weibull). The mixture parameters are determined by use of the proposed GA-based method. The results indicate that the stochastic properties of the GVW data acquired from the field-instrumented WIM sensors are effectively characterized by the method of finite mixture distributions in conjunction with the proposed GA-based mixture parameter identification algorithm. Moreover, it is revealed that the Weibull mixture distribution is relatively superior in modeling of the WIM data on the basis of the calculated Akaike's information criterion (AIC) values.

Modelling Stem Diameter Variability in Pinus caribaea (Morelet) Plantations in South West Nigeria

  • Adesoye, Peter Oluremi
    • Journal of Forest and Environmental Science
    • /
    • 제32권3호
    • /
    • pp.280-290
    • /
    • 2016
  • Stem diameter variability is an essential inventory result that provides useful information in forest management decisions. Little has been done to explore the modelling potentials of standard deviation (SDD) and coefficient of variation (CVD) of diameter at breast height (dbh). This study, therefore, was aimed at developing and testing models for predicting SDD and CVD in stands of Pinus caribaea Morelet (pine) in south west Nigeria. Sixty temporary sample plots of size $20m{\times}20m$, ranging between 15 and 37 years were sampled, covering the entire range of pine in south west Nigeria. The dbh (cm), total and merchantable heights (m), number of stems and age of trees were measured within each plot. Basal area ($m^2$), site index (m), relative spacing and percentile positions of dbh at $24^{th}$, $63^{rd}$, $76^{th}$ and $93^{rd}$ (i.e. $P_{24}$, $P_{63}$, $P_{76}$ and $P_{93}$) were computed from measured variables for each plot. Linear mixed model (LMM) was used to test the effects of locations (fixed) and plots (random). Six candidate models (3 for SDD and 3 for CVD), using three categories of explanatory variables (i.e. (i) only stand size measures, (ii) distribution measures, and (iii) combination of i and ii). The best model was chosen based on smaller relative standard error (RSE), prediction residual sum of squares (PRESS), corrected Akaike Information Criterion ($AIC_c$) and larger coefficient of determination ($R^2$). The results of the LMM indicated that location and plot effects were not significant. The CVD and SDD models having only measures of percentiles (i.e. $P_{24}$ and $P_{93}$) as predictors produced better predictions than others. However, CVD model produced the overall best predictions, because of the lower RSE and stability in measuring variability across different stand developments. The results demonstrate the potentials of CVD in modelling stem diameter variability in relationship with percentiles variables.

Analysis and probabilistic modeling of wind characteristics of an arch bridge using structural health monitoring data during typhoons

  • Ye, X.W.;Xi, P.S.;Su, Y.H.;Chen, B.
    • Structural Engineering and Mechanics
    • /
    • 제63권6호
    • /
    • pp.809-824
    • /
    • 2017
  • The accurate evaluation of wind characteristics and wind-induced structural responses during a typhoon is of significant importance for bridge design and safety assessment. This paper presents an expectation maximization (EM) algorithm-based angular-linear approach for probabilistic modeling of field-measured wind characteristics. The proposed method has been applied to model the wind speed and direction data during typhoons recorded by the structural health monitoring (SHM) system instrumented on the arch Jiubao Bridge located in Hangzhou, China. In the summer of 2015, three typhoons, i.e., Typhoon Chan-hom, Typhoon Soudelor and Typhoon Goni, made landfall in the east of China and then struck the Jiubao Bridge. By analyzing the wind monitoring data such as the wind speed and direction measured by three anemometers during typhoons, the wind characteristics during typhoons are derived, including the average wind speed and direction, turbulence intensity, gust factor, turbulence integral scale, and power spectral density (PSD). An EM algorithm-based angular-linear modeling approach is proposed for modeling the joint distribution of the wind speed and direction. For the marginal distribution of the wind speed, the finite mixture of two-parameter Weibull distribution is employed, and the finite mixture of von Mises distribution is used to represent the wind direction. The parameters of each distribution model are estimated by use of the EM algorithm, and the optimal model is determined by the values of $R^2$ statistic and the Akaike's information criterion (AIC). The results indicate that the stochastic properties of the wind field around the bridge site during typhoons are effectively characterized by the proposed EM algorithm-based angular-linear modeling approach. The formulated joint distribution of the wind speed and direction can serve as a solid foundation for the purpose of accurately evaluating the typhoon-induced fatigue damage of long-span bridges.

Applying Conventional and Saturated Generalized Gamma Distributions in Parametric Survival Analysis of Breast Cancer

  • Yavari, Parvin;Abadi, Alireza;Amanpour, Farzaneh;Bajdik, Chris
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제13권5호
    • /
    • pp.1829-1831
    • /
    • 2012
  • Background: The generalized gamma distribution statistics constitute an extensive family that contains nearly all of the most commonly used distributions including the exponential, Weibull and log normal. A saturated version of the model allows covariates having effects through all the parameters of survival time distribution. Accelerated failure-time models assume that only one parameter of the distribution depends on the covariates. Methods: We fitted both the conventional GG model and the saturated form for each of its members including the Weibull and lognormal distribution; and compared them using likelihood ratios. To compare the selected parameter distribution with log logistic distribution which is a famous distribution in survival analysis that is not included in generalized gamma family, we used the Akaike information criterion (AIC; r=l(b)-2p). All models were fitted using data for 369 women age 50 years or more, diagnosed with stage IV breast cancer in BC during 1990-1999 and followed to 2010. Results: In both conventional and saturated parametric models, the lognormal was the best candidate among the GG family members; also, the lognormal fitted better than log-logistic distribution. By the conventional GG model, the variables "surgery", "radiotherapy", "hormone therapy", "erposneg" and interaction between "hormone therapy" and "erposneg" are significant. In the AFT model, we estimated the relative time for these variables. By the saturated GG model, similar significant variables are selected. Estimating the relative times in different percentiles of extended model illustrate the pattern in which the relative survival time change during the time. Conclusions: The advantage of using the generalized gamma distribution is that it facilitates estimating a model with improved fit over the standard Weibull or lognormal distributions. Alternatively, the generalized F family of distributions might be considered, of which the generalized gamma distribution is a member and also includes the commonly used log-logistic distribution.

Evaluation of goodness of fit of semiparametric and parametric models in analysis of factors associated with length of stay in neonatal intensive care unit

  • Kheiry, Fatemeh;Kargarian-Marvasti, Sadegh;Afrashteh, Sima;Mohammadbeigi, Abolfazl;Daneshi, Nima;Naderi, Salma;Saadat, Seyed Hossein
    • Clinical and Experimental Pediatrics
    • /
    • 제63권9호
    • /
    • pp.361-367
    • /
    • 2020
  • Background: Length of stay is a significant indicator of care effectiveness and hospital performance. Owing to the limited number of healthcare centers and facilities, it is important to optimize length of stay and associated factors. Purpose: The present study aimed to investigate factors associated with neonatal length of stay in the neonatal intensive care unit (NICU) using parametric and semiparametric models and compare model fitness according to Akaike information criterion (AIC) between 2016 and 2018. Methods: This retrospective cohort study reviewed 600 medical records of infants admitted to the NICU of Bandar Abbas Hospital. Samples were identified using census sampling. Factors associated with NICU length of stay were investigated based on semiparametric Cox model and 4 parametric models including Weibull, exponential, log-logistic, and log-normal to determine the best fitted model. The data analysis was conducted using R software. The significance level was set at 0.05. Results: The study findings suggest that breastfeeding, phototherapy, acute renal failure, presence of mechanical ventilation, and availability of central venous catheter were commonly identified as factors associated with NICU length of stay in all 5 models (P<0.05). Parametric models showed better fitness than the Cox model in this study. Conclusion: Breastfeeding and availability of central venous catheter had protective effects against length of stay, whereas phototherapy, acute renal failure, and mechanical ventilation increased length of stay in NICU. Therefore, the identification of factors associated with NICU length of stay can help establish effective interventions aimed at decreasing the length of stay among infants.

동해안 자망에 대한 고무꺽정이 (Dasycottus setiger )의 망목 선택성 (Size selectivity of the gill net for spinyhead sculpin, Dasycottus setiger in the eastern coastal waters of Korea)

  • 박창두;배재현;조삼광;안희춘;김인옥
    • 수산해양기술연구
    • /
    • 제52권4호
    • /
    • pp.281-289
    • /
    • 2016
  • Spinyhead sculpin Dasycottus setiger, a species of cold water fish, is distributed along the eastern coastal waters of Korea. A series of fishing experiments was carried out in the waters near Uljin from June, 2002 to November, 2004, using the experimental monofilament gill nets of different mesh sizes (82.2, 89.4, 104.8, and 120.2 mm) to describe the selectivity of the gill net for the fish. The SELECT (Share Each Length's Catch Total) analysis with maximum likelihood method was applied to fit the different functional models (normal, lognormal, and bi-normal models) for selection curves to the catch data. The bi-normal model with the fixed relative fishing intensity was selected as the best-fit selection curve by AIC (Akaike's Information Criterion) comparison. For the best-fit selection curve, the optimum relative length (the ratio of fish total length to mesh size) with the maximum efficiency and the selection range ($R_{50%,large}-R_{50%,small}$) of 50% retention were obtained as 2.363 and 0.851, respectively. The ratios of body girth to mesh perimeter at 100% retention where the selection curve of each mesh size represented the optimum total length were calculated as the range of 0.86 ~ 0.87.

Line Transect에서 발견율함수 추정에 사용되는 모델에 따른 상괭이, Neophocaena phocaenoides의 자원개체수 추정 (Abundance Estimation of the Finless Porpoise, Neophocaena phocaenoides, Using Models of the Detection Function in a Line Transect)

  • 박겸준;김장근;장창익
    • 한국수산과학회지
    • /
    • 제40권4호
    • /
    • pp.201-209
    • /
    • 2007
  • Line transect sampling in a sighting survey is one of most widely used methods for assessing animal abundance. This study applied distance data, collected from three sighting surveys using line transects for finless porpoise that were conducted in 2004 and 2005 off the west coast of Korea, to four models (hazard-rate, uniform, half-normal and exponential) that can use a variety of detection functions, g (x). The hazard-rate model, a derived model for the detection function, should have a shoulder condition chosen using the AIC (Akaike Information Criterion), as the most suitable model. However, it did not describe a shoulder shape for the value of g(x) near the track tine and underestimated g (x), just as the exponential model did. The hazard-rate model showed a bias toward overestimating the densities of finless porpoises with a higher coefficient of variation (CV) than the other models did. The uniform model underestimated the densities of finless porpoise but had the lowest CV. The half-normal model described a detection function with a shape similar to that of the uniform model. The half-normal model was robust for finless porpoise data and should be able to avoid density underestimation. The estimated abundance of finless porpoise was 3,602 individuals (95% CI=1,251-10,371) inshore in 2005 and 33,045 individuals (95% CI=24,274-44,985) offshore in 2004.

평점에 따른 OTT 서비스 콘텐츠의 성공과 실패 요인 분석: 넷플릭스를 중심으로 (Analysis of Success and Failure Factors of OTT Service Contents According to the Rating: Focus on Netflix)

  • 홍지수;박진수;강성우
    • 산업경영시스템학회지
    • /
    • 제44권4호
    • /
    • pp.65-75
    • /
    • 2021
  • This study explores multiple variables of an OTT service for discovering hidden relationship between rating and the other variables of each successful and failed content, respectively. In order to extract key variables that are strongly correlated to the rating across the contents, this work analyzes 170 Netflix original dramas and 419 movies. These contents are classified as success and failure by using the rating site IMDb, respectively. The correlation between the contents, which are classified via rating, and variables such as violence, lewdness and running time are analyzed to determine whether a certain variable appears or not in each successful and failure content. This study employs a regression analysis to discover correlations across the variables as a main analysis method. Since the correlation between independent variables should be low, check multicollinearity and select the variable. Cook's distance is used to detect and remove outliers. To improve the accuracy of the model, a variable selection based on AIC(Akaike Information Criterion) is performed. Finally, the basic assumptions of regression analysis are identified by residual diagnosis and Dubin Watson test. According to the whole analysis process, it is concluded that the more director awards exist and the less immatatable tend to be successful in movies. On the contrary, lower fear tend to be failure in movies. In case of dramas, there are close correlations between failure dramas and lower violence, higher fear, higher drugs.