• Title/Summary/Keyword: Akaike's Information (AIC)

Search Result 36, Processing Time 0.031 seconds

Allometric equation for estimating aboveground biomass of Acacia-Commiphora forest, southern Ethiopia

  • Wondimagegn Amanuel;Chala Tadesse;Moges Molla;Desalegn Getinet;Zenebe Mekonnen
    • Journal of Ecology and Environment
    • /
    • v.48 no.2
    • /
    • pp.196-206
    • /
    • 2024
  • Background: Most of the biomass equations were developed using sample trees collected mainly from pan-tropical and tropical regions that may over- or underestimate biomass. Site-specific models would improve the accuracy of the biomass estimates and enhance the country's measurement, reporting, and verification activities. The aim of the study is to develop site-specific biomass estimation models and validate and evaluate the existing generic models developed for pan-tropical forest and newly developed allometric models. Total of 140 trees was harvested from each diameter class biomass model development. Data was analyzed using SAS procedures. All relevant statistical tests (normality, multicollinearity, and heteroscedasticity) were performed. Data was transformed to logarithmic functions and multiple linear regression techniques were used to develop model to estimate aboveground biomass (AGB). The root mean square error (RMSE) was used for measuring model bias, precision, and accuracy. The coefficient of determination (R2 and adjusted [adj]-R2), the Akaike Information Criterion (AIC) and the Schwarz Bayesian information Criterion was employed to select most appropriate models. Results: For the general total AGB models, adj-R2 ranged from 0.71 to 0.85, and model 9 with diameter at stump height at 10 cm (DSH10), ρ and crown width (CW) as predictor variables, performed best according to RMSE and AIC. For the merchantable stem models, adj-R2 varied from 0.73 to 0.82, and model 8) with combination of ρ, diameter at breast height and height (H), CW and DSH10 as predictor variables, was best in terms of RMSE and AIC. The results showed that a best-fit model for above-ground biomass of tree components was developed. AGBStem = exp {-1.8296 + 0.4814 natural logarithm (Ln) (ρD2H) + 0.1751 Ln (CW) + 0.4059 Ln (DSH30)} AGBBranch = exp {-131.6 + 15.0013 Ln (ρD2H) + 13.176 Ln (CW) + 21.8506 Ln (DSH30)} AGBFoliage = exp {-0.9496 + 0.5282 Ln (DSH30) + 2.3492 Ln (ρ) + 0.4286 Ln (CW)} AGBTotal = exp {-1.8245 + 1.4358 Ln (DSH30) + 1.9921 Ln (ρ) + 0.6154 Ln (CW)} Conclusions: The results demonstrated that the development of local models derived from an appropriate sample of representative species can greatly improve the estimation of total AGB.

Statistical analysis and probabilistic modeling of WIM monitoring data of an instrumented arch bridge

  • Ye, X.W.;Su, Y.H.;Xi, P.S.;Chen, B.;Han, J.P.
    • Smart Structures and Systems
    • /
    • v.17 no.6
    • /
    • pp.1087-1105
    • /
    • 2016
  • Traffic load and volume is one of the most important physical quantities for bridge safety evaluation and maintenance strategies formulation. This paper aims to conduct the statistical analysis of traffic volume information and the multimodal modeling of gross vehicle weight (GVW) based on the monitoring data obtained from the weigh-in-motion (WIM) system instrumented on the arch Jiubao Bridge located in Hangzhou, China. A genetic algorithm (GA)-based mixture parameter estimation approach is developed for derivation of the unknown mixture parameters in mixed distribution models. The statistical analysis of one-year WIM data is firstly performed according to the vehicle type, single axle weight, and GVW. The probability density function (PDF) and cumulative distribution function (CDF) of the GVW data of selected vehicle types are then formulated by use of three kinds of finite mixed distributions (normal, lognormal and Weibull). The mixture parameters are determined by use of the proposed GA-based method. The results indicate that the stochastic properties of the GVW data acquired from the field-instrumented WIM sensors are effectively characterized by the method of finite mixture distributions in conjunction with the proposed GA-based mixture parameter identification algorithm. Moreover, it is revealed that the Weibull mixture distribution is relatively superior in modeling of the WIM data on the basis of the calculated Akaike's information criterion (AIC) values.

A Prediction Model on Freeway Accident Duration using AFT Survival Analysis (AFT 생존분석 기법을 이용한 고속도로 교통사고 지속시간 예측모형)

  • Jeong, Yeon-Sik;Song, Sang-Gyu;Choe, Gi-Ju
    • Journal of Korean Society of Transportation
    • /
    • v.25 no.5
    • /
    • pp.135-148
    • /
    • 2007
  • Understanding the relation between characteristics of an accident and its duration is crucial for the efficient response of accidents and the reduction of total delay caused by accidents. Thus the objective of this study is to model accident duration using an AFT metric model. Although the log-logistic and log-normal AFT models were selected based on the previous studies and statistical theory, the log-logistic model was better fitted. Since the AFT model is commonly used for the purpose of prediction, the estimated model can be also used for the prediction of duration on freeways as soon as the base accident information is reported. Therefore, the predicted information will be directly useful to make some decisions regarding the resources needed to clear accident and dispatch crews as well as will lead to less traffic congestion and much saving the injured.

Development of drought frequency analysis program (가뭄빈도해석 프로그램 개발)

  • Lee, Jeong Ju;Kang, Shin Uk;Chun, Gun Il;Kim, Hyeon Sik
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.14-14
    • /
    • 2020
  • 일반적으로 수문빈도해석은 치수계획 수립에 이용되는 설계강수량, 계획홍수량 등을 산정하기 위해 연최대치계열 또는 연초과치계열 자료를 이용한 극치빈도해석을 수행하고, 확률분포의 우측꼬리(right tail) 부분을 이용하여 확장된 재현기간에 해당하는 확률수문량을 추정한다. 하지만 가뭄 관련 분석에서는 확률분포의 좌측꼬리(left tail) 부분은 이용해 확장된 재현기간별 확률수문량을 추정해야할 경우가 발생한다. 또한 물관리 실무에서 장 단기 운영계획 수립을 위해 이용하는 갈수빈도 유입량 산정 등에서도 평년보다 작은 수문량에 대한 빈도해석이 필요한 경우가 있다. 국가 가뭄정보분석센터에서는 기존에 K-water연구원에서 개발한 빈도해석 프로그램인 K-FAT의 분석모듈을 이용해 극소치계열 또는 갈수빈도 유입량 분석에 특화된 가뭄빈도해석 프로그램을 개발하였다. 본 프로그램은 GEV, Gumbel, Weibull 등 14개의 확률분포형을 포함하며, 모멘트법, 최우도법 및 L-모멘트법을 사용하여 매개변수를 추정한다. 적합도 검정의 경우 χ2, K-S, CVM, PPCC 및 수정 Anderson-Darling test를 이용하여 다각적인 검정을 할 수 있도록 하였다. 분석을 위한 입력 자료의 경우 사용자가 전처리를 통해 준비한 연최소치계열 등 연도별 시계열자료를 이용할 수 있으며, 일단위 및 월단위의 강수량 또는 댐 유입량 자료를 이용해 사용자가 원하는 기간의 누적강수량, 평균 유입량으로 변환할 수 있는 자료변환 기능을 추가하여 실무 활용성을 높였다. 또한 최적 확률분포 선정을 위해 참고할 수 있도록 AIC(Akaike information criteria)와 BIC(Bayesian information criteria) 분석이 포함되어 있으며, Bootstrap 기법 등을 이용한 불확실성 산정을 통해 추정 값의 신뢰구간을 표시하도록 하였다. 개발된 프로그램은 베타버전 시험배포를 거쳐 가뭄정보포털을 통해 배포할 예정이다.

  • PDF

Analysis and probabilistic modeling of wind characteristics of an arch bridge using structural health monitoring data during typhoons

  • Ye, X.W.;Xi, P.S.;Su, Y.H.;Chen, B.
    • Structural Engineering and Mechanics
    • /
    • v.63 no.6
    • /
    • pp.809-824
    • /
    • 2017
  • The accurate evaluation of wind characteristics and wind-induced structural responses during a typhoon is of significant importance for bridge design and safety assessment. This paper presents an expectation maximization (EM) algorithm-based angular-linear approach for probabilistic modeling of field-measured wind characteristics. The proposed method has been applied to model the wind speed and direction data during typhoons recorded by the structural health monitoring (SHM) system instrumented on the arch Jiubao Bridge located in Hangzhou, China. In the summer of 2015, three typhoons, i.e., Typhoon Chan-hom, Typhoon Soudelor and Typhoon Goni, made landfall in the east of China and then struck the Jiubao Bridge. By analyzing the wind monitoring data such as the wind speed and direction measured by three anemometers during typhoons, the wind characteristics during typhoons are derived, including the average wind speed and direction, turbulence intensity, gust factor, turbulence integral scale, and power spectral density (PSD). An EM algorithm-based angular-linear modeling approach is proposed for modeling the joint distribution of the wind speed and direction. For the marginal distribution of the wind speed, the finite mixture of two-parameter Weibull distribution is employed, and the finite mixture of von Mises distribution is used to represent the wind direction. The parameters of each distribution model are estimated by use of the EM algorithm, and the optimal model is determined by the values of $R^2$ statistic and the Akaike's information criterion (AIC). The results indicate that the stochastic properties of the wind field around the bridge site during typhoons are effectively characterized by the proposed EM algorithm-based angular-linear modeling approach. The formulated joint distribution of the wind speed and direction can serve as a solid foundation for the purpose of accurately evaluating the typhoon-induced fatigue damage of long-span bridges.

Size selectivity of the gill net for spinyhead sculpin, Dasycottus setiger in the eastern coastal waters of Korea (동해안 자망에 대한 고무꺽정이 (Dasycottus setiger )의 망목 선택성)

  • PARK, Chang-Doo;BAE, Jae-Hyun;CHO, Sam-Kwang;AN, Heui-Chun;KIM, In-Ok
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.52 no.4
    • /
    • pp.281-289
    • /
    • 2016
  • Spinyhead sculpin Dasycottus setiger, a species of cold water fish, is distributed along the eastern coastal waters of Korea. A series of fishing experiments was carried out in the waters near Uljin from June, 2002 to November, 2004, using the experimental monofilament gill nets of different mesh sizes (82.2, 89.4, 104.8, and 120.2 mm) to describe the selectivity of the gill net for the fish. The SELECT (Share Each Length's Catch Total) analysis with maximum likelihood method was applied to fit the different functional models (normal, lognormal, and bi-normal models) for selection curves to the catch data. The bi-normal model with the fixed relative fishing intensity was selected as the best-fit selection curve by AIC (Akaike's Information Criterion) comparison. For the best-fit selection curve, the optimum relative length (the ratio of fish total length to mesh size) with the maximum efficiency and the selection range ($R_{50%,large}-R_{50%,small}$) of 50% retention were obtained as 2.363 and 0.851, respectively. The ratios of body girth to mesh perimeter at 100% retention where the selection curve of each mesh size represented the optimum total length were calculated as the range of 0.86 ~ 0.87.

The Analysis of the Number of Donations Based on a Mixture of Poisson Regression Model (포아송 분포의 혼합모형을 이용한 기부 횟수 자료 분석)

  • Kim In-Young;Park Su-Bum;Kim Byung-Soo;Park Tae-Kyu
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.1-12
    • /
    • 2006
  • The aim of this study is to analyse a survey data on the number of charitable donations using a mixture of two Poisson regression models. The survey was conducted in 2002 by Volunteer 21, an nonprofit organization, based on Koreans, who were older than 20. The mixture of two Poisson distributions is used to model the number of donations based on the empirical distribution of the data. The mixture of two Poisson distributions implies the whole population is subdivided into two groups, one with lesser number of donations and the other with larger number of donations. We fit the mixture of Poisson regression models on the number of donations to identify significant covariates. The expectation-maximization algorithm is employed to estimate the parameters. We computed 95% bootstrap confidence interval based on bias-corrected and accelerated method and used then for selecting significant explanatory variables. As a result, the income variable with four categories and the volunteering variable (1: experience of volunteering, 0: otherwise) turned out to be significant with the positive regression coefficients both in the lesser and the larger donation groups. However, the regression coefficients in the lesser donation group were larger than those in larger donation group.

Evaluation of goodness of fit of semiparametric and parametric models in analysis of factors associated with length of stay in neonatal intensive care unit

  • Kheiry, Fatemeh;Kargarian-Marvasti, Sadegh;Afrashteh, Sima;Mohammadbeigi, Abolfazl;Daneshi, Nima;Naderi, Salma;Saadat, Seyed Hossein
    • Clinical and Experimental Pediatrics
    • /
    • v.63 no.9
    • /
    • pp.361-367
    • /
    • 2020
  • Background: Length of stay is a significant indicator of care effectiveness and hospital performance. Owing to the limited number of healthcare centers and facilities, it is important to optimize length of stay and associated factors. Purpose: The present study aimed to investigate factors associated with neonatal length of stay in the neonatal intensive care unit (NICU) using parametric and semiparametric models and compare model fitness according to Akaike information criterion (AIC) between 2016 and 2018. Methods: This retrospective cohort study reviewed 600 medical records of infants admitted to the NICU of Bandar Abbas Hospital. Samples were identified using census sampling. Factors associated with NICU length of stay were investigated based on semiparametric Cox model and 4 parametric models including Weibull, exponential, log-logistic, and log-normal to determine the best fitted model. The data analysis was conducted using R software. The significance level was set at 0.05. Results: The study findings suggest that breastfeeding, phototherapy, acute renal failure, presence of mechanical ventilation, and availability of central venous catheter were commonly identified as factors associated with NICU length of stay in all 5 models (P<0.05). Parametric models showed better fitness than the Cox model in this study. Conclusion: Breastfeeding and availability of central venous catheter had protective effects against length of stay, whereas phototherapy, acute renal failure, and mechanical ventilation increased length of stay in NICU. Therefore, the identification of factors associated with NICU length of stay can help establish effective interventions aimed at decreasing the length of stay among infants.

Abundance Estimation of the Finless Porpoise, Neophocaena phocaenoides, Using Models of the Detection Function in a Line Transect (Line Transect에서 발견율함수 추정에 사용되는 모델에 따른 상괭이, Neophocaena phocaenoides의 자원개체수 추정)

  • Park, Kyum-Joon;Kim, Zang-Geun;Zhang, Chang-Ik
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.40 no.4
    • /
    • pp.201-209
    • /
    • 2007
  • Line transect sampling in a sighting survey is one of most widely used methods for assessing animal abundance. This study applied distance data, collected from three sighting surveys using line transects for finless porpoise that were conducted in 2004 and 2005 off the west coast of Korea, to four models (hazard-rate, uniform, half-normal and exponential) that can use a variety of detection functions, g (x). The hazard-rate model, a derived model for the detection function, should have a shoulder condition chosen using the AIC (Akaike Information Criterion), as the most suitable model. However, it did not describe a shoulder shape for the value of g(x) near the track tine and underestimated g (x), just as the exponential model did. The hazard-rate model showed a bias toward overestimating the densities of finless porpoises with a higher coefficient of variation (CV) than the other models did. The uniform model underestimated the densities of finless porpoise but had the lowest CV. The half-normal model described a detection function with a shape similar to that of the uniform model. The half-normal model was robust for finless porpoise data and should be able to avoid density underestimation. The estimated abundance of finless porpoise was 3,602 individuals (95% CI=1,251-10,371) inshore in 2005 and 33,045 individuals (95% CI=24,274-44,985) offshore in 2004.

Analysis of Success and Failure Factors of OTT Service Contents According to the Rating: Focus on Netflix (평점에 따른 OTT 서비스 콘텐츠의 성공과 실패 요인 분석: 넷플릭스를 중심으로)

  • Hong, Ji-Soo;Park, Jin-Soo;Kang, Sung-Woo
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.44 no.4
    • /
    • pp.65-75
    • /
    • 2021
  • This study explores multiple variables of an OTT service for discovering hidden relationship between rating and the other variables of each successful and failed content, respectively. In order to extract key variables that are strongly correlated to the rating across the contents, this work analyzes 170 Netflix original dramas and 419 movies. These contents are classified as success and failure by using the rating site IMDb, respectively. The correlation between the contents, which are classified via rating, and variables such as violence, lewdness and running time are analyzed to determine whether a certain variable appears or not in each successful and failure content. This study employs a regression analysis to discover correlations across the variables as a main analysis method. Since the correlation between independent variables should be low, check multicollinearity and select the variable. Cook's distance is used to detect and remove outliers. To improve the accuracy of the model, a variable selection based on AIC(Akaike Information Criterion) is performed. Finally, the basic assumptions of regression analysis are identified by residual diagnosis and Dubin Watson test. According to the whole analysis process, it is concluded that the more director awards exist and the less immatatable tend to be successful in movies. On the contrary, lower fear tend to be failure in movies. In case of dramas, there are close correlations between failure dramas and lower violence, higher fear, higher drugs.