• Title/Summary/Keyword: Bayesian model

Search Result 1,329, Processing Time 0.034 seconds

Prediction of genomic breeding values of carcass traits using whole genome SNP data in Hanwoo (Korean cattle) (한우에 있어서 유전체 육종가 추정)

  • Lee, Seung Hwan;Kim, Heong Cheul;Lim, Dajeong;Dang, Chang Gwan;Cho, Yong Min;Kim, Si Dong;Lee, Hak Kyo;Lee, Jun Heon;Yang, Boh Suk;Oh, Sung Jong;Hong, Seong Koo;Chang, Won Kyung
    • Korean Journal of Agricultural Science
    • /
    • v.39 no.3
    • /
    • pp.357-364
    • /
    • 2012
  • Genomic breeding value (GEBV) has recently become available in the beef cattle industry. Genomic selection methods are exceptionally valuable for selecting traits, such as marbling, that are difficult to measure until later in life. One method to utilize information from sparse marker panels is the Bayesian model selection method with RJMCMC. The accuracy of prediction varies between a multiple SNP model with RJMCMC (0.47 to 0.73) and a least squares method (0.11 to 0.41) when using SNP information, while the accuracy of prediction increases in the multiple SNP (0.56 to 0.90) and least square methods (0.21 to 0.63) when including a polygenic effect. In the multiple SNP model with RJMCMC model selection method, the accuracy ($r^2$) of GEBV for marbling predicted based only on SNP effects was 0.47, while the $r^2$ of GEBV predicted by SNP plus polygenic effect was 0.56. The accuracies of GEBV predicted using only SNP information were 0.62, 0.68 and 0.73 for CWT, EMA and BF, respectively. However, when polygenic effects were included, the accuracies of GEBV were increased to 0.89, 0.90 and 0.89 for CWT, EMA and BF, respectively. Our data demonstrate that SNP information alone is missing genetic variation information that contributes to phenotypes for carcass traits, and that polygenic effects compensate genetic variation that whole genome SNP data do not explain. Overall, the multiple SNP model with the RJMCMC model selection method provides a better prediction of GEBV than does the least squares method (single marker regression).

Character-based Subtitle Generation by Learning of Multimodal Concept Hierarchy from Cartoon Videos (멀티모달 개념계층모델을 이용한 만화비디오 컨텐츠 학습을 통한 등장인물 기반 비디오 자막 생성)

  • Kim, Kyung-Min;Ha, Jung-Woo;Lee, Beom-Jin;Zhang, Byoung-Tak
    • Journal of KIISE
    • /
    • v.42 no.4
    • /
    • pp.451-458
    • /
    • 2015
  • Previous multimodal learning methods focus on problem-solving aspects, such as image and video search and tagging, rather than on knowledge acquisition via content modeling. In this paper, we propose the Multimodal Concept Hierarchy (MuCH), which is a content modeling method that uses a cartoon video dataset and a character-based subtitle generation method from the learned model. The MuCH model has a multimodal hypernetwork layer, in which the patterns of the words and image patches are represented, and a concept layer, in which each concept variable is represented by a probability distribution of the words and the image patches. The model can learn the characteristics of the characters as concepts from the video subtitles and scene images by using a Bayesian learning method and can also generate character-based subtitles from the learned model if text queries are provided. As an experiment, the MuCH model learned concepts from 'Pororo' cartoon videos with a total of 268 minutes in length and generated character-based subtitles. Finally, we compare the results with those of other multimodal learning models. The Experimental results indicate that given the same text query, our model generates more accurate and more character-specific subtitles than other models.

Analysis of Elderly Drivers' Accident Models Considering Operations and Physical Characteristics (고령운전자 운전 및 신체특성을 반영한 교통사고 분석 연구)

  • Lim, Sam Jin;Park, Jun Tae;Kim, Young Il;Kim, Tae Ho
    • Journal of Korean Society of Transportation
    • /
    • v.30 no.6
    • /
    • pp.37-46
    • /
    • 2012
  • The number of traffic accidents caused by elderly drivers over the age of 65 has surged over the past ten years from 37,000 to 274,000 cases. The proportion of elderly drivers' accidents has jumped 3.1 times from 1.2% to 3.7% out of all traffic accidents, and traffic safety organizations are pursuing diverse measures to address the situation. Above all, connecting safety measures with an in-depth research on behavioral and physical characteristics of elderly drivers will prove vital. This study conducted an empirical research linking the driving characteristics and traffic accidents by elderly drivers based on the Driving Aptitude Test items and traffic accident data, which enabled the measurement of behavioral characteristics of elderly drivers. In developing the Influence Model, we applied the zero-inflated Poisson (ZIP) regression model and selected an accident prediction model based on the Bayesian Influence in regards to the ZIP regression model and the zero-inflated negative binomial (ZINB) regression model. According to the results of the AAE analysis, the ZIP regression model was more appropriate and it was found that three variables? prediction of velocity, diversion, and cognitive ability? had a relation of influence with traffic accidents caused by elderly drivers.

Managing the Reverse Extrapolation Model of Radar Threats Based Upon an Incremental Machine Learning Technique (점진적 기계학습 기반의 레이더 위협체 역추정 모델 생성 및 갱신)

  • Kim, Chulpyo;Noh, Sanguk
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.13 no.4
    • /
    • pp.29-39
    • /
    • 2017
  • Various electronic warfare situations drive the need to develop an integrated electronic warfare simulator that can perform electronic warfare modeling and simulation on radar threats. In this paper, we analyze the components of a simulation system to reversely model the radar threats that emit electromagnetic signals based on the parameters of the electronic information, and propose a method to gradually maintain the reverse extrapolation model of RF threats. In the experiment, we will evaluate the effectiveness of the incremental model update and also assess the integration method of reverse extrapolation models. The individual model of RF threats are constructed by using decision tree, naive Bayesian classifier, artificial neural network, and clustering algorithms through Euclidean distance and cosine similarity measurement, respectively. Experimental results show that the accuracy of reverse extrapolation models improves, while the size of the threat sample increases. In addition, we use voting, weighted voting, and the Dempster-Shafer algorithm to integrate the results of the five different models of RF threats. As a result, the final decision of reverse extrapolation through the Dempster-Shafer algorithm shows the best performance in its accuracy.

Allometric equation for estimating aboveground biomass of Acacia-Commiphora forest, southern Ethiopia

  • Wondimagegn Amanuel;Chala Tadesse;Moges Molla;Desalegn Getinet;Zenebe Mekonnen
    • Journal of Ecology and Environment
    • /
    • v.48 no.2
    • /
    • pp.196-206
    • /
    • 2024
  • Background: Most of the biomass equations were developed using sample trees collected mainly from pan-tropical and tropical regions that may over- or underestimate biomass. Site-specific models would improve the accuracy of the biomass estimates and enhance the country's measurement, reporting, and verification activities. The aim of the study is to develop site-specific biomass estimation models and validate and evaluate the existing generic models developed for pan-tropical forest and newly developed allometric models. Total of 140 trees was harvested from each diameter class biomass model development. Data was analyzed using SAS procedures. All relevant statistical tests (normality, multicollinearity, and heteroscedasticity) were performed. Data was transformed to logarithmic functions and multiple linear regression techniques were used to develop model to estimate aboveground biomass (AGB). The root mean square error (RMSE) was used for measuring model bias, precision, and accuracy. The coefficient of determination (R2 and adjusted [adj]-R2), the Akaike Information Criterion (AIC) and the Schwarz Bayesian information Criterion was employed to select most appropriate models. Results: For the general total AGB models, adj-R2 ranged from 0.71 to 0.85, and model 9 with diameter at stump height at 10 cm (DSH10), ρ and crown width (CW) as predictor variables, performed best according to RMSE and AIC. For the merchantable stem models, adj-R2 varied from 0.73 to 0.82, and model 8) with combination of ρ, diameter at breast height and height (H), CW and DSH10 as predictor variables, was best in terms of RMSE and AIC. The results showed that a best-fit model for above-ground biomass of tree components was developed. AGBStem = exp {-1.8296 + 0.4814 natural logarithm (Ln) (ρD2H) + 0.1751 Ln (CW) + 0.4059 Ln (DSH30)} AGBBranch = exp {-131.6 + 15.0013 Ln (ρD2H) + 13.176 Ln (CW) + 21.8506 Ln (DSH30)} AGBFoliage = exp {-0.9496 + 0.5282 Ln (DSH30) + 2.3492 Ln (ρ) + 0.4286 Ln (CW)} AGBTotal = exp {-1.8245 + 1.4358 Ln (DSH30) + 1.9921 Ln (ρ) + 0.6154 Ln (CW)} Conclusions: The results demonstrated that the development of local models derived from an appropriate sample of representative species can greatly improve the estimation of total AGB.

Estimating Fine Particulate Matter Concentration using GLDAS Hydrometeorological Data (GLDAS 수문기상인자를 이용한 초미세먼지 농도 추정)

  • Lee, Seulchan;Jeong, Jaehwan;Park, Jongmin;Jeon, Hyunho;Choi, Minha
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.6_1
    • /
    • pp.919-932
    • /
    • 2019
  • Fine particulate matter (PM2.5) is not only affected by anthropogenic emissions, but also intensifies, migrates, decreases by hydrometeorological factors. Therefore, it is essential to understand relationships between the hydrometeorological factors and PM2.5 concentration. In Korea, PM2.5 concentration is measured at the ground observatories and estimated data are given to locations where observatories are not present. In this way, the data is not suitable to represent an area, hence it is impossible to know accurate concentration at such locations. In addition, it is hard to trace migration, intensification, reduction of PM2.5. In this study, we analyzed the relationships between hydrometeorological factors, acquired from Global Land Data Assimilation System (GLDAS), and PM2.5 by means of Bayesian Model Averaging (BMA). By BMA, we also selected factors that have meaningful relationship with the variation of PM2.5 concentration. 4 PM2.5 concentration models for different seasons were developed using those selected factors, with Aerosol Optical Depth (AOD) from MODerate resolution Imaging Spectroradiometer (MODIS). Finally, we mapped the result of the model, to show spatial distribution of PM2.5. The model correlated well with the observed PM2.5 concentration (R ~0.7; IOA ~0.78; RMSE ~7.66 ㎍/㎥). When the models were compared with the observed PM2.5 concentrations at different locations, the correlation coefficients differed (R: 0.32-0.82), although there were similarities in data distribution. The developed concentration map using the models showed its capability in representing temporal, spatial variation of PM2.5 concentration. The result of this study is expected to be able to facilitate researches that aim to analyze sources and movements of PM2.5, if the study area is extended to East Asia.

Effects of Financial College Tuition Support by Korean Parents using a Hierarchical Bayes Model (계층적 베이즈 모형을 이용한 대학등록금에 대한 부모님의 경제적 지원 영향 분석)

  • Oh, Man-Suk;Oh, Hyun Sook;Oh, Min Jung
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.2
    • /
    • pp.267-280
    • /
    • 2013
  • College tuition is a significant economic, social, and political issue in Korea. We conduct a Bayesian analysis of a hierarchical model to address the factors related to college tuition based on a survey data collected by Statistics Korea. A binary response variable is selected depending on if more than 70% of tuition costs are supported by parents, and a hierarchical Probit model is constructed with areas as groups. A set of explanatory variables is selected from a factor analysis of available variables in the survey. A Markov chain Monte Carlo algorithm is used to estimate parameters. From the analysis results, income and stress are significantly related to college tuition support from parents. Parents with high income tend to support children's college tuition and students with parents' financial support tend to be mentally less stressed; subsequently, this shows that the economic status of parents significantly affects the mental health of college students. Gender, a healthy life style, and college satisfaction are not significant factors. Comparing areas in terms of the degrees of correlation between stress/income and tuition support from parents, students in Kangwon-do are the most mentally stressed when parents' support is limited; in addition, the positive correlation between parents support and income is stronger in big cities compared to provincial areas.

The development of water circulation model based on quasi-realtime hydrological data for drought monitoring (수문학적 가뭄 모니터링을 위한 실적자료 기반 물순환 모델 개발)

  • Kim, Jin-Young;Kim, Jin-Guk;Kim, Jang-Gyeng;Chun, Gun-il;Kang, Shin-uk;Lee, Jeong-Ju;Nam, Woo-Sung;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.8
    • /
    • pp.569-582
    • /
    • 2020
  • Recently, Korea has faced a change in the pattern of water use due to urbanization, which has caused difficulties in understanding the rainfall-runoff process and optimizing the allocation of available water resources. In this perspective, spatially downscaled analysis of the water balance is required for the efficient operation of water resources in the National Water Management Plan and the River Basin Water Resource Management Plan. However, the existing water balance analysis does not fully consider water circulation and availability in the basin, thus, the obtained results provide limited information in terms of decision making. This study aims at developing a novel water circulation analysis model that is designed to support a quasi-real-time assessment of water availability along the river. The water circulation model proposed in this study improved the problems that appear in the existing water balance analysis. More importantly, the results showed a significant improvement over the existing model, especially in the low flow simulation. The proposed modeling framework is expected to provide primary information for more realistic hydrological drought monitoring and drought countermeasures by providing streamflow information in quasi-real-time through a more accurate natural flow estimation approach with highly complex network.

The PIC Bumper Beam Design Method with Machine Learning Technique (머신 러닝 기법을 이용한 PIC 범퍼 빔 설계 방법)

  • Ham, Seokwoo;Ji, Seungmin;Cheon, Seong S.
    • Composites Research
    • /
    • v.35 no.5
    • /
    • pp.317-321
    • /
    • 2022
  • In this study, the PIC design method with machine learning that automatically assigning different stacking sequences according to loading types was applied bumper beam. The input value and labels of the training data for applying machine learning were defined as coordinates and loading types of reference elements that are part of the total elements, respectively. In order to compare the 2D and 3D implementation method, which are methods of representing coordinate value, training data were generated, and machine learning models were trained with each method. The 2D implementation method is divided FE model into each face and generating learning data and training machine learning models accordingly. The 3D implementation method is training one machine learning model by generating training data from the entire finite element model. The hyperparameter were tuned to optimal values through the Bayesian algorithm, and the k-NN classification method showed the highest prediction rate and AUC-ROC among the tuned models. The 3D implementation method revealed higher performance than the 2D implementation method. The loading type data predicted through the machine learning model were mapped to the finite element model and comparatively verified through FE analysis. It was found that 3D implementation PIC bumper beam was superior to 2D implementation and uni-stacking sequence composite bumper.

A Reliability Analysis of Slope Stability of Earth-Rockfill Dam (Earth-Rockfill Dam사면파괴에 대한 신뢰도 연구(I))

  • 박현종;이인모
    • Geotechnical Engineering
    • /
    • v.7 no.3
    • /
    • pp.21-32
    • /
    • 1991
  • The purpose of this paper is to develop a reliability model for slope stability of Earth-rockfill dams which accounts for all uncertainties encountered. The uncertain factors of the design variables include the cohesion, the angle of internal friction, and the porewater Pressure in each zone. More specifically, the model errors in estimating those variables are studied in depth. To reduce the uncertainties due to model errors, updated design variables are obtained using Bayesian Theory. For stability analysis, both the two-dimesional stability analysis and the three-dimensional stability analysis where the end effects and the system reliability concept are considered are used for the reliability calculations. The deterministic safety factor by the three-dimensional analysis is lager than that by the two-dimensional anlysis. However, the probability of failure by the three-dimensional analysis is about 3.5 times larger that by the two-dimensional analysis. It is because the system reliability concept is used in the three-dimensional analysis. The sensitivity analysis shows that the probability of failure is more sensitive to the uncertainty of the cohesion than that of the angle of internal friction.

  • PDF