• 제목/요약/키워드: Gibbs sampling

검색결과 168건 처리시간 0.026초

Genetic Variability of Show Jumping Attributes in Young Horses Commencing Competing

  • Prochniak, Tomasz;Rozempolska-Rucinska, Iwona;Zieba, Grzegorz;Lukaszewicz, Marek
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제28권8호
    • /
    • pp.1090-1094
    • /
    • 2015
  • The aim of the study was to select traits that may constitute a prospective criterion for breeding value prediction of young horses. The results of 1,232 starts of 894 four-, five-, six-, and seven-year-old horses, obtained during jumping championships for young horses which had not been evaluated in, alternative to championships, training centres were analyed. Nine traits were chosen of those recorded: ranking in the championship, elimination (y/n), conformation, rating of style on day one, two, and three, and penalty points on day one, two, and three of a championship. (Co)variance components were estimated via the Gibbs sampling procedure and adequate (co)variance component ratios were calculated. Statistical classifications were trait dependent but all fitted random additive genetic and permanent environment effects. It was found that such characteristics as penalty points and jumping style are potential indicators of jumping ability, and the genetic variability of the traits was within the range of 14% to 27%. Given the low genetic correlations between the conformation and other results achieved on the parkour, the relevance of assessment of conformation in four-years-old horses has been questioned.

Investigation of Biases for Variance Components on Multiple Traits with Varying Number of Categories in Threshold Models Using Bayesian Inferences

  • Lee, D.H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제15권7호
    • /
    • pp.925-931
    • /
    • 2002
  • Gibbs sampling algorithms were implemented to the multi-trait threshold animal models with any combinations of multiple binary, ordered categorical, and linear traits and investigate the amount of bias on these models with two kinds of parameterization and algorithms for generating underlying liabilities. Statistical models which included additive genetic and residual effects as random and contemporary group effects as fixed were considered on the models using simulated data. The fully conditional posterior means of heritabilities and genetic (residual) correlations were calculated from 1,000 samples retained every 10th samples after 15,000 samples discarded as "burn-in" period. Under the models considered, several combinations of three traits with binary, multiple ordered categories, and continuous were analyzed. Five replicates were carried out. Estimates for heritabilities and genetic (residual) correlations as the posterior means were unbiased when underlying liabilities for a categorical trait were generated given by underlying liabilities of the other traits and threshold estimates were rescaled. Otherwise, when parameterizing threshold of zero and residual variance of one for binary traits, heritability estimates were inflated 7-10% upward. Genetic correlation estimates were biased upward if positively correlated and downward if negatively correlated when underling liabilities were generated without accounting for correlated traits on prior information. Residual correlation estimates were, consequently, much biased downward if positively correlated and upward if negatively correlated in that case. The more categorical trait had categories, the better mixing rate was shown.

텍스트마이닝을 활용한 보건의료산업학회지의 토픽 모델링 및 토픽트렌드 분석 (Analysis on Topic Trends and Topic Modeling of KSHSM Journal Papers using Text Mining)

  • 조경원;배성권;우영운
    • 보건의료산업학회지
    • /
    • 제11권4호
    • /
    • pp.213-224
    • /
    • 2017
  • Objectives : The purpose of this study was to analyze representative topics and topic trends of papers in Korean Society and Health Service Management(KSHSM) Journal. Methods : We collected English abstracts and key words of 516 papers in KSHSM Journal from 2007 to 2017. We utilized Python web scraping programs for collecting the papers from Korea Citation Index web site, and RStudio software for topic analysis based on latent Dirichlet allocation algorithm. Results : 9 topics were decided as the best number of topics by perplexity analysis and the resultant 9 topics for all the papers were extracted using Gibbs sampling method. We could refine 9 topics to 5 topics by deep consideration of meanings of each topics and analysis of intertopic distance map. In topic trends analysis from 2007 to 2017, we could verify 'Health Management' and 'Hospital Service' were two representative topics, and 'Hospital Service' was prevalent topic by 2011, but the ratio of the two topics became to be similar from 2012. Conclusions : We discovered 5 topics were the best number of topics and the topic trends reflected the main issues of KSHSM Journal, such as name revision of the society in 2012.

일반화 파레토 모형에서의 베이지안 예측 (A Bayesian Prediction of the Generalized Pareto Model)

  • 판허;손중권
    • 응용통계연구
    • /
    • 제27권6호
    • /
    • pp.1069-1076
    • /
    • 2014
  • 기후 온난화의 한 현상으로 받아들여지는 집중호우로 인한 관심이 늘어난 만큼 강우량에 대한 예측 모형이 필요하다. 이러 환경 문제를 다룰 때, 모형을 설정하는 방법 중에 하나로 일반화 파레토 모형을 활용하는 연구가 이루어지고 있다. 본 논문에서는 서울특별시에 대한 1973년부터 2011년까지 매 7월 일별강우량 자료를 가지고 일반화 파레토 모형을 사용하여 강우량의 임계값(70mm) 이상의 분포가 어떻게 되는지 연구한다. 모수의 사전분포는 감마분포랑 역감마분포를 정의하고, 또는 제프리의 정보가 없는 사전분포를 두고, 깁스 표본방법을 통해 베이지안 사후예측분포를 구하고 얻어진 결과를 비교해 본다.

Estimation of Genetic Parameters for Body Weight in Chinese Simmental Cattle Using Random Regression Model

  • Yang, R.Q.;Ren, H.Y.;Xu, S.Z.;Pan, Y.C.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제17권7호
    • /
    • pp.914-918
    • /
    • 2004
  • The random regression model methodology was applied into the estimation of genetic parameters for body weights in Chinese Simmental cattle to replace the traditional multiple trait models. The variance components were estimated using Gibbs sampling procedure on Bayesion theory. The data were extracted for Chinese Simmental cattle born during 1980 to 2000 from 6 national breeding farms, where records from 3 months to 36 months were only used in this study. A 3 orders Legendre polynomial was defined as the submodel to describe the general law of that body weight changing with months of age in population. The heritabilities of body weights from 3 months to 36 months varied between 0.31 and 0.48, where the heritabilities from 3 months to 12 months slightly decreased with months of age but ones from 13 months to 36 months increased with months of age. Specially, the heritabilities at eighteenth and twenty-fourth month of age were 0.33 and 0.36, respectively, which were slightly greater than 0.30 and 0.31 from multiple trait models. In addition, the genetic and phenotypic correlations between body weights at different month ages were also obtained using regression model.

Topic Extraction and Classification Method Based on Comment Sets

  • Tan, Xiaodong
    • Journal of Information Processing Systems
    • /
    • 제16권2호
    • /
    • pp.329-342
    • /
    • 2020
  • In recent years, emotional text classification is one of the essential research contents in the field of natural language processing. It has been widely used in the sentiment analysis of commodities like hotels, and other commentary corpus. This paper proposes an improved W-LDA (weighted latent Dirichlet allocation) topic model to improve the shortcomings of traditional LDA topic models. In the process of the topic of word sampling and its word distribution expectation calculation of the Gibbs of the W-LDA topic model. An average weighted value is adopted to avoid topic-related words from being submerged by high-frequency words, to improve the distinction of the topic. It further integrates the highest classification of the algorithm of support vector machine based on the extracted high-quality document-topic distribution and topic-word vectors. Finally, an efficient integration method is constructed for the analysis and extraction of emotional words, topic distribution calculations, and sentiment classification. Through tests on real teaching evaluation data and test set of public comment set, the results show that the method proposed in the paper has distinct advantages compared with other two typical algorithms in terms of subject differentiation, classification precision, and F1-measure.

제한조건이 있는 선형회귀 모형에서의 베이지안 변수선택 (Bayesian Variable Selection in Linear Regression Models with Inequality Constraints on the Coefficients)

  • 오만숙
    • 응용통계연구
    • /
    • 제15권1호
    • /
    • pp.73-84
    • /
    • 2002
  • 계수에 대한 부등 제한조건이 있는 선형 회귀모형은 경제모형에서 가장 흔하게 다루어지는 것 중의 하나이다. 이는 특정 설명변수에 대한 계수의 부호를 음양 중 하나로 제한하거나 계수들에 대하여 순서적 관계를 주기 때문이다. 본 논문에서는 이러한 부등 제한이 있는 선형회귀 모형에서 유의한 설명변수의 선택을 해결하는 베이지안 기법을 고려한다. 베이지안 변수선택은 가능한 모든 모형의 사후확률 계산이 요구되는데 본 논문에서는 이러한 사후확률들을 동시에 계산하는 방법을 제시한다. 구체적으로 가장 일반적인 모형의 모수에 대한 사후표본을 깁스 표본기법을 적용시켜 얻은 후 이를 이용하여 모든 가능한 모형의 사후확률을 계산하고 실제적인 자료에 본 논문에서 제안된 방법을 적용시켜 본다.

Adaptive Reconstruction of Harmonic Time Series Using Point-Jacobian Iteration MAP Estimation and Dynamic Compositing: Simulation Study

  • Lee, Sang-Hoon
    • 대한원격탐사학회지
    • /
    • 제24권1호
    • /
    • pp.79-89
    • /
    • 2008
  • Irregular temporal sampling is a common feature of geophysical and biological time series in remote sensing. This study proposes an on-line system for reconstructing observation image series contaminated by noises resulted from mechanical problems or sensing environmental condition. There is also a high likelihood that during the data acquisition periods the target site corresponding to any given pixel may be covered by fog or cloud, thereby resulting in bad or missing observation. The surface parameters associated with the land are usually dependent on the climate, and many physical processes that are displayed in the image sensed from the land then exhibit temporal variation with seasonal periodicity. A feedback system proposed in this study reconstructs a sequence of images remotely sensed from the land surface having the physical processes with seasonal periodicity. The harmonic model is used to track seasonal variation through time, and a Gibbs random field (GRF) is used to represent the spatial dependency of digital image processes. The experimental results of this simulation study show the potentiality of the proposed system to reconstruct the image series observed by imperfect sensing technology from the environment which are frequently influenced by bad weather. This study provides fundamental information on the elements of the proposed system for right usage in application.

토픽모델링을 활용한 실내환경 분야 연구동향 파악 : 실내환경학회지 초록 사례연구 (An analysis of indoor environment research trends in Korea using topic modeling : Case study on abstracts from the journal of the Korean society for indoor environment)

  • 전형진;김도연;한국진;김동우;손승우;이철민
    • 실내환경 및 냄새 학회지
    • /
    • 제17권4호
    • /
    • pp.322-329
    • /
    • 2018
  • The objective of this study is to identify the research trend in the field of indoor environment in Korea. We collected 419 papers published in the Journal of the Korean Society for indoor environment between 2004 and 2018, and attempted to produce datasets using a topic modeling technique, Latent Dirichlet Allocation(LDA). The result of topic modeling showed that 8 topics ("VOCs investigation", "Subway environment", "Building thermal environment", "School health", "Building particulate matter", "Asbestos risk", "Radon risk", "Air cleaner and treatment") could be extracted using Gibbs sampling method. In terms of topic trends, investigation of volatile organic compounds, subway environment, school health, and building particulate matter showed a decreasing tendency, while the building thermal environment, asbestos risk, radon risk, air cleaners, and air treatment showed an increasing tendency. The results of this topic modeling could help us to understand current trends related indoor environment, and provide valuable information in developing future research and policy frameworks.

Bayesian estimates of genetic parameters of non-return rate and success in first insemination in Japanese Black cattle

  • Setiaji, Asep;Arakaki, Daichi;Oikawa, Takuro
    • Animal Bioscience
    • /
    • 제34권7호
    • /
    • pp.1100-1104
    • /
    • 2021
  • Objective: The objective of present study was to estimate heritability of non-return rate (NRR) and success of first insemination (SFI) by using the Bayesian approach with Gibbs sampling. Methods: Heifer Traits were denoted as NRR-h and SFI-h, and cow traits as NRR-c and SFI-c. The variance covariance components were estimated using threshold model under Bayesian procedures THRGIBBS1F90. Results: The SFI was more relevant to evaluating success of insemination because a high percentage of animals that demonstrated no return did not successfully conceive in NRR. Estimated heritability of NRR and SFI in heifers were 0.032 and 0.039 and the corresponding estimates for cows were 0.020 and 0.027. The model showed low values of Geweke (p-value ranging between 0.012 and 0.018) and a low Monte Carlo chain error, indicating that the amount of a posteriori for the heritability estimate was valid for binary traits. Genetic correlation between the same traits among heifers and cows by using the two-trait threshold model were low, 0.485 and 0.591 for NRR and SFI, respectively. High genetic correlations were observed between NRR-h and SFI-h (0.922) and between NRR-c and SFI-c (0.954). Conclusion: SFI showed slightly higher heritability than NRR but the two traits are genetically correlated. Based on this result, both two could be used for early indicator for evaluate the capacity of cows to conceive.