• Title/Summary/Keyword: bayesian method

Search Result 1,138, Processing Time 0.028 seconds

Genetic Contribution of Indigenous Yakutian Cattle to Two Hybrid Populations, Revealed by Microsatellite Variation

  • Li, M.H.;Nogovitsina, E.;Ivanova, Z.;Erhardt, G.;Vilkki, J.;Popov, R.;Ammosov, I.;Kiselyova, T.;Kantanen, J.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.18 no.5
    • /
    • pp.613-619
    • /
    • 2005
  • Indigenous Yakutian cattle' adaptation to the hardest subarctic conditions makes them a valuable genetic resource for cattle breeding in the Siberian area. Since early last century, crossbreeding between native Yakutian cattle and imported Simmental and Kholmogory breeds has been widely adopted. In this study, variations at 22 polymorphic microsatellite loci in 5 populations of Yakutian, Kholmogory, Simmental, Yakutian-Kholmogory and Yakutian-Simmental cattle were analysed to estimate the genetic contribution of Yakutian cattle to the two hybrid populations. Three statistical approaches were used: the weighted least-squares (WLS) method which considers all allele frequencies; a recently developed implementation of a Markov chain Monte Carlo (MCMC) method called likelihood-based estimation of admixture (LEA); and a model-based Bayesian admixture analysis method (STRUCTURE). At population-level admixture analyses, the estimate based on the LEA was consistent with that obtained by the WLS method. Both methods showed that the genetic contribution of the indigenous Yakutian cattle in Yakutian-Kholmogory was small (9.6% by the LEA and 14.2% by the WLS method). In the Yakutian-Simmental population, the genetic contribution of the indigenous Yakutian cattle was considerably higher (62.8% by the LEA and 56.9% by the WLS method). Individual-level admixture analyses using STRUCTURE proved to be more informative than the multidimensional scaling analysis (MDSA) based on individual-based genetic distances. Of the 9 Yakutian-Simmental animals studied, 8 showed admixed origin, whereas of the 14 studied Yakutian-Kholmogory animals only 2 showed Yakutian ancestry (>5%). The mean posterior distributions of individual admixture coefficient (q) varied greatly among the samples in both hybrid populations. This study revealed a minor existing contribution of the Yakutian cattle in the Yakutian-Kholmogory hybrid population, but in the Yakutian-Simmental hybrid population, a major genetic contribution of the Yakutian cattle was seen. The results reflect the different crossbreeding patterns used in the development of the two hybrid populations. Additionally, molecular evidence for differences among individual admixture proportions was seen in both hybrid populations, resulting from the stochastic process in crossing over generations.

Comparison of ISO-GUM and Monte Carlo Method for Evaluation of Measurement Uncertainty (몬테카를로 방법과 ISO-GUM 방법의 불확도 평가 결과 비교)

  • Ha, Young-Cheol;Her, Jae-Young;Lee, Seung-Jun;Lee, Kang-Jin
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.38 no.7
    • /
    • pp.647-656
    • /
    • 2014
  • To supplement the ISO-GUM method for the evaluation of measurement uncertainty, a simulation program using the Monte Carlo method (MCM) was developed, and the MCM and GUM methods were compared. The results are as follows: (1) Even under a non-normal probability distribution of the measurand, MCM provides an accurate coverage interval; (2) Even if a probability distribution that emerged from combining a few non-normal distributions looks as normal, there are cases in which the actual distribution is not normal and the non-normality can be determined by the probability distribution of the combined variance; and (3) If type-A standard uncertainties are involved in the evaluation of measurement uncertainty, GUM generally offers an under-valued coverage interval. However, this problem can be solved by the Bayesian evaluation of type-A standard uncertainty. In this case, the effective degree of freedom for the combined variance is not required in the evaluation of expanded uncertainty, and the appropriate coverage factor for 95% level of confidence was determined to be 1.96.

Development and application of GLS OD matrix estimation with genetic algorithm for Seoul inner-ringroad (유전알고리즘을 이용한 OD 추정모형의 개발과 적용에 관한 연구 (서울시 내부순환도로를 대상으로))

  • 임용택;김현명;백승걸
    • Journal of Korean Society of Transportation
    • /
    • v.18 no.4
    • /
    • pp.117-126
    • /
    • 2000
  • Conventional methods for collecting origin-destination trips have been mainly relied on the surveys of home or roadside interview. However, the methods tend to be costly, labor intensive and time disruptive to the trip makers, thus the methods are not considered suitable for Planning applications such as routing guidance, arterial management and information Provision, as the parts of deployments in Intelligent Transport Systems Motivated by the problems, more economic ways to estimate origin-destination trip tables have been studied since the late 1970s. Some of them, which have been estimating O-D table from link traffic counts are generally Entropy maximizing, Maximum likelihood, Generalized least squares(GLS), and Bayesian inference estimation etc. In the Paper, with user equilibrium constraint we formulate GLS problem for estimating O-D trips and develop a solution a1gorithm by using Genetic Algorithm, which has been known as a g1oba1 searching technique. For the purpose of evaluating the method, we apply it to Seoul inner ringroad and compare it with gradient method proposed by Spiess(1990). From the resu1ts we fond that the method developed in the Paper is superior to other.

  • PDF

An estimation method for non-response model using Monte-Carlo expectation-maximization algorithm (Monte-Carlo expectation-maximaization 방법을 이용한 무응답 모형 추정방법)

  • Choi, Boseung;You, Hyeon Sang;Yoon, Yong Hwa
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.3
    • /
    • pp.587-598
    • /
    • 2016
  • In predicting an outcome of election using a variety of methods ahead of the election, non-response is one of the major issues. Therefore, to address the non-response issue, a variety of methods of non-response imputation may be employed, but the result of forecasting tend to vary according to methods. In this study, in order to improve electoral forecasts, we studied a model based method of non-response imputation attempting to apply the Monte Carlo Expectation Maximization (MCEM) algorithm, introduced by Wei and Tanner (1990). The MCEM algorithm using maximum likelihood estimates (MLEs) is applied to solve the boundary solution problem under the non-ignorable non-response mechanism. We performed the simulation studies to compare estimation performance among MCEM, maximum likelihood estimation, and Bayesian estimation method. The results of simulation studies showed that MCEM method can be a reasonable candidate for non-response model estimation. We also applied MCEM method to the Korean presidential election exit poll data of 2012 and investigated prediction performance using modified within precinct error (MWPE) criterion (Bautista et al., 2007).

Performance Improvement of Collaborative Filtering System Using Associative User′s Clustering Analysis for the Recalculation of Preference and Representative Attribute-Neighborhood (선호도 재계산을 위한 연관 사용자 군집 분석과 Representative Attribute -Neighborhood를 이용한 협력적 필터링 시스템의 성능향상)

  • Jung, Kyung-Yong;Kim, Jin-Su;Kim, Tae-Yong;Lee, Jung-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.10B no.3
    • /
    • pp.287-296
    • /
    • 2003
  • There has been much research focused on collaborative filtering technique in Recommender System. However, these studies have shown the First-Rater Problem and the Sparsity Problem. The main purpose of this Paper is to solve these Problems. In this Paper, we suggest the user's predicting preference method using Bayesian estimated value and the associative user clustering for the recalculation of preference. In addition to this method, to complement a shortcoming, which doesn't regard the attribution of item, we use Representative Attribute-Neighborhood method that is used for the prediction when we find the similar neighborhood through extracting the representative attribution, which most affect the preference. We improved the efficiency by using the associative user's clustering analysis in order to calculate the preference of specific item within the cluster item vector to the collaborative filtering algorithm. Besides, for the problem of the Sparsity and First-Rater, through using Association Rule Hypergraph Partitioning algorithm associative users are clustered according to the genre. New users are classified into one of these genres by Naive Bayes classifier. In addition, in order to get the similarity value between users belonged to the classified genre and new users, and this paper allows the different estimated value to item which user evaluated through Naive Bayes learning. As applying the preference granted the estimated value to Pearson correlation coefficient, it can make the higher accuracy because the errors that cause the missing value come less. We evaluate our method on a large collaborative filtering database of user rating and it significantly outperforms previous proposed method.

An Active Learning-based Method for Composing Training Document Set in Bayesian Text Classification Systems (베이지언 문서분류시스템을 위한 능동적 학습 기반의 학습문서집합 구성방법)

  • 김제욱;김한준;이상구
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.12
    • /
    • pp.966-978
    • /
    • 2002
  • There are two important problems in improving text classification systems based on machine learning approach. The first one, called "selection problem", is how to select a minimum number of informative documents from a given document collection. The second one, called "composition problem", is how to reorganize selected training documents so that they can fit an adopted learning method. The former problem is addressed in "active learning" algorithms, and the latter is discussed in "boosting" algorithms. This paper proposes a new learning method, called AdaBUS, which proactively solves the above problems in the context of Naive Bayes classification systems. The proposed method constructs more accurate classification hypothesis by increasing the valiance in "weak" hypotheses that determine the final classification hypothesis. Consequently, the proposed algorithm yields perturbation effect makes the boosting algorithm work properly. Through the empirical experiment using the Routers-21578 document collection, we show that the AdaBUS algorithm more significantly improves the Naive Bayes-based classification system than other conventional learning methodson system than other conventional learning methods

Comparison of Dynamic Origin Destination Demand Estimation Models in Highway Network (고속도로 네트워크에서 동적기종점수요 추정기법 비교연구)

  • 이승재;조범철;김종형
    • Journal of Korean Society of Transportation
    • /
    • v.18 no.5
    • /
    • pp.83-97
    • /
    • 2000
  • The traffic management schemes through traffic signal control and information provision could be effective when the link-level data and trip-level data were used simultaneously in analysis Procedures. But, because the trip-level data. such as origin, destination and departure time, can not be obtained through the existing surveillance systems directly. It is needed to estimate it using the link-level data which can be obtained easily. Therefore the objective of this study is to develop the model to estimate O-D demand using only the link flows in highway network as a real time. The methodological approaches in this study are kalman filer, least-square method and normalized least-square method. The kalman filter is developed in the basis of the bayesian update. The normalized least-square method is developed in the basis of the least-square method and the natural constraint equation. These three models were experimented using two kinds of simulated data. The one has two abrupt changing Patterns in traffic flow rates The other is a 24 hours data that has three Peak times in a day Among these models, kalman filer has Produced more accurate and adaptive results than others. Therefore it is seemed that this model could be used in traffic demand management. control, travel time forecasting and dynamic assignment, and so forth.

  • PDF

Continual Reassessment Method in Phase I Clinical Trials for Leukemia Patients (백혈병환자 대상의 제1상임상시험 연속재평가방법)

  • Lee, Joo-Hyoung;Song, Hae-Hiang
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.5
    • /
    • pp.581-594
    • /
    • 2011
  • The traditional method of 3+3 standard design and model-based Bayesian continual reassessment method (CRM) are commonly used in Phase I clinical trials to identify the maximal tolerated dose(MTD) of a new drug. In this paper we review clinical examples of Phase I trials that were carried out in patients with refractory or relapsed leukemia and myelodysplastic syndrome. The recently proposed 3+1+1 design and rolling-6 design can shorten the trial duration, when a very slow accrual of patients with a simple 3+3 standard design may result in the untimely termination of trials. Too conservative approaches in determining the dose levels in Phase I clinical trials can leave clinical investigators unable to accurately determine the MTD. When determining future patient doses, the designs that use a time-to-event CRM can cooperate late toxicities by accounting for the proportion of the observation period of each enrolled patient. With the CRM design, simulations under different scenarios during the trial are important in detecting the under- or over-estimation of the initial estimate of the dose-limiting toxicity rate for each dose level. We present the advantages and drawbacks of the designs used in Phase I clinical trials for leukemia patients.

Semantic Topic Selection Method of Document for Classification (문서분류를 위한 의미적 주제선정방법)

  • Ko, kwang-Sup;Kim, Pan-Koo;Lee, Chang-Hoon;Hwang, Myung-Gwon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.1
    • /
    • pp.163-172
    • /
    • 2007
  • The web as global network includes text document, video, sound, etc and connects each distributed information using link Through development of web, it accumulates abundant information and the main is text based documents. Most of user use the web to retrieve information what they want. So, numerous researches have progressed to retrieve the text documents using the many methods, such as probability, statistics, vector similarity, Bayesian, and so on. These researches however, could not consider both the subject and the semantics of documents. As a result user have to find by their hand again. Especially, it is more hard to find the korean document because the researches of korean document classification is insufficient. So, to overcome the previous problems, we propose the korean document classification method for semantic retrieval. This method firstly, extracts TF value and RV value of concepts that is included in document, and maps into U-WIN that is korean vocabulary dictionary to select the topic of document. This method is possible to classify the document semantically and showed the efficiency through experiment.

Enhancement of Buckling Characteristics for Composite Square Tube by Load Type Analysis (하중유형 분석을 통한 좌굴에 강한 복합재료 사각관 설계에 관한 연구)

  • Seokwoo Ham;Seungmin Ji;Seong S. Cheon
    • Composites Research
    • /
    • v.36 no.1
    • /
    • pp.53-58
    • /
    • 2023
  • The PIC design method is assigning different stacking sequences for each shell element through the preliminary FE analysis. In previous study, machine learning was applied to the PIC design method in order to assign the region efficiently, and the training data is labeled by dividing each region into tension, compression, and shear through the preliminary FE analysis results value. However, since buckling is not considered, when buckling occurs, it can't be divided into appropriate loading type. In the present study, it was proposed PIC-NTL (PIC design using novel technique for analyzing load type) which is method for applying a novel technique for analyzing load type considering buckling to the conventional PIC design. The stress triaxiality for each ply were analyzed for buckling analysis, and the representative loading type was designated through the determined loading type within decision area divided into two regions of the same size in the thickness direction of the elements. The input value of the training data and label consisted in coordination of element and representative loading type of each decision area, respectively. A machine learning model was trained through the training data, and the hyperparameters that affect the performance of the machine learning model were tuned to optimal values through Bayesian algorithm. Among the tuned machine learning models, the SVM model showed the highest performance. Most effective stacking sequence were mapped into PIC tube based on trained SVM model. FE analysis results show the design method proposed in this study has superior external loading resistance and energy absorption compared to previous study.