• Title/Summary/Keyword: Bayes method

Search Result 365, Processing Time 0.024 seconds

An Efficient Algorithm for NaiveBayes with Matrix Transposition (행렬 전치를 이용한 효율적인 NaiveBayes 알고리즘)

  • Lee, Jae-Moon
    • The KIPS Transactions:PartB
    • /
    • v.11B no.1
    • /
    • pp.117-124
    • /
    • 2004
  • This paper proposes an efficient algorithm of NaiveBayes without loss of its accuracy. The proposed method uses the transposition of category vectors, and minimizes the computation of the probability of NaiveBayes. The proposed method was implemented on the existing framework of the text categorization, so called, AI::Categorizer and it was compared with the conventional NaiveBayes with the well-known data, Router-21578. The comparisons show that the proposed method outperforms NaiveBayes about two times with respect to the executing time.

How to Improve Classical Estimators via Linear Bayes Method?

  • Wang, Lichun
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.6
    • /
    • pp.531-542
    • /
    • 2015
  • In this survey, we use the normal linear model to demonstrate the use of the linear Bayes method. The superiorities of linear Bayes estimator (LBE) over the classical UMVUE and MLE are established in terms of the mean squared error matrix (MSEM) criterion. Compared with the usual Bayes estimator (obtained by the MCMC method) the proposed LBE is simple and easy to use with numerical results presented to illustrate its performance. We also examine the applications of linear Bayes method to some other distributions including two-parameter exponential family, uniform distribution and inverse Gaussian distribution, and finally make some remarks.

Default Bayes Factors for Testing the Equality of Poisson Population Means

  • Son, Young Sook;Kim, Seong W.
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.2
    • /
    • pp.549-562
    • /
    • 2000
  • Default Bayes factors are computed to test the equality of one Poisson population mean and the equality of two independent Possion population means. As default priors are assumed Jeffreys priors, noninformative improper priors, and default Bayes factors such as three intrinsic Bayes factors of Berger and Pericchi(1996, 1998), the arithmetic, the median, and the geometric intrinsic Bayes factor, and the factional Bayes factor of O'Hagan(1995) are computed. The testing results by each default Bayes factor are compared with those by the classical method in the simulation study.

  • PDF

A Novel Posterior Probability Estimation Method for Multi-label Naive Bayes Classification

  • Kim, Hae-Cheon;Lee, Jaesung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.6
    • /
    • pp.1-7
    • /
    • 2018
  • A multi-label classification is to find multiple labels associated with the input pattern. Multi-label classification can be achieved by extending conventional single-label classification. Common extension techniques are known as Binary relevance, Label powerset, and Classifier chains. However, most of the extended multi-label naive bayes classifier has not been able to accurately estimate posterior probabilities because it does not reflect the label dependency. And the remaining extended multi-label naive bayes classifier has a problem that it is unstable to estimate posterior probability according to the label selection order. To estimate posterior probability well, we propose a new posterior probability estimation method that reflects the probability between all labels and labels efficiently. The proposed method reflects the correlation between labels. And we have confirmed through experiments that the extended multi-label naive bayes classifier using the proposed method has higher accuracy then the existing multi-label naive bayes classifiers.

A Bayes Criterion for Testing Homogeneity of Two Multivariate Normal Covariances

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.1
    • /
    • pp.11-23
    • /
    • 1998
  • A Bayes criterion for testing the equality of covariance matrices of two multivariate normal distributions is proposed and studied. Development of the criterion invloves calculation of Bayes factor using the imaginary sample method introduced by Spiegelhalter and Smith (1982). The criterion is designed to develop a Bayesian test criterion, so that it provides an alternative test criterion to those based upon asymptotic sampling theory (such as Box's M test criterion). For the constructed criterion, numerical studies demonstrate routine application and give comparisons with the traditional test criteria.

  • PDF

A Study on the Posterior Density under the Bayes-empirical Bayes Models

  • Sohn, Joong-K.Sohn;Kim, Heon-Joo-Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.3
    • /
    • pp.215-223
    • /
    • 1996
  • By using Tukey's generalized lambda distribution, appoximate posterior density is derived under the Bayes-empirical Bayes model. The sensitivity of posterior distribution to the hyperprior distribution is examined by using Tukey's generalized lambda distriburion which approximate many well-knmown distributions. Based upon Monte Varlo simulation studies it can be said that posterior distribution is sensitive to the cariance of the prior distribution and to the symmetry of the hyperprior distribution. Also posterior distribution is approximately obtained by using the following methods : Lindley method, Laplace method and Gibbs sampler method.

  • PDF

A Study on the Fuzzy-Bayes Method

  • Kyeoi, Tae-Hwa;Sohn, Joong-Kweon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.1
    • /
    • pp.173-181
    • /
    • 2004
  • In this paper, we study and examine the sensitivity of the fuzzy-Bayes method whose properties are relatively not known much. Two fuzzy conditions and two actions are considered. Also some normal distributions and uniform distributions are assumed as a prior distribution for a parameter in the fuzzy-Bayes method.

  • PDF

Comparison of Sentiment Analysis from Large Twitter Datasets by Naïve Bayes and Natural Language Processing Methods

  • Back, Bong-Hyun;Ha, Il-Kyu
    • Journal of information and communication convergence engineering
    • /
    • v.17 no.4
    • /
    • pp.239-245
    • /
    • 2019
  • Recently, effort to obtain various information from the vast amount of social network services (SNS) big data generated in daily life has expanded. SNS big data comprise sentences classified as unstructured data, which complicates data processing. As the amount of processing increases, a rapid processing technique is required to extract valuable information from SNS big data. We herein propose a system that can extract human sentiment information from vast amounts of SNS unstructured big data using the naïve Bayes algorithm and natural language processing (NLP). Furthermore, we analyze the effectiveness of the proposed method through various experiments. Based on sentiment accuracy analysis, experimental results showed that the machine learning method using the naïve Bayes algorithm afforded a 63.5% accuracy, which was lower than that yielded by the NLP method. However, based on data processing speed analysis, the machine learning method by the naïve Bayes algorithm demonstrated a processing performance that was approximately 5.4 times higher than that by the NLP method.

Development and evaluation of dam inflow prediction method based on Bayesian method (베이지안 기법 기반의 댐 예측유입량 산정기법 개발 및 평가)

  • Kim, Seon-Ho;So, Jae-Min;Kang, Shin-Uk;Bae, Deg-Hyo
    • Journal of Korea Water Resources Association
    • /
    • v.50 no.7
    • /
    • pp.489-502
    • /
    • 2017
  • The objective of this study is to propose and evaluate the BAYES-ESP, which is a dam inflow prediction method based on Ensemble Streamflow Prediction method (ESP) and Bayesian theory. ABCD rainfall-runoff model was used to predict monthly dam inflow. Monthly meteorological data collected from KMA, MOLIT and K-water and dam inflow data collected from K-water were used for the model calibration and verification. To estimate the performance of ABCD model, ESP and BAYES-ESP method, time series analysis and skill score (SS) during 1986~2015 were used. In time series analysis monthly ESP dam inflow prediction values were nearly similar for every years, particularly less accurate in wet and dry years. The proposed BAYES-ESP improved the performance of ESP, especially in wet year. The SS was used for quantitative analysis of monthly mean of observed dam inflows, predicted values from ESP and BAYES-ESP. The results indicated that the SS values of ESP were relatively high in January, February and March but negative values in the other months. It also showed that the BAYES-ESP improved ESP when the values from ESP and observation have a relatively apparent linear relationship. We concluded that the existing ESP method has a limitation to predict dam inflow in Korea due to the seasonality of precipitation pattern and the proposed BAYES-ESP is meaningful for improving dam inflow prediction accuracy of ESP.

Logit Confidence Intervals Using Pseudo-Bayes Estimators for the Common Odds Ratio in 2 X 2 X K Contingency Tables

  • Kim, Donguk;Chun, Eunhee
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.479-496
    • /
    • 2003
  • We investigate logit confidence intervals for the odds ratio based on the delta method. These intervals are constructed using pseudo-Bayes estimators. The Gart method and Agresti method smooth the observed counts toward the model of equiprobability and independence, respectively. We obtain better coverage probability by smoothing the observed counts toward the pseudo-Bayes estimators in 2$\times$2 table. We also improve legit confidence intervals in 2$\times$2$\times$K tables by generalizing these ideas. Utilizing pseudo-Bayes estimators, we obtain better coverage probability by smoothing the observed counts toward the conditional independence model, no three-factor interaction model and saturated model in 2$\times$2$\times$K tables.