• 제목/요약/키워드: Symmetrical conditional probability

검색결과 2건 처리시간 0.016초

대칭 조건부 확률과 TF-IDF 기반 텍스트 분류를 위한 N-gram 특질 선택 (N-gram Feature Selection for Text Classification Based on Symmetrical Conditional Probability and TF-IDF)

  • 최우식;김성범
    • 대한산업공학회지
    • /
    • 제41권4호
    • /
    • pp.381-388
    • /
    • 2015
  • The rapid growth of the World Wide Web and online information services has generated and made accessible a huge number of text documents. To analyze texts, selecting important keywords is an essential step. In this paper, we propose a feature selection method that combines a term frequency-inverse document frequency technique and symmetrical conditional probability. The proposed method can identify features with N-gram, the sequential multiword. The effectiveness of the proposed method is demonstrated through a real text data from the machine learning repository, University of California, Irvine.

COMPARISON STUDY OF BIVARIATE LAPLACE DISTRIBUTIONS WITH THE SAME MARGINAL DISTRIBUTION

  • Hong, Chong-Sun;Hong, Sung-Sick
    • Journal of the Korean Statistical Society
    • /
    • 제33권1호
    • /
    • pp.107-128
    • /
    • 2004
  • Bivariate Laplace distributions for which both marginal distributions and Laplace are discussed. Three kinds of bivariate Laplace distributions which are extended bivariate exponential distributions of Gumbel (1960) are introduced in this paper. These symmetrical distributions are compared with asymmetrical distributions of Kotz et al. (2000). Their probability density functions, cumulative distribution functions are derived. Conditional skewnesses and kurtoses are also defined. Their correlation coefficients are calculated and compared with others. We proposed bivariate random vector generating methods whose distributions are bivariate Laplace. With sample means and medians obtained from generated random vectors, variance and covariance matrices of means and medians are calculated and discussed with those of bivariate normal distribution.