• Title/Summary/Keyword: Statistics Classification

Search Result 867, Processing Time 0.019 seconds

On the Use of Modified Adaptive Nearest Neighbors for Classification (수정된 적응 최근접 방법을 활용한 판별분류방법에 대한 연구)

  • Maeng, Jin-Woo;Bang, Sung-Wan;Jhun, Myoung-Shic
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.6
    • /
    • pp.1093-1102
    • /
    • 2010
  • Even though the k-Nearest Neighbors Classification(KNNC) is one of the popular non-parametric classification methods, it does not consider the local features and class information for each observation. In order to overcome such limitations, several methods have been developed such as Adaptive Nearest Neighbors Classification(ANNC) and Modified k-Nearest Neighbors Classification(MKNNC). In this paper, we propose the Modified Adaptive Nearest Neighbors Classification(MANNC) that employs the advantages of both the ANNC and MKNNC. Through a real data analysis and a simulation study, we show that the proposed MANNC outperforms other methods in terms of classification accuracy.

Classification of Microarray Gene Expression Data by MultiBlock Dimension Reduction

  • Oh, Mi-Ra;Kim, Seo-Young;Kim, Kyung-Sook;Baek, Jang-Sun;Son, Young-Sook
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.3
    • /
    • pp.567-576
    • /
    • 2006
  • In this paper, we applied the multiblock dimension reduction methods to the classification of tumor based on microarray gene expressions data. This procedure involves clustering selected genes, multiblock dimension reduction and classification using linear discrimination analysis and quadratic discrimination analysis.

A Musical Genre Classification Method Based on the Octave-Band Order Statistics (옥타브밴드 순서 통계량에 기반한 음악 장르 분류)

  • Seo, Jin Soo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.33 no.1
    • /
    • pp.81-86
    • /
    • 2014
  • This paper presents a study on the effectiveness of using the spectral and the temporal octave-band order statistics for musical genre classification. In order to represent the relative disposition of the harmonic and non-harmonic components, we utilize the octave-band order statistics of power spectral distribution. Experiments on the widely used two music datasets were performed; the results show that the octave-band order statistics improve genre classification accuracy by 2.61 % for one dataset and 8.9 % for another dataset compared with the mel-frequency cepstral coefficients and the octave-band spectral contrast. Experimental results show that the octave-band order statistics are promising for musical genre classification.

Suggestions for KDC Improvement According to Academic Characteristics of Statistics (통계학의 학문적 특성에 따른 KDC 문헌분류의 개선방안)

  • Park, JaeHyeok;Kim, BeeYeon
    • Journal of Korean Library and Information Science Society
    • /
    • v.44 no.2
    • /
    • pp.399-422
    • /
    • 2013
  • This study suggests some ideas for improvement of mixing classification and illogical subdivisions arrangement of Statistics in Social Science and Mathematical Statistics in Natural Science on KDC. We investigate the characteristics, educational system, and curriculum of Statistics in Korea. Besides, we compare and analyze classification systems such as KDC, DDC, LCC, NDC and Research Fields Code by National Research Foundation of Korea. As a result, Statistics in Social Science is relocated and integrated with the subfield of Natural Science according to the academic background. Existing social statistics topics are subdivided into statistical research methods complementing social science research methods. The heading 'Probabilities, Statistical mathematics' in Natural Science is changed to 'Statistics', and the subdivisions are expanded and revised.

Bivariate ROC Curve and Optimal Classification Function

  • Hong, C.S.;Jeong, J.A.
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.4
    • /
    • pp.629-638
    • /
    • 2012
  • We propose some methods to obtain optimal thresholds and classification functions by using various cutoff criterion based on the bivariate ROC curve that represents bivariate cumulative distribution functions. The false positive rate and false negative rate are calculated with these classification functions for bivariate normal distributions.

Classification via principal differential analysis

  • Jang, Eunseong;Lim, Yaeji
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.2
    • /
    • pp.135-150
    • /
    • 2021
  • We propose principal differential analysis based classification methods. Computations of squared multiple correlation function (RSQ) and principal differential analysis (PDA) scores are reviewed; in addition, we combine principal differential analysis results with the logistic regression for binary classification. In the numerical study, we compare the principal differential analysis based classification methods with functional principal component analysis based classification. Various scenarios are considered in a simulation study, and principal differential analysis based classification methods classify the functional data well. Gene expression data is considered for real data analysis. We observe that the PDA score based method also performs well.

Development of Cause Classification Method for Improving Reliability of Electrical Fire Statistics (전기화재 조사 및 통계의 신뢰성 향상을 위한 원인분류방법의 개발)

  • Jeon, Jeong-Chay;Jeon, Hyun-Jae;Lee, Sang-Ick;Yoo, Jae-Geun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.8 no.3
    • /
    • pp.466-471
    • /
    • 2007
  • Electrical fires form over 30 percent of fires, but the study on the reliability of electrical fire statistics is not performed. Electrical roe occupancy was very high due to investigating and classifying fires, which is not directly continuous with electrical cause, as electrical fire because insufficiency of cause classification method or system, and the problems of the reliability of electrical fire statistics were presented. So, the reliability of electrical fire statistics must be guaranteed by improvement of the existing cause classification method of electrical fire. This paper analyzed the problems of electrical rue statistics by the existing cause classification method of electrical fire and presented the new method to classify causes of electrical fire.

  • PDF

Binary classification on compositional data

  • Joo, Jae Yun;Lee, Seokho
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.1
    • /
    • pp.89-97
    • /
    • 2021
  • Due to boundedness and sum constraint, compositional data are often transformed by logratio transformation and their transformed data are put into traditional binary classification or discriminant analysis. However, it may be problematic to directly apply traditional multivariate approaches to the transformed data because class distributions are not Gaussian and Bayes decision boundary are not polynomial on the transformed space. In this study, we propose to use flexible classification approaches to transformed data for compositional data classification. Empirical studies using synthetic and real examples demonstrate that flexible approaches outperform traditional multivariate classification or discriminant analysis.

A comparative study of feature screening methods for ultrahigh dimensional multiclass classification (초고차원 다범주분류를 위한 변수선별 방법 비교 연구)

  • Lee, Kyungeun;Kim, Kyoung Hee;Shin, Seung Jun
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.5
    • /
    • pp.793-808
    • /
    • 2017
  • We compare various variable screening methods on multiclass classification problems when the data is ultrahigh-dimensional. Two different approaches were considered: (1) pairwise extension from binary classification via one versus one or one versus rest comparisons and (2) direct classification of multiclass responses. We conducted extensive simulation studies under different conditions: heavy tailed explanatory variables, correlated signal and noise variables, correlated joint distributions but uncorrelated marginals, and unbalanced response variables. We then analyzed real data to examine the performance of the methods. The results showed that model-free methods perform better for multiclass classification problems as well as binary ones.

Object-oriented Classification and QuickBird Multi-spectral Imagery in Forest Density Mapping

  • Jayakumar, S.;Ramachandran, A.;Lee, Jung-Bin;Heo, Joon
    • Korean Journal of Remote Sensing
    • /
    • v.23 no.3
    • /
    • pp.153-160
    • /
    • 2007
  • Forest cover density studies using high resolution satellite data and object oriented classification are limited in India. This article focuses on the potential use of QuickBird satellite data and object oriented classification in forest density mapping. In this study, the high-resolution satellite data was classified based on NDVI/pixel based and object oriented classification methods and results were compared. The QuickBird satellite data was found to be suitable in forest density mapping. Object oriented classification was superior than the NDVI/pixel based classification. The Object oriented classification method classified all the density classes of forest (dense, open, degraded and bare soil) with higher producer and user accuracies and with more kappa statistics value compared to pixel based method. The overall classification accuracy and Kappa statistics values of the object oriented classification were 83.33% and 0.77 respectively, which were higher than the pixel based classification (68%, 0.56 respectively). According to the Z statistics, the results of these two classifications were significantly different at 95% confidence level.