• Title/Summary/Keyword: 통계 분류

Search Result 2,155, Processing Time 0.026 seconds

A Technique for Classifying Requirement/Stakeholder and Generating Information for Negotiation Using Kano Model and Statistical Method (Kano 모델과 통계 기법을 이용한 요구사항 분류 및 협상을 위한 정보 생성 기법)

  • Byun, Jung-Won;Kim, Ji-Hyeok;Rhew, Sung-Yul;Hwang, Man-Soo
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.3
    • /
    • pp.161-169
    • /
    • 2010
  • The requirement elicitation is the task of eliciting requirements from needs of stakeholders, and preparing for information for negotiation. However, there are methods for gathering needs, but there is no specific method for classifying them, generating information for negotiation. Therefore, To solve the problems, this paper proposes a method to classify requirement and to generate information for negotiation. The proposed methods use Kano model, statistical technique, and identifying relationship between requirements and problems within scope. Finally, we validate the proposed method on simulations, Rough Set Theory, and case study of model.

AROC Curve and Optimal Threshold (AROC 곡선과 최적분류점)

  • Hong, Chong-Sun;Lee, Hee-Jung
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.1
    • /
    • pp.185-191
    • /
    • 2011
  • In the credit evaluation study with the assumption of mixture distributions, the ROC curve is a useful method to explore the discriminatory power of default and non-default borrowers. The AROC curve is an adjusted ROC curve that can be identified with the corresponding score and is mathematically analyzed in this work. We obtain patterns of this curve by applying normal distributions. Moreover, the relationship between the AROC curve and many classification accuracy statistics are explored to find the optimal threshold. In the case of equivalent variances of two distributions, we obtain that the local minimum of the AROC curve is estimated at the optimal threshold to maximize certain classification accuracies.

Light weight architecture for acoustic scene classification (음향 장면 분류를 위한 경량화 모형 연구)

  • Lim, Soyoung;Kwak, Il-Youp
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.6
    • /
    • pp.979-993
    • /
    • 2021
  • Acoustic scene classification (ASC) categorizes an audio file based on the environment in which it has been recorded. This has long been studied in the detection and classification of acoustic scenes and events (DCASE). In this study, we considered the problem that ASC faces in real-world applications that the model used should have low-complexity. We compared several models that apply light-weight techniques. First, a base CNN model was proposed using log mel-spectrogram, deltas, and delta-deltas features. Second, depthwise separable convolution, linear bottleneck inverted residual block was applied to the convolutional layer, and Quantization was applied to the models to develop a low-complexity model. The model considering low-complexity was similar or slightly inferior to the performance of the base model, but the model size was significantly reduced from 503 KB to 42.76 KB.

Bayesian logit models with auxiliary mixture sampling for analyzing diabetes diagnosis data (보조 혼합 샘플링을 이용한 베이지안 로지스틱 회귀모형 : 당뇨병 자료에 적용 및 분류에서의 성능 비교)

  • Rhee, Eun Hee;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.131-146
    • /
    • 2022
  • Logit models are commonly used to predicting and classifying categorical response variables. Most Bayesian approaches to logit models are implemented based on the Metropolis-Hastings algorithm. However, the algorithm has disadvantages of slow convergence and difficulty in ensuring adequacy for the proposal distribution. Therefore, we use auxiliary mixture sampler proposed by Frühwirth-Schnatter and Frühwirth (2007) to estimate logit models. This method introduces two sequences of auxiliary latent variables to make logit models satisfy normality and linearity. As a result, the method leads that logit model can be easily implemented by Gibbs sampling. We applied the proposed method to diabetes data from the Community Health Survey (2020) of the Korea Disease Control and Prevention Agency and compared performance with Metropolis-Hastings algorithm. In addition, we showed that the logit model using auxiliary mixture sampling has a great classification performance comparable to that of the machine learning models.

The Design and Implement of Microarry Data Classification Model for Tumor Classification (종양 분류를 위한 마이크로어레이 데이터 분류 모델 설계와 구현)

  • Park, Su-Young;Jung, Chai-Yeoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.10
    • /
    • pp.1924-1929
    • /
    • 2007
  • Nowadays, a lot of related data obtained from these research could be given a new present meaning to accomplish the original purpose of the whole research as a human project. The method of tumor classification based on microarray could contribute to being accurate tumor classification by finding differently expressing gene pattern statistically according to a tumor type. Therefore, the process to select a closely related informative gene with a particular tumor classification to classify tumor using present microarray technology with effect is essential. In this thesis, we used cDNA microarrays of 3840 genes obtained from neuronal differentiation experiment of cortical stem cells on white mouse with cancer, constructed accurate tumor classification model by extracting informative gene list through normalization separately and then did performance estimation by analyzing and comparing each of the experiment results. Result classifying Multi-Perceptron classifier for selected genes using Pearson correlation coefficient represented the accuracy of 95.6%.

Comparison of recently developed classification tools in microarray data analysis (마이크로어레이자료분석에서의 최신 분류방법들의 비교연구)

  • Lee, Jae-Won;Lee, Jeong-Bok;Park, Mi-Ra
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.05a
    • /
    • pp.99-104
    • /
    • 2002
  • cDNA 마이크로어레이자료를 이용한 분류방법은 수많은 유전자의 발현을 동시에 모니터링 할 수 있으므로 특정 질병간의 분자생물학적 변이를 이해하는데 있어 기존의 분류방법보다 신뢰성이 훨씬 높을 것으로 기대되고 있다 최근에 Dudoit et al.(2001)은 cDNA 마이크로어레이를 이용한 유전자발현자료의 분석에 있어 분류를 위한 여러 고전적인 판별분류기법 및 최근에 개발된 기법들을 비교, 평가하였다. 본 논문에서는 Dudoit et al.(2001)에서 다루지 않았던 많은 최신 기법들을 포함하여 인간의 종양 자료뿐만이 아니라 농작물을 포함한 동식물 자료에 적용하여 보다 폭넓은 비교연구를 하였다.

  • PDF

Optimal Threshold from ROC and CAP Curves (ROC와 CAP 곡선에서의 최적 분류점)

  • Hong, Chong-Sun;Choi, Jin-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.5
    • /
    • pp.911-921
    • /
    • 2009
  • Receiver Operating Characteristic(ROC) and Cumulative Accuracy Profile(CAP) curves are two methods used to assess the discriminatory power of different credit-rating approaches. The points of optimal classification accuracy on an ROC curve and of maximal profit on a CAP curve can be found by using iso-performance tangent lines, which are based on the standard notion of accuracy. In this paper, we offer an alternative accuracy measure called the true rate. Using this rate, one can obtain alternative optimal threshold points on both ROC and CAP curves. For most real populations of borrowers, the number of the defaults is much less than that of the non-defaults, and in such cases the true rate may be more efficient than the accuracy rate in terms of cost functions. Moreover, it is shown that both alternative scores of optimal classification accuracy and maximal profit are the identical, and this single score coincides with the score corresponding to Kolmogorov-Smirnov statistic used to test the homogeneous distribution functions of the defaults and non-defaults.

The Measures of Agreement between the Classification Standard of BMI and that of CDRS in Women university students (여자대학생의 BMI와 신체상평정척도(CDRS) 분류기준에 대한 일치도 검정)

  • Nam, Duck-Hyun
    • Journal of Digital Convergence
    • /
    • v.14 no.2
    • /
    • pp.519-527
    • /
    • 2016
  • This research aims at investigating the measures of agreement between BMI classification standard and that of 9-point contour drawing rating scale(CDRS), verifying their usefulness for the application to the filed, examining university students' substantial understanding of their bodies, and offering correct information regarding the distorted recognition of their bodies. In order to examine the measures of agreement between the classification standard of BMI and that of CDRS, and the women university students' recognition of their body images depending on BMI, Cross tabulation was carried out, and ${\chi}^2$, Spearman rank correlation coefficient and kappa statistics were calculated. As the analysis results, the classification standards of CDRS and BMI judged by general female college students showed statistically the correlation was high with ${\rho}=.719$(p<.001) and an average level of confirmity with ${\kappa}=.506$(p<.001). Based on these results, regarding body shape, sizes and shapes according to racial characteristics need to be controlled later.

韓國統計의 現況과 將來 - 統計와 電算

  • 박성현
    • Journal of the Korean Statistical Society
    • /
    • v.10
    • /
    • pp.81-82
    • /
    • 1981
  • 허문열 박사에 의하여 분류된 EDP의 3단계에 대하여 전적으로 동감하며, 우리나라가 현재 초기단계를 벗어나 성장단계에 접어들고 있는 것도 사실이라고 하겠다. 또한 조만간 (5-6년후)에 성숙단계에 접어들기 시작하면 컴퓨터에 의하여 처리되는 통계 package의 사용이 급증하고 이를 다룰 줄 아는 통계인의 수요가 매우 크리라고 믿어진다. 통계를 다루는 학자 또는 실무자들은 사회에서 요구하는 통계서비스(statistical consulting service)를 할 수 있는 것이 바람직하고, 우리의 역할을 다 할 수 있도록 우리 스스로를 준비시켜야 할 것이다. 사회에 통계를 보급시키고 통계의 활용을 더욱 촉진시키기 위하여 다음의 몇가지를 제안하는 바이다. 이들은 또한 허문열 박사의 주제논문에서도 암시되고 있는 방향이라고도 하겠다.

  • PDF

Kernel Pattern Recognition using K-means Clustering Method (K-평균 군집방법을 이요한 가중커널분류기)

  • 백장선;심정욱
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.447-455
    • /
    • 2000
  • We propose a weighted kernel pattern recognition method using the K -means clustering algorithm to reduce computation and storage required for the full kernel classifier. This technique finds a set of reference vectors and weights which are used to approximate the kernel classifier. Since the hierarchical clustering method implemented in the 'Weighted Parzen Window (WP\V) classifier is not able to rearrange the proper clusters, we adopt the K -means algorithm to find reference vectors and weights from the more properly rearranged clusters \Ve find that the proposed method outperforms the \VP\V method for the repre~entativeness of the reference vectors and the data reduction.

  • PDF