• Title/Summary/Keyword: 최적분류점

Search Result 120, Processing Time 0.028 seconds

Cost Ratios for Cost and ROC Curves (비용곡선과 ROC곡선에서의 비용비율)

  • Hong, Chong-Sun;Yoo, Hyun-Sang
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.6
    • /
    • pp.755-765
    • /
    • 2010
  • For classification problems on mixture distribution, a threshold based on cost functions is optimal from the viewpoint of a minimum expected cost. Assuming that there is no cost information, we propose cost ratios in the expected cost corresponding to thresholds where the total accuracy and the true rate are maximized to explain the relation of these cost ratios minimizing the expected cost. Other cost ratios are also proposed by comparing the normalized expected costs when classification accuracy is maximized. The values of these cost ratios are located between two cost ratios for the expected costs based on classification accuracies, and converge to that of the minimum expected cost. This work suggests two cost ratios: one is minimized by the expected cost and the normalized expected cost, and the other in the expected cost and the normalized expected cost functions that are maximized classification accuracies. We discuss their compatibility based on the relation of these cost ratios.

Premature Ventricular Contraction Classification through R Peak Pattern and RR Interval based on Optimal R Wave Detection (최적 R파 검출 기반의 R피크 패턴과 RR간격을 통한 조기심실수축 분류)

  • Cho, Ik-sung;Kwon, Hyeog-soong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.2
    • /
    • pp.233-242
    • /
    • 2018
  • Previous works for detecting arrhythmia have mostly used nonlinear method such as artificial neural network, fuzzy theory, support vector machine to increase classification accuracy. Most methods require higher computational cost and larger processing time. Therefore it is necessary to design efficient algorithm that classifies PVC(premature ventricular contraction) and decreases computational cost by accurately detecting feature point based on only R peak through optimal R wave. For this purpose, we detected R wave through optimal threshold value and extracted RR interval and R peak pattern from noise-free ECG signal through the preprocessing method. Also, we classified PVC in realtime through RR interval and R peak pattern. The performance of R wave detection and PVC classification is evaluated by using 9 record of MIT-BIH arrhythmia database that included over 30. The achieved scores indicate the average of 99.02% in R wave detection and the rate of 94.85% in PVC classification.

Review on Genetic Algorithms for Pattern Recognition (패턴 인식을 위한 유전 알고리즘의 개관)

  • Oh, Il-Seok
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.1
    • /
    • pp.58-64
    • /
    • 2007
  • In pattern recognition field, there are many optimization problems having exponential search spaces. To solve of sequential search algorithms seeking sub-optimal solutions have been used. The algorithms have limitations of stopping at local optimums. Recently lots of researches attempt to solve the problems using genetic algorithms. This paper explains the huge search spaces of typical problems such as feature selection, classifier ensemble selection, neural network pruning, and clustering, and it reviews the genetic algorithms for solving them. Additionally we present several subjects worthy of noting as future researches.

Selecting the optimal threshold based on impurity index in imbalanced classification (불균형 자료에서 불순도 지수를 활용한 분류 임계값 선택)

  • Jang, Shuin;Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.711-721
    • /
    • 2021
  • In this paper, we propose the method of adjusting thresholds using impurity indices in classification analysis on imbalanced data. Suppose the minority category is Positive and the majority category is Negative for the imbalanced binomial data. When categories are determined based on the commonly used 0.5 basis, the specificity tends to be high in unbalanced data while the sensitivity is relatively low. Increasing sensitivity is important when proper classification of objects in minority categories is relatively important. We explore how to increase sensitivity through adjusting thresholds. Existing studies have adjusted thresholds based on measures such as G-Mean and F1-score, but in this paper, we propose a method to select optimal thresholds using the chi-square statistic of CHAID, the Gini index of CART, and the entropy of C4.5. We also introduce how to get a possible unique value when multiple optimal thresholds are obtained. Empirical analysis shows what improvements have been made compared to the results based on 0.5 through classification performance metrics.

Vehicle Detection Using Optimal Features for Adaboost (Adaboost 최적 특징점을 이용한 차량 검출)

  • Kim, Gyu-Yeong;Lee, Geun-Hoo;Kim, Jae-Ho;Park, Jang-Sik
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.8
    • /
    • pp.1129-1135
    • /
    • 2013
  • A new vehicle detection algorithm based on the multiple optimal Adaboost classifiers with optimal feature selection is proposed. It consists of two major modules: 1) Theoretical DDISF(Distance Dependent Image Scaling Factor) based image scaling by site modeling of the installed cameras. and 2) optimal features selection by Haar-like feature analysis depending on the distance of the vehicles. The experimental results of the proposed algorithm shows improved recognition rate compare to the previous methods for vehicles and non-vehicles. The proposed algorithm shows about 96.43% detection rate and about 3.77% false alarm rate. These are 3.69% and 1.28% improvement compared to the standard Adaboost algorithmt.

Advanced detection of sentence boundaries based on hybrid method (하이브리드 방법을 이용한 개선된 문장경계인식)

  • Lee, Chung-Hee;Jang, Myung-Gil;Seo, Young-Hoon
    • Annual Conference on Human and Language Technology
    • /
    • 2009.10a
    • /
    • pp.61-66
    • /
    • 2009
  • 본 논문은 다양한 형태의 웹 문서에 적용하기 위해서, 언어의 통계정보 및 후처리 규칙에 기반 하여 개선된 문장경계 인식 기술을 제안한다. 제안한 방법은 구두점 생략 및 띄어쓰기 오류가 빈번한 웹 문서에 적용하기 위해서 문장경계로 사용될 수 있는 모든 음절을 대상으로 학습하여 문장경계 인식을 수행하였고, 문장경계인식 성능을 최대화 하기 위해서 다양한 실험을 통해 최적의 자질 및 학습데이터를 선정하였고, 다양한 기계학습 기반 분류 모델을 비교하여 최적의 분류모델을 선택하였으며, 학습데이터에 의존적인 통계모델의 오류를 규칙에 기반 해서 보정하였다. 성능 실험은 다양한 형태의 문서별 성능 측정을 위해서 문어체와 구어체가 복합적으로 사용된 신문기사와 블로그 문서(평가셋1), 문어체 위주로 구성된 세종말뭉치와 백과사전 본문(평가셋2), 구두점 생략 및 띄어쓰기 오류가 빈번한 웹 사이트의 게시판 글(평가셋3)을 대상으로 성능 측정을 하였다. 성능척도로는 F-measure를 사용하였으며, 구두점만을 대상으로 문장경계 인식 성능을 평가한 결과, 평가셋1에서는 96.5%, 평가셋2에서는 99.4%를 보였는데, 구어체의 문장경계인식이 더 어려움을 알 수 있었다. 평가셋1의 경우에도 규칙으로 후처리한 경우 정확률이 92.1%에서 99.4%로 올라갔으며, 이를 통해 후처리 규칙의 필요성을 알 수 있었다. 최종 성능평가로는 구두점만을 대상으로 학습된 기본 엔진과 모든 문장경계후보를 인식하도록 개선된 엔진을 평가셋3을 사용하여 비교 평가하였고, 기본 엔진(61.1%)에 비해서 개선된 엔진이 32.0% 성능 향상이 있음을 확인함으로써 제안한 방법이 웹 문서에 효과적임을 입증하였다.

  • PDF

Parameter Extraction for Based on AR and Arrhythmia Classification through Deep Learning (AR 기반의 특징점 추출과 딥러닝을 통한 부정맥 분류)

  • Cho, Ik-sung;Kwon, Hyeog-soong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.10
    • /
    • pp.1341-1347
    • /
    • 2020
  • Legacy studies for classifying arrhythmia have been studied in order to improve the accuracy of classification, Neural Network, Fuzzy, Machine Learning, etc. In particular, deep learning is most frequently used for arrhythmia classification using error backpropagation algorithm by solving the limit of hidden layer number, which is a problem of neural network. In order to apply a deep learning model to an ECG signal, it is necessary to select an optimal model and parameters. In this paper, we propose parameter extraction based on AR and arrhythmia classification through a deep learning. For this purpose, the R-wave is detected in the ECG signal from which noise has been removed, QRS and RR interval is modelled. And then, the weights were learned by supervised learning method through deep learning and the model was evaluated by the verification data. The classification rate of PVC is evaluated through MIT-BIH arrhythmia database. The achieved scores indicate arrhythmia classification rate of over 97%.

분산 환경하에서의 데이타관리 분류체계에 대한 연구

  • 박주석;편흥렬
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1994.04a
    • /
    • pp.49-57
    • /
    • 1994
  • 진정한 의미의 다운사이징을 구현하기 위해서는 필요한 분산데이타베이스의 구현은 현재 여러가지 기술적인 문제점들을 안고 있다. 따라서 동시성 제어(concurrency control)와 갱신 (update propagation), 복구(recovery), 질의어 처리(query processing), 카달로그 관리(catalog management)등과 같은 분산 환경에서의 데이타베이스에 관련된 기술적인 문제점들을 해결하기 위해서는 어떠한 최적의 방법들을 개발해야만 한다. 이러한 방법들의 개발은 관계형 데이타베이스의 데이타관리 분류체계를 통한 대안들의 선택과 운용에 의해 가능할 것이다. 분산 환경하에서 사용되어질 수 있는 관계형 데이타베이스의 데이타관리 분류체계를 availability, expression, currency의 관점에서 Basic table과 view로 구분하여 정립해 보았다. Basic table은 current update가 필수적이므로 availability와 expression의 관점에서 분류하였고, view는 physical file의 존재성 유무와 시간적 실행의 차이를 기준으로 분류하였다. 그리고 이러한 분류기준에 따른 특성들을 분산데이타베이스 구축에 이용하는 방법들에 대해 이야기 하였다. 다시 말해 non-current materialized view 뿐만 아니라 current materialized view를 동시에 지원하는 하나의 distributed view update architecture를 개발하는 것에 대해 하나의 방법으로 제시하였다. 즉 immediate update와 deferred update는 current view를 이용하고 periodical update는 non-current view를 이용하여 100%의 distributed data resources를 관리 할 수 있는 효율적인 distributed system를 개발하는 것을 제시하였다. 본 논문은 데이타베이스론의 입장에서 아직 정립되어 있지 않은 분산 환경하에서의 관계형 데이타베이스의 데이타관리의 분류체계를 나름대로 정립하였다는데 그 의의가 있다. 또한 이것의 응용은 현재 분산데이타베이스 구축에 있어 나타나는 기술적인 문제점들을 어느정도 보완할 수 있다는 점에서 그 중요성이 있다.

Landcover classification by coherence analysis from multi-temporal SAR images (다중시기 SAR 영상자료 긴밀도 분석을 통한 토지피복 분류)

  • Yoon, Bo-Yeol;Kim, Youn-Soo
    • Aerospace Engineering and Technology
    • /
    • v.8 no.1
    • /
    • pp.132-137
    • /
    • 2009
  • This study has regard to classification by using multi-temporal SAR data. Multi-temporal JERS-1 SAR images are used for extract the land cover information and possibility. So far, land cover information extracted by high resolution aerial photo, satellite images, and field survey. This study developed on multi-temporal land cover status monitoring and coherence information mapping can be processing by L band SAR image. From July, 1997 to October, 1998 JERS SAR images (9 scenes) coherence values are analyzed and then extracted land cover information factors, so on. This technique which forms the basis of what is called SAR Interferometry or InSAR for short has also been employed in spaceborne systems. In such systems the separation of the antennas, called the baseline is obtained by utilizing a single antenna in a repeat pass.

  • PDF

An Efficient Traning of Multilayer Neural Newtorks Using Stochastic Approximation and Conjugate Gradient Method (확률적 근사법과 공액기울기법을 이용한 다층신경망의 효율적인 학습)

  • 조용현
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.8 no.5
    • /
    • pp.98-106
    • /
    • 1998
  • This paper proposes an efficient learning algorithm for improving the training performance of the neural network. The proposed method improves the training performance by applying the backpropagation algorithm of a global optimization method which is a hybrid of a stochastic approximation and a conjugate gradient method. The approximate initial point for f a ~gtl obal optimization is estimated first by applying the stochastic approximation, and then the conjugate gradient method, which is the fast gradient descent method, is applied for a high speed optimization. The proposed method has been applied to the parity checking and the pattern classification, and the simulation results show that the performance of the proposed method is superior to those of the conventional backpropagation and the backpropagation algorithm which is a hyhrid of the stochastic approximation and steepest descent method.

  • PDF