• 제목/요약/키워드: classification method

Search Result 7,153, Processing Time 0.039 seconds

A Study on the Relationship between Class Similarity and the Performance of Hierarchical Classification Method in a Text Document Classification Problem (텍스트 문서 분류에서 범주간 유사도와 계층적 분류 방법의 성과 관계 연구)

  • Jang, Soojung;Min, Daiki
    • The Journal of Society for e-Business Studies
    • /
    • v.25 no.3
    • /
    • pp.77-93
    • /
    • 2020
  • The literature has reported that hierarchical classification methods generally outperform the flat classification methods for a multi-class document classification problem. Unlike the literature that has constructed a class hierarchy, this paper evaluates the performance of hierarchical and flat classification methods under a situation where the class hierarchy is predefined. We conducted numerical evaluations for two data sets; research papers on climate change adaptation technologies in water sector and 20NewsGroup open data set. The evaluation results show that the hierarchical classification method outperforms the flat classification methods under a certain condition, which differs from the literature. The performance of hierarchical classification method over flat classification method depends on class similarities at levels in the class structure. More importantly, the hierarchical classification method works better when the upper level similarity is less that the lower level similarity.

A Classification Method of Anthropometric Variables for Improved Usability of Anthropometric Data (인체측정자료의 사용성 제고를 위한 인체측정변수 분류 방법)

  • Yu, Hui-Cheon;Sin, Seung-U;Ryu, Tae-Beom
    • Journal of the Ergonomics Society of Korea
    • /
    • v.23 no.3
    • /
    • pp.13-24
    • /
    • 2004
  • Anthropometric data is a fundamental resource in developing ergonomic products and workplaces. However, designers often experience difficulty in searching anthropometric data relevant to the design due to the technicality of anthropometric terminologies, ambiguity in the description of measurement method for some anthropometric variables, and inefficiency of existing search methods for anthropometric data. The present study suggests a method to develop a classification system of anthropometric variables for systematic, efficient search of anthropometric data. The proposed method first classifies anthropometric variables according to body segment and type of variable, and then arranges anthropometric variables of the same body segment and variable type by comparing the heights of their reference points. The proposed classification method was applied to establish a classification system of 66 anthropometric variables that were selected for an automotive interior design. Then the established anthropometric classification system was utilized to design a search interface of a web-based anthropometric data retrieval system.

Power Efficient Classification Method for Sensor Nodes in BSN Based ECG Monitoring System

  • Zeng, Min;Lee, Jeong-A
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.9B
    • /
    • pp.1322-1329
    • /
    • 2010
  • As body sensor network (BSN) research becomes mature, the need for managing power consumption of sensor nodes has become evident since most of the applications are designed for continuous monitoring. Real time Electrocardiograph (ECG) analysis on sensor nodes is proposed as an optimal choice for saving power consumption by reducing data transmission overhead. Smart sensor nodes with the ability to categorize lately detected ECG cycles communicate with base station only when ECG cycles are classified as abnormal. In this paper, ECG classification algorithms are described, which categorize detected ECG cycles as normal or abnormal, or even more specific cardiac diseases. Our Euclidean distance (ED) based classification method is validated to be most power efficient and very accurate in determining normal or abnormal ECG cycles. A close comparison of power efficiency and classification accuracy between our ED classification algorithm and generalized linear model (GLM) based classification algorithm is provided. Through experiments we show that, CPU cycle power consumption of ED based classification algorithm can be reduced by 31.21% and overall power consumption can be reduced by 13.63% at most when compared with GLM based method. The accuracy of detecting NSR, APC, PVC, SVT, VT, and VF using GLM based method range from 55% to 99% meanwhile, we show that the accuracy of detecting normal and abnormal ECG cycles using our ED based method is higher than 86%.

Design and Performance Measurement of a Genetic Algorithm-based Group Classification Method : The Case of Bond Rating (유전 알고리듬 기반 집단분류기법의 개발과 성과평가 : 채권등급 평가를 중심으로)

  • Min, Jae-H.;Jeong, Chul-Woo
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.32 no.1
    • /
    • pp.61-75
    • /
    • 2007
  • The purpose of this paper is to develop a new group classification method based on genetic algorithm and to com-pare its prediction performance with those of existing methods in the area of bond rating. To serve this purpose, we conduct various experiments with pilot and general models. Specifically, we first conduct experiments employing two pilot models : the one searching for the cluster center of each group and the other one searching for both the cluster center and the attribute weights in order to maximize classification accuracy. The results from the pilot experiments show that the performance of the latter in terms of classification accuracy ratio is higher than that of the former which provides the rationale of searching for both the cluster center of each group and the attribute weights to improve classification accuracy. With this lesson in mind, we design two generalized models employing genetic algorithm : the one is to maximize the classification accuracy and the other one is to minimize the total misclassification cost. We compare the performance of these two models with those of existing statistical and artificial intelligent models such as MDA, ANN, and Decision Tree, and conclude that the genetic algorithm-based group classification method that we propose in this paper significantly outperforms the other methods in respect of classification accuracy ratio as well as misclassification cost.

Comparison of Land Use Change Detection Methods with Satellite Image (위성영상을 이용한 토지이용 변화 검색기법 비교연구)

  • Park, Soon-Ho;Kim, Woo-Kwan
    • Journal of the Korean association of regional geographers
    • /
    • v.5 no.1
    • /
    • pp.137-150
    • /
    • 1999
  • Five land use change detection methods were applied to 1994 and 1997 Landsat Thematic Mapper (TM) images of Pook-Gu, Taegu city to determine the land-cover changes between the two dates. The two images were coregistred to UTM coordinates. A post-classification comparison method was the most commonly used quantitative method of change detection. A pre-classification comparison method was more effective method to change detection of land cover than a post-classification comparison method. Two indices were used to assess the accuracies of the studied methods. A image differencing method was found to be most accurate for detecting change verse no change among five land use change detection methods. The difference image of band 2 was found to be most accurate. The overall accuracy and Kappa index agreement of the difference image of band 2 were 0.810 and 0.447.

  • PDF

A New Method for Classification of Structural Textures

  • Lee, Bongkyu
    • International Journal of Control, Automation, and Systems
    • /
    • v.2 no.1
    • /
    • pp.125-133
    • /
    • 2004
  • In this paper, we present a new method that combines the characteristics of edge in-formation and second-order neural networks for the classification of structural textures. The edges of a texture are extracted using an edge detection approach. From this edge information, classification features called second-order features are obtained. These features are fed into a second-order neural network for training and subsequent classification. It will be shown that the main disadvantage of using structural methods in texture classifications, namely, the difficulty of the extraction of texels, is overcome by the proposed method.

A New Pattern Classification and the Analysis of the Lung Sound by Using Cepstrum (Cepstrum을 이용한 폐음의 분석 및 패턴 분류)

  • 김종원;김성환
    • Journal of Biomedical Engineering Research
    • /
    • v.15 no.2
    • /
    • pp.159-166
    • /
    • 1994
  • A new pattern classification algorithm using cepstrum to analyze lung sounds for the classification of pattern with pulmonary and bronchial disorders is proposed. To evaluate the perfomance of the proposed method, the results are compared to the pattern classification with the AR modeling method. In the experiment lung sounds recorded for the training of physician used. As a results, the accuracy of the cepstrum classification is 92.3 % and AR modeling is the 53.8 %, therefore cepstrum modeling method has very high performance than AR and it turned out to be a very efficient algorithm.

  • PDF

A Feature Selection-based Ensemble Method for Arrhythmia Classification

  • Namsrai, Erdenetuya;Munkhdalai, Tsendsuren;Li, Meijing;Shin, Jung-Hoon;Namsrai, Oyun-Erdene;Ryu, Keun Ho
    • Journal of Information Processing Systems
    • /
    • v.9 no.1
    • /
    • pp.31-40
    • /
    • 2013
  • In this paper, a novel method is proposed to build an ensemble of classifiers by using a feature selection schema. The feature selection schema identifies the best feature sets that affect the arrhythmia classification. Firstly, a number of feature subsets are extracted by applying the feature selection schema to the original dataset. Then classification models are built by using the each feature subset. Finally, we combine the classification models by adopting a voting approach to form a classification ensemble. The voting approach in our method involves both classification error rate and feature selection rate to calculate the score of the each classifier in the ensemble. In our method, the feature selection rate depends on the extracting order of the feature subsets. In the experiment, we applied our method to arrhythmia dataset and generated three top disjointed feature sets. We then built three classifiers based on the top-three feature subsets and formed the classifier ensemble by using the voting approach. Our method can improve the classification accuracy in high dimensional dataset. The performance of each classifier and the performance of their ensemble were higher than the performance of the classifier that was based on whole feature space of the dataset. The classification performance was improved and a more stable classification model could be constructed with the proposed approach.

Classification of Rural Villages Using Information Theory (정보이론을 이용한 농촌마을 권역화 연구)

  • Lee, Ji-Min;Lee, Jeong-Jae
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.49 no.1
    • /
    • pp.23-33
    • /
    • 2007
  • Classification results of rural villages provide useful information about rural village characteristics to select similar villages in rural development project; many researches about regional classification have been practiced. Recently rural amenity was introduced as an alternative for rural development, and rural villages have been surveyed to find potential resources for rural development by 'Rural Amenity Resources Survey Project'. Accumulated information through this survey project could be used to classify rural villages. However existing rural classification method using statistical data is not efficient method to use rural amenity resources information described with text. We introduced Information Bottleneck Method (IBM) based on information theory and implemented this method to classification with rural amenity resources information of Yanggang-myen, Yeongdong-gun in Chungbuk province.

Evaluation of the classification method using ancestry SNP markers for ethnic group

  • Lee, Hyo Jung;Hong, Sun Pyo;Lee, Soong Deok;Rhee, Hwan seok;Lee, Ji Hyun;Jeong, Su Jin;Lee, Jae Won
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.1
    • /
    • pp.1-9
    • /
    • 2019
  • Various probabilistic methods have been proposed for using interpopulation allele frequency differences to infer the ethnic group of a DNA specimen. The selection of the statistical method is critical because the accuracy of the statistical classification results vary. For the ancestry classification, we proposed a new ancestry evaluation method that estimate the combined ethnicity index as well as compared its performance with various classical classification methods using two real data sets. We selected 13 SNPs that are useful for the inference of ethnic origin. These single nucleotide polymorphisms (SNPs) were analyzed by restriction fragment mass polymorphism assay and followed by classification among ethnic groups. We genotyped 400 individuals from four ethnic groups (100 African-American, 100 Caucasian, 100 Korean, and 100 Mexican-American) for 13 SNPs and allele frequencies that differed among the four ethnic groups. Additionally, we applied our new method to HapMap SNP genotypes for 1,011 samples from 4 populations (African, European, East Asian, and Central-South Asian). Our proposed method yielded the highest accuracy among statistical classification methods. Our ethnic group classification system based on the analysis of ancestry informative SNP markers can provide a useful statistical tool to identify ethnic groups.