• Title/Summary/Keyword: Feature Classification

Search Result 2,161, Processing Time 0.035 seconds

Short Note on Optimizing Feature Selection to Improve Medical Diagnosis

  • Guo, Cui;Ryoo, Hong Seo
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.39 no.4
    • /
    • pp.71-74
    • /
    • 2014
  • A new classification framework called 'support feature machine' was introduced in [2] for analyzing medical data. Contrary to authors' claim, however, the proposed method is not designed to guarantee minimizing the use of the spatial feature variables. This paper mathematically remedies this drawback and provides comments on models from [2].

Fault Diagnosis of Low Speed Bearing Using Support Vector Machine

  • Widodo, Achmad;Son, Jong-Duk;Yang, Bo-Suk;Gu, Dong-Sik;Choi, Byeong-Keun;Kim, Yong-Han;Tan, Andy C.C;Mathew, Joseph
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2007.11a
    • /
    • pp.891-894
    • /
    • 2007
  • This study presents fault diagnosis of low speed bearing using support vector machine (SVM). The data used in the experiment was acquired using acoustic emission (AE) sensor and accelerometer. The aim of this study is to compare the performance of fault diagnosis based on AE signal and vibration signal with same load and speed. A low speed test rig was developed to simulate various defects with shaft speeds as low as 10 rpm under several loading conditions. In this study, component analysis was also performed to extract the feature and reduce the dimensionality of original data feature. Moreover, the classification for fault diagnosis was also conducted using original data feature without feature extraction. The result shows that extracted feature from AE sensor gave better performance in faults classification.

  • PDF

Hepatitis C Stage Classification with hybridization of GA and Chi2 Feature Selection

  • Umar, Rukayya;Adeshina, Steve;Boukar, Moussa Mahamat
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.1
    • /
    • pp.167-174
    • /
    • 2022
  • In metaheuristic algorithms such as Genetic Algorithm (GA), initial population has a significant impact as it affects the time such algorithm takes to obtain an optimal solution to the given problem. In addition, it may influence the quality of the solution obtained. In the machine learning field, feature selection is an important process to attaining a good performance model; Genetic algorithm has been utilized for this purpose by scientists. However, the characteristics of Genetic algorithm, namely random initial population generation from a vector of feature elements, may influence solution and execution time. In this paper, the use of a statistical algorithm has been introduced (Chi2) for feature relevant checks where p-values of conditional independence were considered. Features with low p-values were discarded and subject relevant subset of features to Genetic Algorithm. This is to gain a level of certainty of the fitness of features randomly selected. An ensembled-based learning model for Hepatitis has been developed for Hepatitis C stage classification. 1385 samples were used using Egyptian-dataset obtained from UCI repository. The comparative evaluation confirms decreased in execution time and an increase in model performance accuracy from 56% to 63%.

Unsupervised Multispectral Image Segmentation Based on 1D Combined Neighborhood Differences (1D 통합된 근접차이에 기반한 자율적인 다중분광 영상 분할)

  • Saipullah, Khairul Muzzammil;Yun, Byung-Choon;Kim, Deok-Hwan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.625-628
    • /
    • 2010
  • This paper proposes a novel feature extraction method for unsupervised multispectral image segmentation based in one dimensional combined neighborhood differences (1D CND). In contrast with the original CND, which is applied with traditional image, 1D CND is computed on a single pixel with various bands. The proposed algorithm utilizes the sign of differences between bands of the pixel. The difference values are thresholded to form a binary codeword. A binomial factor is assigned to these codeword to form another unique value. These values are then grouped to construct the 1D CND feature image where is used in the unsupervised image segmentation. Various experiments using two LANDSAT multispectral images have been performed to evaluate the segmentation and classification accuracy of the proposed method. The result shows that 1D CND feature outperforms the spectral feature, with average classification accuracy of 87.55% whereas that of spectral feature is 55.81%.

A Comprehensive Approach for Tamil Handwritten Character Recognition with Feature Selection and Ensemble Learning

  • Manoj K;Iyapparaja M
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.6
    • /
    • pp.1540-1561
    • /
    • 2024
  • This research proposes a novel approach for Tamil Handwritten Character Recognition (THCR) that combines feature selection and ensemble learning techniques. The Tamil script is complex and highly variable, requiring a robust and accurate recognition system. Feature selection is used to reduce dimensionality while preserving discriminative features, improving classification performance and reducing computational complexity. Several feature selection methods are compared, and individual classifiers (support vector machines, neural networks, and decision trees) are evaluated through extensive experiments. Ensemble learning techniques such as bagging, and boosting are employed to leverage the strengths of multiple classifiers and enhance recognition accuracy. The proposed approach is evaluated on the HP Labs Dataset, achieving an impressive 95.56% accuracy using an ensemble learning framework based on support vector machines. The dataset consists of 82,928 samples with 247 distinct classes, contributed by 500 participants from Tamil Nadu. It includes 40,000 characters with 500 user variations. The results surpass or rival existing methods, demonstrating the effectiveness of the approach. The research also offers insights for developing advanced recognition systems for other complex scripts. Future investigations could explore the integration of deep learning techniques and the extension of the proposed approach to other Indic scripts and languages, advancing the field of handwritten character recognition.

A Document Sentiment Classification System Based on the Feature Weighting Method Improved by Measuring Sentence Sentiment Intensity (문장 감정 강도를 반영한 개선된 자질 가중치 기법 기반의 문서 감정 분류 시스템)

  • Hwang, Jae-Won;Ko, Young-Joong
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.6
    • /
    • pp.491-497
    • /
    • 2009
  • This paper proposes a new feature weighting method for document sentiment classification. The proposed method considers the difference of sentiment intensities among sentences in a document. Sentiment features consist of sentiment vocabulary words and the sentiment intensity scores of them are estimated by the chi-square statistics. Sentiment intensity of each sentence can be measured by using the obtained chi-square statistics value of each sentiment feature. The calculated intensity values of each sentence are finally applied to the TF-IDF weighting method for whole features in the document. In this paper, we evaluate the proposed method using support vector machine. Our experimental results show that the proposed method performs about 2.0% better than the baseline which doesn't consider the sentiment intensity of a sentence.

A New Tempo Feature Extraction Based on Modulation Spectrum Analysis for Music Information Retrieval Tasks

  • Kim, Hyoung-Gook
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.6 no.2
    • /
    • pp.95-106
    • /
    • 2007
  • This paper proposes an effective tempo feature extraction method for music information retrieval. The tempo information is modeled by the narrow-band temporal modulation components, which are decomposed into a modulation spectrum via joint frequency analysis. In implementation, the tempo feature is directly extracted from the modified discrete cosine transform coefficients, which is the output of partial MP3(MPEG 1 Layer 3) decoder. Then, different features are extracted from the amplitudes of modulation spectrum and applied to different music information retrieval tasks. The logarithmic scale modulation frequency coefficients are employed in automatic music emotion classification and music genre classification. The classification precision in both systems is improved significantly. The bit vectors derived from adaptive modulation spectrum is used in audio fingerprinting task That is proved to be able to achieve high robustness in this application. The experimental results in these tasks validate the effectiveness of the proposed tempo feature.

  • PDF

Hyperspectral Image Fusion for Tumor Detection (초분광 영상 융합을 이용한 종양인식)

  • Xu Cheng-Zhe;Kim In-Taek
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.43 no.4 s.310
    • /
    • pp.11-20
    • /
    • 2006
  • This paper presents a method for detecting tumors on chicken carcasses by fusion of hyperspectral fluorescence and reflectance images. Classification of normal skin and tumor is performed by the image obtain 어 from optimal band ratio which minimizes the overlapping area of PDFs for normal skin and tumor. This method yields four feature images, each of them represents the ratio of two intensity values from a pixel. Classification is achieved by applying ISODATA to each pixel from the feature images. For the analysis of reflectance image, band selection method is proposed based on the information quantity, many effective features are acquired for the classification by defining the linear transformation selecting the projection axis, accordingly, accurate interpretation of images is possible in the reflectance image and automatic feature selection method is realized. Feature images from reflectance images are also classified by ISODATA and combined with the result from fluorescence images. Experimental result indicates that improved performance in term of reducing false detection rate is observed.

Feature Expansion based on LDA Word Distribution for Performance Improvement of Informal Document Classification (비격식 문서 분류 성능 개선을 위한 LDA 단어 분포 기반의 자질 확장)

  • Lee, Hokyung;Yang, Seon;Ko, Youngjoong
    • Journal of KIISE
    • /
    • v.43 no.9
    • /
    • pp.1008-1014
    • /
    • 2016
  • Data such as Twitter, Facebook, and customer reviews belong to the informal document group, whereas, newspapers that have grammar correction step belong to the formal document group. Finding consistent rules or patterns in informal documents is difficult, as compared to formal documents. Hence, there is a need for additional approaches to improve informal document analysis. In this study, we classified Twitter data, a representative informal document, into ten categories. To improve performance, we revised and expanded features based on LDA(Latent Dirichlet allocation) word distribution. Using LDA top-ranked words, the other words were separated or bundled, and the feature set was thus expanded repeatedly. Finally, we conducted document classification with the expanded features. Experimental results indicated that the proposed method improved the micro-averaged F1-score of 7.11%p, as compared to the results before the feature expansion step.

Design and Implementation of an Information Visualization System based on Structured Classification Technique (구조적 분류 기법을 기반으로 한 정보 시각화 시스템 설계 및 구현)

  • Kim, Young-Ran;Koo, Yeon-Seol
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.12
    • /
    • pp.3514-3522
    • /
    • 1999
  • While the method of information collection and visual interface technique have been researched actively on web information retrieval, a study on structured modeling for effective classification of a wide collective information leaves to be desired. In this paper, we represent information feature based on structured information model. It aims at carrying out effectively the user's retrieval environment through visualization technique with analyzing the information feature. We propose a information classification method using Facet units and we construct the object model, table model, SQL code to define the relation of the information, and represent the information feature based on a wide range of views. After users gain a better global understanding of the information feature, retrieve more easily through their information. Conventional information retrieval is user-oriented to be what user want, but proposed technique it data-oriented which helps users to understand what exist in database by showing information feature.

  • PDF