• Title/Summary/Keyword: Feature Classification

Search Result 2,155, Processing Time 0.032 seconds

An Approach to Feature Selection for Classification of Resume (이력서의 Classification을 위한 Feature Selection 방안)

  • Lee, Manyu;Cho, Hyungsuk;Lee, Yu-jin;Hong, Jiwon;Kim, Sang-Wook
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.10a
    • /
    • pp.535-536
    • /
    • 2016
  • 사람이 수많은 지원자의 이력서들을 모두 꼼꼼히 읽는 데에는 엄청난 시간과 노동이 필요하다. 만약 컴퓨터가 이력서를 알맞은 직군으로 분류해 줄 수 있다면 이러한 어려움을 해소할 수 있다. 이를 위해 본 논문에서는 알맞은 직군으로 분류하기 위한 이력서를 학습할 때에 feature를 어떤 방법으로 선택할 수 있는지 그리고 feature의 개수는 몇 개가 적절한지에 대해 알아본다.

The Classification and Frequency Analysis in Radial Pulse (맥파의 인식상의 분류와 주파수 해석)

  • Kil, S.K.;Han, S.H.;Kwon, O.S.;Park, S.H.;Hong, S.H.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1998 no.11
    • /
    • pp.263-264
    • /
    • 1998
  • In this paper, we present the result of feature points recognition and classification of radial pulse by the shape of pulse wave. And we analyze radial pulse in frequency domain. The recognition algorithm use the method which runs in parallel with both the data of ECG and differential pulse simultaneously to recognize the feature points. Also fie specified 3-time elements of pulse wave as main parameters for diagnosis and measured them by execution of algorithm, then we classify the shape of radial pulse by existence and position of feature points. lastly we execute frequency analysis on the feature points and get the power spectrum of radial pulse.

  • PDF

Generation of 3D STEP Model from 2D Drawings Using Feature Definition of Ship Structure (선체구조 특징형상 정의에 의한 2D 도면에서 3D STEP 선체 모델의 생성)

  • 황호진;한순흥;김용대
    • Korean Journal of Computational Design and Engineering
    • /
    • v.8 no.2
    • /
    • pp.122-132
    • /
    • 2003
  • STEP AP218 has a standard schema to represent the structural model of a midship section. While it helps to exchange ship structural models among heterogeneous automation systems, most shipyards and classification societies still exchange information using 2D paper drawings. We propose a feature parameter input method to generate a 3D STEP model of a ship structure from 2D drawings. We have analyzed the ship structure information contained in 2D drawings and have defined a data model to express the contents of the drawing. We also developed a QUI for the feature parameter input. To translate 2D information extracted from the drawing into a STEP AP2l8 model, we have developed a shape generation library, and generated the 3D ship model through this library. The generated 3D STEP model of a ship structure can be used to exchange information between design departments in a shipyard as well as between classification societies and shipyards.

Term Frequency-Inverse Document Frequency (TF-IDF) Technique Using Principal Component Analysis (PCA) with Naive Bayes Classification

  • J.Uma;K.Prabha
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.4
    • /
    • pp.113-118
    • /
    • 2024
  • Pursuance Sentiment Analysis on Twitter is difficult then performance it's used for great review. The present be for the reason to the tweet is extremely small with mostly contain slang, emoticon, and hash tag with other tweet words. A feature extraction stands every technique concerning structure and aspect point beginning particular tweets. The subdivision in a aspect vector is an integer that has a commitment on ascribing a supposition class to a tweet. The cycle of feature extraction is to eradicate the exact quality to get better the accurateness of the classifications models. In this manuscript we proposed Term Frequency-Inverse Document Frequency (TF-IDF) method is to secure Principal Component Analysis (PCA) with Naïve Bayes Classifiers. As the classifications process, the work proposed can produce different aspects from wildly valued feature commencing a Twitter dataset.

Using the fusion of spatial and temporal features for malicious video classification (공간과 시간적 특징 융합 기반 유해 비디오 분류에 관한 연구)

  • Jeon, Jae-Hyun;Kim, Se-Min;Han, Seung-Wan;Ro, Yong-Man
    • The KIPS Transactions:PartB
    • /
    • v.18B no.6
    • /
    • pp.365-374
    • /
    • 2011
  • Recently, malicious video classification and filtering techniques are of practical interest as ones can easily access to malicious multimedia contents through the Internet, IPTV, online social network, and etc. Considerable research efforts have been made to developing malicious video classification and filtering systems. However, the malicious video classification and filtering is not still being from mature in terms of reliable classification/filtering performance. In particular, the most of conventional approaches have been limited to using only the spatial features (such as a ratio of skin regions and bag of visual words) for the purpose of malicious image classification. Hence, previous approaches have been restricted to achieving acceptable classification and filtering performance. In order to overcome the aforementioned limitation, we propose new malicious video classification framework that takes advantage of using both the spatial and temporal features that are readily extracted from a sequence of video frames. In particular, we develop the effective temporal features based on the motion periodicity feature and temporal correlation. In addition, to exploit the best data fusion approach aiming to combine the spatial and temporal features, the representative data fusion approaches are applied to the proposed framework. To demonstrate the effectiveness of our method, we collect 200 sexual intercourse videos and 200 non-sexual intercourse videos. Experimental results show that the proposed method increases 3.75% (from 92.25% to 96%) for classification of sexual intercourse video in terms of accuracy. Further, based on our experimental results, feature-level fusion approach (for fusing spatial and temporal features) is found to achieve the best classification accuracy.

RFA: Recursive Feature Addition Algorithm for Machine Learning-Based Malware Classification

  • Byeon, Ji-Yun;Kim, Dae-Ho;Kim, Hee-Chul;Choi, Sang-Yong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.2
    • /
    • pp.61-68
    • /
    • 2021
  • Recently, various technologies that use machine learning to classify malicious code have been studied. In order to enhance the effectiveness of machine learning, it is most important to extract properties to identify malicious codes and normal binaries. In this paper, we propose a feature extraction method for use in machine learning using recursive methods. The proposed method selects the final feature using recursive methods for individual features to maximize the performance of machine learning. In detail, we use the method of extracting the best performing features among individual feature at each stage, and then combining the extracted features. We extract features with the proposed method and apply them to machine learning algorithms such as Decision Tree, SVM, Random Forest, and KNN, to validate that machine learning performance improves as the steps continue.

Spectral Signatures of Tombs and their Classification (묘지의 분광적 특성과 통계적 분류)

  • Eunmi Change;Kyeong Park;Minho Kim
    • Journal of the Korean Geographical Society
    • /
    • v.39 no.2
    • /
    • pp.283-296
    • /
    • 2004
  • More than 0.5 percent of land in Korea is used for cemetery and the rate is growing in spite of the increase in cremation these days. The systematic management of tombs may be possible through the ‘Feature Extraction’ method which is applied to the high-resolution satellite imagery. For this reason, this research focused on finding out the radiometric characteristics of tombs and the classification of them. An IKONOS image of northwest areas of Seoul with 8km x 10km dimension was analyzed. After sampling 24 tombs in the study area, the statistical radiometric characteristics of tombs are analyzed. And tombs were classified based on the criteria such as landscape, NDVI, and cluster analysis. In addition, it was investigated if the aspect or slope of the terrain influenced to the classification of tombs. As a result of this research, authors find that there is similarity between the classification tv NDVI and the classification through cluster analysis. And aspect or slope didn't have much influence on the classification of tombs.

A Genre-based Classification of Digital Documents by using Deviation Statistic of Genre-revealing Term and Subject-revealing Term (장르와 주제 범주간 용어 편차정보를 이용한 디지털 문서의 장르기반 분류)

  • 이용배;맹성현
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.11
    • /
    • pp.1062-1071
    • /
    • 2003
  • A genre-based classification means classifying documents by the purpose for which they were written, not by the semantics or subject areas. Most genre classifying methods in the past were based on the existing documents categorization algorithms and ineffective for feature selections, resulting in low quality classification results. In this research, we propose a new method for automatic classification of digital documents by genre. The genre classifier we developed uses the deviation statistic between the genre-revealing term frequencies and between the subject-revealing term frequencies within a genre. We collected Web documents to evaluate the proposed genre classification method. The experimental results show that the proposed method outperforms a direct application of a kai-square feature selection and bayesian classifier often used for subject classification by proving an excellent accuracy of about 30 percent.

An Analytical Study on Automatic Classification of Domestic Journal articles Using Random Forest (랜덤포레스트를 이용한 국내 학술지 논문의 자동분류에 관한 연구)

  • Kim, Pan Jun
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.2
    • /
    • pp.57-77
    • /
    • 2019
  • Random Forest (RF), a representative ensemble technique, was applied to automatic classification of journal articles in the field of library and information science. Especially, I performed various experiments on the main factors such as tree number, feature selection, and learning set size in terms of classification performance that automatically assigns class labels to domestic journals. Through this, I explored ways to optimize the performance of random forests (RF) for imbalanced datasets in real environments. Consequently, for the automatic classification of domestic journal articles, Random Forest (RF) can be expected to have the best classification performance when using tree number interval 100~1000(C), small feature set (10%) based on chi-square statistic (CHI), and most learning sets (9-10 years).

A Study on Feature Selection and Feature Extraction for Hyperspectral Image Classification Using Canonical Correlation Classifier (정준상관분류에 의한 하이퍼스펙트럴영상 분류에서 유효밴드 선정 및 추출에 관한 연구)

  • Park, Min-Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.29 no.3D
    • /
    • pp.419-431
    • /
    • 2009
  • The core of this study is finding out the efficient band selection or extraction method discovering the optimal spectral bands when applying canonical correlation classifier (CCC) to hyperspectral data. The optimal efficient bands grounded on each separability decision technique are selected using Multispec$^{(C)}$ software developed by Purdue university of USA. Total 6 separability decision techniques are used, which are Divergence, Transformed Divergence, Bhattacharyya, Mean Bhattacharyya, Covariance Bhattacharyya, Noncovariance Bhattacharyya. For feature extraction, PCA transformation and MNF transformation are accomplished by ERDAS Imagine and ENVI software. For the comparison and assessment on the effect of feature selection and feature extraction, land cover classification is performed by CCC. The overall accuracy of CCC using the firstly selected 60 bands is 71.8%, the highest classification accuracy acquired by CCC is 79.0% as the case that executes CCC after appling Noncovariance Bhattacharyya. In conclusion, as a matter of fact, only Noncovariance Bhattacharyya separability decision method was valuable as feature selection algorithm for hyperspectral image classification depended on CCC. The lassification accuracy using other feature selection and extraction algorithms except Divergence rather declined in CCC.