• Title/Summary/Keyword: Feature Collection

Search Result 197, Processing Time 0.026 seconds

A Study on Feature Selection for kNN Classifier using Document Frequency and Collection Frequency (문헌빈도와 장서빈도를 이용한 kNN 분류기의 자질선정에 관한 연구)

  • Lee, Yong-Gu
    • Journal of Korean Library and Information Science Society
    • /
    • v.44 no.1
    • /
    • pp.27-47
    • /
    • 2013
  • This study investigated the classification performance of a kNN classifier using the feature selection methods based on document frequency(DF) and collection frequency(CF). The results of the experiments, which used HKIB-20000 data, were as follows. First, the feature selection methods that used high-frequency terms and removed low-frequency terms by the CF criterion achieved better classification performance than those using the DF criterion. Second, neither DF nor CF methods performed well when low-frequency terms were selected first in the feature selection process. Last, combining CF and DF criteria did not result in better classification performance than using the single feature selection criterion of DF or CF.

Indoor Path Recognition Based on Wi-Fi Fingerprints

  • Donggyu Lee;Jaehyun Yoo
    • Journal of Positioning, Navigation, and Timing
    • /
    • v.12 no.2
    • /
    • pp.91-100
    • /
    • 2023
  • The existing indoor localization method using Wi-Fi fingerprinting has a high collection cost and relatively low accuracy, thus requiring integrated correction of convergence with other technologies. This paper proposes a new method that significantly reduces collection costs compared to existing methods using Wi-Fi fingerprinting. Furthermore, it does not require labeling of data at collection and can estimate pedestrian travel paths even in large indoor spaces. The proposed pedestrian movement path estimation process is as follows. Data collection is accomplished by setting up a feature area near an indoor space intersection, moving through the set feature areas, and then collecting data without labels. The collected data are processed using Kernel Linear Discriminant Analysis (KLDA) and the valley point of the Euclidean distance value between two data is obtained within the feature space of the data. We build learning data by labeling data corresponding to valley points and some nearby data by feature area numbers, and labeling data between valley points and other valley points as path data between each corresponding feature area. Finally, for testing, data are collected randomly through indoor space, KLDA is applied as previous data to build test data, the K-Nearest Neighbor (K-NN) algorithm is applied, and the path of movement of test data is estimated by applying a correction algorithm to estimate only routes that can be reached from the most recently estimated location. The estimation results verified the accuracy by comparing the true paths in indoor space with those estimated by the proposed method and achieved approximately 90.8% and 81.4% accuracy in two experimental spaces, respectively.

Prototype-based Classifier with Feature Selection and Its Design with Particle Swarm Optimization: Analysis and Comparative Studies

  • Park, Byoung-Jun;Oh, Sung-Kwun
    • Journal of Electrical Engineering and Technology
    • /
    • v.7 no.2
    • /
    • pp.245-254
    • /
    • 2012
  • In this study, we introduce a prototype-based classifier with feature selection that dwells upon the usage of a biologically inspired optimization technique of Particle Swarm Optimization (PSO). The design comprises two main phases. In the first phase, PSO selects P % of patterns to be treated as prototypes of c classes. During the second phase, the PSO is instrumental in the formation of a core set of features that constitute a collection of the most meaningful and highly discriminative coordinates of the original feature space. The proposed scheme of feature selection is developed in the wrapper mode with the performance evaluated with the aid of the nearest prototype classifier. The study offers a complete algorithmic framework and demonstrates the effectiveness (quality of solution) and efficiency (computing cost) of the approach when applied to a collection of selected data sets. We also include a comparative study which involves the usage of genetic algorithms (GAs). Numerical experiments show that a suitable selection of prototypes and a substantial reduction of the feature space could be accomplished and the classifier formed in this manner becomes characterized by low classification error. In addition, the advantage of the PSO is quantified in detail by running a number of experiments using Machine Learning datasets.

A Novel Statistical Feature Selection Approach for Text Categorization

  • Fattah, Mohamed Abdel
    • Journal of Information Processing Systems
    • /
    • v.13 no.5
    • /
    • pp.1397-1409
    • /
    • 2017
  • For text categorization task, distinctive text features selection is important due to feature space high dimensionality. It is important to decrease the feature space dimension to decrease processing time and increase accuracy. In the current study, for text categorization task, we introduce a novel statistical feature selection approach. This approach measures the term distribution in all collection documents, the term distribution in a certain category and the term distribution in a certain class relative to other classes. The proposed method results show its superiority over the traditional feature selection methods.

Vehicle Information Recognition and Electronic Toll Collection System with Detection of Vehicle feature Information in the Rear-Side of Vehicle (차량후면부 차량특징정보 검출을 통한 차량정보인식 및 자동과금시스템)

  • 이응주
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.1
    • /
    • pp.35-43
    • /
    • 2004
  • In this paper, we proposed a vehicle recognition and electronic toll collection system with detection and classification of vehicle identification mark and emblem as well as recognition of vehicle license plate to unman toll fee collection system or incoming/outcoming vehicles to an institution. In the proposed algorithm, we first process pre-processing step such as noise reduction and thinning from the rear side input image of vehicle and detect vehicle mark, emblem and license plate region using intensity variation informations, template masking and labeling operation. And then, we classify the detected vehicle features regions into vehicle mark and emblem as well as recognize characters and numbers of vehicle license plate using hybrid and seven segment pattern vector. To show the efficiency of the proposed algorithm, we tested it on real vehicle images of implemented vehicle recognition system in highway toll gate and found that the proposed method shows good feature detection/classification performance regardless of irregular environment conditions as well as noise, size, and location of vehicles. And also, the proposed algorithm may be utilized for catching criminal vehicles, unmanned toll collection system, and unmanned checking incoming/outcoming vehicles to an institution.

  • PDF

Analysis of cellular fatty acid methyl esters (FAMEs) for the identification of leuconostoc strains isolated from kimchi

  • Lee, Jung-Sook;Chun, Chang-Ouk;Kim, Hong-Joong;Joo, Yun-Jung;Lee, Hun-Joo;Park, Chan-Sum;Park, Yong-Ha;Mheen, Tae-Ick
    • Journal of Microbiology
    • /
    • v.34 no.3
    • /
    • pp.225-228
    • /
    • 1996
  • The cellular fatty acid methyl esters (FAMEs) analysis data obtained for clusters defined at a Euclidian distance of 17.5, in the classification of lactic acid bacteria isolated from kimchi, described by Lee et al. (4), was used for the identification of 79 Leuconostoc isolates. The test strains were isolated using a selective isolation medium specific for the genus Leuconostoc. These strains were then characterized according to their fatty acid profiles. The results show that all seventy nine test strains were identified to the known Leuconostoc clusters B, C, and D. Cluster B had the highest relative amount of the saturated fatty acid 16 : 0. The saturated fatty acid 16 : 0 and summed feature 9 were found as a major components in cluster C, which had a higher level of summed feature 9 than cluster B. Cluster D is characterized by the highest relative amount of the unsaturated fatty acid 18 : 1 w9c. It is suggested that FAMEs analysis can be successfully applied in the identification of lactic acid bacteria isolated from kimchi.

  • PDF

Feature Voting for Object Localization via Density Ratio Estimation

  • Wang, Liantao;Deng, Dong;Chen, Chunlei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.12
    • /
    • pp.6009-6027
    • /
    • 2019
  • Support vector machine (SVM) classifiers have been widely used for object detection. These methods usually locate the object by finding the region with maximal score in an image. With bag-of-features representation, the SVM score of an image region can be written as the sum of its inside feature-weights. As a result, the searching process can be executed efficiently by using strategies such as branch-and-bound. However, the feature-weight derived by optimizing region classification cannot really reveal the category knowledge of a feature-point, which could cause bad localization. In this paper, we represent a region in an image by a collection of local feature-points and determine the object by the region with the maximum posterior probability of belonging to the object class. Based on the Bayes' theorem and Naive-Bayes assumptions, the posterior probability is reformulated as the sum of feature-scores. The feature-score is manifested in the form of the logarithm of a probability ratio. Instead of estimating the numerator and denominator probabilities separately, we readily employ the density ratio estimation techniques directly, and overcome the above limitation. Experiments on a car dataset and PASCAL VOC 2007 dataset validated the effectiveness of our method compared to the baselines. In addition, the performance can be further improved by taking advantage of the recently developed deep convolutional neural network features.

A Study about the Characteristics of Designs in John Galliano Collection - focusing on Christian Dior's Collection - (존 갈리아노 컬렉션의 디자인 특성에 관한 연구 - 크리스찬 디오르의 컬렉션을 중심으로 -)

  • Lee, Kwuy-Young;Cho, Kyu-Hwa
    • Journal of Fashion Business
    • /
    • v.13 no.2
    • /
    • pp.50-65
    • /
    • 2009
  • The main purpose of this study is to identify characteristics of shapes of John Galliano's Dior Collection as the chief executive designer of Christian Dior Maison during $1996{\sim}2007$ after he showed himself in Paris in 1990. This study was based on the analyses of John Galliano's design trends of his collections, the pictures of his works in Christian Dior's collection, real works, documents and fashion magazines, newspapers, mass media, internet sites and other visual materials. The study identified characteristics of shapes in Dior Collection until 07/08 F/W as the chief executive designer of Christian Dior Maison, and the design trends before his post-Paris period. Followings are the conclusions of the study. First, Galliano was open to any types of cultures as a liberalist, and also respectful to the tradition or principles. He led the fashion business with new trends by exploring both sides. Second, he succeeded in commercializing his avant-garde feature. Especially, His creativity changed the image of Christian Dior to younger and more casual one. Third, born in England and worked in French, he always took both English (Victorian Style) and French(Napoleon era, Femme Fatal style) sides, and showed excellent formulation that the times needed by combining topical Chinese, Japanese, Egyptian styles.

Robust Image Mosaic using Geometrical Feature Model (기하학적 특징 모델을 이용한 강건한 영상 모자이크 기법)

  • 김정훈;김대현;윤용인;최종수
    • Proceedings of the IEEK Conference
    • /
    • 2000.11d
    • /
    • pp.13-16
    • /
    • 2000
  • This paper presents a robust method to combine a collection of images with small fields of view to obtain an image with a large field of view. In the previous works, there are two main areas which one is a cross correlation-based method and the other is a feature-based method. The former is based on motion estimation from video sequences. so there are a problem on rotating a camera about optical axis. In the latter method, it is difficult to match correspondence feature points correctly.'re find correct correspondences, we proposed the geometrical feature model and correspondence filters and the Gaussian distribution weight function to blend the images smoothly. The experiments show that our method is robust and effective.

  • PDF

A Study for Feature Selection in the Intrusion Detection System (침입탐지시스템에서의 특징 선택에 대한 연구)

  • Han, Myung-Mook
    • Convergence Security Journal
    • /
    • v.6 no.3
    • /
    • pp.87-95
    • /
    • 2006
  • An intrusion can be defined as any set of actors that attempt to compromise the integrity, confidentiality and availability of computer resource and destroy the security policy of computer system. The Intrusion Detection System that detects the intrusion consists of data collection, data reduction, analysis and detection, and report and response. It is important for feature selection to detect the intrusion efficiently after collecting the large set of data of Intrusion Detection System. In this paper, the feature selection method using Genetic Algorithm and Decision Tree is proposed. Also the method is verified by the simulation with KDD data.

  • PDF