• Title/Summary/Keyword: 계층적 분류기

Search Result 95, Processing Time 0.025 seconds

Performance Comparison of Automatic Classification Using Word Embeddings of Book Titles (단행본 서명의 단어 임베딩에 따른 자동분류의 성능 비교)

  • Yong-Gu Lee
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.4
    • /
    • pp.307-327
    • /
    • 2023
  • To analyze the impact of word embedding on book titles, this study utilized word embedding models (Word2vec, GloVe, fastText) to generate embedding vectors from book titles. These vectors were then used as classification features for automatic classification. The classifier utilized the k-nearest neighbors (kNN) algorithm, with the categories for automatic classification based on the DDC (Dewey Decimal Classification) main class 300 assigned by libraries to books. In the automatic classification experiment applying word embeddings to book titles, the Skip-gram architectures of Word2vec and fastText showed better results in the automatic classification performance of the kNN classifier compared to the TF-IDF features. In the optimization of various hyperparameters across the three models, the Skip-gram architecture of the fastText model demonstrated overall good performance. Specifically, better performance was observed when using hierarchical softmax and larger embedding dimensions as hyperparameters in this model. From a performance perspective, fastText can generate embeddings for substrings or subwords using the n-gram method, which has been shown to increase recall. The Skip-gram architecture of the Word2vec model generally showed good performance at low dimensions(size 300) and with small sizes of negative sampling (3 or 5).

A Study on the Word 'is' in a Sentence "A Parallelogram is Trapezoid." ("평행사변형은 사다리꼴이다."에서 '이다'에 대한 고찰)

  • Yi, Gyuhee;Choi, Younggi
    • School Mathematics
    • /
    • v.18 no.3
    • /
    • pp.527-539
    • /
    • 2016
  • A word 'is' in "A parallelogram is trapezoid." is ambiguous and very rich when it comes to its meaning. In this paper, 'is' as in everyday language will be identified as semantic primes that can be interpreted in different ways depending on context and situation, and meanings of 'is' in mathematics will be discussed separately. Focusing on 'identity', 'is' will be reinterpreted in the view of equivalence relation and van Hieles' work. 'Is', as a mathematical sign, is thought to have a significant importance in producing mathematical ideas meaningfully.

Progressive Image Transmission Using Hierarchical Pyramid Structure and Classified Vector Quantizer in DCT Domain (계층적 피라미드 구조와 DCT 영역에서의 분류 벡터 양지기를 이용한 점진적 영상전송)

  • 박섭형;이상욱
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.8
    • /
    • pp.1227-1237
    • /
    • 1989
  • In this paper, we propose a lossless progressive image transmission scheme using hierarchical pyramid structure and classified vector quantizer in DCT domain. By adopting DCT to the hierarchical pyramid signals, we can reduce the spatial redundance. Moreover, the DCT coefficients can be encoded efficiently by using classified vector quantizer in DCT domain. The classifier is simply based on the variance of a subblock. Also, the mirror set of training set of images can improve the robustness of codebooks. Progressive image transmission can be achieved through following processes: from top to bottom level of planes in a pyramid, and from high to low AC variance class in a plane. Some simulation results with real images show that the proposed coding scheme yields a good performance at below 0.3 bpp and an excellent result at 0.409 bpp. The proposed coding scheme is well suited for lossless progressive image transmission as well as image data compression.

  • PDF

Face Detection Using Multiple Filters and Hybrid Neural Networks (다중 필터와 복합형 신경망을 이용한 얼굴 검출 기법)

  • Cho, Il-Gook;Park, Hyun-Jung;Kim, Ho-Joon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2005.11a
    • /
    • pp.191-194
    • /
    • 2005
  • 본 논문에서는 방송 영상에서 조명효과와 크기변화 등에 강인한 얼굴패턴 검출기법을 제시한다. 제안된 얼굴검출 모델은 영상 전처리 과정과 얼굴패턴 검출 과정으로 이루어진다. 전처리 과정은 조명변화에 대한 보정기능과 다중필터에 의한 후보영역 선별기능으로 구분된다. 얼굴패턴 검출과정은 다단계의 특징지도 생성과정과 패턴분류 과정으로 이루어진다. 특징지도를 생성하기 위하여 가보(Gabor) 필터계층을 포함하는 CNN(Convolutional Neural Networks)모델을 도입하였다. 다양한 배경을 고려한 효과적인 학습을 위하여 본 논문에서는 억제성의 뉴런(Inhibitory neuron)을 포함하는 구조의 CNN모델을 적용한다. CNN으로부터 추출되는 특징집합은 최종 단계에서 WFMM(Weighted Fuzzy Min Max) 모델을 사용하여 분류된다. 이때 사용되는 특징집합의 크기는 분류기의 규모 및 계산량의 결정적인 역할을 준다. 이에 본 연구에서는 최종 분류 과정에 사용되는 특징의 수를 효과적으로 줄이기 위해 FMM모델을 사용하는 적응적인 특징 선별 기법을 제안한다. 또한 실제 영상을 통한 실험결과로부터 제안된 이론의 타당성을 고찰한다.

  • PDF

Version Management System of Hierarchy Interface System for CAD Database (CAD 데이터 베이스를 위한 HIS에서의 버전 관리 시스템)

  • Ahn, Syung-Og;Park, Dong-Won
    • The Journal of Engineering Research
    • /
    • v.2 no.1
    • /
    • pp.23-30
    • /
    • 1997
  • For a effective management and easy tool integration of CAD database, Hierarchy interface System(HIS) was designed and GROCO(Graph Representation fOr Complex Objects) Model was presented in another my paper[10]. Hierarchy Interface System which is composed of two subsystems of a configurator and a converter is designed for the interface between a conventional database management system and CAD tools. In this paper, Version Management System is presented for supporting effective operations of HIS using GROCO model. Version Management System supports efficiently CAD database charaters having a hierarchical structure of composite objects. In Version Management System, A design evolves in discrete states through mutation and derivation for going phases of design giving rise to multiple versions. Operations and rules are provided transition between their different states. and for controlling update propagation and preventing version proliferation. Version Modeling Graph is proposed for dealing with versioning at the instance and type levels.

  • PDF

A Noise-Tolerant Hierarchical Image Classification System based on Autoencoder Models (오토인코더 기반의 잡음에 강인한 계층적 이미지 분류 시스템)

  • Lee, Jong-kwan
    • Journal of Internet Computing and Services
    • /
    • v.22 no.1
    • /
    • pp.23-30
    • /
    • 2021
  • This paper proposes a noise-tolerant image classification system using multiple autoencoders. The development of deep learning technology has dramatically improved the performance of image classifiers. However, if the images are contaminated by noise, the performance degrades rapidly. Noise added to the image is inevitably generated in the process of obtaining and transmitting the image. Therefore, in order to use the classifier in a real environment, we have to deal with the noise. On the other hand, the autoencoder is an artificial neural network model that is trained to have similar input and output values. If the input data is similar to the training data, the error between the input data and output data of the autoencoder will be small. However, if the input data is not similar to the training data, the error will be large. The proposed system uses the relationship between the input data and the output data of the autoencoder, and it has two phases to classify the images. In the first phase, the classes with the highest likelihood of classification are selected and subject to the procedure again in the second phase. For the performance analysis of the proposed system, classification accuracy was tested on a Gaussian noise-contaminated MNIST dataset. As a result of the experiment, it was confirmed that the proposed system in the noisy environment has higher accuracy than the CNN-based classification technique.

Real-Time Face Detection and Tracking Using the AdaBoost Algorithm (AdaBoost 알고리즘을 이용한 실시간 얼굴 검출 및 추적)

  • Lee, Wu-Ju;Kim, Jin-Chul;Lee, Bae-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.10
    • /
    • pp.1266-1275
    • /
    • 2006
  • In this paper, we propose a real-lime face detection and tracking algorithm using AdaBoost(Adaptive Boosting) algorithm. The proposed algorithm consists of two levels such as the face detection and the face tracking. First, the face detection used the eight-wavelet feature models which ate very simple. Each feature model applied to variable size and position, and then create initial feature set. The intial feature set and the training images which were consisted of face images, non-face images used the AdaBoost algorithm. The basic principal of the AdaBoost algorithm is to create final strong classifier joining linearly weak classifiers. In the training of the AdaBoost algorithm, we propose SAT(Summed-Area Table) method. Face tracking becomes accomplished at real-time using the position information and the size information of detected face, and it is extended view region dynamically using the fan-Tilt camera. We are setting to move center of the detected face to center of the Image. The experiment results were amply satisfied with the computational efficiency and the detection rates. In real-time application using Pan-Tilt camera, the detecter runs at about 12 frames per second.

  • PDF

Pattern Classification of Chromosome Images using the Image Reconstruction Method (영상 재구성방법을 이용한 염색체 영상의 패턴 분류)

  • 김충석;남재현;장용훈
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.4
    • /
    • pp.839-844
    • /
    • 2003
  • To improve classification accuracy in this paper, we proposed an algorithm for the chromosome image reconstruction in the image preprocessing part. also we proposed the pattern classification method using the hierarchical multilayer neural network(HMNN) to classify the chromosome karyotype. It reconstructed chromosome images for twenty normal human chromosome by the image reconstruction algorithm. The four morphological and ten density feature parameters were extracted from the 920 reconstructed chromosome images. The each combined feature parameters of ten human chromosome images were used to learn HMNN(Hierarchical Multilayer Neural Network) and the rest of them were used to classify the chromosome images. The experimental results in this paper were composed to optimized HMNN and also obtained about 98.26% to recognition ratio.

Scene Text Extraction in Natural Images using Hierarchical Feature Combination and Verification (계층적 특징 결합 및 검증을 이용한 자연이미지에서의 장면 텍스트 추출)

  • 최영우;김길천;송영자;배경숙;조연희;노명철;이성환;변혜란
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.4
    • /
    • pp.420-438
    • /
    • 2004
  • Artificially or naturally contained texts in the natural images have significant and detailed information about the scenes. If we develop a method that can extract and recognize those texts in real-time, the method can be applied to many important applications. In this paper, we suggest a new method that extracts the text areas in the natural images using the low-level image features of color continuity. gray-level variation and color valiance and that verifies the extracted candidate regions by using the high-level text feature such as stroke. And the two level features are combined hierarchically. The color continuity is used since most of the characters in the same text lesion have the same color, and the gray-level variation is used since the text strokes are distinctive in their gray-values to the background. Also, the color variance is used since the text strokes are distinctive in their gray-values to the background, and this value is more sensitive than the gray-level variations. The text level stroke features are extracted using a multi-resolution wavelet transforms on the local image areas and the feature vectors are input to a SVM(Support Vector Machine) classifier for the verification. We have tested the proposed method using various kinds of the natural images and have confirmed that the extraction rates are very high even in complex background images.

Influences on Health Behaviors Execution and Self Rated Health as Socioeconomic Class by the Age Bracket (연령층별 사회경제적 계층에 따른 건강행위 실천과 주관적 건강수준에 미치는 영향)

  • Lee, Jung-Min;Kim, Won-Joong;Sohn, Hae-Sook;Chun, Jin-Ho;Lee, Myeong-Jin;Park, Hyun-Suk
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.6
    • /
    • pp.317-327
    • /
    • 2012
  • The purpose of present study was to observe the path and influencing effects between socioeconomic class (SEC), health practices and self-rated health(SRH) by the age bracket. The subjects were 4,987 adults over 25 years old who participated in the 2008 Korean National Examination Health and Nutrition Survey and could be classified into SEC in terms of the three characteristics: education, income and occupation. Path analysis was conducted with the effects of health behaviors execution on the differences in SRH, and the complex samples analysis executed by chi-square test, t-test, ANOVA. As the result, lower SRH level paralleled with the lower SEC, and more health behaviors had differed by SEC in the younger and middle aged group. The lower SEC, the lower SRH: non-smoking and weight control for younger women and exercise for aged men had indirect effects as parameters. In conclusion, when planning a health promotion program, to select the correct target populations with consideration of the age bracket, gender and SEC and to establish tailored contents fit for each of the population would be important.