• Title/Summary/Keyword: LDA Method

Search Result 269, Processing Time 0.025 seconds

Face Recognition Based on PCA and LDA Combining Clustering (Clustering을 결합한 PCA와 LDA 기반 얼굴 인식)

  • Guo, Lian-Hua;Kim, Pyo-Jae;Chang, Hyung-Jin;Choi, Jin-Young
    • Proceedings of the IEEK Conference
    • /
    • 2006.06a
    • /
    • pp.387-388
    • /
    • 2006
  • In this paper, we propose an efficient algorithm based on PCA and LDA combining K-means clustering method, which has better accuracy of face recognition than Eigenface and Fisherface. In this algorithm, PCA is firstly used to reduce the dimensionality of original face image. Secondly, a truncated face image data are sub-clustered by K-means clustering method based on Euclidean distances, and all small subclusters are labeled in sequence. Then LDA method project data into low dimension feature space and group data easier to classify. Finally we use nearest neighborhood method to determine the label of test data. To show the recognition accuracy of the proposed algorithm, we performed several simulations using the Yale and ORL (Olivetti Research Laboratory) database. Simulation results show that proposed method achieves better performance in recognition accuracy.

  • PDF

Real-Time Face Recognition Based on Subspace and LVQ Classifier (부분공간과 LVQ 분류기에 기반한 실시간 얼굴 인식)

  • Kwon, Oh-Ryun;Min, Kyong-Pil;Chun, Jun-Chul
    • Journal of Internet Computing and Services
    • /
    • v.8 no.3
    • /
    • pp.19-32
    • /
    • 2007
  • This paper present a new face recognition method based on LVQ neural net to construct a real time face recognition system. The previous researches which used PCA, LDA combined neural net usually need much time in training neural net. The supervised LVQ neural net needs much less time in training and can maximize the separability between the classes. In this paper, the proposed method transforms the input face image by PCA and LDA sequentially into low-dimension feature vectors and recognizes the face through LVQ neural net. In order to make the system robust to external light variation, light compensation is performed on the detected face by max-min normalization method as preprocessing. PCA and LDA transformations are applied to the normalized face image to produce low-level feature vectors of the image. In order to determine the initial centers of LVQ and speed up the convergency of the LVQ neural net, the K-Means clustering algorithm is adopted. Subsequently, the class representative vectors can be produced by LVQ2 training using initial center vectors. The face recognition is achieved by using the euclidean distance measure between the center vector of classes and the feature vector of input image. From the experiments, we can prove that the proposed method is more effective in the recognition ratio for the cases of still images from ORL database and sequential images rather than using conventional PCA of a hybrid method with PCA and LDA.

  • PDF

Language Model Adaptation Based on Topic Probability of Latent Dirichlet Allocation

  • Jeon, Hyung-Bae;Lee, Soo-Young
    • ETRI Journal
    • /
    • v.38 no.3
    • /
    • pp.487-493
    • /
    • 2016
  • Two new methods are proposed for an unsupervised adaptation of a language model (LM) with a single sentence for automatic transcription tasks. At the training phase, training documents are clustered by a method known as Latent Dirichlet allocation (LDA), and then a domain-specific LM is trained for each cluster. At the test phase, an adapted LM is presented as a linear mixture of the now trained domain-specific LMs. Unlike previous adaptation methods, the proposed methods fully utilize a trained LDA model for the estimation of weight values, which are then to be assigned to the now trained domain-specific LMs; therefore, the clustering and weight-estimation algorithms of the trained LDA model are reliable. For the continuous speech recognition benchmark tests, the proposed methods outperform other unsupervised LM adaptation methods based on latent semantic analysis, non-negative matrix factorization, and LDA with n-gram counting.

Operation diagnostic based on PCA for wastewater treatment (PCA를 이용한 하폐수처리시설 운전상태진단)

  • Jun Byong-Hee;Park Jang-Hwan;Chun Myung-Geun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.16 no.3
    • /
    • pp.383-388
    • /
    • 2006
  • SBR is one of the most general sewage/wastewater treatment processes and, particularly, has an advantage in high concentration wastewater treatment like sewage wastewater. A Kernel PCA based fault diagnosis system for biological reaction in full-scale wastewater treatment plant was proposed using only common bio-chemical sensors such as ORP(Oxidation-Reduction Potential) and DO(Dissolved Oxygen). During the SBR operation, the operation status could be divided into normal status and abnormal status such as controller malfunction, influent disturbance and instrumental trouble. For the classification and diagnosis of these statuses, a series of preprocessing, dimension reduction using PCA, LDA, K-PCA and feature reduction was performed. Also, the diagnosis result using differential data was superior to that of raw data, and the fusion data show better results than other data. Also, the results of combination of K-PCA and LDA were better than those of LDA or (PCA+LDA). Finally, the fault recognition rate in case of using only ORP or DO was around maximum 97.03% and the fusion method showed better result of maximum 98.02%.

Fault Diagnosis of Induction Motor using Linear Discriminant Analysis (선형판별분석기법을 이용한 유도전동기의 고장진단)

  • 전병석;이상혁;박장환;유정웅;전명근
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.18 no.4
    • /
    • pp.104-111
    • /
    • 2004
  • In this paper, we propose a diagnosis algorithm to detect faults of induction motor using LDA First, after reducing the input dimension of a current value measured by experiment at each period using PCA method, we extract characteristic vectors for each fault using LDA Next, we analyze the driving condition of an induction motor using the Euclidean distance between a precalculated characteristic vector and an input vector. Finally, from the experiments under various noise conditions showing the properties of the LDA method, we obtained better results than the case of using the PCA method.

Topic Extraction and Classification Method Based on Comment Sets

  • Tan, Xiaodong
    • Journal of Information Processing Systems
    • /
    • v.16 no.2
    • /
    • pp.329-342
    • /
    • 2020
  • In recent years, emotional text classification is one of the essential research contents in the field of natural language processing. It has been widely used in the sentiment analysis of commodities like hotels, and other commentary corpus. This paper proposes an improved W-LDA (weighted latent Dirichlet allocation) topic model to improve the shortcomings of traditional LDA topic models. In the process of the topic of word sampling and its word distribution expectation calculation of the Gibbs of the W-LDA topic model. An average weighted value is adopted to avoid topic-related words from being submerged by high-frequency words, to improve the distinction of the topic. It further integrates the highest classification of the algorithm of support vector machine based on the extracted high-quality document-topic distribution and topic-word vectors. Finally, an efficient integration method is constructed for the analysis and extraction of emotional words, topic distribution calculations, and sentiment classification. Through tests on real teaching evaluation data and test set of public comment set, the results show that the method proposed in the paper has distinct advantages compared with other two typical algorithms in terms of subject differentiation, classification precision, and F1-measure.

Hot Topic Discovery across Social Networks Based on Improved LDA Model

  • Liu, Chang;Hu, RuiLin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.11
    • /
    • pp.3935-3949
    • /
    • 2021
  • With the rapid development of Internet and big data technology, various online social network platforms have been established, producing massive information every day. Hot topic discovery aims to dig out meaningful content that users commonly concern about from the massive information on the Internet. Most of the existing hot topic discovery methods focus on a single network data source, and can hardly grasp hot spots as a whole, nor meet the challenges of text sparsity and topic hotness evaluation in cross-network scenarios. This paper proposes a novel hot topic discovery method across social network based on an im-proved LDA model, which first integrates the text information from multiple social network platforms into a unified data set, then obtains the potential topic distribution in the text through the improved LDA model. Finally, it adopts a heat evaluation method based on the word frequency of topic label words to take the latent topic with the highest heat value as a hot topic. This paper obtains data from the online social networks and constructs a cross-network topic discovery data set. The experimental results demonstrate the superiority of the proposed method compared to baseline methods.

Futures Price Prediction based on News Articles using LDA and LSTM (LDA와 LSTM를 응용한 뉴스 기사 기반 선물가격 예측)

  • Jin-Hyeon Joo;Keun-Deok Park
    • Journal of Industrial Convergence
    • /
    • v.21 no.1
    • /
    • pp.167-173
    • /
    • 2023
  • As research has been published to predict future data using regression analysis or artificial intelligence as a method of analyzing economic indicators. In this study, we designed a system that predicts prospective futures prices using artificial intelligence that utilizes topic probability data obtained from past news articles using topic modeling. Topic probability distribution data for each news article were obtained using the Latent Dirichlet Allocation (LDA) method that can extract the topic of a document from past news articles via unsupervised learning. Further, the topic probability distribution data were used as the input for a Long Short-Term Memory (LSTM) network, a derivative of Recurrent Neural Networks (RNN) in artificial intelligence, in order to predict prospective futures prices. The method proposed in this study was able to predict the trend of futures prices. Later, this method will also be able to predict the trend of prices for derivative products like options. However, because statistical errors occurred for certain data; further research is required to improve accuracy.

Face Detection using PCA-LDA and Color Information (색상정보와 PCA-LDA를 이용한 얼굴검출)

  • Lee, Ju-Seung;Han, Young-Hwan;Hong, Seung-Hong
    • Journal of IKEEE
    • /
    • v.6 no.1 s.10
    • /
    • pp.72-79
    • /
    • 2002
  • This paper presents an efficient face detection algorithm for color images with a complex background. The presented algorithm utilizes the color information and eigenface that is calculated by PCA-LDA (Principle Component Analysis - Linear Discriminant Analysis). The method of using the color information is faster than any other methods. Eigenface includes average information of the whole test faces. Therefore eigenface can decide that the candidate region is a face. The whole process is composed of two steps. First, it finds first face candidates region of skin tone using a color information in image. We can get a size and position of face candidate region. Second, we compare first face candidate region with eigenface, so decide that an image whether include a face or not. The advantages of the proposed approach include that increasing the detection speed by deciding a size and position of first face candidates region. Also, Betting 97% of the detection rate by comparing the eigenfaces calculated in PCA-LDA.

  • PDF

The LDA+U Effect on the Electronic Structure and Magnetism of Bulk, Monolayer, and Linear Chain of Iron (덩어리, 단층 및 사슬 구조 철의 전자구조와 자성에 대한 LDA+U 효과)

  • Landge, Kalpana K.;Bialek, Beata;Lee, Jae-Il
    • Journal of the Korean Magnetics Society
    • /
    • v.19 no.3
    • /
    • pp.81-84
    • /
    • 2009
  • We examine the effect of U term (U = 3 eV) describing the Coulomb interactions between electrons on the results of electronic band structure calculations carried out for bcc Fe bulk, monolayer, and chain. We investigated the properties of the three Fe structures by using the all-electron total-energy full-potential linearized augmented plane wave method. The U term was included in the exchange - correlation functionals constructed on the basis of local density approximation (LDA) and general gradient approximation (GGA). We found that in the case of bcc Fe bulk structure inclusion of the U term leads to the overestimated values of magnetic moment on Fe atom. The values of magnetic moment calculated for Fe in monolayer and chain are in accordance with calculations in which the U term was not included. In general, for each system the calculated values of magnetic moment on Fe sites were larger when the U term was incorporated in the energy functional. In Fe bulk, the value of magnetic moment $2.54{\mu}_B$ for LDA+U larger than $2.25{\mu}_B$ for LDA.