• Title/Summary/Keyword: Non-Negative Matrix Factorization

Search Result 103, Processing Time 0.05 seconds

Vehicle Recognition using NMF in Urban Scene (도심 영상에서의 비음수행렬분해를 이용한 차량 인식)

  • Ban, Jae-Min;Lee, Byeong-Rae;Kang, Hyun-Chul
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.7C
    • /
    • pp.554-564
    • /
    • 2012
  • The vehicle recognition consists of two steps; the vehicle region detection step and the vehicle identification step based on the feature extracted from the detected region. Features using linear transformations have the effect of dimension reduction as well as represent statistical characteristics, and show the robustness in translation and rotation of objects. Among the linear transformations, the NMF(Non-negative Matrix Factorization) is one of part-based representation. Therefore, we can extract NMF features with sparsity and improve the vehicle recognition rate by the representation of local features of a car as a basis vector. In this paper, we propose a feature extraction using NMF suitable for the vehicle recognition, and verify the recognition rate with it. Also, we compared the vehicle recognition rate for the occluded area using the SNMF(sparse NMF) which has basis vectors with constraint and LVQ2 neural network. We showed that the feature through the proposed NMF is robust in the urban scene where occlusions are frequently occur.

Comparison of independent component analysis algorithms for low-frequency interference of passive line array sonars (수동 선배열 소나의 저주파 간섭 신호에 대한 독립성분분석 알고리즘 비교)

  • Kim, Juho;Ashraf, Hina;Lee, Chong-Hyun;Cheong, Myoung Jun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.38 no.2
    • /
    • pp.177-183
    • /
    • 2019
  • In this paper, we proposed an application method of ICA (Independent Component Analysis) to passive line array sonar to separate interferences from target signals in low frequency band and compared performance of three conventional ICA algorithms. Since the low frequency signals are received through larger bearing angles than other frequency bands, neighboring beam signals can be used to perform ICA as measurement signals of the ICA. We use three ICA algorithms such as Fast ICA, NNMF (Non-negative Matrix Factorization) and JADE (Joint Approximation Diagonalization of Eigen-matrices). Through experiments on real data obtained from passive line array sonar, it is verified that the interference can be separable from target signals by the suggested method and the JADE algorithm shows the best separation performance among the three algorithms.

Mono-To-Stereo Blind Upmix Using Non-Negative Matrix Factorization and Decorrelator (비음수 행렬 분해와 디코릴레이터를 이용한 모노-스테레오 블라인드 업믹스 기법)

  • Choi, Keun-Woo;Chon, Sang-Bae;Lee, Seok-Jin;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.8
    • /
    • pp.509-515
    • /
    • 2010
  • This paper presents a new method for upmixing mono signal to stereo signal with guaranteeing high stereophonic image quality (SIQ) and large apparent source width (ASW). The proposed method consists of analysis phase and synthesis phase. In analysis phase, a mono signal is first decomposed into multiple sound sources by the use of high-rank nonnegative matrix factorization. Then the multiple sources are clustered into two groups based on tonality criterion. In synthesis phase, one group is directly fed into left and right channels while the other group is decorrelated before being fed into each channel. Subjective tests reveals that the proposed method gives listener high SIQ and large ASW with minimizing timbral distortions.

An Implementation of Story Path Recommendation System of Interactive Drama Using PCA and NMF (PCA와 NMF를 이용한 대화식 드라마의 스토리 경로 추천 시스템 구현)

  • Lee, Yeon-Chang;Jang, Jae-Hee;Kim, Myung-Gwan
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.4
    • /
    • pp.95-102
    • /
    • 2012
  • Interactive drama is a story which requires user's free choice and participation. In this study, we grasp user's preference by making training data that utilize characters of interactive drama. Furthermore, we describe process of implementing systems which recommend new users path of stories that correspond with their preference. We used PCA and NMF to extract characteristic of preference. The success rate of recommending was 75% with PCA, while 62.5% with NMF.

Query-Based Summarization using Semantic Feature Matrix and Semantic Variable Matrix (의미 특징 행렬과 의미 가변행렬을 이용한 질의 기반의 문서 요약)

  • Park, Sun
    • Journal of Advanced Navigation Technology
    • /
    • v.12 no.4
    • /
    • pp.372-377
    • /
    • 2008
  • This paper proposes a new query-based document summarization method using the semantic feature matrix and the semantic variable matrix. The proposed method doesn't need the training phase using training data comprising queries and query specific documents. And it exactly summarizes documents for the given query by using semantic features and semantic variables that is better at identifying sub-topics of document. Because the NMF have a great power to naturally extract semantic features representing the inherent structure of a document. The experimental results show that the proposed method achieves better performance than other methods.

  • PDF

Research on Designing Korean Emotional Dictionary using Intelligent Natural Language Crawling System in SNS (SNS대상의 지능형 자연어 수집, 처리 시스템 구현을 통한 한국형 감성사전 구축에 관한 연구)

  • Lee, Jong-Hwa
    • The Journal of Information Systems
    • /
    • v.29 no.3
    • /
    • pp.237-251
    • /
    • 2020
  • Purpose The research was studied the hierarchical Hangul emotion index by organizing all the emotions which SNS users are thinking. As a preliminary study by the researcher, the English-based Plutchick (1980)'s emotional standard was reinterpreted in Korean, and a hashtag with implicit meaning on SNS was studied. To build a multidimensional emotion dictionary and classify three-dimensional emotions, an emotion seed was selected for the composition of seven emotion sets, and an emotion word dictionary was constructed by collecting SNS hashtags derived from each emotion seed. We also want to explore the priority of each Hangul emotion index. Design/methodology/approach In the process of transforming the matrix through the vector process of words constituting the sentence, weights were extracted using TF-IDF (Term Frequency Inverse Document Frequency), and the dimension reduction technique of the matrix in the emotion set was NMF (Nonnegative Matrix Factorization) algorithm. The emotional dimension was solved by using the characteristic value of the emotional word. The cosine distance algorithm was used to measure the distance between vectors by measuring the similarity of emotion words in the emotion set. Findings Customer needs analysis is a force to read changes in emotions, and Korean emotion word research is the customer's needs. In addition, the ranking of the emotion words within the emotion set will be a special criterion for reading the depth of the emotion. The sentiment index study of this research believes that by providing companies with effective information for emotional marketing, new business opportunities will be expanded and valued. In addition, if the emotion dictionary is eventually connected to the emotional DNA of the product, it will be possible to define the "emotional DNA", which is a set of emotions that the product should have.

Statistical Voice Activity Detection Using Probabilistic Non-Negative Matrix Factorization (확률적 비음수 행렬 인수분해를 사용한 통계적 음성검출기법)

  • Kim, Dong Kook;Shin, Jong Won;Kwon, Kisoo;Kim, Nam Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.8
    • /
    • pp.851-858
    • /
    • 2016
  • This paper presents a new statistical voice activity detection (VAD) based on the probabilistic interpretation of nonnegative matrix factorization (NMF). The objective function of the NMF using Kullback-Leibler divergence coincides with the negative log likelihood function of the data if the distribution of the data given the basis and encoding matrices is modeled as Poisson distributions. Based on this probabilistic NMF, the VAD is constructed using the likelihood ratio test assuming that speech and noise follow Poisson distributions. Experimental results show that the proposed approach outperformed the conventional Gaussian model-based and NMF-based methods at 0-15 dB signal-to-noise ratio simulation conditions.

A New Anchor Shot Detection System for News Video Indexing

  • Lee, Han-Sung;Im, Young-Hee;Park, Joo-Young;Park, Dai-Hee
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.217-220
    • /
    • 2007
  • In this paper, we present a new anchor shot detection system which is a core step of the preprocessing process for the news video analysis. The proposed system is composed of four modules and operates sequentially: 1) skin color detection module for reducing the candidate face regions; 2) face detection module for finding the key-frames with a facial data; 3) vector representation module for the key-frame images using a non-negative matrix factorization; 4) anchor shot detection module using a support vector data description. According to our computer experiments, the proposed system shows not only the comparable accuracy to the recent other results, but also more faster detection rate than others.

  • PDF

Dimension-Reduced Audio Spectrum Projection Features for Classifying Video Sound Clips

  • Kim, Hyoung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.3E
    • /
    • pp.89-94
    • /
    • 2006
  • For audio indexing and targeted search of specific audio or corresponding visual contents, the MPEG-7 standard has adopted a sound classification framework, in which dimension-reduced Audio Spectrum Projection (ASP) features are used to train continuous hidden Markov models (HMMs) for classification of various sounds. The MPEG-7 employs Principal Component Analysis (PCA) or Independent Component Analysis (ICA) for the dimensional reduction. Other well-established techniques include Non-negative Matrix Factorization (NMF), Linear Discriminant Analysis (LDA) and Discrete Cosine Transformation (DCT). In this paper we compare the performance of different dimensional reduction methods with Gaussian mixture models (GMMs) and HMMs in the classifying video sound clips.

Language Model Adaptation Based on Topic Probability of Latent Dirichlet Allocation

  • Jeon, Hyung-Bae;Lee, Soo-Young
    • ETRI Journal
    • /
    • v.38 no.3
    • /
    • pp.487-493
    • /
    • 2016
  • Two new methods are proposed for an unsupervised adaptation of a language model (LM) with a single sentence for automatic transcription tasks. At the training phase, training documents are clustered by a method known as Latent Dirichlet allocation (LDA), and then a domain-specific LM is trained for each cluster. At the test phase, an adapted LM is presented as a linear mixture of the now trained domain-specific LMs. Unlike previous adaptation methods, the proposed methods fully utilize a trained LDA model for the estimation of weight values, which are then to be assigned to the now trained domain-specific LMs; therefore, the clustering and weight-estimation algorithms of the trained LDA model are reliable. For the continuous speech recognition benchmark tests, the proposed methods outperform other unsupervised LM adaptation methods based on latent semantic analysis, non-negative matrix factorization, and LDA with n-gram counting.