통합 검색 | Korea Science

Document Clustering Using Semantic Features and Fuzzy Relations

Kim, Chul-Won;Park, Sun
- Journal of information and communication convergence engineering
- /
- 제11권3호
- /
- pp.179-184
- /
- 2013
Traditional clustering methods are usually based on the bag-of-words (BOW) model. A disadvantage of the BOW model is that it ignores the semantic relationship among terms in the data set. To resolve this problem, ontology or matrix factorization approaches are usually used. However, a major problem of the ontology approach is that it is usually difficult to find a comprehensive ontology that can cover all the concepts mentioned in a collection. This paper proposes a new document clustering method using semantic features and fuzzy relations for solving the problems of ontology and matrix factorization approaches. The proposed method can improve the quality of document clustering because the clustered documents use fuzzy relation values between semantic features and terms to distinguish clearly among dissimilar documents in clusters. The selected cluster label terms can represent the inherent structure of a document set better by using semantic features based on non-negative matrix factorization, which is used in document clustering. The experimental results demonstrate that the proposed method achieves better performance than other document clustering methods.
https://doi.org/10.6109/jicce.2013.11.3.179 인용 PDF KSCI

오디오 컨텐츠를 위한 비음수 행렬 분해 기법 기반의 실시간 단일채널 배경 잡음 추출 기법 (Online Monaural Ambient Sound Extraction based on Nonnegative Matrix Factorization Method for Audio Contents)

이석진
- 방송공학회논문지
- /
- 제19권6호
- /
- pp.819-825
- /
- 2014
본 논문에서는 비음수 행렬 분해 (NMF) 기법을 이용하여 단일 채널에서 배경음 성분을 추출하는 알고리즘에 대해 서술한다. 이러한 배경음 성분 추출은 오디오 업믹싱 시스템을 고려하여 개발되었으며, 기존의 연구를 통하여 분리된 배경음 신호가 업믹싱 시스템에 적용될 경우 공간감을 향상시킬 수 있다는 사실이 이미 확인된 바 있다. 다만 기존의 기법은 음향 신호를 모두 축적하여 일괄적으로 처리해야 한다는 단점이 있어, 스트리밍 시스템이나 디지털 시그널 프로세서 (DSP) 등을 이용한 시스템에서 사용되기 어렵다. 본 논문에서는 이를 해소하기 위하여 실시간 비음수 행렬 분해 기법을 이용한 배경음 추출 시스템을 고안하여 실험하였다. 실험에서 처리된 음원을 스펙트럼 평활도를 이용하여 분석한 결과, 고안된 배경음 추출 시스템이 기존의 일괄 추출 시스템과 유사한 정도로 배경음 성분을 추출했음을 확인할 수 있었다.
https://doi.org/10.5909/JBE.2014.19.6.819 인용 PDF KSCI KPUBS HTML

내부점 선형계획법의 밀집열 분할에 대하여 (On dence column splitting in interial point methods of linear programming)

설동렬;박순달;정호원
- 경영과학
- /
- 제14권2호
- /
- pp.69-79
- /
- 1997
The computational speed of interior point method of linear programming depends on the speed of Cholesky factorization. If the coefficient matrix A has dense columns then the matrix A.THETA. $A^{T}$ becomes a dense matrix. This causes Cholesky factorization to be slow. We study an efficient implementation method of the dense column splitting among dense column resolving technique and analyze the relation between dense column splitting and order methods to improve the sparsity of Cholesky factoror.
PDF

비음수 행렬 분해와 군집의 응집도를 이용한 문서군집 (Document Clustering Method using Coherence of Cluster and Non-negative Matrix Factorization)

김철원;박선
- 한국정보통신학회논문지
- /
- 제13권12호
- /
- pp.2603-2608
- /
- 2009
문서군집은 정보검색의 많은 응용분야에 사용되는 중요한 문서 분석 방법이다. 본 논문은 비음수 행렬 분해 (NMF, non-negative matrix factorization)를 군집방법과 군집의 응집도(coherence of cluster)를 이용한 군집 내 문서들의 정제를 이용한 새로운 문서군집방법을 제안한다. 제안된 방법은 문서집합의 내부구조를 나타내는 의미특징행렬과 의미변수행렬 이용하여 문서군집의 성능을 높일 수 있고, 문장들 간의 유사도에 기반 한 군집의 응집도를 이용하여 군집내의 문서들을 정제하여서 재 할당함으로써 군집의 효율을 향상시킬 수 있다. 실험결과 제안방법을 적용한 문서군집방법이 다른 문서군집 방법에 비하여 좋은 성능을 보인다.
https://doi.org/10.6109/JKIICE.2009.13.12.2603 인용 PDF KSCI

비음수 행렬 분해와 K-means를 이용한 주제기반의 다중문서요약 (Topic-based Multi-document Summarization Using Non-negative Matrix Factorization and K-means)

박선;이주홍
- 한국정보과학회논문지:소프트웨어및응용
- /
- 제35권4호
- /
- pp.255-264
- /
- 2008
본 논문은 K-means과 비음수 행렬 분해(NMF)를 이용하여 주제기반의 다중문서를 요약하는 새로운 방법을 제안하였다. 제안방법은 비음수 행렬 분해를 이용하여 가중치가 부여된 용어-문장 행렬을 희소(Sparse)한 비음수 의미특징 행렬과 비음수 변수 행렬로 분해함으로써 직관적으로 이해할 수 있는 형태의 의미적 특징을 추출할 수 있고, 주제와 의미특징간의 유사도에 가중치를 부여하여 유사도는 높으나 실제 의미 없는 문장이 추출되는 것을 막는다. 또한 K-means 군집을 이용하여 문장에 포함된 노이즈를 제거함으로써 문서의 의미가 요약에 편향되게 반영하는 것을 피할 수 있고, 추출된 문장에 부여된 순위순서대로 정렬하여 보여 줌으로써 응집성을 높인다. 실험 결과 제안방법이 다른 방법에 비하여 좋은 성능을 보인다.
PDF KSCI

아이템 정보 기반 협업 필터링 추천 시스템 연구 (A Study on Collaborative Filtering Recommender system based on Item Knowledge)

양영욱;윤유동;임희석
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2017년도 춘계학술발표대회
- /
- pp.439-441
- /
- 2017
Matrix factorization은 사용자의 아이템 선호도를 통해 아이템을 추천해주는 성공적인 기술 중 하나이다. 이 기법은 사용자-아이템의 선호도 행렬을 채우는 것을 목표로 한다. 이 목표를 달성하기 위해 사용자-아이템의 선호도 행렬을 사용자 행렬(user latent factor)와 아이템 행렬(item latent factor)로 분해하고, 각 행렬에 대해 추론하여 완성된 사용자-아이템의 선호도 행렬을 추론한다. 하지만 Matrix factorization은 아이템의 수가 많고, 아이템에 대한 사용자들의 선호도 데이터가 적을 때 성능이 제한된다. 또한 새로운 아이템이 추가되었을 때, 새로운 아이템에 대한 사용자들의 선호도 정보가 없기 때문에 새로운 아이템이 추천되지 않는다는 문제를 가진다. 이를 해결하기 위해 본 논문에서는 아이템에 대한 부가적인 정보인 아이템 간의 유사도 정보와 아이템의 시나리오 정보의 유사도를 모델링하여 기존의 전통적인 Matrix factorization에 추가하는 아이템 정보 기반 추천 시스템을 제안한다.
https://doi.org/10.3745/PKIPS.y2017m04a.439 인용 PDF

Recovery of Lost Speech Segments Using Incremental Subspace Learning

Huang, Jianjun;Zhang, Xiongwei;Zhang, Yafei
- ETRI Journal
- /
- 제34권4호
- /
- pp.645-648
- /
- 2012
An incremental subspace learning scheme to recover lost speech segments online is presented. Our contributions in this work are twofold. First, the recovery problem is transformed into an interpolation problem of the time-varying gains via nonnegative matrix factorization. Second, incremental nonnegative matrix factorization is employed to allow online processing and track the evolution of speech statistics. The effectiveness of the proposed scheme is confirmed by the experiment results.
https://doi.org/10.4218/etrij.12.0211.0408 인용 PDF KSCI

Non-negative matrix factorization 을 이용한 마이크로어레이 데이터의 클러스터링 (Clustering gene expression data using Non -Negative matrix factorization)

Lee, Min-Young;Cho, Ji-Hoon;Lee, In-Beum
- 한국생물정보학회:학술대회논문집
- /
- 한국생물정보시스템생물학회 2004년도 The 3rd Annual Conference for The Korean Society for Bioinformatics Association of Asian Societies for Bioinformatics 2004 Symposium
- /
- pp.117-123
- /
- 2004
마이크로어레이 (microarray) 기술이 개발된 후로 연관된 유전자 클러스터 (cluster)를 찾는 문제는 깊이 연구되어왔다. 이 문제는 핵심적인 과제 중 하나는 생물학적으로 타당한 클러스터의 수를 결정하는 데 있다. 본 논문은 최적의 클러스터 수를 결정하는 기준을 제시하고, non-negative factorization (NMF)를 이용해 클러스터 centroid의 패턴을 찾는 방법을 제안한다. NMF에 의해 발견된 각각의 패턴은 생물학적 프로세스의 특정 부분으로 해석될 수 있다. NMF는 factor matrix의 entity를 non-negative로 제약 (constraint)하고, 이 제약은 오직 additive combination만 허용하기 때문에 이러한 부분적인 패턴을 찾아낼 수 있다. NMF의 유용성은 이미지 분석과 텍스트 분석에서 이미 입증되어 있다. 본 논문에서 제안한 방법에 의해 위의패턴과 유사한 발현 패턴을 갖는 유전자를 모을 수 있었다. 제안된 방법은 human fibroblast데이터와 yeast cell cycle 데이터에 적용해 성능을 입증하였다.
PDF

Vehicle Face Recognition Algorithm Based on Weighted Nonnegative Matrix Factorization with Double Regularization Terms

Shi, Chunhe;Wu, Chengdong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제14권5호
- /
- pp.2171-2185
- /
- 2020
In order to judge that whether the vehicles in different images which are captured by surveillance cameras represent the same vehicle or not, we proposed a novel vehicle face recognition algorithm based on improved Nonnegative Matrix Factorization (NMF), different from traditional vehicle recognition algorithms, there are fewer effective features in vehicle face image than in whole vehicle image in general, which brings certain difficulty to recognition. The innovations mainly include the following two aspects: 1) we proposed a novel idea that the vehicle type can be determined by a few key regions of the vehicle face such as logo, grille and so on; 2) Through adding weight, sparseness and classification property constraints to the NMF model, we can acquire the effective feature bases that represent the key regions of vehicle face image. Experimental results show that the proposed algorithm not only achieve a high correct recognition rate, but also has a strong robustness to some non-cooperative factors such as illumination variation.
https://doi.org/10.3837/tiis.2020.05.017 인용 PDF KSCI HTML

Enhancing Text Document Clustering Using Non-negative Matrix Factorization and WordNet

Kim, Chul-Won;Park, Sun
- Journal of information and communication convergence engineering
- /
- 제11권4호
- /
- pp.241-246
- /
- 2013
A classic document clustering technique may incorrectly classify documents into different clusters when documents that should belong to the same cluster do not have any shared terms. Recently, to overcome this problem, internal and external knowledge-based approaches have been used for text document clustering. However, the clustering results of these approaches are influenced by the inherent structure and the topical composition of the documents. Further, the organization of knowledge into an ontology is expensive. In this paper, we propose a new enhanced text document clustering method using non-negative matrix factorization (NMF) and WordNet. The semantic terms extracted as cluster labels by NMF can represent the inherent structure of a document cluster well. The proposed method can also improve the quality of document clustering that uses cluster labels and term weights based on term mutual information of WordNet. The experimental results demonstrate that the proposed method achieves better performance than the other text clustering methods.
https://doi.org/10.6109/jicce.2013.11.4.241 인용 PDF KSCI

검색결과 305건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)