• Title/Summary/Keyword: additive clustering

Search Result 12, Processing Time 0.022 seconds

Nonnegative Matrix Factorization with Orthogonality Constraints

  • Yoo, Ji-Ho;Choi, Seung-Jin
    • Journal of Computing Science and Engineering
    • /
    • v.4 no.2
    • /
    • pp.97-109
    • /
    • 2010
  • Nonnegative matrix factorization (NMF) is a popular method for multivariate analysis of nonnegative data, which is to decompose a data matrix into a product of two factor matrices with all entries restricted to be nonnegative. NMF was shown to be useful in a task of clustering (especially document clustering), but in some cases NMF produces the results inappropriate to the clustering problems. In this paper, we present an algorithm for orthogonal nonnegative matrix factorization, where an orthogonality constraint is imposed on the nonnegative decomposition of a term-document matrix. The result of orthogonal NMF can be clearly interpreted for the clustering problems, and also the performance of clustering is usually better than that of the NMF. We develop multiplicative updates directly from true gradient on Stiefel manifold, whereas existing algorithms consider additive orthogonality constraints. Experiments on several different document data sets show our orthogonal NMF algorithms perform better in a task of clustering, compared to the standard NMF and an existing orthogonal NMF.

A Kernel based Possibilistic Approach for Clustering and Image Segmentation (클러스터링 및 영상 분할을 위한 커널 기반의 Possibilistic 접근 방법)

  • Choi, Kil-Soo;Choi, Byung-In;Rhee, Chung-Hoon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.7
    • /
    • pp.889-894
    • /
    • 2004
  • The fuzzy kernel c-means (FKCM) algorithm, which uses a kernel function, can obtain more desirable clustering results than fuzzy c-means (FCM) for not only spherical data but also non-spherical data. However, it can be sensitive to noise as in the FCM algorithm. In this paper, a kernel function is applied to the possibilistic c-means (PCM) algorithm and is shown to be robust for data with additive noise. Several experimental results show that the proposed kernel possibilistic c-means (KPCM) algorithm out performs the FKCM algorithm for general data with additive noise.

Adaptive Clustering based Sparse Representation for Image Denoising (적응 군집화 기반 희소 부호화에 의한 영상 잡음 제거)

  • Kim, Seehyun
    • Journal of IKEEE
    • /
    • v.23 no.3
    • /
    • pp.910-916
    • /
    • 2019
  • Non-local similarity of natural images is one of highly exploited features in various applications dealing with images. Unique edges, texture, and pattern of the images are frequently repeated over the entire image. Once the similar image blocks are classified into a cluster, representative features of the image blocks can be extracted from the cluster. The bigger the size of the cluster is the better the additive white noise can be separated. Denoising is one of major research topics in the image processing field suppressing the additive noise. In this paper, a denoising algorithm is proposed which first clusters the noisy image blocks based on similarity, extracts the feature of the cluster, and finally recovers the original image. Performance experiments with several images under various noise strengths show that the proposed algorithm recovers the details of the image such as edges, texture, and patterns while outperforming the previous methods in terms of PSNR in removing the additive Gaussian noise.

The Structure of Korean Consonants as Perceived by the Japanese (일본인이 지각하는 한국어 자음의 구조)

  • Bae, Moon-Jung;Kim, Jung-Oh
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.2
    • /
    • pp.163-175
    • /
    • 2008
  • Twelve Japanese students living in South Korea have been examined for their perceptual identification of an initial consonant in Korean syllables with or without a white noise. A confusion matrix was then subject to analyses of additive clustering, individual difference scaling, and probability of information transmission, the results of which were also compared to those of South Koreans. The Japanese in the present experiment confused /다/and/타/ most frequently, followed by /가/ and /카/, /자, 차, 짜/, /타/ and /따/, and so on. The results of additive clustering analysis of the Japanese significantly differed from those of the South Koreans. Individual difference scaling revealed dimensions of sonorant, aspiration and coronal. While South Koreans showed binary values on aspiration and tenseness dimensions, the Japanese did continuous values on such dimensions. An information transmission probability analysis revealed that the Japanese participants could not perceive very well such larynx features as tenseness and aspiration compared to the South Korean participants. The former group, however, perceived very well place of articulation features such as labial and coronal. The present results suggest that an approach dealing with structures of base representations is important in understanding the phonological categories of languages.

  • PDF

The Clustering of Parts with Qualitative and Quantitative Quality Properties using λ-Fuzzy Measure (λ-퍼지측도를 사용한 질적, 양적혼합품질특성을 가진 부품의 군집화)

  • Kim, Jeong-Man;Lee, Sang-Do
    • Journal of Korean Society for Quality Management
    • /
    • v.24 no.1
    • /
    • pp.126-136
    • /
    • 1996
  • In multi-item production system, GT(Group Technology) is used effectively in order to cluster various parts into groups. GT is based on clustering parts which have similar features, and these features are classified into two properties, namely crisp(quantitative) feature and fuzzy(qualitative) feature. Especially, many difficult problems are often faced that have to evaluate the properties of parts with the crisp and fuzzy feature together. As the basis of determining the similarity of inter-parts, in this method, one aggregate value is calculated on each part. However, because the above aggregate value is only gained from simple additive weighted sum, there is one problem in this method that has been handled the combination effect of inter-parts. For these reasons, in this paper, a proposed method is suggested for representing combination effect in order to cluster parts that have crisp and fuzzy properties into groups using ${\lambda}$-fuzzy measure and fuzzy integral.

  • PDF

Compositional data analysis by the square-root transformation: Application to NBA USG% data

  • Jeseok Lee;Byungwon Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.3
    • /
    • pp.349-363
    • /
    • 2024
  • Compositional data refers to data where the sum of the values of the components is a constant, hence the sample space is defined as a simplex making it impossible to apply statistical methods developed in the usual Euclidean vector space. A natural approach to overcome this restriction is to consider an appropriate transformation which moves the sample space onto the Euclidean space, and log-ratio typed transformations, such as the additive log-ratio (ALR), the centered log-ratio (CLR) and the isometric log-ratio (ILR) transformations, have been mostly conducted. However, in scenarios with sparsity, where certain components take on exact zero values, these log-ratio type transformations may not be effective. In this work, we mainly suggest an alternative transformation, that is the square-root transformation which moves the original sample space onto the directional space. We compare the square-root transformation with the log-ratio typed transformation by the simulation study and the real data example. In the real data example, we applied both types of transformations to the USG% data obtained from NBA, and used a density based clustering method, DBSCAN (density-based spatial clustering of applications with noise), to show the result.

Orthogonal Nonnegative Matrix Factorization: Multiplicative Updates on Stiefel Manifolds (Stiefel 다양체에서 곱셈의 업데이트를 이용한 비음수 행렬의 직교 분해)

  • Yoo, Ji-Ho;Choi, Seung-Jin
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.5
    • /
    • pp.347-352
    • /
    • 2009
  • Nonnegative matrix factorization (NMF) is a popular method for multivariate analysis of nonnegative data, the goal of which is decompose a data matrix into a product of two factor matrices with all entries in factor matrices restricted to be nonnegative. NMF was shown to be useful in a task of clustering (especially document clustering). In this paper we present an algorithm for orthogonal nonnegative matrix factorization, where an orthogonality constraint is imposed on the nonnegative decomposition of a term-document matrix. We develop multiplicative updates directly from true gradient on Stiefel manifold, whereas existing algorithms consider additive orthogonality constraints. Experiments on several different document data sets show our orthogonal NMF algorithms perform better in a task of clustering, compared to the standard NMF and an existing orthogonal NMF.

A Study of Design for Additive Manufacturing Method for Part Consolidation to Redesign IoT Device (IoT 기기 재설계를 위한 적층제조를 활용한 부품병합 설계 방법에 대한 연구)

  • Kim, Samyeon
    • Journal of Internet of Things and Convergence
    • /
    • v.8 no.2
    • /
    • pp.55-59
    • /
    • 2022
  • Recently, IoT technology has great attention and plays a key role in 4th industrial revolution in order to design customized products and services. Additive Manufacturing (AM) is applied to fabricate IoT sensor directly or IoT sensor embedded structure. Also, design methods for AM are developing to consolidate various parts of IoT devices. Part consolidation leads to assembly time and cost reduction, reliability improvement, and lightweight. Therefore, a design method was proposed to guide designers to consolidate parts. The design method helps designers to define product architecture that consists of functions and function-part relations. The product architecture is converted to a network graph and then Girvan Newman algorithm is applied to cluster the graph network. Parts in clusters are candidates for part consolidation. To demonstrate the usefulness of the proposed design method, a case study was performed with e-bike fabricated by additive manufacturing.

Clustering gene expression data using Non -Negative matrix factorization (Non-negative matrix factorization 을 이용한 마이크로어레이 데이터의 클러스터링)

  • Lee, Min-Young;Cho, Ji-Hoon;Lee, In-Beum
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2004.11a
    • /
    • pp.117-123
    • /
    • 2004
  • 마이크로어레이 (microarray) 기술이 개발된 후로 연관된 유전자 클러스터 (cluster)를 찾는 문제는 깊이 연구되어왔다. 이 문제는 핵심적인 과제 중 하나는 생물학적으로 타당한 클러스터의 수를 결정하는 데 있다. 본 논문은 최적의 클러스터 수를 결정하는 기준을 제시하고, non-negative factorization (NMF)를 이용해 클러스터 centroid의 패턴을 찾는 방법을 제안한다. NMF에 의해 발견된 각각의 패턴은 생물학적 프로세스의 특정 부분으로 해석될 수 있다. NMF는 factor matrix의 entity를 non-negative로 제약 (constraint)하고, 이 제약은 오직 additive combination만 허용하기 때문에 이러한 부분적인 패턴을 찾아낼 수 있다. NMF의 유용성은 이미지 분석과 텍스트 분석에서 이미 입증되어 있다. 본 논문에서 제안한 방법에 의해 위의패턴과 유사한 발현 패턴을 갖는 유전자를 모을 수 있었다. 제안된 방법은 human fibroblast데이터와 yeast cell cycle 데이터에 적용해 성능을 입증하였다.

  • PDF

III-V 삼상 화합물 반도체의 분자선 결정성장법에서의 열역학적 고찰

  • O, Won-Ung;O, Jae-Eng;Baek, Su-Hyun
    • ETRI Journal
    • /
    • v.13 no.4
    • /
    • pp.42-51
    • /
    • 1991
  • MBE 성장시 기판 표면에서의 성장과정을 운동론적 지배과정과 열역학적 지배과정으로 나누어 성장모델을 제시하였으며, 화학적 평형상태에서의 열역학이 III-V compound의 성장속도와 composition 에 미치는 영향을 기존의 보고된 결과 데이터와 비교 분석하였다. 특히 miscibility gap 내에 존재하는 III-V ternary compound의 경우 박막의 성질 및 소자의 특성에 영향을 미치는 alloy clustering은 저온 성장시 surface kinetics에 의해, 고온성장시에는 열역학적 spinodal decomposition에 의해 결정됨을 알수 있었다. 열역학적 모델에서는 기판과 layer사이의 lattice mismatch와 재료의 elastic coefficient의 함수인 additive strain Gibbs free energy, 그리고 ternary solid solution의 regular behavior를 가정하여 ternary alloy의 mixing에 기인한 excess Gibbs free energy를 고려하였다.

  • PDF