• Title/Summary/Keyword: k-means

Search Result 17,898, Processing Time 0.039 seconds

K-Means Clustering in the PCA Subspace using an Unified Measure (통합 측도를 사용한 주성분해석 부공간에서의 k-평균 군집화 방법)

  • Yoo, Jae-Hung
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.4
    • /
    • pp.703-708
    • /
    • 2022
  • K-means clustering is a representative clustering technique. However, there is a limitation in not being able to integrate the performance evaluation scale and the method of determining the minimum number of clusters. In this paper, a method for numerically determining the minimum number of clusters is introduced. The explained variance is presented as an integrated measure. We propose that the k-means clustering method should be performed in the subspace of the PCA in order to simultaneously satisfy the minimum number of clusters and the threshold of the explained variance. It aims to present an explanation in principle why principal component analysis and k-means clustering are sequentially performed in pattern recognition and machine learning.

Reproducibility Assessment of K-Means Clustering and Applications (K-평균 군집화의 재현성 평가 및 응용)

  • 허명회;이용구
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.1
    • /
    • pp.135-144
    • /
    • 2004
  • We propose a reproducibility (validity) assessment procedure of K-means cluster analysis by randomly partitioning the data set into three parts, of which two subsets are used for developing clustering rules and one subset for testing consistency of clustering rules. Also, as an alternative to Rand index and corrected Rand index, we propose an entropy-based consistency measure between two clustering rules, and apply it to determination of the number of clusters in K-means clustering.

Proposal of Cluster Head Election Method in K-means Clustering based WSN (K-평균 군집화 기반 WSN에서 클러스터 헤드 선택 방법 제안)

  • Yun, Dai Yeol;Park, SeaYoung;Hwang, Chi-Gon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.447-449
    • /
    • 2021
  • Various wireless sensor network protocols have been proposed to maintain the network for a long time by minimizing energy consumption. Using the K-means clustering algorithm takes longer to cluster than traditional hierarchical algorithms because the center point must be moved repeatedly until the final cluster is established. For K-means clustering-based protocols, only the residual energy of nodes or nodes near the center point of the cluster is considered when the cluster head is elected. In this paper, we propose a new wireless sensor network protocol based on K-means clustering to improve the energy efficiency while improving the aforementioned problems.

  • PDF

Efficient K-means Clustering for High-dimensional Large Data (고차원 대규모 데이터를 위한 효율적인 K-means 클러스터링)

  • Yoon, Tae-Sik;Shim, Kyu-Seok
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.33-36
    • /
    • 2011
  • 클러스터링은 데이터 포인트들을 그룹으로 묶어 데이터를 분석하는데 유용하다. 특히 K-means는 가장 널리 쓰이는 클러스터링 알고리즘으로 k개의 군집(Cluster)을 찾는다. 본 논문에서는 기존의 K-means 알고리즘과 비교해 고차원 대규모데이터에 대해서 효율적으로 동작하는 K-means 알고리즘을 제안한다. 제안된 알고리즘은 기존의 알고리즘에서와 같이 거리 정보를 이용해 불필요한 계산을 줄여나가며 또한 움직임 없는 군집들을 계산에서 제외하여 수행시간을 단축한다. 제안된 알고리즘은 기존의 관련연구에서 제안된 알고리즘에 비해 공간을 적게 쓰면서 동시에 빠르다. 실제 고차원 데이터 실험을 통해서 제안된 알고리즘의 효율성을 보였다.

Partial Discharge Distribution Analysis of Ultra High Voltage Cable using K-means clustering (K-means 클러스터링을 이용한 초고압 케이블 절연재료의 부분방전 분포 해석)

  • Lee, Kang-Won;Lee, Hyuk-Jin;Lee, Chung-Ho;Yeon, Kyu-Ho;Hong, Jin-Woong
    • Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
    • /
    • 2007.11a
    • /
    • pp.201-202
    • /
    • 2007
  • In this paper we investigated the partial discharge distribution using the K-means clustering according to the needle of tilt and void at the cross linked polyethylene(XLPE) insulators. As a result, the specimen with tilt $45^{\circ}$ has highest breakdown voltage and the specimen with air void has lower breakdown voltage than the specimen with on void. In K-menas clustering distribution of clusters concentrates at inception condition, but the distribution spreads widely at breakdown.

  • PDF

Automatic Dynamic Range Improvement Method using Histogram Modification and K-means Clustering (히스토그램 변형 및 K-means 분류 기반 동적 범위 개선 기법)

  • Cha, Su-Ram;Kim, Jeong-Tae;Kim, Min-Seok
    • Journal of Broadcast Engineering
    • /
    • v.16 no.6
    • /
    • pp.1047-1057
    • /
    • 2011
  • In this paper, we propose a novel tone mapping method that implements histogram modification framework on two local regions that are classified using K-means clustering algorithm. In addition, we propose automatic parameter tuning method for histogram modification. The proposed method enhances local details better than the global histogram method. Moreover, the proposed method is fully automatic in the sense that it does not require intervention from human to tune parameters that are involved for computing tone mapping functions. In simulations and experimental studies, the proposed method showed better performance than existing histogram modification method.

Isolated Words Recognition using K-means iteration without Initialization (초기화하지 않은 K-means iteration을 이용한 고립단어 인식)

  • Kim, Jin-Young;Sung, Keong-Mo
    • Proceedings of the KIEE Conference
    • /
    • 1988.07a
    • /
    • pp.7-9
    • /
    • 1988
  • K-means iteration method is generally used for creating the templates in speaker-independent isolated-word recognition system. In this paper the initialization method of initial centers is proposed. The concepts are sorting and trace segmentation. All the tokens are sorted and segmented by trace segmentation so that initial centers are decided. The performance of this method is evaluated by isolated-word recognition of Korean digits. The highest recognition rate is 97.6%.

  • PDF

Pre-Adjustment of Incomplete Group Variable via K-Means Clustering

  • Hwang, S.Y.;Hahn, H.E.
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.3
    • /
    • pp.555-563
    • /
    • 2004
  • In classification and discrimination, we often face with incomplete group variable arising typically from many missing values and/or incredible cases. This paper suggests the use of K-means clustering for pre-adjusting incompleteness and in turn classification based on generalized statistical distance is performed. For illustrating the proposed procedure, simulation study is conducted comparatively with CART in data mining and traditional techniques which are ignoring incompleteness of group variable. Simulation study manifests that our methodology out-performs.

  • PDF

A Study on the Effect of Happenstance Factors on the Perception of Means-Goal Relationship (우연 요인이 수단-목표 관계의 지각에 미치는 효과)

  • Lee, Byung-Kwan;Kim, Sung-Eun
    • Science of Emotion and Sensibility
    • /
    • v.19 no.1
    • /
    • pp.11-20
    • /
    • 2016
  • This study was performed to examine how intrinsic acceptance for happenstance(IAH) affects evaluation of the means to achieve additional goals when the additional goals are happened to be added to the original means-goal link. Specifically, participants were asked to evaluate the effectiveness of means(e.g., exercise and saving) to achieve additional goals which are either more or less relevant to the means. The result showed that participants with high IAH were more likely to think that the original means is effective to accomplish additional goals than those with low IAH. This pattern, however, was found only when the additional goals are relevant to the means. When the relevance of the additional goals to the means was low, there was no difference in evaluation of the effectiveness of the means to achieve additional goals between participants with high and low IAH. Future research and implications of this study for goal-systems theory and intrinsic acceptance for happenstance were discussed.

Initial Mode Decision Method for Clustering in Categorical Data

  • Yang, Soon-Cheol;Kang, Hyung-Chang;Kim, Chul-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.481-488
    • /
    • 2007
  • The k-means algorithm is well known for its efficiency in clustering large data sets. However, working only on numeric values prohibits it from being used to cluster real world data containing categorical values. The k-modes algorithm is to extend the k-means paradigm to categorical domains. The algorithm requires a pre-setting or random selection of initial points (modes) of the clusters. This paper improved the problem of k-modes algorithm, using the Max-Min method that is a kind of methods to decide initial values in k-means algorithm. we introduce new similarity measures to deal with using the categorical data for clustering. We show that the mushroom data sets and soybean data sets tested with the proposed algorithm has shown a good performance for the two aspects(accuracy, run time).

  • PDF