• 제목/요약/키워드: Clustering Problem

검색결과 708건 처리시간 0.027초

복수 목적함수를 갖는 새로운 형태의 집단분할 문제 (A New Type of Clustering Problem with Two Objectives)

  • 이재영
    • 대한산업공학회지
    • /
    • 제24권1호
    • /
    • pp.145-156
    • /
    • 1998
  • In a classical clustering problem, grouping is done on the basis of similarities or distances (dissimilarities) among the elements. Therefore, the objective is to minimize the variance within each group while maximizing the between-group variance among all groups. In this paper, however, a new class of clustering problem is introduced. We call this a laydown grouping problem (LGP). In LGP, the objective is to minimize both the within-group and between-group variances. Furthermore, the problem is expanded to a multi-dimensional case where the two-way minimization process must be considered for each dimension simultaneously for all measurement characteristics. At first, the problem is assessed by analyzing its variance structures and their complexities by conjecturing that LGP is NP-complete. Then, the simulated annealing (SA) algorithm is applied and the results are compared against that from others.

  • PDF

Semidefinite Programming을 통한 그래프의 동시 분할법 (K-Way Graph Partitioning: A Semidefinite Programming Approach)

  • Jaehwan, Kim;Seungjin, Choi;Sung-Yang, Bang
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2004년도 가을 학술발표논문집 Vol.31 No.2 (1)
    • /
    • pp.697-699
    • /
    • 2004
  • Despite many successful spectral clustering algorithm (based on the spectral decomposition of Laplacian(1) or stochastic matrix(2) ) there are several unsolved problems. Most spectral clustering Problems are based on the normalized of algorithm(3) . are close to the classical graph paritioning problem which is NP-hard problem. To get good solution in polynomial time. it needs to establish its convex form by using relaxation. In this paper, we apply a novel optimization technique. semidefinite programming(SDP). to the unsupervised clustering Problem. and present a new multiple Partitioning method. Experimental results confirm that the Proposed method improves the clustering performance. especially in the Problem of being mixed with non-compact clusters compared to the previous multiple spectral clustering methods.

  • PDF

무선 애드 혹 네트워크에서 노드 클러스터링을 위한 유전 알고리즘 (A Genetic Algorithm for Clustering Nodes in Wireless Ad-hoc Networks)

  • 장길웅
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2017년도 춘계학술대회
    • /
    • pp.649-651
    • /
    • 2017
  • 클러스터링 문제는 무선 애드 혹 네트워크의 네트워크 수명과 확장성을 향상시키는 문제 중 하나이다. 이 문제는 무선 애드 혹 네트워크의 설계 및 운영과 관련된 어려운 조합 최적화 문제이다. 본 논문에서는 네트워크 수명을 최대화하고 무선 애드 혹 네트워크의 확장성을 고려한 효율적인 클러스터링 알고리즘을 제안한다. 클러스터링 문제는 NP-hard 문제로 알려져 있습니다. 따라서 본 논문에서는 노드의 수가 많은 네트워크에서 합리적인 시간 내에 최적의 해를 효율적으로 얻을 수 있는 최적화 방식을 사용하여 문제를 해결한다. 제안된 알고리즘은 노드의 전력과 클러스터링 비용을 고려하여 클러스터 헤드를 선택하고 클러스터를 구성한다. 우리는 노드의 전송에너지 측면에서 시뮬레이션을 통해 성능을 평가한다. 시뮬레이션 결과는 제안된 알고리즘이 기존의 알고리즘보다 성능이 우수함을 보여 준다.

  • PDF

그래프분할문제 (The Graph Partition Problem)

  • 명영수
    • 한국경영과학회지
    • /
    • 제28권4호
    • /
    • pp.131-143
    • /
    • 2003
  • In this paper, we present a survey about the various graph partition problems including the clustering problem, the k-cut problem, the multiterminal cut problem, the multicut problem, the sparsest cut problem, the network attack problem, the network disconnection problem. We compare those problems focusing on the problem characteristics such as the objective function and the conditions that the partitioned clusters should satisfy. We also introduce the mathematical programming formulations, and the solution approaches developed for the problems.

The Effect of Bias in Data Set for Conceptual Clustering Algorithms

  • Lee, Gye Sung
    • International journal of advanced smart convergence
    • /
    • 제8권3호
    • /
    • pp.46-53
    • /
    • 2019
  • When a partitioned structure is derived from a data set using a clustering algorithm, it is not unusual to have a different set of outcomes when it runs with a different order of data. This problem is known as the order bias problem. Many algorithms in machine learning fields try to achieve optimized result from available training and test data. Optimization is determined by an evaluation function which has also a tendency toward a certain goal. It is inevitable to have a tendency in the evaluation function both for efficiency and for consistency in the result. But its preference for a specific goal in the evaluation function may sometimes lead to unfavorable consequences in the final result of the clustering. To overcome this bias problems, the first clustering process proceeds to construct an initial partition. The initial partition is expected to imply the possible range in the number of final clusters. We apply the data centric sorting to the data objects in the clusters of the partition to rearrange them in a new order. The same clustering procedure is reapplied to the newly arranged data set to build a new partition. We have developed an algorithm that reduces bias effect resulting from how data is fed into the algorithm. Experiment results have been presented to show that the algorithm helps minimize the order bias effects. We have also shown that the current evaluation measure used for the clustering algorithm is biased toward favoring a smaller number of clusters and a larger size of clusters as a result.

Identification of Plastic Wastes by Using Fuzzy Radial Basis Function Neural Networks Classifier with Conditional Fuzzy C-Means Clustering

  • Roh, Seok-Beom;Oh, Sung-Kwun
    • Journal of Electrical Engineering and Technology
    • /
    • 제11권6호
    • /
    • pp.1872-1879
    • /
    • 2016
  • The techniques to recycle and reuse plastics attract public attention. These public attraction and needs result in improving the recycling technique. However, the identification technique for black plastic wastes still have big problem that the spectrum extracted from near infrared radiation spectroscopy is not clear and is contaminated by noise. To overcome this problem, we apply Raman spectroscopy to extract a clear spectrum of plastic material. In addition, to improve the classification ability of fuzzy Radial Basis Function Neural Networks, we apply supervised learning based clustering method instead of unsupervised clustering method. The conditional fuzzy C-Means clustering method, which is a kind of supervised learning based clustering algorithms, is used to determine the location of radial basis functions. The conditional fuzzy C-Means clustering analyzes the data distribution over input space under the supervision of auxiliary information. The auxiliary information is defined by using k Nearest Neighbor approach.

준정부호 스펙트럼의 군집화 (Semidefinite Spectral Clustering)

  • 김재환;최승진
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2005년도 한국컴퓨터종합학술대회 논문집 Vol.32 No.1 (A)
    • /
    • pp.892-894
    • /
    • 2005
  • Graph partitioning provides an important tool for data clustering, but is an NP-hard combinatorial optimization problem. Spectral clustering where the clustering is performed by the eigen-decomposition of an affinity matrix [1,2]. This is a popular way of solving the graph partitioning problem. On the other hand, semidefinite relaxation, is an alternative way of relaxing combinatorial optimization. issuing to a convex optimization[4]. In this paper we present a semidefinite programming (SDP) approach to graph equi-partitioning for clustering and then we use eigen-decomposition to obtain an optimal partition set. Therefore, the method is referred to as semidefinite spectral clustering (SSC). Numerical experiments with several artificial and real data sets, demonstrate the useful behavior of our SSC. compared to existing spectral clustering methods.

  • PDF

Optimal Base Station Clustering for a Mobile Communication Network Design

  • Hong, Jung-Man;Lee, Jong-Hyup;Lee, Soong-Hee
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제5권5호
    • /
    • pp.1069-1084
    • /
    • 2011
  • This paper considers an optimal base station clustering problem for designing a mobile (wireless) communication network. For a given network with a set of nodes (base stations), the problem is to optimally partition the set of nodes into subsets (each called a cluster) such that the associated inter-cluster traffic is minimized under certain topological constraints and cluster capacity constraints. In the problem analysis, the problem is formulated as an integer programming problem. The integer programming problem is then transformed into a binary integer programming problem, for which the associated linear programming relaxation is solved in a column generation approach assisted by a branch-and-bound procedure. For the column generation, both a heuristic algorithm and a valid inequality approach are exploited. Various numerical examples are solved to evaluate the effectiveness of the LP (Linear Programming) based branch-and-bound algorithm.

인자점수와 자기조직화지도를 이용한 희소한 문서데이터의 군집화 (Sparse Document Data Clustering Using Factor Score and Self Organizing Maps)

  • 전성해
    • 한국지능시스템학회논문지
    • /
    • 제22권2호
    • /
    • pp.205-211
    • /
    • 2012
  • 통계학과 기계학습의 다양한 기법을 이용하여 문서집합을 군집화하기 위해서는 우선 군집화분석에 적합한 데이터구조로 대상 문서집합을 변환해야 한다. 문서군집화를 위한 대표적인 구조가 문서-단어행렬이다. 각 문서에서 발생한 특정단어의 빈도값을 갖는 문서-단어행렬은 상당부분의 빈도값이 0인 희소성문제를 갖는다. 이 문제는 문서군집화의 성능에 직접적인 영향을 주어 군집화결과의 성능감소를 초래한다. 본 논문에서는 문서-단어행렬의 희소성문제를 해결하기 위하여 인자분석을 통한 인자점수를 이용하였다. 즉, 문서-단어행렬을 문서-인자점수행렬로 바꾸어 문서군집화의 입력데이터로 사용하였다. 대표적인 문서군집화 알고리즘인 자기조직화지도에 적용하여 문서-단어행렬과 문서-인자점수행렬에 대한 문서군집화의 결과들을 비교하였다.

대규모 무선 센서 네트워크에서 계층 기반의 동적 불균형 클러스터링 기법 (A Layer-based Dynamic Unequal Clustering Method in Large Scale Wireless Sensor Networks)

  • 김진수
    • 한국산학기술학회논문지
    • /
    • 제13권12호
    • /
    • pp.6081-6088
    • /
    • 2012
  • 불균형 클러스터링은 클러스터의 크기를 서로 다른 크기로 나누는 방법으로 균형 클러스터링에 비해 핫스팟 문제를 어느 정도 해결할 수 있으므로 전체 네트워크의 에너지 소모량을 줄인다. 본 논문에서는 불균형 클러스터링 모델을 이용하여 계층 기반의 동적 불균형 클러스터링을 제안한다. 이는 라운드별로 최적의 클러스터 수 및 클러스터 헤드 위치를 제공함으로써 전체 네트워크에 대한 에너지 소모를 균형 있게 하고 또한 네트워크 수명을 연장시킨다. 실험을 통하여 제안된 기법이 이전 클러스터링 기법보다 네트워크 수명이 연장되었음을 보였다.