• Title/Summary/Keyword: 클러스터링 계수

Search Result 67, Processing Time 0.029 seconds

Analysis of Assortativity in the Keyword-based Patent Network Evolution (키워드기반 특허 네트워크 진화에 따른 동종성 분석)

  • Choi, Jinho;Kim, Junguk
    • Journal of Internet Computing and Services
    • /
    • v.14 no.6
    • /
    • pp.107-115
    • /
    • 2013
  • Various networks can be observed in the world. Knowledge networks which are closely related with technology and research are especially important because these networks help us understand how knowledge is produced. Therefore, many studies regarding knowledge networks have been conducted. The assortativity coefficient represents the tendency of connections between nodes having a similar property as figures. The relevant characteristics of the assortativity coefficient help us understand how corresponding technologies have evolved in the keyword-based patent network which is considered to be a knowledge network. The relationships of keywords in a knowledge network where a node is depicted as a keyword show the structure of the technology development process. In this paper, we suggest two hypotheses basedon the previous research indicating that there exist core nodes in the keyword network and we conduct assortativity analysis to verify the hypotheses. First, the patents network based on the keyword represents disassortativity over time. Through our assortativity analysis, it is confirmed that the knowledge network shows disassortativity as the network evolves. Second, as the keyword-based patents network becomes disassortavie, clustering coefficients become lower. As the result of this hypothesis, weconfirm the clustering coefficient also becomes lower as the assortative coefficient of the network gets lower. Another interesting result concerning the second hypothesis is that, when the knowledge network is disassorativie, the tendency of decreasing of the clustering coefficient is much higher than when the network is assortative.

Threshold based User-centric Clustering for Cell-free MIMO Network (셀프리 다중안테나 네트워크를 위한 임계값 기반 사용자 중심 클러스터링)

  • Ryu, Jong Yeol;Lee, Woongsup;Ban, Tae-Won
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.1
    • /
    • pp.114-121
    • /
    • 2022
  • In this paper, we consider a user centric clustering in order to guarantee the performance of the users in cell free multiple-input multiple-output (MIMO) network. In the user centric clustering scheme, by using large scale fading coefficients of the connected access points (APs), each user decides own cluster with the APs having the higher the large scale fading coefficients than threshold value compared to the highest large scale fading coefficient. In the determined user centric clusters, the APs design the beamformers and power allocations in the distributed manner and the APs cooperatively transmit data to users by using beamformers and power allocations. In the simulation results, we verify the performance of user centric clustering in terms of the spectral efficiency and we also find the optimal threshold value in the given configuration.

Improving the Performance of Document Clustering with Distributional Similarities (분포유사도를 이용한 문헌클러스터링의 성능향상에 대한 연구)

  • Lee, Jae-Yun
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.4
    • /
    • pp.267-283
    • /
    • 2007
  • In this study, measures of distributional similarity such as KL-divergence are applied to cluster documents instead of traditional cosine measure, which is the most prevalent vector similarity measure for document clustering. Three variations of KL-divergence are investigated; Jansen-Shannon divergence, symmetric skew divergence, and minimum skew divergence. In order to verify the contribution of distributional similarities to document clustering, two experiments are designed and carried out on three test collections. In the first experiment the clustering performances of the three divergence measures are compared to that of cosine measure. The result showed that minimum skew divergence outperformed the other divergence measures as well as cosine measure. In the second experiment second-order distributional similarities are calculated with Pearson correlation coefficient from the first-order similarity matrixes. From the result of the second experiment, secondorder distributional similarities were found to improve the overall performance of document clustering. These results suggest that minimum skew divergence must be selected as document vector similarity measure when considering both time and accuracy, and second-order similarity is a good choice for considering clustering accuracy only.

Effective Image Segmentation using a Locally Weighted Fuzzy C-Means Clustering (지역 가중치 적용 퍼지 클러스터링을 이용한 효과적인 이미지 분할)

  • Alamgir, Nyma;Kim, Jong-Myon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.12
    • /
    • pp.83-93
    • /
    • 2012
  • This paper proposes an image segmentation framework that modifies the objective function of Fuzzy C-Means (FCM) to improve the performance and computational efficiency of the conventional FCM-based image segmentation. The proposed image segmentation framework includes a locally weighted fuzzy c-means (LWFCM) algorithm that takes into account the influence of neighboring pixels on the center pixel by assigning weights to the neighbors. Distance between a center pixel and a neighboring pixels are calculated within a window and these are basis for determining weights to indicate the importance of the memberships as well as to improve the clustering performance. We analyzed the segmentation performance of the proposed method by utilizing four eminent cluster validity functions such as partition coefficient ($V_{pc}$), partition entropy ($V_{pe}$), Xie-Bdni function ($V_{xb}$) and Fukuyama-Sugeno function ($V_{fs}$). Experimental results show that the proposed LWFCM outperforms other FCM algorithms (FCM, modified FCM, and spatial FCM, FCM with locally weighted information, fast generation FCM) in the cluster validity functions as well as both compactness and separation.

An Enhanced Spatial Fuzzy C-Means Algorithm for Image Segmentation (영상 분할을 위한 개선된 공간적 퍼지 클러스터링 알고리즘)

  • Truong, Tung X.;Kim, Jong-Myon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.2
    • /
    • pp.49-57
    • /
    • 2012
  • Conventional fuzzy c-means (FCM) algorithms have achieved a good clustering performance. However, they do not fully utilize the spatial information in the image and this results in lower clustering performance for images that have low contrast, vague boundaries, and noises. To overcome this issue, we propose an enhanced spatial fuzzy c-means (ESFCM) algorithm that takes into account the influence of neighboring pixels on the center pixel by assigning weights to the neighbors in a $3{\times}3$ square window. To evaluate between the proposed ESFCM and various FCM based segmentation algorithms, we utilized clustering validity functions such as partition coefficient ($V_{pc}$), partition entropy ($V_{pe}$), and Xie-Bdni function ($V_{xb}$). Experimental results show that the proposed ESFCM outperforms other FCM based algorithms in terms of clustering validity functions.

Partially Evaluated Genetic Algorithm based on Fuzzy Clustering (퍼지 클러스터링 기반의 국소평가 유전자 알고리즘)

  • Yoo Si-Ho;Cho Sung-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.9
    • /
    • pp.1246-1257
    • /
    • 2004
  • To find an optimal solution with genetic algorithm, it is desirable to maintain the population sire as large as possible. In some cases, however, the cost to evaluate each individual is relatively high and it is difficult to maintain large population. To solve this problem we propose a novel genetic algorithm based on fuzzy clustering, which considerably reduces evaluation number without any significant loss of its performance by evaluating only one representative for each cluster. The fitness values of other individuals are estimated from the representative fitness values indirectly. We have used fuzzy c-means algorithm and distributed the fitness using membership matrix, since it is hard to distribute precise fitness values by hard clustering method to individuals which belong to multiple groups. Nine benchmark functions have been investigated and the results are compared to six hard clustering algorithms with Euclidean distance and Pearson correlation coefficients as fitness distribution method.

Verb concept clustering using Independent Component Analysis and Box-Cox transformation (독립성분분석과 Box-Cox 변환을 이용한 동사 개념 클러스터링)

  • Chagnaa, Altangerel;Lee, Chang-Beom;Ock, Cheol-Young
    • Annual Conference on Human and Language Technology
    • /
    • 2006.10e
    • /
    • pp.164-170
    • /
    • 2006
  • 본 논문에서는 한국어 동사의 개념적 클러스터링 방법을 제안하다. 사용되는 기법은 독립성분분석, Box-Cox 변환, 상관분석 등이다. 독립성분분석은 잠재적인 성분을 통계적 독립(statistical independence)에 기반하여 추출하는 분석 방법이다. 그런데, 독립성분분석에서는 mixture(동사)의 분포는 정규 분포(가우시안 분포)에 따른다고 가정한다. 따라서 동사의 분포를 보다 정규 분포화 할 필요가 있다. 이에 본 논문에서는 Box-Cox 변환을 이용하여 동사의 분포를 정규 분포에 근사한다. 또한, 독립성분분석에서는 추출할 적당한 성분의 개수를 결정할 수가 없다. 이에 본 논문에서는 주성분분석의 결과로 획득되는 고유치의 누적 기여율을 이용하여 독립성분의 수를 결정한다. 그리고, 추출된 독립성분 벡터와 동사 벡터간의 상관계수에 이용하여 독립성분(개념)에 밀접하게 관련 있는 동사들을 하나의 클러스터로 구성한다. 한국어 동사를 대상으로 클러스터링한 결과, Box-Cox 변환을 적용한 경우가 더 좋은 성능을 보였다.

  • PDF

Multi-FNN Identification by Means of HCM Clustering and ITs Optimization Using Genetic Algorithms (HCM 클러스터링에 의한 다중 퍼지-뉴럴 네트워크 동정과 유전자 알고리즘을 이용한 이의 최적화)

  • 오성권;박호성
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.10 no.5
    • /
    • pp.487-496
    • /
    • 2000
  • In this paper, the Multi-FNN(Fuzzy-Neural Networks) model is identified and optimized using HCM(Hard C-Means) clustering method and genetic algorithms. The proposed Multi-FNN is based on Yamakawa's FNN and uses simplified inference as fuzzy inference method and error back propagation algorithm as learning rules. We use a HCM clustering and Genetic Algorithms(GAs) to identify both the structure and the parameters of a Multi-FNN model. Here, HCM clustering method, which is carried out for the process data preprocessing of system modeling, is utilized to determine the structure of Multi-FNN according to the divisions of input-output space using I/O process data. Also, the parameters of Multi-FNN model such as apexes of membership function, learning rates and momentum coefficients are adjusted using genetic algorithms. A aggregate performance index with a weighting factor is used to achieve a sound balance between approximation and generalization abilities of the model. The aggregate performance index stands for an aggregate objective function with a weighting factor to consider a mutual balance and dependency between approximation and predictive abilities. According to the selection and adjustment of a weighting factor of this aggregate abjective function which depends on the number of data and a certain degree of nonlinearity, we show that it is available and effective to design an optimal Multi-FNN model. To evaluate the performance of the proposed model, we use the time series data for gas furnace and the numerical data of nonlinear function.

  • PDF

Optimization Method of Differential Evolution-based Radial Basis Function Neural Networks (차분 진화 알고리즘 기반 방사형 기저 함수 신경회로망 분류기의 최적화 방법)

  • Ma, Chang-Min;Oh, Sung-Kwun
    • Proceedings of the KIEE Conference
    • /
    • 2011.07a
    • /
    • pp.1962-1963
    • /
    • 2011
  • 본 연구에서는 패턴분류를 위해 최적화된 방사형 기저 함수 신경회로망(Radial Basis Function Neural Networks) 분류기를 제안한다. RBFNN은 입력층, 은닉층, 출력층의 3층 구조로 되어 있으며 Multi Dimension, Predictive ability, Robustness한 특징이 있다. RBFNN의 은닉층에는 기존의 활성함수가 아닌 Fuzzy C-means 클러스터링 알고리즘을 사용하여 입력 데이터의 특성을 고려한 적합도를 사용하였다. RBFNN은 은닉층의 노드수와 FCM 클러스터링의 퍼지화 계수, 연결가중치의 다항식 타입이 모델의 성능의 향상에 영향을 미치기 때문에 최적화가 필요하며 본 논문에서는 Differential Evolution(DE) 알고리즘을 사용하여 모델의 구조 및 파라미터를 최적화시켜 모델의 성능을 향상시켰다. 제안된 모델을 평가하기 위해 패턴분류에 많이 사용되는 Iris 데이터와 Wine 데이터를 이용하였다.

  • PDF

Genetically Optimization of Fuzzy C-Means Clustering based Fuzzy Neural Networks (FCM 기반 퍼지 뉴럴 네트워크의 진화론적 최적화)

  • Choi, Jeoung-Nae;Oh, Sung-Kwun
    • Proceedings of the KIEE Conference
    • /
    • 2007.10a
    • /
    • pp.405-406
    • /
    • 2007
  • 본 논문에서는 FCM 기반 퍼지 뉴럴네트워크 구조를 제안하고 진화 알고리즘을 이용한 FCM 기반 퍼지 뉴럴네트워크의 구조와 파라미터의 최적화 방법을 제시한다. 클러스터링 알고리즘은 퍼지 뉴럴 네트워크에서 멤버쉽함수의 중심점과 반경 등을 결정하는 학습에 일반적으로 사용된다. 제안된 FCM 기반 뉴럴 네트워크에서 멤버쉽함수는 가우시안, 삼각형 타입등의 정해진 형태를 사용하지 않고 데이터들 사이의 거리에 관계된 계산을 수행하는 FCM에 의해 결정된다. 후반부는 상수형, 선형, 2차식 등의 다양한 다항식 구조로 표현될 수 있으며 다항식의 계수는 LSE를 이용하여 결정한다. FCM 기반 퍼지 뉴럴 네트워크는 퍼지규칙의 수, 입력변수의 선택, 후반부 다항식의 차수, FCM의 퍼지화 계수의 결정은 성능에 많은 차이가 있으며 이러한 구조와 파라미터의 최적화가 요구된다. 본 논문에서는 유전자 알고리즘을 이용하여 FCM 기반 퍼지뉴럴네트워크의 구조에 관련된 입력변수의 수, 퍼지규칙의 수 그리고 후반부 다항식의 차수와 파라미터에 관련된 퍼지화 계수를 최적화 한다. 제안된 방법은 비선형 시스템의 모델링에 적용하여 성능을 분석하였다.

  • PDF