• 제목/요약/키워드: distance-based clustering algorithm

검색결과 130건 처리시간 0.025초

A Novel Image Segmentation Method Based on Improved Intuitionistic Fuzzy C-Means Clustering Algorithm

  • Kong, Jun;Hou, Jian;Jiang, Min;Sun, Jinhua
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권6호
    • /
    • pp.3121-3143
    • /
    • 2019
  • Segmentation plays an important role in the field of image processing and computer vision. Intuitionistic fuzzy C-means (IFCM) clustering algorithm emerged as an effective technique for image segmentation in recent years. However, standard fuzzy C-means (FCM) and IFCM algorithms are sensitive to noise and initial cluster centers, and they ignore the spatial relationship of pixels. In view of these shortcomings, an improved algorithm based on IFCM is proposed in this paper. Firstly, we propose a modified non-membership function to generate intuitionistic fuzzy set and a method of determining initial clustering centers based on grayscale features, they highlight the effect of uncertainty in intuitionistic fuzzy set and improve the robustness to noise. Secondly, an improved nonlinear kernel function is proposed to map data into kernel space to measure the distance between data and the cluster centers more accurately. Thirdly, the local spatial-gray information measure is introduced, which considers membership degree, gray features and spatial position information at the same time. Finally, we propose a new measure of intuitionistic fuzzy entropy, it takes into account fuzziness and intuition of intuitionistic fuzzy set. The experimental results show that compared with other IFCM based algorithms, the proposed algorithm has better segmentation and clustering performance.

Optimization study of a clustering algorithm for cosmic-ray muon scattering tomography used in fast inspection

  • Hou, Linjun;Huo, Yonggang;Zuo, Wenming;Yao, Qingxu;Yang, Jianqing;Zhang, Quanhu
    • Nuclear Engineering and Technology
    • /
    • 제53권1호
    • /
    • pp.208-215
    • /
    • 2021
  • Cosmic-ray muon scattering tomography (MST) technology is a new radiation imaging technology with unique advantages. As the performance of its image reconstruction algorithm has a crucial influence on the imaging quality, researches on this algorithm are of great significance to the development and application of this technology. In this paper, a fast inspection algorithm based on clustering analysis for the identification of the existence of nuclear materials is studied and optimized. Firstly, the principles of MST technology and a binned clustering algorithm were introduced, and then several simulation experiments were carried out using Geant4 toolkit to test the effects of exposure time, algorithm parameter, the size and structure of object on the performance of the algorithm. Based on these, we proposed two optimization methods for the clustering algorithm: the optimization of vertical distance coefficient and the displacement of sub-volumes. Finally, several sets of experiments were designed to validate the optimization effect, and the results showed that these two optimization methods could significantly enhance the distinguishing ability of the algorithm for different materials, help to obtain more details in practical applications, and was therefore of great importance to the development and application of the MST technology.

A Comparison of Clustering Algorithm in Data Mining

  • Lee, Yung-Seop;An, Mi-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권4호
    • /
    • pp.725-736
    • /
    • 2003
  • To provide the information needed to make a decision, it is important to know the relationship or pattern between variables in database. Grouping objects which have similar characteristics of pattern is called as cluster analysis, one of data mining techniques. In this study, it is compared with several partitioning clustering algorithms, based on the statistical distance or total variance in each cluster.

  • PDF

Data Pattern Estimation with Movement of the Center of Gravity

  • Ahn Tae-Chon;Jang Kyung-Won;Shin Dong-Du;Kang Hak-Soo;Yoon Yang-Woong
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제6권3호
    • /
    • pp.210-216
    • /
    • 2006
  • In the rule based modeling, data partitioning plays crucial role be cause partitioned sub data set implies particular information of the given data set or system. In this paper, we present an empirical study result of the data pattern estimation to find underlying data patterns of the given data. Presented method performs crisp type clustering with given n number of data samples by means of the sequential agglomerative hierarchical nested model (SAHN). In each sequence, the average value of the sum of all inter-distance between centroid and data point. In the sequel, compute the derivation of the weighted average distance to observe a pattern distribution. For the final step, after overall clustering process is completed, weighted average distance value is applied to estimate range of the number of clusters in given dataset. The proposed estimation method and its result are considered with the use of FCM demo data set in MATLAB fuzzy logic toolbox and Box and Jenkins's gas furnace data.

발산거리 기반의 신경망에 의한 가우시안 확률 밀도 함수의 군집화 (Guassian pdfs Clustering Using a Divergence Measure-based Neural Network)

  • 박동철;권오현
    • 한국통신학회논문지
    • /
    • 제29권5C호
    • /
    • pp.627-631
    • /
    • 2004
  • 음성인식 모델상의 GPDFs(Gaussian Probability Density Functions)을 효율적으로 군집화 할 수 있는 알고리즘이 제안되었다. 제안된 알고리즘은 데이터 사이의 거리 척도로 발산 거리를 사용하는 새로운 형태의 CNN(Centroid Neural Network)으로, 제한된 자원을 가지는 H/W환경의 음성인식에서 메모리 사용량을 축소하는 응용에 대한 실험 결과, 음성인식 모델인 CDHMM(Continuous Density Hidden Markov Model)에서 기존의 Dk-means(Divergence-based k-means)알고리즘을 이용한 방법과 비교하여 인식 성능의 유지와 함께 약 31.3%의 GPDFs를 더 축소할 수 있었고, 군집화 알고리즘을 적용하지 자은 전체 GPDFs를 사용한 경우와 비교해서 인식 성능의 유지와 함께 약 61.8%의 GPDFs를 압축할 수 있었으며, SNR 10㏈ 잡음 데이터에 대한 성능평가에서도 인식 성능이 유지될 수 있었다.

The Energy Efficiency of Improved Routing Technique Based on The LEACH

  • Gauta, Ganesh;Cho, Seongsoo;Jung, Kyedong;Lee, Jong-Yong
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제7권1호
    • /
    • pp.49-56
    • /
    • 2015
  • As WSN is energy constraint so energy efficiency of nodes is important. Because avoiding long distance communication, clustering operating in rounds is an efficient algorithm for prolonging the lifetime of WSN and its performance depends on duration of a round. A short round time leads to frequent re-clustering while a long round time increases energy consume of cluster heads more. So existing clustering schemes determine proper round time, based on the parameters of initial WSN. But it is not appropriate to apply the round time according to initial value throughout the whole network time because WSN is very dynamic networks nodes can be added or vanished. In this paper we propose a new algorithm which calculates the round time relying on the alive node number to adapt the dynamic WSN. Simulation results validate the proposed algorithm has better performance in terms of energy consumption of nodes and loss rate of data.

웹 응용 재구성을 위한 폼 클러스터링 알고리즘 (A Form Clustering Algorithm for Web-based Application Reengineering)

  • 최상수;박학수;이강수
    • 한국전자거래학회지
    • /
    • 제8권2호
    • /
    • pp.77-98
    • /
    • 2003
  • 최근의 정보시스템은 웹기반 정보시스템이며 이의 개발과 유지보수 시에 "웹 위기" 현상이 발생하고 있다. 이를 해결하기 위해, 웹 공학 기술 중 웹기반 어플리케이션에 대한 소프트웨어 클러스터링 기술이 필요하다. 본 논문에서는 웹기반 정보시스템의 내부시스템 재구성을 위한 폼 클러스터링 알고리즘을 제시한다. 폼 클러스터링 알고리즘은 웹기반 정보시스템의 다양한 구조모델 중에서 웹의 특징이라 할 수 있는 페이지 모델에 초점을 맞춘다. 특히, 그래프 형태의 항해구조를 분석이 용이한 계층구조로 분석하기 위해 거리 척도 개념을 응용하고, 부하가 큰 핵심 기능객체를 파악하기 위하여 웹 로그분석 기술을 적용한다. 또한,2단계에 걸친 클러스터링 과정을 통해 재사용 성을 극대화하고 부하 균형화를 위한 하드웨어 할 당시에 사용할 수 있는 웹 소프트웨어 구조를 생성한다. 본 논문에서 제시한 폼 클러스터링 알고리즘은 웹기반 정보시스템의 신규 개발 또는 유지보수 시에 재사용 가능한 웹 컴포넌트 개발 및 부하균형화를 위한 하드웨어 할당 시에 적용할 수 있다.

  • PDF

무선 센서 네트워크의 수명연장을 위한 클러스터링 알고리즘 (Clustering Algorithm for Extending Lifetime of Wireless Sensor Networks)

  • 김선철;최승권;조용환
    • 한국컴퓨터정보학회논문지
    • /
    • 제20권4호
    • /
    • pp.77-85
    • /
    • 2015
  • 최근 무선 센서 네트워크는 다양한 분야에서 유비쿼터스 컴퓨팅 환경을 구현하기 위해 사용되고 있으며 센서 필드의 정보 수집을 위해 소형의 저전력, 저가격 센서를 사용한다. 본 논문은 이러한 센서 노드의 에너지 효율성을 고려한 클러스터링 알고리즘을 제안한다. 제안 알고리즘은 기존의 LEACH에 기반하며 슬립 기법과 전송 모드의 변경을 통해 네트워크와 노드의 수명을 연장할 수 있다. 제안 알고리즘에서 노드는 이웃 노드들과의 거리를 기반으로 자신의 클러스터링 참여 값을 구하고 자신의 주위에 노드들이 많이 존재한다면 슬립 모드로 대기하도록 한다. 또한 BS로의 데이터 전달에 기존의 싱글 홉 전송만 사용하는 것이 아니라 클러스터 헤드의 에너지 수준에 따라 멀티 홉 전송으로 변경할 수 있도록 하여 에너지를 보존하도록 한다 제안 기법의 성능을 확인하기 위해 LEACH와 비교 실험을 수행하였으며 특히 불균일하게 노드가 배치된 네트워크에서 제안기법의 성능이 우수함을 확인하였다.

Centralized Clustering Routing Based on Improved Sine Cosine Algorithm and Energy Balance in WSNs

  • Xiaoling, Guo;Xinghua, Sun;Ling, Li;Renjie, Wu;Meng, Liu
    • Journal of Information Processing Systems
    • /
    • 제19권1호
    • /
    • pp.17-32
    • /
    • 2023
  • Centralized hierarchical routing protocols are often used to solve the problems of uneven energy consumption and short network life in wireless sensor networks (WSNs). Clustering and cluster head election have become the focuses of WSNs. In this paper, an energy balanced clustering routing algorithm optimized by sine cosine algorithm (SCA) is proposed. Firstly, optimal cluster head number per round is determined according to surviving node, and the candidate cluster head set is formed by selecting high-energy node. Secondly, a random population with a certain scale is constructed to represent a group of cluster head selection scheme, and fitness function is designed according to inter-cluster distance. Thirdly, the SCA algorithm is improved by using monotone decreasing convex function, and then a certain number of iterations are carried out to select a group of individuals with the minimum fitness function value. From simulation experiments, the process from the first death node to 80% only needs about 30 rounds. This improved algorithm balances the energy consumption among nodes and avoids premature death of some nodes. And it greatly improves the energy utilization and extends the effective life of the whole network.

Microblog User Geolocation by Extracting Local Words Based on Word Clustering and Wrapper Feature Selection

  • Tian, Hechan;Liu, Fenlin;Luo, Xiangyang;Zhang, Fan;Qiao, Yaqiong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권10호
    • /
    • pp.3972-3988
    • /
    • 2020
  • Existing methods always rely on statistical features to extract local words for microblog user geolocation. There are many non-local words in extracted words, which makes geolocation accuracy lower. Considering the statistical and semantic features of local words, this paper proposes a microblog user geolocation method by extracting local words based on word clustering and wrapper feature selection. First, ordinary words without positional indications are initially filtered based on statistical features. Second, a word clustering algorithm based on word vectors is proposed. The remaining semantically similar words are clustered together based on the distance of word vectors with semantic meanings. Next, a wrapper feature selection algorithm based on sequential backward subset search is proposed. The cluster subset with the best geolocation effect is selected. Words in selected cluster subset are extracted as local words. Finally, the Naive Bayes classifier is trained based on local words to geolocate the microblog user. The proposed method is validated based on two different types of microblog data - Twitter and Weibo. The results show that the proposed method outperforms existing two typical methods based on statistical features in terms of accuracy, precision, recall, and F1-score.