• Title/Summary/Keyword: Network clustering analysis

Search Result 397, Processing Time 0.026 seconds

A novel clustering method for examining and analyzing the intellectual structure of a scholarly field (지적 구조 분석을 위한 새로운 클러스터링 기법에 관한 연구)

  • Lee, Jae-Yun
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.4 s.62
    • /
    • pp.215-231
    • /
    • 2006
  • Recently there are many bibliometric studies attempting to utilize Pathfinder networks(PFNets) for examining and analyzing the intellectual structure of a scholarly field. Pathfinder network scaling has many advantages over traditional multidimensional scaling, including its ability to represent local details as well as global intellectual structure. However there are some limitations in PFNets including very high time complexity. And Pathfinder network scaling cannot be combined with cluster analysis, which has been combined well with traditional multidimensional scaling method. In this paper, a new method named as Parallel Nearest Neighbor Clustering (PNNC) are proposed for complementing those weak points of PFNets. Comparing the clustering performance with traditional hierarchical agglomerative clustering methods shows that PNNC is not only a complement to PFNets but also a fast and powerful clustering method for organizing informations.

Data Clustering Using Hybrid Neural Network

  • Guan, Donghai;Gavrilov, Andrey;Yuan, Weiwei;Lee, Sung-Young;Lee, Young-Koo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.457-458
    • /
    • 2007
  • Clustering plays an indispensable role for data analysis. Many clustering algorithms have been developed. However, most of them suffer poor performance of learning. To archive good clustering performance, we develop a hybrid neural network model. It is the combination of Multi-Layer Perceptron (MLP) and Adaptive Resonance Theory 2 (ART2). It inherits two distinct advantages of stability and plasticity from ART2. Meanwhile, by combining the merits of MLP, it improves the performance for clustering. Experiment results show that our model can be used for clustering with promising performance.

  • PDF

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.10a
    • /
    • pp.59-69
    • /
    • 2005
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF

Social Network Analysis using Common Neighborhood Subgraph Density (공통 이웃 그래프 밀도를 사용한 소셜 네트워크 분석)

  • Kang, Yoon-Seop;Choi, Seung-Jin
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.4
    • /
    • pp.432-436
    • /
    • 2010
  • Finding communities from network data including social networks can be done by clustering the nodes of the network as densely interconnected groups, where keeping interconnection between groups sparse. To exploit a clustering algorithm for community detection task, we need a well-defined similarity measure between network nodes. In this paper, we propose a new similarity measure named "Common Neighborhood Sub-graph density" and combine the similarity with affinity propagation, which is a recently devised clustering algorithm.

CACHE:Context-aware Clustering Hierarchy and Energy efficient for MANET (CACHE:상황인식 기반의 계층적 클러스터링 알고리즘에 관한 연구)

  • Mun, Chang-min;Lee, Kang-Hwan
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.10a
    • /
    • pp.571-573
    • /
    • 2009
  • Mobile Ad-hoc Network(MANET) needs efficient node management because the wireless network has energy constraints. Mobility of MANET would require the topology change frequently compared with a static network. To improve the routing protocol in MANET, energy efficient routing protocol would be required as well as considering the mobility would be needed. Previously proposed a hybrid routing CACH prolong the network lifetime and decrease latency. However the algorithm has a problem when node density is increase. In this paper, we propose a new method that the CACHE(Context-aware Clustering Hierarchy and Energy efficient) algorithm. The proposed analysis could not only help in defining the optimum depth of hierarchy architecture CACH utilize, but also improve the problem about node density.

  • PDF

In-silico inferences for expression data using IGAM: Applied to Fuzzy-Clustering & Regulatory Network Modeling (연판 지식을 이용한 유전자 발현 데이터 분석: 퍼지 플러스링과 조절 네트웍 모델링에의 응용)

  • Lee, Philhyone;Hojeong Nam;Lee, Doheon;Lee, Kwang H.
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2004.04a
    • /
    • pp.273-276
    • /
    • 2004
  • Genome-scale expression data provides us with valuable insights about organisms, but the biological validation of in-silico analysis is difficult and often controversial. Here we present a new approach for integrating previously established knowledge with computational analysis. Based on the known biological evidences, IGAM (Integrated Gene Association Matrix) automatically estimates the relatedness between a pair of genes. We combined this association knowledge to the regulatory network modeling and fuzzy clustering in yeast 5. Cerevisiae. The result was found to be more effective for extracting biological meanings from in-silico inferences for gene expression data.

  • PDF

A Study on Technology Forecasting based on Co-occurrence Network of Keyword in Multidisciplinary Journals (다학제 분야 학술지의 주제어 동시발생 네트워크를 활용한 기술예측 연구)

  • Kim, Hyunuk;Ahn, Sang-Jin;Jung, Woo-Sung
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.40 no.4
    • /
    • pp.49-63
    • /
    • 2015
  • Keyword indexed in multidisciplinary journals show trends about science and technology innovation. Nature and Science were selected as multidisciplinary journals for our analysis. In order to reduce the effect of plurality of keyword, stemming algorithm were implemented. After this process, we fitted growth curve of keyword (stem) following bass model, which is a well-known model in diffusion process. Bass model is useful for expressing growth pattern by assuming innovative and imitative activities in innovation spreading. In addition, we construct keyword co-occurrence network and calculate network measures such as centrality indices and local clustering coefficient. Based on network metrics and yearly frequency of keyword, time series analysis was conducted for obtaining statistical causality between these measures. For some cases, local clustering coefficient seems to Granger-cause yearly frequency of keyword. We expect that local clustering coefficient could be a supportive indicator of emerging science and technology.

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.1-11
    • /
    • 2006
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF

Development of Educational Simulator for Novel Network Reduction (송전망 축약을 위한 교육용 시뮬레이터 개발)

  • Kim, Hyun-Houng;Lee, Woo-Nam;Kim, Wook;Park, Jong-Bae;Shin, Joong-Rin
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.58 no.10
    • /
    • pp.1902-1910
    • /
    • 2009
  • This paper presents a graphical windows-based program for the education and training for novel network reduction. The object of developed simulator is to provide users with a simple and useable tool for gaining an intuitive feel for power system analysis. The developed simulator consists of the main module (MMI,GUI), the location marginal price module (LMP), the clustering module and network reduction module. Each module has a separate graphical and interactive interfacing window. The developed simulator needs with the PSS/E input data format, generator cost function, location information. Line admittances of reduced network was determined by using the power flow method(Newton-Raphson). So line flow of reduced network is almost same to original power system. Results of reduced network are compared on the window in the tabular format. Therefore, the developed simulator can be utilized as a useful tool for effective education and training for power system analysis.

A Multi-Dimensional Issue Clustering from the Perspective Consumers' Interests and R&D (소비자 선호 이슈 및 R&D 관점에서의 다차원 이슈 클러스터링)

  • Hyun, Yoonjin;Kim, Namgyu;Cho, Yoonho
    • Journal of Information Technology Services
    • /
    • v.14 no.1
    • /
    • pp.237-249
    • /
    • 2015
  • The volume of unstructured text data generated by various social media has been increasing rapidly; therefore, use of text mining to support decision making has also been increasing. Especially, issue Clustering-determining a new relation with various issues through clustering-has gained attention from many researchers. However, traditional issue clustering methods can only be performed based on the co-occurrence frequency of issue keywords in many documents. Therefore, an association between issues that have a low co-occurrence frequency cannot be discovered using traditional issue clustering methods, even if those issues are strongly related in other perspectives. Therefore, issue clustering that fits each of criteria needs to be performed by the perspective of analysis and the purpose of use. In this study, a multi-dimensional issue clustering is proposed to overcome the limitation of traditional issue clustering. We assert, specifically in this study, that issue clustering should be performed for a particular purpose. We analyze the results of applying our methodology to two specific perspectives on issue clustering, (i) consumers' interests, and (ii) related R&D terms.