• 제목/요약/키워드: community clustering

검색결과 137건 처리시간 0.026초

커뮤니티 검출기법을 이용한 소프트웨어 아키텍쳐 모듈 뷰 복원 (Recovering Module View of Software Architecture using Community Detection Algorithm)

  • 김정민;이찬근
    • 소프트웨어공학소사이어티 논문지
    • /
    • 제25권4호
    • /
    • pp.69-74
    • /
    • 2012
  • 본 논문은 소프트웨어 클러스터링 기법과 커뮤니티 검출 기법의 비교를 통하여 아키텍쳐 모듈 복원 프로세스에 커뮤니티 검출 알고리즘의 적용가능성을 제시한다. 또한, 대표적인 클러스터링 알고리즘과 커뮤니티 검출 알고리즘의 값과 나눠진 모듈간의 상관관계와 차이점을 분석한다. 이를 통하여 커뮤니티 검출 알고리즘이 소프트웨어 아키텍쳐 모듈 뷰 복원에 활용되어질 수 있다는 몇 가지 근거를 제시하였고, 기존의 클러스터링 결과와 커뮤니티 알고리즘의 결과치를 비교함으로써, 서로의 결과 데이터가 어떠한 연관성을 가지는지 제시하였다.

  • PDF

영양서비스 개발을 위한 대구지역의 인슐린저항성증후군 패턴의 인구학적 특성 분석 (Patterns of Insulin Resistance Syndrome in the Taegu Community for the Development of Nutritional Service Improvement Programs)

  • 이희자;윤진숙;신동훈
    • 대한지역사회영양학회지
    • /
    • 제6권1호
    • /
    • pp.97-107
    • /
    • 2001
  • The clustering of insulin resistance with hypertension, glucose intolerance, hyperinsulinemia, increased triglyceride and decreased HDL cholesterol levels, and central and overall obesity has been called syndrome X, or the insulin resistance syndrome(IRS). To develop a nutrition service for IRS, this study was performed to evaluate the prevalence of each component of the metabolic abnormalities of IRS and analyze the clustering pattern of IRS among subjects living in the Taegu community. Participants in this study were 9234(mean age ; M/F 48/40yrs);63.5% were men, 24.4% were obese, 13.3% had hypertension. 3.7% had hyperglycemia, and 32.4% had hyperlipidemia. The IRS was defined as the coexistence of two or more components among metabolic abnormalities; obesity, hypertension. hyperglucemia and hyperlipidemia. The prevalence of IRS in Taegu was 19.2%(M/F:20.8%/16.4%), the clustering of these fisk variables was higher in advanced age group. Among the subjects of IRS having two of more diseases, 75.6% were obese, the pattern were similar in men and women. The younger, the higher the prevalence of obesity associated clustering patterns. The prevalence of obesity associated patterns among the hyperglycemia associated clustering patterns was 44.5%. The samples of the representative clustering patterns were obesity and hyperlipidemia (8.0%), hypertension and hyperlipidemia(3.2%), hypertension, obesity and hyperlipiemia(3.1%), hypertension and obesity(2.3%), and hyperglycemia and hyperlipidemia(0.8%). The clustering of obesity and hyperlipidemia until 50 year old groups, and the clustering of hypertension and hyperlipidemia in the 60 and 70 age groups were the most prevalent. We concluded that insulin resistance syndrome was a relatively common disorder in the Taegu community, and prevalence and the characteristics of the intervention strategies for IRS are desired, an effective improvement will be achieved.

  • PDF

Discovering Community Interests Approach to Topic Model with Time Factor and Clustering Methods

  • Ho, Thanh;Thanh, Tran Duy
    • Journal of Information Processing Systems
    • /
    • 제17권1호
    • /
    • pp.163-177
    • /
    • 2021
  • Many methods of discovering social networking communities or clustering of features are based on the network structure or the content network. This paper proposes a community discovery method based on topic models using a time factor and an unsupervised clustering method. Online community discovery enables organizations and businesses to thoroughly understand the trend in users' interests in their products and services. In addition, an insight into customer experience on social networks is a tremendous competitive advantage in this era of ecommerce and Internet development. The objective of this work is to find clusters (communities) such that each cluster's nodes contain topics and individuals having similarities in the attribute space. In terms of social media analytics, the method seeks communities whose members have similar features. The method is experimented with and evaluated using a Vietnamese corpus of comments and messages collected on social networks and ecommerce sites in various sectors from 2016 to 2019. The experimental results demonstrate the effectiveness of the proposed method over other methods.

도시마을 커뮤니티 활성화를 위한 전통마을 공간 구조 특성 분석 (Analysis of the Spatial Structure of Traditional Villages for Revitalization of the Community in Urban Villages)

  • 문지원;김주현;하재명
    • 한국주거학회논문집
    • /
    • 제19권6호
    • /
    • pp.85-93
    • /
    • 2008
  • This study analyzes areas, traffic lines and characteristics of block of traditional villages in order to suggest how to build urban village in the way that can solve problems occurring in residential areas these days. The study showed the following results: 1) Traditional villages have definite boundary and entrance, and the community area for the villages is close to the entrance to encourage community activities of villagers. 2) With an access in the form of a blind alley branched from the main road, traditional villages form a small-sized clustering and encourage community activities in a natural way. 3) Formed of block with a pattern of net, blind alley or standing in a line on both sides, traditional villages help residents to form close relations between. These findings suggest that for building desirable urban villages, 1) they should have definite boundary, 2) size and location of community area should be determined in the way to activate community activities of residents, 3) roads inside the village should have branched form rather than standardized check pattern so that small-sized clustering could be formed along the branched inner roads, and 4) clustering in villages should be arranged in a line on both sides or in the form of a blind alley giving consideration to the length and width of roads. The roads should be also of a closed type so that residents could create strong bonds with their neighbors.

Community Detection using Closeness Similarity based on Common Neighbor Node Clustering Entropy

  • Jiang, Wanchang;Zhang, Xiaoxi;Zhu, Weihua
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권8호
    • /
    • pp.2587-2605
    • /
    • 2022
  • In order to efficiently detect community structure in complex networks, community detection algorithms can be designed from the perspective of node similarity. However, the appropriate parameters should be chosen to achieve community division, furthermore, these existing algorithms based on the similarity of common neighbors have low discrimination between node pairs. To solve the above problems, a noval community detection algorithm using closeness similarity based on common neighbor node clustering entropy is proposed, shorted as CSCDA. Firstly, to improve detection accuracy, common neighbors and clustering coefficient are combined in the form of entropy, then a new closeness similarity measure is proposed. Through the designed similarity measure, the closeness similar node set of each node can be further accurately identified. Secondly, to reduce the randomness of the community detection result, based on the closeness similar node set, the node leadership is used to determine the most closeness similar first-order neighbor node for merging to create the initial communities. Thirdly, for the difficult problem of parameter selection in existing algorithms, the merging of two levels is used to iteratively detect the final communities with the idea of modularity optimization. Finally, experiments show that the normalized mutual information values are increased by an average of 8.06% and 5.94% on two scales of synthetic networks and real-world networks with real communities, and modularity is increased by an average of 0.80% on the real-world networks without real communities.

범주형 값들이 순서를 가지고 있는 데이터들의 클러스터링 기법 (Clustering Algorithm for Sequences of Categorical Values)

  • 오승준;김재련
    • 산업경영시스템학회지
    • /
    • 제26권1호
    • /
    • pp.17-21
    • /
    • 2003
  • We study clustering algorithm for sequences of categorical values. Clustering is a data mining problem that has received significant attention by the database community. Traditional clustering algorithms deal with numerical or categorical data points. However, there exist many important databases that store categorical data sequences. In this paper, we introduce new similarity measure and develop a hierarchical clustering algorithm. An experimental section shows performance of the proposed approach.

범주형 값들이 순서를 가지고 있는 데이터들의 클러스터링 기법 (Clustering Algorithm for Sequences of Categorical Values)

  • 오승준;김재련
    • 한국산업경영시스템학회:학술대회논문집
    • /
    • 한국산업경영시스템학회 2002년도 춘계학술대회
    • /
    • pp.125-132
    • /
    • 2002
  • We study clustering algorithm for sequences of categorical values. Clustering is a data mining problem that has received significant attention by the database community. Traditional clustering algorlthms deal with numerical or categorical data points. However, there exist many important databases that store categorical data sequences. In this paper we introduce new similarity measure and develope a hierarchical clustering algorithm. An experimental section shows performance of the proposed approach.

  • PDF

외부 커뮤니티 연관도를 이용한 소셜 네트워크 분석 (Social Networks Analysis using External Community Relationship)

  • 이현진;지태창
    • 디지털콘텐츠학회 논문지
    • /
    • 제12권1호
    • /
    • pp.69-75
    • /
    • 2011
  • 소셜 네트워크로부터 커뮤니티를 발견하기 위해서는 네트워크의 노드들에 대한 군집화 과정이 필요하다. 일반적인 군집화 알고리즘은 커뮤니티의 개수를 미리 설정해 주어야 한다는 약점을 가지고 있다. 커뮤니티의 개수에 의해서 군집화의 결과가 달라질 수 있기 때문에 커뮤니티의 개수 선정은 매우 중요하다. 본 논문에서는 커뮤니티 간의 구별을 위한 외부 커뮤니티 연관도를 정의하고, 이를 기존의 군집화 알고리즘과 결합하여 커뮤니티의 개수를 동적으로 발견하는 방법을 제안한다. 제안하는 방법은 커뮤니티 개수의 정확도와 커뮤니티 평균 순도를 기준으로 기존의 방법들과 비교하였다. 실험 결과 기존의 방법들에 비하여 제안하는 방법의 우수한 성능 확인할 수 있었다.

Cost-Traffic Tradeoff를 고려한 통신망 설계의 Clustering 알고리듬 개발 (Development of Clustering Algorithm for the Design of Telecommunication Network Considering Cost-Traffic Tradeoff)

  • 박영준;이홍철;김승권
    • 한국경영과학회지
    • /
    • 제22권3호
    • /
    • pp.23-36
    • /
    • 1997
  • In the design of telecommunication network, the network configuration using hubbing topology is useful for designing and managing the network efficiently : i. e. all of central offices (COs) are grouped into clusters. Each cluster has one hub consisting of large-scale transmission facilities like digital cross-connect systems and ATMs. In clustering process, the community of interest and geographical factor should be considered. However, there exists a tradeoff between two factors. One is to minimize total link costs for geographical factor and the other is to maximize the total intra-cluster traffics for community of interest. Hence, this can be solved by multiobjective linear programming techniques. In this paper, the problem under considerations is formulated as two p-median subproblems taking into considerations total costs and total intra-traffics, respectively. Then we propose the algorithm to solve the problem based on the concept of cost-traffic tradeoff. The algorithm enables to identify efficient cost-traffic tradeoff pairs. An illustration is also presented.

  • PDF

공통 이웃 그래프 밀도를 사용한 소셜 네트워크 분석 (Social Network Analysis using Common Neighborhood Subgraph Density)

  • 강윤섭;최승진
    • 한국정보과학회논문지:컴퓨팅의 실제 및 레터
    • /
    • 제16권4호
    • /
    • pp.432-436
    • /
    • 2010
  • 소셜 네트워크를 비롯한 네트워크로부터 커뮤니티를 발견하려면 네트워크의 노드를 그룹 내에서는 서로 조밀하게 연결되고 그룹 간에는 연결의 밀도가 낮은 그룹들로 군집화하는 과정이 꼭 필요하다. 군집화 알고리즘의 성능을 위해서는 군집화의 기준이 되는 유사도 기준이 잘 정의되어야 한다. 이 논문에서는 네트워크 내의 커뮤니티 발견을 위해 유사도 기준을 정의하고, 정의한 유사도를 유사도 전파(affinity propagation) 알고리즘과 결합하여 만든 방법을 기존의 방법들과 비교한다.