• 제목/요약/키워드: Hierarchical Cluster analysis

검색결과 297건 처리시간 0.023초

군집분석을 이용한 수도권 도시의 유형화에 관한 연구 (Charaterization of Cities in Seoul Metropolitan Area by Cluster Analysis)

  • 송민경;장훈
    • 대한공간정보학회지
    • /
    • 제18권1호
    • /
    • pp.83-88
    • /
    • 2010
  • 본 논문은 수도권 내 지역을 도시적 특성에 따라 군집 분석하여 수도권의 지역유형을 구분하고 그 특성을 파악하고자 하였다. 수도권 도시의 유형화를 위해 2005년을 시점으로 도시의 구성요소인 인구, 활동, 토지 및 시설을 나타내는 지표 10가지를 선정하였으며, 주성분분석을 통해 변수들의 공통된 특성이나 유사한 성질에 따라 공통인자를 추출하였다. 추출된 인자에 대한 인자득점을 이용하여 군집분석을 행하였고, 군집분석의 방법은 계층적 군집방법인 최단연결법, 최장연결법, 평균연결법, 와드법을 이용하였다. 그 결과 수도권을 5개의 군집으로 유형화 할 수 있었다.

기온과 강수량의 수치모델 격자자료를 이용한 기상관측지점의 월별 군집화 (Cluster analysis by month for meteorological stations using a gridded data of numerical model with temperatures and precipitation)

  • 김희경;김광섭;이재원;이영섭
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권5호
    • /
    • pp.1133-1144
    • /
    • 2017
  • 기상자료를 이용한 군집분석은 기상 특성에 근거한 기상 지역의 세분화를 가능하게 하고 군집을 이루는 지형별 기상 특성의 파악을 용이하게 한다. 이때 기상관측자료를 이용한 군집분석은 관측지점의 밀도가 다르기 때문에 우리나라의 기상특성이 고르게 반영되지 못할 수 있다. 반면 수치모델 격자자료는 $5km{\times}5km$ 간격으로 조밀하고 고른 자료의 생산이 가능하므로 우리나라의 기상 특성을 고르게 반영할 수 있다. 본 연구에서는 기온과 강수량의 수치모델 격자자료를 이용하여 군집분석을 수행하고, 그 결과를 바탕으로 기상관측지점에 대한 군집을 결정하였다. 기상 특성이 월별로 상이할 수 있기 때문에 군집분석은 월별로 수행하였으며, K-Means 군집분석 방법의 단점을 보완하고자 계층적 군집분석 방법인 Ward 방법과 결합하여 적용하였다. 그 결과 우리나라 기상관측지점들에 대해 시 공간적으로 세분화된 군집화가 이루어졌다.

전국자연환경조사를 활용한 포유류 서식지 유형의 분류 (The Habitat Classification of mammals in Korea based on the National Ecosystem Survey)

  • 이화진;하정욱;차진열;이중효;윤희남;정철운;오홍식;배소연
    • 환경영향평가
    • /
    • 제26권2호
    • /
    • pp.160-170
    • /
    • 2017
  • 본 연구는 2006년부터 2012년까지 수행된 제3차 전국자연환경조사 포유류 데이터(70,562개)를 활용하여 국내에서 서식하는 포유류의 서식지 유형을 클러스터링하고 서식지 유형에 나타나는 종의 특징을 파악하고자 하였다. 제3차 전국자연환경조사의 야장에 기록된 서식지 유형 중에서 15개의 키워드를 뽑아 재분류하여 포유류 서식지유형을 통계 분석하였다. 서식지 유형 군집분석에서는 30회 이상 기록된 14개의 서식지 유형을 대상으로 비계층적 클러스터 분석(k 평균 클러스터 분석), 계층적 클러스터 분석, 비계량형 다차원척도법을 시행하였다. 2006년에서 2012년까지 전국에서 수집된 제3차 전국자연환경조사를 통해 확인된 포유류는 총 7목 16과 39종이었다. 서식지 유형에 대한 분류는 11개로 클러스터를 분류했을 때 단순구조지수가 가장 높았다(ssi = 0.07). 계층적 클러스터 분석으로 서식지 유형들 간의 유사성과 위계를 확인해 본 결과, 포유류에게는 주거지가 가장 차별된 서식지 유형이었고, 그 다음은 하천과 해안이 병합된 클러스터였다. 비계량형 다차원척도 분석 결과, 포유류에게 가장 차별된 서식지유형인 주거지의 경우 생쥐와 집쥐 두 종이 제한적으로 나타났으며, 해안과 하천의 경우 수달이 제한적으로 나타났다. 연구결과를 종합해보면, 포유류의 서식지 유형은 크게 산림을 주요 서식지와 이동경로로 이용하는 산림형과, 물을 주요 서식지로 이용하는 하천형, 주거지 인근에서 서식하는 주거형, 곡류나 씨앗을 주 먹이원으로 하는 저지대형 등 4가지로 구분할 수 있다.

AHP - 군집분석을 이용한 주요어종의 자원감소 원인 비교분석에 관한 연구 (The Comparative Analysis of the Reasons for Decreases in Marin Fishery Resources Based on AHP & duster Analysis)

  • 박철형;이상고
    • 수산경영론집
    • /
    • 제40권3호
    • /
    • pp.127-146
    • /
    • 2009
  • This study is to estimate the factor weights of the reasons for decreases in marine fishery resources using the Analytical Hierarchy Process. Furthermore, it classifies 20 fishes under a fishery resource recovery plan into various groups of fishes according to these factor weights using the non-hierarchial cluster analysis. The factors of decreases in marine fishery resources are identified as bio-ecological, technology-system, economic-business, and fishing village-society factors. Two of the most important factors of decreases in resource are turned out to be the economic-business and bio-ecological factors, estimated as 31% and 30% respectively. The technology-system and fishing village-society factors are estimated as 21% and 18% respectively. The study utilizes non-hierarchical cluster analysis in order to classify 20 fishes into 2, 3, and 4 groups. K-means cluster analysis is applied for grouping in conjunction with ANOVA to identify statistical differences in factors. Once again, the economic-business and bio-economic factors play main role in grouping 2-groups of fishes case. The third group of fishes in addition to the previous 2 groups of fishes appears as those 4 factors of decrease evenly play about the same role at a 3-groups of fishes case. Finally, the economic-business and bio-economic factors are turned out to be evenly important in the 4th group once there are 4-groups of fishes.

  • PDF

합성곱 오토인코더 기반의 응집형 계층적 군집 분석 (Agglomerative Hierarchical Clustering Analysis with Deep Convolutional Autoencoders)

  • 박노진;고한석
    • 한국멀티미디어학회논문지
    • /
    • 제23권1호
    • /
    • pp.1-7
    • /
    • 2020
  • Clustering methods essentially take a two-step approach; extracting feature vectors for dimensionality reduction and then employing clustering algorithm on the extracted feature vectors. However, for clustering images, the traditional clustering methods such as stacked auto-encoder based k-means are not effective since they tend to ignore the local information. In this paper, we propose a method first to effectively reduce data dimensionality using convolutional auto-encoder to capture and reflect the local information and then to accurately cluster similar data samples by using a hierarchical clustering approach. The experimental results confirm that the clustering results are improved by using the proposed model in terms of clustering accuracy and normalized mutual information.

PCR Analysis for Genetic Distances of Two Charybdis Crab Populations

  • Yoon, Jong-Man
    • 한국발생생물학회지:발생과생식
    • /
    • 제26권2호
    • /
    • pp.91-98
    • /
    • 2022
  • Genomic DNA (gDNA) set apart from two populations of Korean Charybdis crab (Charybdis japonica) was augmented by PCR experiments. The five oligonucleotides primers (ONT-primers) were spent to yield the number of unique loci shared to each crab population (ULSECP) and number of loci shared by the two crab populations (LSTCP). 305 fragments (FRAGs) were identified in the Charybdis crab population A (CCPA), and 344 in the Charybdis crab population B (CCPB): 44 number of ULSECP (14.43%) in the CCPA and 110 (31.98%) in the CCPB. 44 number of LSTCP, with an average of 8.8 per primer, were detected in the two crab populations. The bandsharing (BS) value between entity's no. 01 and no. 10 was the lowest (0.371) between the two CCPs. The average bandsharing (ABS) values of individuals in the CCPA (0.575±0.014) were lesser than in those originated from the CCPB (0.705±0.011) (p < 0.05). The polar hierarchical dendrogram (PHD) achieved by the five ONT-primers denotes three genetic clusters (GCs): cluster I (CHARYBCRAB 01, 04, 05, 06, and 08), cluster II (CHARYBCRAB 02, 03, 07, 09, 10, and 11) and cluster III (CHARYBCRAB 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, and 22). The shortest genetic distance (GD) displaying significant molecular difference (MD) was between individuals CHARYBCRAB no. 18 and CHARYBCRAB no. 17 (0.055).

Evaluation of Shopping Items: Focused on Purchase of Foreign Tourists in South Korea

  • Jeong, Dong-Bin
    • 동아시아경상학회지
    • /
    • 제7권2호
    • /
    • pp.21-30
    • /
    • 2019
  • Purpose - In this work, we categorize the 21 shopping items which foreign tourists purchase in South Korea and monitor the level of dissimilarity (or similarity) between each item by utilizing distance matrix, and both hierarchical and k-means cluster analyses, respectively, based on several purpose of visit attributes in 2017. In addition, multidimensional scaling (MDS) method is applied for mining visual appearance of proximities among shopping items based on purpose of visit attributes. Research design and methodology - This study is carried out in 2017 by Ministry of Culture, Sports and Tourism and conduct a face-to-face survey of foreign tourists from 20 countries who purchase shopping items in South Korea. CLUSTER, PROXIMITIES and ALSCAL modules in IBM SPSS 23.0 are used to perform this work. Results - We ascertain that 21 shopping items can be classified into five similar groups which have homogeneous traits by going through two-step cluster analysis. We can position homogeneous places of cluster and shopping items joining each cluster. Conclusions - We can relatively assess patterns and characteristics of each shopping item, come by useful information in activating shopping tour based on the actual state of recognition of foreign tourists and practically apply to each tourism industry on underlying results.

사상체질판별(四象體質判別) 검사지(檢査紙) 문항(問項)의 타당성(妥當性)과 신뢰성(信賴性) 및 응답자(應答者) 개체분석(應答者)에 관한 연구(硏究) (The Studies on the Statistical Reliability and Significancy of the Questionnaire for the Sasang Constitution)

  • 이화섭;안탁원
    • 혜화의학회지
    • /
    • 제12권2호
    • /
    • pp.177-197
    • /
    • 2004
  • 1. The values of Cronbach's alpha for the Taeyang, Taeum, Soyang and Soeum questionnaire were 0.7955, 0.7776, 0.8545, and 0.8601 respectively. These results indicate a highly satisfactory level of internal consistency for the questionnaire. 2. If the deletion of an item increases Cronbach's alpha then what that means is that the deletion of that item improves reliability. Therefore, any items that result in substantially greater values of alpha than the overall alpha may need to be deleted from the questionnaire to improve its reliability. 3. Factor analysis was performed on the 81 questionnaires. Based on the scree plot and the number and decrement of eigen values greater than one, three to four factor solution was most significant. 4. The hierarchical cluster analysis was performed on the 81 Sasang constitution questionnaire. These results suggested that two or four clusters identified with homogeneous groups 5. The hierarchical cluster analysis was performed on the 1046 responders. These results suggested that two, three, or four clusters might identified with homogeneous groups. Furthemore, there were statistically significant difference among the each group by ANOVA(P<0.0001).

  • PDF

커피전문점 방문동기유형에 따른 시장세분화 (Market Segmentation Based on Types of Motivations to Visit Coffee Shops)

  • 이용숙;김은정;박흥진
    • 한국프랜차이즈경영연구
    • /
    • 제7권1호
    • /
    • pp.21-29
    • /
    • 2016
  • Purpose - The primary purpose of this study is to employ effective marketing methods using market segmentation of coffee shops by determining how motivations to visit coffee shops have different impacts on demographic profile of visitors and characteristics of coffee shop visits, so as to draw out a better understanding of customers of coffee market. Research design, data, and methodology - Data were collected using surveys of self-administered questionnaires toward coffee shop users in Daejeon, Korea. A number of samples used in data analysis were 253 excluding unusable responses. The data were analyzed through frequency, reliability, and factor analysis using SPSS 20.0. Factor analysis was conducted through the principal component analysis and varimax rotation method to derive factors of one or more eigen values. In addition, the cluster analysis, multivariate ANOVA, and cross-tab analysis were used for the market segmentation based on the types of motivation for coffee shop visits. The process of the cluster analysis is as follows. Four clusters were derived through hierarchical clustering, and k-means cluster analysis was then carried out using mean value of the four clusters as the initial seed value. Result - The factor analysis delineated four dimensions of motivation to visit coffee shops: ostentation motivation, hedonic motivation, esthetic motivation, utility motivation. The cluster analysis yielded four clusters: utility and esthetic seekers, hedonic seekers, utility seekers, ostentation seekers. In order to further specify the profile of four clusters, each cluster was cross tabulated with socio-demographics and characteristics of coffee shop visits. Four clusters are significantly different from each other by four types of motivations for coffee shop visits. Conclusions - This study has empirically examined the difference in demographic profile of visitors and characteristics of coffee shop visits by motivation to visit coffee shops. There are significant differences according to age, education background, marital status, occupation and monthly income. In addition, coffee shops use pattern characterization in frequency of visits to coffee shops, relationships with companion, purpose of visit, information sources, brand type, average expense per visit, important elements of selection attribute were significantly different depending on motivations for coffee shop visits.

남성복의 치수규격을 위한 체형분류(제3보) -사진자료에 의한 동체부의 분류- (Classification of Bodytype on Adult Male for the Apparel Sizing System (Part 3) -Bodytype of Trunk from the Photoqraphic Data-)

  • 김구자
    • 한국의류학회지
    • /
    • 제19권6호
    • /
    • pp.924-932
    • /
    • 1995
  • Concept of the comfort and fitness has become a major concern in the basic function of the ready.made clothes. Until now ready-made clothes were not made by on the basis of the bodytype, but by the body size only This research was performed to classify and characterize the bodytypes of Korean adult males. Sample size was 1290 subjects and their age range was from 19 to 54 years old. 25 variables from the photographic data were applied to analyze the bodytype of trunk. Data were analyzed by the multivariate method, especially factor and cluster analysis. The groups forming a cluster can be subdivided into 5 sets by crosstabulation extracted by the hierarchical cluster analysis. 5 bodytypes classified by the photographic sources could be combined with the anthropometric data and were demonstrated with 5 silhouette. Type 3 and 4 in trunk were dominant and were composed of the majority of 55.6% of the subjects. Bodytypes of Korean males were influenced by the degree of posture erectness and of curvature of the front side of the body in waist and abdomen.

  • PDF