• Title/Summary/Keyword: cluster analysis.

Search Result 5,219, Processing Time 0.041 seconds

Categorization of Traffic Type According to Seoul-City Administrative District Using Cluster Analysis (군집분석을 이용한 서울시 행정구역별 교통유형 분류)

  • Han, Mahn-Seob;Oh, Heung-Un
    • International Journal of Highway Engineering
    • /
    • v.14 no.4
    • /
    • pp.133-140
    • /
    • 2012
  • PURPOSES : Traffic situation of Seoul City is different each administrative district. because each administrative district population, average travel speed, etc are different. thus, regionally differentiated policy is necessary. METHODS : In this study, first, it is to implement the cluster analysis using the traffic factor of twenty-five administrative districts in Seoul, categorize it into the cluster and understand the properties. second, related factors of speed were derived. and method to increase the speed was investigated. we choose the eleven traffic factors such as the number of traffic accident cases, total length, speed, the number of cross section, the number of cross section per km, the rate of roads, registered cars, population attending office and school, population density, area. RESULTS : In the results, first, we could categorize the Seoul-City administrative district into three clusters. in order to find Factors associated with speed a simple regression analysis was performed. and the number of intersections per km is closely related to the speed. CONCLUSIONS : Through this study, transportation policies reflecting local traffic-related characteristics are required.

A Strategy Through Segmentation Using Factor and Cluster Analysis: focusing on corporations having a special status (요인분석과 군집분석을 통한 세분화 및 전략방향 제시: 특수법인 사례를 중심으로)

  • Cho, Yong-Jun;Kim, Yeong-Hwa
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.1
    • /
    • pp.23-38
    • /
    • 2007
  • Corporations adopt a segmentation depends on the existence of target variables, in general. In this paper, for the case of no target variables, a strategy through segmentation is proposed for corporations having a special status based on the management index. In case of segmentation using cluster analysis, however, if one classify according to many variables then he will be in face of difficulties in characterizing. Therefore, after extracting representative factors by factor analysis, a segmentation method through 2 step cluster analysis is employed on the basis of these representative factors. As a result, six segmentation groups are found and the resulting strategy is proposed which strengthens prominent factors and makes up defective factors for each group.

Impact Analysis of Partition Utility Score in Cluster Analysis (군집분석의 분할 유용도 점수의 영향 분석)

  • Lee, Gye Sung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.3
    • /
    • pp.481-486
    • /
    • 2021
  • Machine learning algorithms adopt criterion function as a key component to measure the quality of their model derived from data. Cluster analysis also uses this function to rate the clustering result. All the criterion functions have in general certain types of favoritism in producing high quality clusters. These clusters are then described by attributes and their values. Category utility and partition utility play an important role in cluster analysis. These are fully analyzed in this research particularly in terms of how they are related to the favoritism in the final results. In this research, several data sets are selected and analyzed to show how different results are induced from these criterion functions.

A Study on FIFA Partner Adidas of 2022 Qatar World Cup Using Big Data Analysis

  • Kyung-Won, Byun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.1
    • /
    • pp.164-170
    • /
    • 2023
  • The purpose of this study is to analyze the big data of Adidas brand participating in the Qatar World Cup in 2022 as a FIFA partner to understand useful information, semantic connection and context from unstructured data. Therefore, this study collected big data generated during the World Cup from Adidas participating in sponsorship as a FIFA partner for the 2022 Qatar World Cup and collected data from major portal sites to understand its meaning. According to text mining analysis, 'Adidas' was used the most 3,340 times based on the frequency of keyword appearance, followed by 'World Cup', 'Qatar World Cup', 'Soccer', 'Lionel Messi', 'Qatar', 'FIFA', 'Korea', and 'Uniform'. In addition, the TF-IDF rankings were 'Qatar World Cup', 'Soccer', 'Lionel Messi', 'World Cup', 'Uniform', 'Qatar', 'FIFA', 'Ronaldo', 'Korea', and 'Nike'. As a result of semantic network analysis and CONCOR analysis, four groups were formed. First, Cluster A named it 'Qatar World Cup Sponsor' as words such as 'Adidas', 'Nike', 'Qatar World Cup', 'Sponsor', 'Sponsor Company', 'Marketing', 'Nation', 'Launch', 'Official', 'Commemoration' and 'National Team' were formed into groups. Second, B Cluster named it 'Group stage' as words such as 'Qatar', 'Uruguay', 'FIFA' and 'group stage' were formed into groups. Third, C Cluster named it 'Winning' as words such as 'World Cup Winning', 'Champion', 'France', 'Argentina', 'Lionel Messi', 'Advertising' and 'Photograph' formed a group. Fourth, D Cluster named it 'Official Ball' as words such as 'Official Ball', 'World Cup Official Ball', 'Soccer Ball', 'All Times', 'Al Rihla', 'Public', 'Technology' was formed into groups.

The Analysis of Children's Torso using Photographic Anthropometry(II):A Classification of Clusters by Principal Component Score (사진 계측에 의한 아동의 동체 형상 분석(II): 주성분 점수에 의한 군집 유형의 분류)

  • Jeon, Eun-Kyung;Kwon, Sook-Hee
    • Korean Journal of Human Ecology
    • /
    • v.8 no.2
    • /
    • pp.313-325
    • /
    • 1999
  • This study aimed to classify the data of children's bodies into several clusters by principal component scores that were extracted through the factor analysis in the former study, and to describe the distribution and body characteristics of the clusters. The sample was 308 elementary school children aged from 6 to 8 and the anthropometric measurements were performed indirectly from the photographs of the subjects, which was the same as the first analysis. The data were analysed statistically using SPSSWIN Ver. 8.0. Through the statistical analysis, 3 clusters were obtained from the data. The first cluster distributed more in the children aged 7 and 8 than in the children aged 6. The somatotype of this group was the tallest among the three groups, and they were the most developed group compared to the two other groups in lateral component as well as in linear component. The second cluster group wasn't well developed in lateral components, and had lowest level in Rohrer Index, so this group had thin figures compared to the other groups. The third cluster revealed dominant distribution in the group aged 6, and the group had the least developed linear components but higher level in Rohrer Index. Each cluster group revealed peculiar somatotype that was dominant in one group but rarely in other cluster groups. Lateral views of these characteristics were showed using the average of the measurements of clusters.

  • PDF

Parallel Nonlinear Analysis of Prestressed Concrete Frame on Cluster System (클러스터 시스템에서 프리스트레스트 콘크리트 프레임의 병렬 비선형해석)

  • 이재석;최규천
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.14 no.3
    • /
    • pp.287-298
    • /
    • 2001
  • Analysis of nonlinear behavior of prestressed concrete frame structures on PC is a time-consuming computing job if the problem size increase to a certain degree. Cluster system has emerged as one of promising computing environments due to its good extendibility, portability, and cost-effectiveness, comparing it with high-end work-stations or servers. In this paper, a parallel nonlinear analysis procedure of prestressed concrete frame structure is presented using cluster computing. Cluster system is configured with readily available pentium III class PCs under Win98 or Linux and fast ethernet. Parallel computing algorithms on element-wise processing parts including the calculation of stiffness matrix, element stresses and determination of material states, check of material failure and calculation of unbalanced loads are developed using MPL. Validity of the method is discussed through typical numerical examples. For the case of 4 node system, maximum speedup is 3.15 and 3.74 for Win98 and Linux, respectively. Important issues for the efficient use of cluster computing system based un PCs and ethernet are addressed.

  • PDF

An Optimal Cluster Analysis Method with Fuzzy Performance Measures (퍼지 성능 측정자를 결합한 최적 클러스터 분석방법)

  • 이현숙;오경환
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.6 no.3
    • /
    • pp.81-88
    • /
    • 1996
  • Cluster analysis is based on partitioning a collection of data points into a number of clusters, where the data points in side a cluster have a certain degree of similarity and it is a fundamental process of data analysis. So, it has been playing an important role in solving many problems in pattern recognition and image processing. For these many clustering algorithms depending on distance criteria have been developed and fuzzy set theory has been introduced to reflect the description of real data, where boundaries might be fuzzy. If fuzzy cluster analysis is tomake a significant contribution to engineering applications, much more attention must be paid to fundamental questions of cluster validity problem which is how well it has identified the structure that is present in the data. Several validity functionals such as partition coefficient, claasification entropy and proportion exponent, have been used for measuring validity mathematically. But the issue of cluster validity involves complex aspects, it is difficult to measure it with one measuring function as the conventional study. In this paper, we propose four performance indices and the way to measure the quality of clustering formed by given learning strategy.

  • PDF

Interspecific Relationship of Polygonatum Species Collected from Gyeongnam Area Using Cluster Analysis (경남지역 둥굴레속의 Cluster 분석에 의한 종간 유연관계)

  • Shim, Jae-Suk;Park, Jeong-Min;Jeon, Byong-Sam;Kang, Jin-Ho
    • Korean Journal of Medicinal Crop Science
    • /
    • v.13 no.1
    • /
    • pp.30-34
    • /
    • 2005
  • Polygonatum species are a useful medical crop in Korea but basic study in the species was not well conducted. The study was carried out to analyse genetic diversity and intraspecific relationship of 47 Polygonatum species collected from Gyeongnam province. Their analysis was done through principle component analysis and average linkage cluster analysis with their twelve morphological traits. The result of principle component analysis showed the Prin 1, Prin 2 and Prin 3 represented 79% of total variation. By the 0.7 average distance of the cluster analysis and the calculated Euclidian distance, the 47 collected species were grouped into five groups. Group I included 22 collected species representing P. ordoratum var. pluriflorum, group II did 5 ones representing P. involucratum, group III was divided into two subclasses, 2 species including P. inflatum and 7 species including P. thunbergii, group IV also consisted of 2 subclasses, a species similar to P. thunbergii and P.involucratum, respectively, and finally group V included 8 species representing P.lasianthum var. coreanum. meaning that the useful germplasms can be collected from relatively small area.

S-RCSA : Efficiency Analysis of Sectored Random Cluster Header Selection Algorithm (섹터화된 랜덤 클러스터 헤더 선출 알고리즘 효율성 분석)

  • Kim, Min-Je;Lee, Doo-Wan;Jang, Kyung-Sik
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2011.10a
    • /
    • pp.831-834
    • /
    • 2011
  • LEACH(One of the leading algorithms in the field of WSN) for the life of the system, even by the number of all nodes to ensure that the cluster header. However, each round does not guarantee a certain number of cluster header. So sometimes cluster header is elected of small number or not elected. If cluster header number is to small, takes a heavy load on cluster header. And empty cluster is occur depending on the location of the cluster header. The algorithm proposed in this paper, the area of interest is divided into sectors. And randomly, cluster header be elected one the in each sector. When clustering the sensor nodes will belong to the nearest cluster header. So clustering is independent of the sector. This algorithm is guarantee a certain number of cluster header in each round. And has prevent occurrence of empty cluster.

  • PDF

Combining cluster analysis and neural networks for the classification problem

  • Kim, Kyungsup;Han, Ingoo
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1996.10a
    • /
    • pp.31-34
    • /
    • 1996
  • The extensive researches have compared the performance of neural networks(NN) with those of various statistical techniques for the classification problem. The empirical results of these comparative studies have indicated that the neural networks often outperform the traditional statistical techniques. Moreover, there are some efforts that try to combine various classification methods, especially multivariate discriminant analysis with neural networks. While these efforts improve the performance, there exists a problem violating robust assumptions of multivariate discriminant analysis that are multivariate normality of the independent variables and equality of variance-covariance matrices in each of the groups. On the contrary, cluster analysis alleviates this assumption like neural networks. We propose a new approach to classification problems by combining the cluster analysis with neural networks. The resulting predictions of the composite model are more accurate than each individual technique.

  • PDF