• Title/Summary/Keyword: 군집 수 결정

Search Result 365, Processing Time 0.032 seconds

Magnifying Block Diagonal Structure for Spectral Clustering (스펙트럼 군집화에서 블록 대각 형태의 유사도 행렬 구성)

  • Heo, Gyeong-Yong;Kim, Kwang-Baek;Woo, Young-Woon
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.9
    • /
    • pp.1302-1309
    • /
    • 2008
  • Traditional clustering methods, like k-means or fuzzy clustering, are prototype-based methods which are applicable only to convex clusters. On the other hand, spectral clustering tries to find clusters only using local similarity information. Its ability to handle concave clusters has gained the popularity recent years together with support vector machine (SVM) which is a kernel-based classification method. However, as is in SVM, the kernel width plays an important role and has a great impact on the result. Several methods are proposed to decide it automatically, it is still determined based on heuristics. In this paper, we proposed an adaptive method deciding the kernel width based on distance histogram. The proposed method is motivated by the fact that the affinity matrix should be formed into a block diagonal matrix to generate the best result. We use the tradition Euclidean distance together with the random walk distance, which make it possible to form a more apparent block diagonal affinity matrix. Experimental results show that the proposed method generates more clear block structured affinity matrix than the existing one does.

  • PDF

Comparison between at-site frequency analysis and regional frequency analysis at Gangwon Province (강원도에서의 지점빈도분석과 지역빈도분석의 비교)

  • Seo, Dong Il;Kim, Sang Ug;Jeon, Young Il;Han, Jae Wook
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.205-205
    • /
    • 2023
  • 지역 빈도 분석과 점 빈도 분석은 하천 기본계획 및 수공 구조물의 설계에 있어 재현기간 별 확률강우량을 산정하기 위한 방법이다. 점 빈도 분석은 자료의 수가 부족하여 높은 재현기간에 대한 확률강우량을 산정하기에 어려운 점이 있다. 2019년도부터 사용되고 있는 지역빈도분석 방법은 이러한 점을 보완해주고 있다. 지역빈도분석을 수행하기 위해서는 지역의 동질성을 확인하는 과정이 가장 중요한 과정이다. 이러한 동질성을 판단하기 위하여 K-means등의 군집분석과 L-moment 법 등을 사용하고 있다. 이러한 차이점으로 인해 두 방법 간의 정확성은 비교가 어려우나 서로 간의 장점, 단점과 결과 간의 차이를 기반으로 산간지역이 많은 강원도와 같은 지역에 대한 확률강우량 산정의 적절한 방법을 판단해보고자 본 연구를 진행하였다. 지역 빈도 분석은 강원도에 위치한 48개 관측소의 강우 자료 수집 후 고도, 위치, 지속시간 별 강우량을 변수로 지정하고 K-means 분석을 통해 6개의 군집으로 구분하여 수행되었다. 이질성 척도는 관측 자료와 500번의 모의 수행을 통해 결정하였다. 이후 분석된 군집이 동질한 경우 확률분포형에 적합시켜 확률강우량을 산정하였다. 점 빈도 분석은 지역 빈도 분석에서 결정된 군집에서의 최대 강우량과 최소 강우량 관측소의 자료를 이용하여 수행하였다. 본 연구에서는 점빈도분석과 지역빈도분석의 결과를 비교하였으며, 두 가지 분석 방법에 따른 차이의 발생원인 및 특성을 결론으로 제시하였다.

  • PDF

Decision Tree Based Context Clustering with Cross Likelihood Ratio for HMM-based TTS (HMM 기반의 TTS를 위한 상호유사도 비율을 이용한 결정트리 기반의 문맥 군집화)

  • Jung, Chi-Sang;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.2
    • /
    • pp.174-180
    • /
    • 2013
  • This paper proposes a decision tree based context clustering algorithm for HMM-based speech synthesis systems using the cross likelihood ratio with a hierarchical prior (CLRHP). Conventional algorithms tie the context-dependent HMM states that have similar statistical characteristics, but they do not consider the statistical similarity of split child nodes, which does not guarantee the statistical difference between the final leaf nodes. The proposed CLRHP algorithm improves the reliability of model parameters by taking a criterion of minimizing the statistical similarity of split child nodes. Experimental results verify the superiority of the proposed approach to conventional ones.

Classification of Magnetic Resonance Imagery Using Deterministic Relaxation of Neural Network (신경망의 결정론적 이완에 의한 자기공명영상 분류)

  • 전준철;민경필;권수일
    • Investigative Magnetic Resonance Imaging
    • /
    • v.6 no.2
    • /
    • pp.137-146
    • /
    • 2002
  • Purpose : This paper introduces an improved classification approach which adopts a deterministic relaxation method and an agglomerative clustering technique for the classification of MRI using neural network. The proposed approach can solve the problems of convergency to local optima and computational burden caused by a large number of input patterns when a neural network is used for image classification. Materials and methods : Application of Hopfield neural network has been solving various optimization problems. However, major problem of mapping an image classification problem into a neural network is that network is opt to converge to local optima and its convergency toward the global solution with a standard stochastic relaxation spends much time. Therefore, to avoid local solutions and to achieve fast convergency toward a global optimization, we adopt MFA to a Hopfield network during the classification. MFA replaces the stochastic nature of simulated annealing method with a set of deterministic update rules that act on the average value of the variable. By minimizing averages, it is possible to converge to an equilibrium state considerably faster than standard simulated annealing method. Moreover, the proposed agglomerative clustering algorithm which determines the underlying clusters of the image provides initial input values of Hopfield neural network. Results : The proposed approach which uses agglomerative clustering and deterministic relaxation approach resolves the problem of local optimization and achieves fast convergency toward a global optimization when a neural network is used for MRI classification. Conclusion : In this paper, we introduce a new paradigm to classify MRI using clustering analysis and deterministic relaxation for neural network to improve the classification results.

  • PDF

Spatial Clustering Method Via Generalized Lasso (Generalized Lasso를 이용한 공간 군집 기법)

  • Song, Eunjung;Choi, Hosik;Hwang, Seungsik;Lee, Woojoo
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.4
    • /
    • pp.561-575
    • /
    • 2014
  • In this paper, we propose a penalized likelihood method to detect local spatial clusters associated with disease. The key computational algorithm is based on genlasso by Tibshirani and Taylor (2011). The proposed method has two main advantages over Kulldorff's method which is popoular to detect local spatial clusters. First, it is not needed to specify a proper cluster size a priori. Second, any type of covariate can be incorporated and, it is possible to find local spatial clusters adjusted for some demographic variables. We illustrate our proposed method using tuberculosis data from Seoul.

Cluster analysis by month for meteorological stations using a gridded data of numerical model with temperatures and precipitation (기온과 강수량의 수치모델 격자자료를 이용한 기상관측지점의 월별 군집화)

  • Kim, Hee-Kyung;Kim, Kwang-Sub;Lee, Jae-Won;Lee, Yung-Seop
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1133-1144
    • /
    • 2017
  • Cluster analysis with meteorological data allows to segment meteorological region based on meteorological characteristics. By the way, meteorological observed data are not adequate for cluster analysis because meteorological stations which observe the data are located not uniformly. Therefore the clustering of meteorological observed data cannot reflect the climate characteristic of South Korea properly. The clustering of $5km{\times}5km$ gridded data derived from a numerical model, on the other hand, reflect it evenly. In this study, we analyzed long-term grid data for temperatures and precipitation using cluster analysis. Due to the monthly difference of climate characteristics, clustering was performed by month. As the result of K-Means cluster analysis is so sensitive to initial values, we used initial values with Ward method which is hierarchical cluster analysis method. Based on clustering of gridded data, cluster of meteorological stations were determined. As a result, clustering of meteorological stations in South Korea has been made spatio-temporal segmentation.

A new cluster validity index based on connectivity in self-organizing map (자기조직화지도에서 연결강도에 기반한 새로운 군집타당성지수)

  • Kim, Sangmin;Kim, Jaejik
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.5
    • /
    • pp.591-601
    • /
    • 2020
  • The self-organizing map (SOM) is a unsupervised learning method projecting high-dimensional data into low-dimensional nodes. It can visualize data in 2 or 3 dimensional space using the nodes and it is available to explore characteristics of data through the nodes. To understand the structure of data, cluster analysis is often used for nodes obtained from SOM. In cluster analysis, the optimal number of clusters is one of important issues. To help to determine it, various cluster validity indexes have been developed and they can be applied to clustering outcomes for nodes from SOM. However, while SOM has an advantage in that it reflects the topological properties of original data in the low-dimensional space, these indexes do not consider it. Thus, we propose a new cluster validity index for SOM based on connectivity between nodes which considers topological properties of data. The performance of the proposed index is evaluated through simulations and it is compared with various existing cluster validity indexes.

Scene Change Detection with 3-Step Process (3단계 과정의 장면 전환검출)

  • Yoon, Shin-Seong;Won, Rhee-Yang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.6
    • /
    • pp.147-154
    • /
    • 2008
  • First, this paper compute difference value between frames using the composed method of $X^2$ histogram and color histogram and the normalization. Next, cluster representative frame was decided by using the clustering for distance and the k-mean grouping. Finally, representative frame of group was decided by using the likelihood ratio. Proposed method can be known by experiment as outstanding of detection rather than other methods, due to computing of difference value, clustering and grouping, and detecting of representative frame.

  • PDF

Factor Analysis Affecting on Chartering Decision-making in the Dry Bulk Shipping Market (부정기 건화물선 시장에서 용선 의사결정에 영향을 미치는 요인 분석)

  • Lee, Choong-Ho;Park, Keun-Sik
    • Journal of Korea Port Economic Association
    • /
    • v.40 no.1
    • /
    • pp.151-163
    • /
    • 2024
  • This study sought to confirm the impact of analytical methods and behavioral economic theory factors on decision-making when making chartering decisions in the dry bulk shipping market. This study on chartering decision-making model was began to verify why shipping companies do not make rational decision-making and behavior based on analytical methods such as freight prediction and process of alternative selection in the same market situation. To understand the chartering decision-making model, it is necessary to study the impact of behavioral economic theory such as heuristics, loss aversion, and herding behavior on chartering decision-making. Through AHP analysis, the importance of the method factors relied upon in chartering decision-making. The dependence of the top factors in chartering decision-making was in the following order: market factors, heuristics, internal factors, herding behavior, and loss aversion. Market factors, heuristics, and internal factors. As for detailed factors, spot freight index and empirical intuition were confirmed as the most important factors relied on when making decisions. It was confirmed that empirical intuition is more important than internal analysis, which is an analytical method. This study can be said to be meaningful in that it academically researched and proved the bounded rationality of humans, which cannot be fully rational, and sometimes relies on experience or psychological tendencies, by applying it to the chartering decision-making model in the dry bulk shipping market. It also suggests that in the dry bulk shipping market, which is uncertain and has a high risk of loss due to decision-making, the experience and insight of decision makers have a very important impact on the performance and business profits of the operation part of shipping companies. Even though chartering are a decision-making field that requires judgment and intuition based on heuristics, decision-makers need to be aware of this decision-making model in order to reduce repeated mistakes of deciding contrary to market situation. It also suggests that there is a need to internally research analytical methods and procedures that can complement heuristics such as empirical intuition.

Using cluster analysis and genetic algorithm to develop portfolio investment strategy based on investor information (군집분석과 유전자 알고리즘을 활용한 투자자 거래정보 기반 포트폴리오 투자전략)

  • Cheong, Donghyun;Oh, Kyong Joo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.1
    • /
    • pp.107-117
    • /
    • 2014
  • The main purpose of this study is to propose a portfolio investment strategy based on investor types information. For improvement of investment performance, artificial intelligence techniques are used to construct a portfolio. Among many artificial intelligence techniques, cluster analysis is applied to select securities and genetic algorithm is applied to assign the respective weight within the portfolio. Empirical experiments in the Korean stock market show that proposed portfolio investment strategy is practicable and superior strategy. This result implies that analysis of investor's trading behavior may assist investors to make an investment decision and to get superior performance.