• Title/Summary/Keyword: Non-linear Clustering

Search Result 51, Processing Time 0.022 seconds

Design of Hierarchically Structured Clustering Algorithm and its Application (계층 구조 클러스터링 알고리즘 설계 및 그 응용)

  • Bang, Young-Keun;Park, Ha-Yong;Lee, Chul-Heui
    • Journal of Industrial Technology
    • /
    • v.29 no.B
    • /
    • pp.17-23
    • /
    • 2009
  • In many cases, clustering algorithms have been used for extracting and discovering useful information from non-linear data. They have made a great effect on performances of the systems dealing with non-linear data. Thus, this paper presents a new approach called hierarchically structured clustering algorithm, and it is applied to the prediction system for non-linear time series data. The proposed hierarchically structured clustering algorithm (called HCKA: Hierarchical Cross-correlation and K-means clustering Algorithms) in which the cross-correlation and k-means clustering algorithm are combined can accept the correlationship of non-linear time series as well as statistical characteristics. First, the optimal differences of data are generated, which can suitably reveal the characteristics of non-linear time series. Second, the generated differences are classified into the upper clusters for their predictors by the cross-correlation clustering algorithm, and then each classified differences are classified again into the lower fuzzy sets by the k-means clustering algorithm. As a result, the proposed method can give an efficient classification and improve the performance. Finally, we demonstrates the effectiveness of the proposed HCKA via typical time series examples.

  • PDF

The extension of the largest generalized-eigenvalue based distance metric Dij1) in arbitrary feature spaces to classify composite data points

  • Daoud, Mosaab
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.39.1-39.20
    • /
    • 2019
  • Analyzing patterns in data points embedded in linear and non-linear feature spaces is considered as one of the common research problems among different research areas, for example: data mining, machine learning, pattern recognition, and multivariate analysis. In this paper, data points are heterogeneous sets of biosequences (composite data points). A composite data point is a set of ordinary data points (e.g., set of feature vectors). We theoretically extend the derivation of the largest generalized eigenvalue-based distance metric Dij1) in any linear and non-linear feature spaces. We prove that Dij1) is a metric under any linear and non-linear feature transformation function. We show the sufficiency and efficiency of using the decision rule $\bar{{\delta}}_{{\Xi}i}$(i.e., mean of Dij1)) in classification of heterogeneous sets of biosequences compared with the decision rules min𝚵iand median𝚵i. We analyze the impact of linear and non-linear transformation functions on classifying/clustering collections of heterogeneous sets of biosequences. The impact of the length of a sequence in a heterogeneous sequence-set generated by simulation on the classification and clustering results in linear and non-linear feature spaces is empirically shown in this paper. We propose a new concept: the limiting dispersion map of the existing clusters in heterogeneous sets of biosequences embedded in linear and nonlinear feature spaces, which is based on the limiting distribution of nucleotide compositions estimated from real data sets. Finally, the empirical conclusions and the scientific evidences are deduced from the experiments to support the theoretical side stated in this paper.

A Non-linear Variant of Global Clustering Using Kernel Methods (커널을 이용한 전역 클러스터링의 비선형화)

  • Heo, Gyeong-Yong;Kim, Seong-Hoon;Woo, Young-Woon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.4
    • /
    • pp.11-18
    • /
    • 2010
  • Fuzzy c-means (FCM) is a simple but efficient clustering algorithm using the concept of a fuzzy set that has been proved to be useful in many areas. There are, however, several well known problems with FCM, such as sensitivity to initialization, sensitivity to outliers, and limitation to convex clusters. In this paper, global fuzzy c-means (G-FCM) and kernel fuzzy c-means (K-FCM) are combined to form a non-linear variant of G-FCM, called kernel global fuzzy c-means (KG-FCM). G-FCM is a variant of FCM that uses an incremental seed selection method and is effective in alleviating sensitivity to initialization. There are several approaches to reduce the influence of noise and accommodate non-convex clusters, and K-FCM is one of them. K-FCM is used in this paper because it can easily be extended with different kernels. By combining G-FCM and K-FCM, KG-FCM can resolve the shortcomings mentioned above. The usefulness of the proposed method is demonstrated by experiments using artificial and real world data sets.

The Alcock-Paczynski effect via clustering shells

  • Sabiu, Cristiano G.;Lee, Seokcheon;Park, Changbom
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.38 no.2
    • /
    • pp.58.2-58.2
    • /
    • 2013
  • Both peculiar velocities and errors in the assumed redshift-distance relation ("Alcock-Paczynski effect") generate correlations between clustering amplitude and orientation with respect to the line-of-sight. In this talk we propose a novel technique to extract the Alcock-Paczynski, geometric, distortion information from the anisotropic clustering of galaxies in 3-dimensional redshift space while minimizing non-linear clustering and peculiar velocity effects. We capitalize on the recent, large dataset from the Sloan Digital Sky Survey III (SDSS-III), which provides a large comoving sample of the universe out to high redshift. We focus our analysis on the Baryon Oscillation Spectroscopic Survey (BOSS) constant mass (CMASS) sample of 549,005 bright galaxies in the redshift range 0.43

  • PDF

Nonlinear structural finite element model updating with a focus on model uncertainty

  • Mehrdad, Ebrahimi;Reza Karami, Mohammadi;Elnaz, Nobahar;Ehsan Noroozinejad, Farsangi
    • Earthquakes and Structures
    • /
    • v.23 no.6
    • /
    • pp.549-580
    • /
    • 2022
  • This paper assesses the influences of modeling assumptions and uncertainties on the performance of the non-linear finite element (FE) model updating procedure and model clustering method. The results of a shaking table test on a four-story steel moment-resisting frame are employed for both calibrations and clustering of the FE models. In the first part, simple to detailed non-linear FE models of the test frame is calibrated to minimize the difference between the various data features of the models and the structure. To investigate the effect of the specified data feature, four of which include the acceleration, displacement, hysteretic energy, and instantaneous features of responses, have been considered. In the last part of the work, a model-based clustering approach to group models of a four-story frame with similar behavior is introduced to detect abnormal ones. The approach is a composition of property derivation, outlier removal based on k-Nearest neighbors, and a K-means clustering approach using specified data features. The clustering results showed correlations among similar models. Moreover, it also helped to detect the best strategy for modeling different structural components.

The clustering of critical points in the evolving cosmic web

  • Shim, Junsup;Codis, Sandrine;Pichon, Christophe;Pogosyan, Dmitri;Cadiou, Corentin
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.46 no.1
    • /
    • pp.47.2-47.2
    • /
    • 2021
  • Focusing on both small separations and baryonic acoustic oscillation scales, the cosmic evolution of the clustering properties of peak, void, wall, and filament-type critical points is measured using two-point correlation functions in ΛCDM dark matter simulations as a function of their relative rarity. A qualitative comparison to the corresponding theory for Gaussian random fields allows us to understand the following observed features: (i) the appearance of an exclusion zone at small separation, whose size depends both on rarity and signature (i.e. the number of negative eigenvalues) of the critical points involved; (ii) the amplification of the baryonic acoustic oscillation bump with rarity and its reversal for cross-correlations involving negatively biased critical points; (iii) the orientation-dependent small-separation divergence of the cross-correlations of peaks and filaments (respectively voids and walls) that reflects the relative loci of such points in the filament's (respectively wall's) eigenframe. The (cross-) correlations involving the most non-linear critical points (peaks, voids) display significant variation with redshift, while those involving less non-linear critical points seem mostly insensitive to redshift evolution, which should prove advantageous to model. The ratios of distances to the maxima of the peak-to-wall and peak-to-void over that of the peak-to-filament cross-correlation are ~2-√~2 and ~3-√~3WJ, respectively, which could be interpreted as the cosmic crystal being on average close to a cubic lattice. The insensitivity to redshift evolution suggests that the absolute and relative clustering of critical points could become a topologically robust alternative to standard clustering techniques when analysing upcoming surveys such as Euclid or Large Synoptic Survey Telescope (LSST).

  • PDF

Cosmic Distances Probed Using The BAO Ring

  • Sabiu, Cristiano G.;Song, Yong-Seon
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.41 no.1
    • /
    • pp.39.1-39.1
    • /
    • 2016
  • The cosmic distance can be precisely determined using a 'standard ruler' imprinted by primordial baryon acoustic oscillation (hereafter BAO) in the early Universe. The BAO at the targeted epoch is observed by analyzing galaxy clustering in redshift space (hereafter RSD) of which theoretical formulation is not yet fully understood, and thus makes this methodology unsatisfactory. The BAO analysis through full RSD modeling is contaminated by the systematic uncertainty due to a non--linear smearing effect such as non-linear corrections and uncertainty caused by random viral velocity of galaxies. However, BAO can be probed independently of RSD contamination using the BAO peak positions located in the 2D anisotropic correlation function. A new methodology is presented to measure peak positions, to test whether it is also contaminated by the same systematics in RSD, and to provide the radial and transverse cosmic distances determined by the 2D BAO peak positions. We find that in our model independent anisotropic clustering analysis we can obtain about 2% and 5% constraints on $D_A$ and $H^{-1}$ respectively with current BOSS data which is competitive with other analysis.

  • PDF

Implementation of unsupervised clustering methods for measurement gases using artificial olfactory sensing system (인공 후각 센싱 시스템을 이용한 측정 가스의 Unsupervised clustering 방법의 구현)

  • 최지혁;함유경;최찬석;김정도;변형기
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2000.10a
    • /
    • pp.405-405
    • /
    • 2000
  • We designed the artificial olfactory sensing system (Electronic Nose) using MOS type sensor array fur recognizing and analyzing odour. The response of individual sensors of sensor array, each processing a slightly different response towards the sample volatiles, can provide enough information to discriminate between sample odours. In this paper, we applied clustering algorithm for dimension reduction, such as linear projection mapping (PCA method), nonlinear mapping (Sammon mapping method) and the combination of PCA and Sammon mapping having a better discriminating ability. The odours used are VOC (Volatile chemical compound) and Toxic gases.

  • PDF

DYNAMICAL AND STATISTICAL ASPECTS OF GRAVITATIONAL CLUSTERING IN THE UNIVERSE

  • SAHNI V.
    • Journal of The Korean Astronomical Society
    • /
    • v.29 no.spc1
    • /
    • pp.19-21
    • /
    • 1996
  • We apply topological measures of clustering such as percolation and genus curves (PC & GC) and shape statistics to a set of scale free N-body simulations of large scale structure. Both genus and percolation curves evolve with time reflecting growth of non-Gaussianity in the N-body density field. The amplitude of the genus curve decreases with epoch due to non-linear mode coupling, the decrease being more noticeable for spectra with small scale power. Plotted against the filling factor GC shows very little evolution - a surprising result, since the percolation curve shows significant evolution for the same data. Our results indicate that both PC and GC could be used to discriminate between rival models of structure formation and the analysis of CMB maps. Using shape sensitive statistics we find that there is a strong tendency for objects in our simulations to be filament-like, the degree of filamentarity increasing with epoch.

  • PDF

Alcock-Paczynski Test with the Evolution of Redshift-Space Galaxy Clustering Anisotropy: Understanding the Systematics

  • Park, Hyunbae;Park, Changbom;Tonegawa, Motonari;Zheng, Yi;Sabiu, Cristiano G.;Li, Xiao-dong;Hong, Sungwook E.;Kim, Juhan
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.1
    • /
    • pp.78.2-78.2
    • /
    • 2019
  • We develop an Alcock-Paczynski (AP) test method that uses the evolution of redshift-space two-point correlation function (2pCF) of galaxies. The method improves the AP test proposed by Li et al. (2015) in that it uses the full two-dimensional shape of the correlation function. Similarly to the original method, the new one uses the 2pCF in redshift space with its amplitude normalized. Cosmological constraints can be obtained by examining the redshift dependence of the normalized 2pCF. This is because the 2pCF should not change apart from the expected small non-linear evolution if galaxy clustering is not distorted by incorrect choice of cosmology used to convert redshift to comoving distance. Our new method decomposes the redshift difference of the 2-dimensional correlation function into the Legendre polynomials whose amplitudes are modelled by radial fitting functions. The shape of the normalized 2pCF suffers from small intrinsic time evolution due to non-linear gravitational evolution and change of type of galaxies between different redshifts. It can be accurately measured by using state of the art cosmological simulations. We use a set of our Multiverse simulations to find that the systematic effects on the shape of the normalized 2pCF are quite insensitive to change of cosmology over \Omega_m=0.21 - 0.31 and w=-0.5 - -1.5. Thanks to this finding, we can now apply our method for the AP test using the non-linear systematics measured from a single simulation of the fiducial cosmological model.

  • PDF