Tree-Dependent Components of Gene Expression Data for Clustering

유전자발현데이터의 군집분석을 위한 나무 의존 성분 분석

  • Kim Jong-Kyoung (Department of Computer Science Pohang University of Science and Technology) ;
  • Choi Seung-Jin (Department of Computer Science Pohang University of Science and Technology)
  • 김종경 (포항공과대학교 컴퓨터공학과) ;
  • 최승진 (포항공과대학교 컴퓨터공학과)
  • Published : 2006.06.01

Abstract

Tree-dependent component analysis (TCA) is a generalization of independent component analysis (ICA), the goal of which is to model the multivariate data by a linear transformation of latent variables, while latent variables fit by a tree-structured graphical model. In contrast to ICA, TCA allows dependent structure of latent variables and also consider non-spanning trees (forests). In this paper, we present a TCA-based method of clustering gene expression data. Empirical study with yeast cell cycle-related data, yeast metaboiic shift data, and yeast sporulation data, shows that TCA is more suitable for gene clustering, compared to principal component analysis (PCA) as well as ICA.

Keywords