Proceedings of the Korean Information Science Society Conference (한국정보과학회:학술대회논문집)
- 2006.06a
- /
- Pages.4-6
- /
- 2006
- /
- 1598-5164(pISSN)
Tree-Dependent Components of Gene Expression Data for Clustering
유전자발현데이터의 군집분석을 위한 나무 의존 성분 분석
- Kim Jong-Kyoung (Department of Computer Science Pohang University of Science and Technology) ;
- Choi Seung-Jin (Department of Computer Science Pohang University of Science and Technology)
- Published : 2006.06.01
Abstract
Tree-dependent component analysis (TCA) is a generalization of independent component analysis (ICA), the goal of which is to model the multivariate data by a linear transformation of latent variables, while latent variables fit by a tree-structured graphical model. In contrast to ICA, TCA allows dependent structure of latent variables and also consider non-spanning trees (forests). In this paper, we present a TCA-based method of clustering gene expression data. Empirical study with yeast cell cycle-related data, yeast metaboiic shift data, and yeast sporulation data, shows that TCA is more suitable for gene clustering, compared to principal component analysis (PCA) as well as ICA.
Keywords