Browse > Article
http://dx.doi.org/10.7232/iems.2010.9.2.131

A Biclustering Method for Time Series Analysis  

Lee, Jeong-Hwa (Department of Industrial and Management Engineering Pohang University of Science and Technology)
Lee, Young-Rok (Department of Industrial and Management Engineering Pohang University of Science and Technology)
Jun, Chi-Hyuck (Department of Industrial and Management Engineering Pohang University of Science and Technology)
Publication Information
Industrial Engineering and Management Systems / v.9, no.2, 2010 , pp. 131-140 More about this Journal
Abstract
Biclustering is a method of finding meaningful subsets of objects and attributes simultaneously, which may not be detected by traditional clustering methods. It is popularly used for the analysis of microarray data representing the expression levels of genes by conditions. Usually, biclustering algorithms do not consider a sequential relation between attributes. For time series data, however, bicluster solutions should keep the time sequence. This paper proposes a new biclustering algorithm for time series data by modifying the plaid model. The proposed algorithm introduces a parameter controlling an interval between two selected time points. Also, the pruning step preventing an over-fitting problem is modified so as to eliminate only starting or ending points. Results from artificial data sets show that the proposed method is more suitable for the extraction of biclusters from time series data sets. Moreover, by using the proposed method, we find some interesting observations from real-world time-course microarray data sets and apartment price data sets in metropolitan areas.
Keywords
Biclustering; Time-series Data; Plaid Model; Binary Least Square;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Yeung, K. Y., Haynor, D. R., and Ruzzo, W. L. (2000), Validating clustering for gene expression Data, Technical Report, Department of Computer Science and Engineering, University of Washington.
2 Zhang, Y., Zha, H., and Chu, C. (2005), A Time-series biclustering algorithm for revealing co-regulated genes. Proceedings of the International Conference on Information Technology, Coding and Computing, 1, 32-37.
3 Madeira, S. and Oliveira, A. (2004), Biclustering Algorithms for Biological Data Analysis: A Survey, IEEE Transactions on Computational Biology and Bioinformatics, 1, 24-45.   DOI   ScienceOn
4 Madeira, S. and Oliveira, A. (2005), A linear time biclustering algorithm for time series gene expression data, Lecture Notes in Computer Science, Springer Berlin, 39-52.
5 Mirkin, B. (1996), Mathematical classification and clustering, Kluwer Academic Publish.
6 Santamaria, R., Quintales, R. and Theoron, R. (2007), Method to bicluster validation and comparison in microarray data, Intelligent Data Engineering and Automated Learning-Ideal 2007: 8th International Conference, Birmingham, Uk, Proceedings, 780-789,
7 Turner, H., Bailey, T., and Krzanowski, W. (2005), Improved biclustering of microarray data demonstrated through systematic performance tests, Computational Statistics and Data Analysis, 48, 235-254.   DOI   ScienceOn
8 Liao, T. W. (2005), Clustering of time series data-a survey, Pattern Recognition, 38, 1857-1874.   DOI   ScienceOn
9 Lazzeroni, L. and Owen, A. (2002), Plaid models for gene expression data, Statistica Sinica, 12, 61-86.
10 Lee, Y., Lee, J., and Jun, C.-H. (2009), Validation measures of bicluster solutions, Industrial Engineering and Management Systems, 8, 101-108.   과학기술학회마을
11 Getz, G., Levine, E., and Domany, E. (2000), Coupled two-way clustering analysis of gene microarray data, The Proceedings of the National Academy of Sciences of the Unite States of America, 12079-12084.
12 Liu, J. and Wang, W. (2003), OP-Cluster: clustering by tendency in high dimensional space, Proceeding, Third IEEE International Conference, Data Mining, 187-194.
13 Cho, R., Campbell, M., Winzeler, E, Steinmetz, L., Conway, A., Wodicka, L., Wolfsberg, T., Gabrielian, A., Landsman, D., Lockhart, D., and Davis, R. (1998), A genome-wide transcriptional analysis of the mitotic cell cycle, Molecular Cell, 2, 65-73.   DOI   ScienceOn
14 Ernst, J., Nau, G. J., and Bar-Joseph, Z. (2005), Clustering short time series gene expression data, Bioinformatics, 21, i159-i16.   DOI   ScienceOn
15 Hartigan, J. (1972), Direct clustering of a data matrix, Journal of the American Statistical Association, 37, 123-129.
16 Cheng, Y. and Church, G. (2000), Biclustering of expression data, Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, 93-103.
17 Kluger, Y., Basri, R., Ghang, J., and Gerstein, M. (2003), Spectral biclustering of microarray data: Coclustering genes and conditions. Genome Research, 13, 703-716.   DOI   ScienceOn
18 Kohonen, T. (1990) The self organizing maps, Proceeding IEEE, 78, 1464-1480.
19 Berndt, D. J. and Clifford, J. (1994), Using dynamic time warping to find patterns in time series, Association for the Advancement of Artificial Intelligence Technical Report, WS-94-03, 359-370.