An Iterative Normalization Algorithm for cDNA Microarray Medical Data Analysis

  • Kim, Yoonhee (Department of Biostatistics and Epidemiology, School of Public Health, and Institute of Health and Environment, Seoul National University) ;
  • Park, Woong-Yang (Department of Biochemistry Seoul National University College of Medicine) ;
  • Kim, Ho (Department of Biostatistics and Epidemiology, School of Public Health, and Institute of Health and Environment, Seoul National University)
  • Published : 2004.06.01

Abstract

A cDNA microarray experiment is one of the most useful high-throughput experiments in medical informatics for monitoring gene expression levels. Statistical analysis with a cDNA microarray medical data requires a normalization procedure to reduce the systematic errors that are impossible to control by the experimental conditions. Despite the variety of normalization methods, this. paper suggests a more general and synthetic normalization algorithm with a control gene set based on previous studies of normalization. Iterative normalization method was used to select and include a new control gene set among the whole genes iteratively at every step of the normalization calculation initiated with the housekeeping genes. The objective of this iterative normalization was to maintain the pattern of the original data and to keep the gene expression levels stable. Spatial plots, M&A (ratio and average values of the intensity) plots and box plots showed a convergence to zero of the mean across all genes graphically after applying our iterative normalization. The practicability of the algorithm was demonstrated by applying our method to the data for the human photo aging study.

Keywords

References

  1. Chen, Y., Dougherty, E. R., and Bittner, M. (1997). Ratiobased decisions and the quantitative analysis of cDNA microarray images. J. Biomed.Opt. 2,364-374 https://doi.org/10.1117/12.281504
  2. Cleveland, W. S. and Devlin, S .G. (1988). Locally weighted regression: an approach to regression analysis by local fitting. J. Am. Stat. Assoc. 83, 596-610 https://doi.org/10.2307/2289282
  3. David, J., Duggan, M. B, Yidong, C., Paul, M., and Jeffrey, M. T. (1999). Expression profiling using cDNA microarrays. Nature genetics supplement. 21, 10-14 https://doi.org/10.1038/4434
  4. Dudoit, S., Fridlyand, J., and Speed., T.P.(2000). Comparison of Discrimination Methods for the Classification of Tumors Using Gene Experssion Data. Dept. of Statistics, University of California at Berkeley, Technical reports
  5. Dudoit, S., Yang, Y. H., Callow, M. J., and Speed., T. P. (2000). Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Dept. of Statistics, University of California at Berkeley, Technical reports
  6. Golub, T. R., Slonim, D. K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J. P., Coller, H., Loh, M .L., Downing, J. R., Caligiuri, M. A., Bloomfield, C. D., and Lander, E. S. (1999). Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531-7 https://doi.org/10.1126/science.286.5439.531
  7. Hedenfalk, L. et al. (2001). Gene-expression profiles in hereditary breast cancer. N. Engl. J. Med. 344, 539-548 https://doi.org/10.1056/NEJM200102223440801
  8. Kepler, T. B., Crosby, L., and Morgan, K. T. (2002). Normalization and analysis of DNA microarray data by self-consistency and local regression. Genome Biol. 3(7), RESEARCH0037
  9. Kim, J. H., Shin, D. M., and Lee, Y. S. (2002). Effect of local background intensities in thenormalization of cDNA microarray data with a skewed expression profiles. Exp MoI Med. 34, 224-232 https://doi.org/10.1038/emm.2002.31
  10. Michael, B., Eisen, P., and Brown, O. (2000). NA arrays for analysis of Gene Expression. Stanford University School of Medicine, Stanford, CA, Technical reports
  11. Quackenbush, J. (2001). Computional Analysis of micorarray data. Nature 418-427
  12. Schuchhardt, J., Beule, D., Malik, A., Wolski, E., Eickhoff, H., Lehrach, H., and Herzel, H. (2000). Normalization strategies for cDNA microarrays. Nucleic Acids Res. 28(10)E47 https://doi.org/10.1093/nar/28.10.e47
  13. Tran, P. H., Peiffer, D. A., Shin, Y., Meek, L. M., Brody, J. P., and Cho, K. W. (2002). Microarray optimizations: increasing spot accuracy and automated identification of true microarray signals. Nucleic Acids Res. 30, e54 https://doi.org/10.1093/nar/gnf053
  14. Tseng, G. C., Oh, M. K., Rohlin, L., Liao, J. C., and Wong, W. H. (2001). Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects. Nucleic Acids Res. 29, 2549-57 https://doi.org/10.1093/nar/29.12.2549
  15. Tsodikov, A., Szabo, A., and Jones, D. (2002). Adjustments and measures of differential expression for microarray data. Bioinformatics 18, 251-60 https://doi.org/10.1093/bioinformatics/18.2.251
  16. Wang, Y., Lu, J., Lee, R., Gu, Z., and Clarke, R. (2002). Iterative normalization of cDNA microarray data. IEEE Trans Inf. Technol. Biomed. 6, 29-37 https://doi.org/10.1109/4233.992159
  17. Yang, Y. H., Dudoit, S., Luu, P., Lin, D. M., Peng ,V., Ngai, J., and Speed, T. P. (2002). Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 30, e15 https://doi.org/10.1093/nar/30.4.e15
  18. Zien, A., Aigner, T., Zimmer, R., and Lengauer, T. (2001). Centralization: a new method for the normalization of gene expression data. Bioinformatics 17(SuppI. 1), S323-31 https://doi.org/10.1093/bioinformatics/17.suppl_1.S323