• Title/Summary/Keyword: expression microarray

Search Result 982, Processing Time 0.026 seconds

Analysis and Subclass Classification of Microarray Gene Expression Data Using Computational Biology (전산생물학을 이용한 마이크로어레이의 유전자 발현 데이터 분석 및 유형 분류 기법)

  • Yoo, Chang-Kyoo;Lee, Min-Young;Kim, Young-Hwang;Lee, In-Beum
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.10
    • /
    • pp.830-836
    • /
    • 2005
  • Application of microarray technologies which monitor simultaneously the expression pattern of thousands of individual genes in different biological systems results in a tremendous increase of the amount of available gene expression data and have provided new insights into gene expression during drug development, within disease processes, and across species. There is a great need of data mining methods allowing straightforward interpretation, visualization and analysis of the relevant information contained in gene expression profiles. Specially, classifying biological samples into known classes or phenotypes is an important practical application for microarray gene expression profiles. Gene expression profiles obtained from tissue samples of patients thus allowcancer classification. In this research, molecular classification of microarray gene expression data is applied for multi-class cancer using computational biology such gene selection, principal component analysis and fuzzy clustering. The proposed method was applied to microarray data from leukemia patients; specifically, it was used to interpret the gene expression pattern and analyze the leukemia subtype whose expression profiles correlated with four cases of acute leukemia gene expression. A basic understanding of the microarray data analysis is also introduced.

Cancer Genomics Object Model: An Object Model for Cancer Research Using Microarray

  • Park, Yu-Rang;Lee, Hye-Won;Cho, Sung-Bum;Kim, Ju-Han
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.29-34
    • /
    • 2005
  • DNA microarray becomes a major tool for the investigation of global gene expression in all aspects of cancer and biomedical research. DNA microarray experiment generates enormous amounts of data and they are meaningful only in the context of a detailed description of microarrays, biomaterials, and conditions under which they were generated. MicroArray Gene Expression Data (MGED) society has established microarray standard for structured management of these diverse and large amount data. MGED MAGE-OM (MicroArray Gene Expression Object Model) is an object oriented data model, which attempts to define standard objects for gene expression. To assess the relevance of DNA microarray analysis of cancer research it is required to combine clinical and genomics data. MAGE-OM, however, does not have an appropriate structure to describe clinical information of cancer. For systematic integration of gene expression and clinical data, we create a new model, Cancer Genomics Object Model.

  • PDF

Gene Expression Analysis of Acetaminophen-induced Liver Toxicity in Rat (아세트아미노펜에 의해 간손상이 유발된 랫드의 유전자 발현 분석)

  • Chung, Hee-Kyoung
    • Toxicological Research
    • /
    • v.22 no.4
    • /
    • pp.323-328
    • /
    • 2006
  • Global gene expression profile was analyzed by microarray analysis of rat liver RNA after acute acetaminophen (APAP) administration. A single dose of 1g/kg body weight of APAP was given orally, and the liver samples were obtained after 24, 48 h, and 2 weeks. Histopathologic and biochemical studies enabled the classification of the APAP effect into injury (24 and 48 h) and regeneration (2 weeks) stages. The expression levels of 4900 clones on a custom rat gene microarray were analyzed and 484 clones were differentially expressed with more than a 1.625-fold difference(which equals 0.7 in log2 scale) at one or more time points. Two hundred ninety seven clones were classified as injury-specific clones, while 149 clones as regeneration-specific ones. Characteristic gene expression profiles could be associated with APAP-induced gene expression changes in lipid metabolism, stress response, and protein metabolism. We established a global gene expression profile utilizing microarray analysis in rat liver upon acute APAP administration with a full chronological profile that not only covers injury stage but also later point of regeneration stage.

Finding Informative Genes From Microarray Gene Expression Data Using FIGER-test

  • Choi, Kyoung-Oak;Chung, Hwan-Mook
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.5
    • /
    • pp.707-711
    • /
    • 2007
  • Microarray gene expression data is believed to show the functions of living organism through the gene expression values. We have studied a method to get the informative genes from the microarray gene expression data. There are several ways for this. In recent researches to get more sophisticated and detailed results, it has used the intelligence information theory like fuzzy theory. Some methods are to add fudge factors to the significance test for more refined results. In this paper, we suggest a method to get informative genes from microarray gene expression data. We combined the difference of means between two groups and the fuzzy membership degree which reflects the variance of the gene expression data. We have called our significance test the Fuzzy Information method for Gene Expression data(FIGER). The FIGER calculates FIGER variation ratio and FIGER membership degree to show how strongly each object belongs to the each group and then it results in the significance degree of each gene. The FIGER is focused on the variation and distribution of the data set to adjust the significance level. Out simulation shows that the FIGER-test is an effective and useful significance test.

Genes expression monitoring using cDNA microarray: Protocol and Application

  • Muramatsu Masa-aki
    • Proceedings of the Korean Society of Toxicology Conference
    • /
    • 2000.11a
    • /
    • pp.31-41
    • /
    • 2000
  • The major issue in the post genome sequencing era is determination of gene expression patterns in variety of biological systems. A microarray system is a powerful technology for analyzing the expression profile of thousands of genes at one experiment. In this study, we constructed cDNA microarray which carries 2,304 cDNAS derived from oligo-capped mouse cDNA library. Using this hand-made microarray we determined gene expression in various biological systems. To determine tissue specific genes, we compared Nine genes were highly-expressed in adult mouse brain compared to kidney, liver, and skeletal muscle. Tissue distribution analysis using DNA microarray extracted 9 genes that were predominantly expressed in the brain. A database search showed that five of the 9 genes, MBP, SC1, HiAT3, S100 protein-beta, and SNAP25, were previously known to be expressed at high level in the brain and in the nervous system. One gene was highly sequence similar to rat S-Rex-s/human NSP-C, suggesting that the gene is a mouse homologue. The remaining three genes did not match to known genes in the GenBank/EMBL database, indicating that these are novel genes highly-expressed in the brain. Our DNA microarray was also used to detect differentiation specific genes, hormone dependent genes, and transcription-factor-induced genes. We conclude that DNA microarray is an excellent tool for identifying differentially expressed genes.

  • PDF

Veri cation of Improving a Clustering Algorith for Microarray Data with Missing Values

  • Kim, Su-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.315-321
    • /
    • 2011
  • Gene expression microarray data often include multiple missing values. Most gene expression analysis (including gene clustering analysis); however, require a complete data matric as an input. In ordinary clustering methods, just a single missing value makes one abandon the whole data of a gene even if the rest of data for that gene was intact. The quality of analysis may decrease seriously as the missing rate is increased. In the opposite aspect, the imputation of missing value may result in an artifact that reduces the reliability of the analysis. To clarify this contradiction in microarray clustering analysis, this paper compared the accuracy of clustering with and without imputation over several microarray data having different missing rates. This paper also tested the clustering efficiency of several imputation methods including our propose algorithm. The results showed it is worthwhile to check the clustering result in this alternative way without any imputed data for the imperfect microarray data.

Xperanto: A Web-Based Integrated System for DNA Microarray Data Management and Analysis

  • Park, Ji Yeon;Park, Yu Rang;Park, Chan Hee;Kim, Ji Hoon;Kim, Ju Ha
    • Genomics & Informatics
    • /
    • v.3 no.1
    • /
    • pp.39-42
    • /
    • 2005
  • DNA microarray is a high-throughput biomedical technology that monitors gene expression for thousands of genes in parallel. The abundance and complexity of the gene expression data have given rise to a requirement for their systematic management and analysis to support many laboratories performing microarray research. On these demands, we developed Xperanto for integrated data management and analysis using user-friendly web-based interface. Xperanto provides an integrated environment for management and analysis by linking the computational tools and rich sources of biological annotation. With the growing needs of data sharing, it is designed to be compliant to MGED (Microarray Gene Expression Data) standards for microarray data annotation and exchange. Xperanto enables a fast and efficient management of vast amounts of data, and serves as a communication channel among multiple researchers within an emerging interdisciplinary field.

Clustering Approaches to Identifying Gene Expression Patterns from DNA Microarray Data

  • Do, Jin Hwan;Choi, Dong-Kug
    • Molecules and Cells
    • /
    • v.25 no.2
    • /
    • pp.279-288
    • /
    • 2008
  • The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

Quality Control Usage in High-Density Microarrays Reveals Differential Gene Expression Profiles in Ovarian Cancer

  • Villegas-Ruiz, Vanessa;Moreno, Jose;Jacome-Lopez, Karina;Zentella-Dehesa, Alejandro;Juarez-Mendez, Sergio
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.5
    • /
    • pp.2519-2525
    • /
    • 2016
  • There are several existing reports of microarray chip use for assessment of altered gene expression in different diseases. In fact, there have been over 1.5 million assays of this kind performed over the last twenty years, which have influenced clinical and translational research studies. The most commonly used DNA microarray platforms are Affymetrix GeneChip and Quality Control Software along with their GeneChip Probe Arrays. These chips are created using several quality controls to confirm the success of each assay, but their actual impact on gene expression profiles had not been previously analyzed until the appearance of several bioinformatics tools for this purpose. We here performed a data mining analysis, in this case specifically focused on ovarian cancer, as well as healthy ovarian tissue and ovarian cell lines, in order to confirm quality control results and associated variation in gene expression profiles. The microarray data used in our research were downloaded from ArrayExpress and Gene Expression Omnibus (GEO) and analyzed with Expression Console Software using RMA, MAS5 and Plier algorithms. The gene expression profiles were obtained using Partek Genomics Suite v6.6 and data were visualized using principal component analysis, heat map, and Venn diagrams. Microarray quality control analysis showed that roughly 40% of the microarray files were false negative, demonstrating over- and under-estimation of expressed genes. Additionally, we confirmed the results performing second analysis using independent samples. About 70% of the significant expressed genes were correlated in both analyses. These results demonstrate the importance of appropriate microarray processing to obtain a reliable gene expression profile.

A modified partial least squares regression for the analysis of gene expression data with survival information

  • Lee, So-Yoon;Huh, Myung-Hoe;Park, Mira
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1151-1160
    • /
    • 2014
  • In DNA microarray studies, the number of genes far exceeds the number of samples and the gene expression measures are highly correlated. Partial least squares regression (PLSR) is one of the popular methods for dimensional reduction and known to be useful for the classifications of microarray data by several studies. In this study, we suggest a modified version of the partial least squares regression to analyze gene expression data with survival information. The method is designed as a new gene selection method using PLSR with an iterative procedure of imputing censored survival time. Mean square error of prediction criterion is used to determine the dimension of the model. To visualize the data, plot for variables superimposed with samples are used. The method is applied to two microarray data sets, both containing survival time. The results show that the proposed method works well for interpreting gene expression microarray data.