• Title/Summary/Keyword: microarray experiment

Search Result 92, Processing Time 0.029 seconds

Large-Circular Single-stranded Sense and Antisense DNA for Identification of Cancer-Related Genes (장환형 단일가닥 DNA를 이용한 암세포 성장 억제 유전자 발굴)

  • Bae, Yun-Ui;Moon, Ik-Jae;Seu, Young-Bae;Doh, Kyung-Oh
    • Microbiology and Biotechnology Letters
    • /
    • v.38 no.1
    • /
    • pp.70-76
    • /
    • 2010
  • The single-stranded large circular (LC)-sense DNA were utilized as probes for DNA chip experiments. The microarray experiment using LC-sense DNA probes found differentially expressed genes in A549 cells as compared to WI38VA13 cells, and microarray data were well-correlated with data acquired from quantitative real-time RT-PCR. A 5K LC-sense DNA microarray was prepared, and the repeated experiments and dye swap test showed consistent expression patterns. Subsequent functional analysis using LC-antisense library of overexpressed genes identified several genes involved in A549 cell growth. These experiments demonstrated proper feature of LC-sense molecules as probe DNA for microarray and the potential utility of the combination of LC-sense microarray and antisense libraries for an effective functional validation of genes.

Detecting survival related gene sets in microarray analysis (마이크로어레이 자료에서 생존과 유의한 관련이 있는 유전자집단 검색)

  • Lee, Sun-Ho;Lee, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.1
    • /
    • pp.1-11
    • /
    • 2012
  • When the microarray experiment developed, main interest was limited to detect differentially expressed genes associated with a phenotype of interest. However, as human diseases are thought to occur through the interactions of multiple genes within a same functional category, the unit of analysis of the microarray experiment expanded to the set of genes. For the phenotype of censored survival time, Gene Set Enrichment Analysis(GSEA), Global test and Wald type test are widely used. In this paper, we modified the Wald type test by adopting normal score transformation of gene expression values and developed a parametric test which requires much less computation than others. The proposed method is compared with other methods using a real data set of ovarian cancer and a simulation data set.

Simple Method to Correct Gene-Specific Dye Bias from Partial Dye Swap Information of a DNA Microarray Experiment

  • KIM BYUNG SOO;KANG SOO-JIN;LEE SAET-BYUL;HWANG WON;KIM KUN-SOO
    • Journal of Microbiology and Biotechnology
    • /
    • v.15 no.6
    • /
    • pp.1377-1383
    • /
    • 2005
  • In a cDNA microarray experiment using Cy3 and Cy5 as labeling agents, particularly for the direct design, cDNAs from some genes incorporate one dye more efficiently than the other, which is referred to as the gene-specific dye bias. Dye-swaps, in which two dyes are switched on replicate arrays, are commonly used to control the gene-specific dye bias. We developed a simple procedure to extract the gene-specific dye bias information from a partial dye swap experiment. We detected gene-specific dye bias by identifying outliers in an X-Y plane, where the X axis represents the average log-ratio from two sets of dye swap pairs and the Y axis exhibits the average log ratio of four forward labeled arrays. We used this information for detecting differentially expressed genes, of which the additionally detected genes were validated by real-time RT-PCR.

Building a Classifier for Integrated Microarray Datasets through Two-Stage Approach (2 단계 접근법을 통한 통합 마이크로어레이 데이타의 분류기 생성)

  • Yoon, Young-Mi;Lee, Jong-Chan;Park, Sang-Hyun
    • Journal of KIISE:Databases
    • /
    • v.34 no.1
    • /
    • pp.46-58
    • /
    • 2007
  • Since microarray data acquire tens of thousands of gene expression values simultaneously, they could be very useful in identifying the phenotypes of diseases. However, the results of analyzing several microarray datasets which were independently carried out with the same biological objectives, could turn out to be different. One of the main reasons is attributable to the limited number of samples involved in one microarry experiment. In order to increase the classification accuracy, it is desirable to augment the sample size by integrating and maximizing the use of independently-conducted microarray datasets. In this paper, we propose a novel two-stage approach which firstly integrates individual microarray datasets to overcome the problem caused by limited number of samples, and identifies informative genes, secondly builds a classifier using only the informative genes. The classifier from large samples by integrating independent microarray datasets achieves high accuracy up to 24.19% increase as against other comparison methods, sensitivity, and specificity on independent test sample dataset.

A Graph Model and Analysis Algorithm for cDNA Microarray Image (cDNA 마이크로어레이 이미지를 위한 그래프 모델과 분석 알고리즘)

  • Jung, Ho-Youl;Hwang, Mi-Nyeong;Yu, Young-Jung;Cho, Hwan-Gue
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.29 no.7
    • /
    • pp.411-421
    • /
    • 2002
  • In this Paper we propose a new Image analysis algorithm for microarray processing and a method to locate the position of the grid cell using the topology of the grid spots. Microarray is a device which enables a parallel experiment of 10 to 100 thousands of test genes in order to measure the gene expression. Because of the huge data obtained by a experiment automated image analysis is needed. The final output of this microarray experiment is a set of 16-bit gray level image files which consist of grid-structured spots. In this paper we propose one algorithm which located the address of spots (spot indices) using graph structure from image data and a method which determines the precise location and shape of each spot by measuring the inclination of grid structure. Several experiments are given from real data sets.

Effect of missing values in detecting differentially expressed genes in a cDNA microarray experiment

  • Kim, Byung-Soo;Rha, Sun-Young
    • Bioinformatics and Biosystems
    • /
    • v.1 no.1
    • /
    • pp.67-72
    • /
    • 2006
  • The aim of this paper is to discuss the effect of missing values in detecting differentially expressed genes in a cDNA microarray experiment in the context of a one sample problem. We conducted a cDNA micro array experiment to detect differentially expressed genes for the metastasis of colorectal cancer based on twenty patients who underwent liver resection due to liver metastasis from colorectal cancer. Total RNAs from metastatic liver tumor and adjacent normal liver tissue from a single patient were labeled with cy5 and cy3, respectively, and competitively hybridized to a cDNA microarray with 7775 human genes. We used $M=log_2(R/G)$ for the signal evaluation, where Rand G denoted the fluorescent intensities of Cy5 and Cy3 dyes, respectively. The statistical problem comprises a one sample test of testing E(M)=0 for each gene and involves multiple tests. The twenty cDNA microarray data would comprise a matrix of dimension 7775 by 20, if there were no missing values. However, missing values occur for various reasons. For each gene, the no missing proportion (NMP) was defined to be the proportion of non-missing values out of twenty. In detecting differentially expressed (DE) genes, we used the genes whose NMP is greater than or equal to 0.4 and then sequentially increased NMP by 0.1 for investigating its effect on the detection of DE genes. For each fixed NMP, we imputed the missing values with K-nearest neighbor method (K=10) and applied the nonparametric t-test of Dudoit et al. (2002), SAM by Tusher et al. (2001) and empirical Bayes procedure by $L\ddot{o}nnstedt$ and Speed (2002) to find out the effect of missing values in the final outcome. These three procedures yielded substantially agreeable result in detecting DE genes. Of these three procedures we used SAM for exploring the acceptable NMP level. The result showed that the optimum no missing proportion (NMP) found in this data set turned out to be 80%. It is more desirable to find the optimum level of NMP for each data set by applying the method described in this note, when the plot of (NMP, Number of overlapping genes) shows a turning point.

  • PDF

Print-tip Normalization for DNA Microarray Data (DNA 마이크로어레이 자료의 PRINT-TIP별 표준화(NORMALIZATION) 방법)

  • Yi Sung-Gon;Park Taesung;Kang Sung Hyun;Lee Seung-Yeaun;Lee Yang Sung
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.1
    • /
    • pp.115-127
    • /
    • 2005
  • DNA microarray experiments allow us to study expression of thousands of genes simultaneously, Normalization is a process for removing noises occurred during the microarray experiment, Print-tip is regarded as one main sources of noises, In this paper, we review normalization methods most commonly used in the microarray experiments, Especially, we investigate the effects of print-tips through simulated data sets.

Standard-based Integration of Heterogeneous Large-scale DNA Microarray Data for Improving Reusability

  • Jung, Yong;Seo, Hwa-Jeong;Park, Yu-Rang;Kim, Ji-Hun;Bien, Sang Jay;Kim, Ju-Han
    • Genomics & Informatics
    • /
    • v.9 no.1
    • /
    • pp.19-27
    • /
    • 2011
  • Gene Expression Omnibus (GEO) has kept the largest amount of gene-expression microarray data that have grown exponentially. Microarray data in GEO have been generated in many different formats and often lack standardized annotation and documentation. It is hard to know if preprocessing has been applied to a dataset or not and in what way. Standard-based integration of heterogeneous data formats and metadata is necessary for comprehensive data query, analysis and mining. We attempted to integrate the heterogeneous microarray data in GEO based on Minimum Information About a Microarray Experiment (MIAME) standard. We unified the data fields of GEO Data table and mapped the attributes of GEO metadata into MIAME elements. We also discriminated non-preprocessed raw datasets from others and processed ones by using a two-step classification method. Most of the procedures were developed as semi-automated algorithms with some degree of text mining techniques. We localized 2,967 Platforms, 4,867 Series and 103,590 Samples with covering 279 organisms, integrated them into a standard-based relational schema and developed a comprehensive query interface to extract. Our tool, GEOQuest is available at http://www.snubi.org/software/GEOQuest/.

Statistical Analysis of Gene Expression Data

  • 박태성
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2001.10a
    • /
    • pp.97-115
    • /
    • 2001
  • cDNA microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. Many statistical analysis tools become widely applicable to the analysis of cDNA microarray data. In this talk, we consider a two-way ANOVA model to differentiate genes that have high variability and ones that do not. Using this model, we detect genes that have different gene expression profiles among experimental groups. The two-way ANOVA model is illustrated using cDNA microarrays of 3,800 genes obtained in an experiment to search for changes in gene expression profiles during neuronal differentiation of cortical stem cells.

  • PDF

A Bayesian Multiple Testing of Detecting Differentially Expressed Genes in Two-sample Comparison Problem

  • Oh Hyun-Sook;Yang Wan-Youn
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.1
    • /
    • pp.39-47
    • /
    • 2006
  • The Bayesian approach to multiple testing procedure for one sample testing problem proposed by Scott and Berger (2003) is extended to two-sample comparison problem in microarray experiments. The prior distribution of each gene's mean for one sample is given conditionally on the corresponding gene's mean for the other sample. Posterior distributions of interesting parameters are derived and estimated based on an importance sampling method. A simulated example is given for illustration.