• Title/Summary/Keyword: Gene analysis

Search Result 8,172, Processing Time 0.038 seconds

Correlation Analysis between Regulatory Sequence Motifs and Expression Profiles by Kernel CCA

  • Rhee, Je-Keun;Joung, Je-Gun;Chang, Jeong-Ho;Zhang, Byoung-Tak
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.63-68
    • /
    • 2005
  • Transcription factors regulate gene expression by binding to gene upstream region. Each transcription factor has the specific binding site in promoter region. So the analysis of gene upstream sequence is necessary for understanding regulatory mechanism of genes, under a plausible idea that assumption that DNA sequence motif profiles are closely related to gene expression behaviors of the corresponding genes. Here, we present an effective approach to the analysis of the relation between gene expression profiles and gene upstream sequences on the basis of kernel canonical correlation analysis (kernel CCA). Kernel CCA is a useful method for finding relationships underlying between two different data sets. In the application to a yeast cell cycle data set, it is shown that gene upstream sequence profile is closely related to gene expression patterns in terms of canonical correlation scores. By the further analysis of the contributing values or weights of sequence motifs in the construction of a pair of sequence motif profiles and expression profiles, we show that the proposed method can identify significant DNA sequence motifs involved with some specific gene expression patterns, including some well known motifs and those putative, in the process of the yeast cell cycle.

  • PDF

Genetic characterization and phylogenetic analysis of Clostridium chauvoei isolated from Hanwoo in Jeonbuk (전북지역 한우에서 분리한 기종저 균의 유전학적 특성 규명)

  • Kim, Chul-Min;Jeong, Jae-Myong;Choi, Ki-Young
    • Korean Journal of Veterinary Service
    • /
    • v.37 no.3
    • /
    • pp.157-164
    • /
    • 2014
  • Clostridium chauvoei is the etiologic agent of blackleg, a high mortality rated disease infection mainly cattle. In the present study, the partial sequences of 16S rRNA and flagellin gene of C. chauvoei isolated in Jeonbuk, Korea were determined and compared with those of reference strain. Oligonucleotide primers were designed to amplify a 811 bp fragment of 16S rRNA gene and 1229 bp fragment of flagellin gene. Sequencing analysis of 16S rRNA gene showed high homology to the reference strains ranging 82.3% to 100%, while flagellin gene were different from published foreign clostridia, showing 98.7% to 72.0% nucleotide sequence homology. Phylogenetic analysis based on 16S rRNA gene revealed the close phylogenetic relationship of C. chauvoei and C. septicum in cluster I, which includes C. carnis, C. tertium, C. quinii, C. celatum, C. perfringens, C. absonum, C. botulinum B. Phylogentic analysis also revealed that flagellin gene formed a single cluster with C. chauvoei, C. septicum, C. novyi A, C. novyi B, C. tyrobutylicum, C. acetobutylicum. The genetic informations obtained from this study could be useful for the molecular study of C. chauvoei.

Application of Crossover Analysis-logistic Regression in the Assessment of Gene- environmental Interactions for Colorectal Cancer

  • Wu, Ya-Zhou;Yang, Huan;Zhang, Ling;Zhang, Yan-Qi;Liu, Ling;Yi, Dong;Cao, Jia
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.5
    • /
    • pp.2031-2037
    • /
    • 2012
  • Background: Analysis of gene-gene and gene-environment interactions for complex multifactorial human disease faces challenges regarding statistical methodology. One major difficulty is partly due to the limitations of parametric-statistical methods for detection of gene effects that are dependent solely or partially on interactions with other genes or environmental exposures. Based on our previous case-control study in Chongqing of China, we have found increased risk of colorectal cancer exists in individuals carrying a novel homozygous TT at locus rs1329149 and known homozygous AA at locus rs671. Methods: In this study, we proposed statistical method-crossover analysis in combination with logistic regression model, to further analyze our data and focus on assessing gene-environmental interactions for colorectal cancer. Results: The results of the crossover analysis showed that there are possible multiplicative interactions between loci rs671 and rs1329149 with alcohol consumption. Multifactorial logistic regression analysis also validated that loci rs671 and rs1329149 both exhibited a multiplicative interaction with alcohol consumption. Moreover, we also found additive interactions between any pair of two factors (among the four risk factors: gene loci rs671, rs1329149, age and alcohol consumption) through the crossover analysis, which was not evident on logistic regression. Conclusions: In conclusion, the method based on crossover analysis-logistic regression is successful in assessing additive and multiplicative gene-environment interactions, and in revealing synergistic effects of gene loci rs671 and rs1329149 with alcohol consumption in the pathogenesis and development of colorectal cancer.

Identification of key genes and functional enrichment analysis of liver fibrosis in nonalcoholic fatty liver disease through weighted gene co-expression network analysis

  • Yue Hu;Jun Zhou
    • Genomics & Informatics
    • /
    • v.21 no.4
    • /
    • pp.45.1-45.11
    • /
    • 2023
  • Nonalcoholic fatty liver disease (NAFLD) is a common type of chronic liver disease, with severity levels ranging from nonalcoholic fatty liver to nonalcoholic steatohepatitis (NASH). The extent of liver fibrosis indicates the severity of NASH and the risk of liver cancer. However, the mechanism underlying NASH development, which is important for early screening and intervention, remains unclear. Weighted gene co-expression network analysis (WGCNA) is a useful method for identifying hub genes and screening specific targets for diseases. In this study, we utilized an mRNA dataset of the liver tissues of patients with NASH and conducted WGCNA for various stages of liver fibrosis. Subsequently, we employed two additional mRNA datasets for validation purposes. Gene set enrichment analysis (GSEA) was conducted to analyze gene function enrichment. Through WGCNA and subsequent analyses, complemented by validation using two additional datasets, we identified five genes (BICC1, C7, EFEMP1, LUM, and STMN2) as hub genes. GSEA analysis indicated that gene sets associated with liver metabolism and cholesterol homeostasis were uniformly downregulated. BICC1, C7, EFEMP1, LUM, and STMN2 were identified as hub genes of NASH, and were all related to liver metabolism, NAFLD, NASH, and related diseases. These hub genes might serve as potential targets for the early screening and treatment of NASH.

Gene Expression Analysis of Acetaminophen-induced Liver Toxicity in Rat (아세트아미노펜에 의해 간손상이 유발된 랫드의 유전자 발현 분석)

  • Chung, Hee-Kyoung
    • Toxicological Research
    • /
    • v.22 no.4
    • /
    • pp.323-328
    • /
    • 2006
  • Global gene expression profile was analyzed by microarray analysis of rat liver RNA after acute acetaminophen (APAP) administration. A single dose of 1g/kg body weight of APAP was given orally, and the liver samples were obtained after 24, 48 h, and 2 weeks. Histopathologic and biochemical studies enabled the classification of the APAP effect into injury (24 and 48 h) and regeneration (2 weeks) stages. The expression levels of 4900 clones on a custom rat gene microarray were analyzed and 484 clones were differentially expressed with more than a 1.625-fold difference(which equals 0.7 in log2 scale) at one or more time points. Two hundred ninety seven clones were classified as injury-specific clones, while 149 clones as regeneration-specific ones. Characteristic gene expression profiles could be associated with APAP-induced gene expression changes in lipid metabolism, stress response, and protein metabolism. We established a global gene expression profile utilizing microarray analysis in rat liver upon acute APAP administration with a full chronological profile that not only covers injury stage but also later point of regeneration stage.

A Method for Gene Group Analysis and Its Application (유전자군 분석의 방법론과 응용)

  • Lee, Tae-Won;Delongchamp, Robert R.
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.2
    • /
    • pp.269-277
    • /
    • 2012
  • In microarray data analysis, recent efforts have focused on the discovery of gene sets from a pathway or functional categories such as Gene Ontology terms(GO terms) rather than on individual gene function for its direct interpretation of genome-wide expression data. We introduce a meta-analysis method that combines $p$-values for changes of each gene in the group. The method measures the significance of overall treatment-induced change in a gene group. An application of the method to a real data demonstrates that it has benefits over other statistical methods such as Fisher's exact test and permutation methods. The method is implemented in a SAS program and it is available on the author's homepage(http://cafe.daum.net/go.analysis).

Veri cation of Improving a Clustering Algorith for Microarray Data with Missing Values

  • Kim, Su-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.2
    • /
    • pp.315-321
    • /
    • 2011
  • Gene expression microarray data often include multiple missing values. Most gene expression analysis (including gene clustering analysis); however, require a complete data matric as an input. In ordinary clustering methods, just a single missing value makes one abandon the whole data of a gene even if the rest of data for that gene was intact. The quality of analysis may decrease seriously as the missing rate is increased. In the opposite aspect, the imputation of missing value may result in an artifact that reduces the reliability of the analysis. To clarify this contradiction in microarray clustering analysis, this paper compared the accuracy of clustering with and without imputation over several microarray data having different missing rates. This paper also tested the clustering efficiency of several imputation methods including our propose algorithm. The results showed it is worthwhile to check the clustering result in this alternative way without any imputed data for the imperfect microarray data.

Nucleotide Sequence Analysis of Movement Protein Gene from Tobacco Mosaic Virus Korean Pepper (TMV-KP) Strain (담배 모자이크 바이러스 한국고추계통에서 분리한 이동 단백질 유전자의 염기서열 분석)

  • 이재열;정동수;장무웅;최장경
    • Korean Journal Plant Pathology
    • /
    • v.11 no.1
    • /
    • pp.87-90
    • /
    • 1995
  • Complementary DNA of the movement protein (MP) gene of tobacco mosaic virus Korean pepper strain (TMV-KP) was synthesized from purified TMV-KP RNA by using the reverse transcription and polymerase chain reaction (PCR) system. The synthesized double stranded cDNA was cloned into the plasmid pUC9 and transformed into Escherichia coli JM110. The movement protein gene of TMV-KP of the selected clones was subjected to sequence analysis by Sanger's dideoxy chain termination method. The complete sequence of viral MP gene from TMV-KP strain was 807 nucleotides long. The nucleotide of MP gene from TMV-KP has thirteen and two nucleotide differences from TMV vulgarae (TMV-OM) and Korean (TMV-K) strains, respectively. Thus, the nucleotide sequence of TMV-KP MP gene showed higher homology of 99% with that of TMV-K MP gene.

  • PDF

Developing a Parametric Method for Testing the Significance of Gene Sets in Microarray Data Analysis (마이크로어레이 자료분석에서 모수적 방법을 이용한 유전자군의 유의성 검정)

  • Lee, Sun-Ho;Lee, Seung-Kyu;Lee, Kwang-Hyun
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.3
    • /
    • pp.397-408
    • /
    • 2009
  • The development of microarray technology makes possible to analyse many thousands of genes simultaneously. While it is important to test each gene whether it shows changes in expression associated with a phenotype, human diseases are thought to occur through the interactions of multiple genes within a same functional cafe-gory. Recent research interests aims to directly test the behavior of sets of functionally related genes, instead of focusing on single genes. Gene set enrichment analysis(GSEA), significance analysis of microarray to gene-set analysis(SAM-GS) and parametric analysis of gene set enrichment(PAGE) have been applied widely as a tool for gene-set analyses. We describe their problems and propose an alternative method using a parametric analysis by adopting normal score transformation of gene expression values. Performance of the newly derived method is compared with previous methods on three real microarray datasets.

Investigation of gene-gene interactions of clock genes for chronotype in a healthy Korean population

  • Park, Mira;Kim, Soon Ae;Shin, Jieun;Joo, Eun-Jeong
    • Genomics & Informatics
    • /
    • v.18 no.4
    • /
    • pp.38.1-38.9
    • /
    • 2020
  • Chronotype is an important moderator of psychiatric illnesses, which seems to be controlled in some part by genetic factors. Clock genes are the most relevant genes for chronotype. In addition to the roles of individual genes, gene-gene interactions of clock genes substantially contribute to chronotype. We investigated genetic associations and gene-gene interactions of the clock genes BHLHB2, CLOCK, CSNK1E, NR1D1, PER1, PER2, PER3, and TIMELESS for chronotype in 1,293 healthy Korean individuals. Regression analysis was conducted to find associations between single nucleotide polymorphism (SNP) and chronotype. For gene-gene interaction analyses, the quantitative multifactor dimensionality reduction (QMDR) method, a nonparametric model-free method for quantitative phenotypes, were performed. No individual SNP or haplotype showed a significant association with chronotype by both regression analysis and single-locus model of QMDR. QMDR analysis identified NR1D1 rs2314339 and TIMELESS rs4630333 as the best SNP pairs among two-locus interaction models associated with chronotype (cross-validation consistency [CVC] = 8/10, p = 0.041). For the three-locus interaction model, the SNP combination of NR1D1 rs2314339, TIMELESS rs4630333, and PER3 rs228669 showed the best results (CVC = 4/10, p < 0.001). However, because the mean differences between genotype combinations were minor, the clinical roles of clock gene interactions are unlikely to be critical.