• 제목/요약/키워드: gene interaction networks

검색결과 37건 처리시간 0.031초

CONSTRUCTING GENE REGULATORY NETWORK USING FREQUENT GENE EXPRESSION PATTERN MINING AND CHAIN RULES

  • Park, Hong-Kyu;Lee, Heon-Gyu;Cho, Kyung-Hwan;Ryu, Keun-Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2006년도 Proceedings of ISRS 2006 PORSEC Volume II
    • /
    • pp.623-626
    • /
    • 2006
  • Group of genes controls the functioning of a cell by complex interactions. These interacting gene groups are called Gene Regulatory Networks (GRNs). Two previous data mining approaches, clustering and classification have been used to analyze gene expression data. While these mining tools are useful for determining membership of genes by homology, they don't identify the regulatory relationships among genes found in the same class of molecular actions. Furthermore, we need to understand the mechanism of how genes relate and how they regulate one another. In order to detect regulatory relationships among genes from time-series Microarray data, we propose a novel approach using frequent pattern mining and chain rule. In this approach, we propose a method for transforming gene expression data to make suitable for frequent pattern mining, and detect gene expression patterns applying FP-growth algorithm. And then, we construct gene regulatory network from frequent gene patterns using chain rule. Finally, we validated our proposed method by showing that our experimental results are consistent with published results.

  • PDF

Gene annotation by the "interactome"analysis in KEGG

  • Kanehisa, Minoru
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2000년도 International Symposium on Bioinformatics
    • /
    • pp.56-58
    • /
    • 2000
  • Post-genomics may be defined in different ways depending on how one views the challenges after the genome. A popular view is to follow the concept of the central dogma in molecular biology, namely from genome to transcriptome to proteome. Projects are going on to analyze gene expression profiles both at the mRNA and protein levels and to catalog protein 3D structure families, which will no doubt help the understanding of information in the genome. However complete, such catalogs of genes, RNAs, and proteins only tell us about the building blocks of life. They do not tell us much about the wiring (interaction) of building blocks, which is essential for uncovering systemic functional behaviors of the cell or the organism. Thus, an alternative view of post-genomics is to go up from the molecular level to the cellular level, and to understand, what I call, the "interactome"or a complete picture of molecular interactions in the cell. KEGG (http://www.genome.ad.jp/kegg/) is our attempt to computerize current knowledge on various cellular processes as a collection of "generalized"protein-protein interaction networks, to develop new graph-based algorithms for predicting such networks from the genome information, and to actually reconstruct the interactomes for all the completely sequenced genomes and some partial genomes. During the reconstruction process, it becomes readily apparent that certain pathways and molecular complexes are present or absent in each organism, indicating modular structures of the interactome. In addition, the reconstruction uncovers missing components in an otherwise complete pathway or complex, which may result from misannotation of the genome or misrepresentation of the KEGG pathway. When combined with additional experimental data on protein-protein interactions, such as by yeast two-hybrid systems, the reconstruction possibly uncovers unknown partners for a particular pathway or complex. Thus, the reconstruction is tightly coupled with the annotation of individual genes, which is maintained in the GENES database in KEGG. We are also trying to expand our literature surrey to include in the GENES database most up-to-date information about gene functions.

  • PDF

Sequence-based 5-mers highly correlated to epigenetic modifications in genes interactions

  • Salimi, Dariush;Moeini, Ali;Masoudi?Nejad, Ali
    • Genes and Genomics
    • /
    • 제40권12호
    • /
    • pp.1363-1371
    • /
    • 2018
  • One of the main concerns in biology is extracting sophisticated features from DNA sequence for gene interaction determination, receiving a great deal of researchers' attention. The epigenetic modifications along with their patterns have been intensely recognized as dominant features affecting on gene expression. However, studying sequenced-based features highly correlated to this key element has remained limited. The main objective in this research was to propose a new feature highly correlated to epigenetic modifications capable of classification of genes. In this paper, classification of 34 genes in PPAR signaling pathway associated with muscle fat tissue in human was performed. Using different statistical outlier detection methods, we proposed that 5-mers highly correlated to epigenetic modifications can correctly categorize the genes involved in the same biological pathway or process. Thirty-four genes in PPAR signaling pathway were classified via applying a proposed feature, 5-mers strongly associated to 17 different epigenetic modifications. For this, diverse statistical outlier detection methods were applied to specify the group of thoroughly correlated genes. The results indicated that these 5-mers can appropriately identify correlated genes. In addition, our results corresponded to GeneMania interaction information, leading to support the suggested method. The appealing findings imply that not only epigenetic modifications but also their highly correlated 5-mers can be applied for reconstructing gene regulatory networks as supplementary data as well as other applications like physical interaction, genes prioritization, indicating some sort of data fusion in this analysis.

Construction of a Protein-Protein Interaction Network for Chronic Myelocytic Leukemia and Pathway Prediction of Molecular Complexes

  • Zhou, Chao;Teng, Wen-Jing;Yang, Jing;Hu, Zhen-Bo;Wang, Cong-Cong;Qin, Bao-Ning;Lv, Qing-Liang;Liu, Ze-Wang;Sun, Chang-Gang
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권13호
    • /
    • pp.5325-5330
    • /
    • 2014
  • Background: Chronic myelocytic leukemia is a disease that threatens both adults and children. Great progress has been achieved in treatment but protein-protein interaction networks underlining chronic myelocytic leukemia are less known. Objective: To develop a protein-protein interaction network for chronic myelocytic leukemia based on gene expression and to predict biological pathways underlying molecular complexes in the network. Materials and Methods: Genes involved in chronic myelocytic leukemia were selected from OMIM database. Literature mining was performed by Agilent Literature Search plugin and a protein-protein interaction network of chronic myelocytic leukemia was established by Cytoscape. The molecular complexes in the network were detected by Clusterviz plugin and pathway enrichment of molecular complexes were performed by DAVID online. Results and Discussion: There are seventy-nine chronic myelocytic leukemia genes in the Mendelian Inheritance In Man Database. The protein-protein interaction network of chronic myelocytic leukemia contained 638 nodes, 1830 edges and perhaps 5 molecular complexes. Among them, complex 1 is involved in pathways that are related to cytokine secretion, cytokine-receptor binding, cytokine receptor signaling, while complex 3 is related to biological behavior of tumors which can provide the bioinformatic foundation for further understanding the mechanisms of chronic myelocytic leukemia.

StrokeBase: A Database of Cerebrovascular Disease-related Candidate Genes

  • Kim, Young-Uk;Kim, Il-Hyun;Bang, Ok-Sun;Kim, Young-Joo
    • Genomics & Informatics
    • /
    • 제6권3호
    • /
    • pp.153-156
    • /
    • 2008
  • Complex diseases such as stroke and cancer have two or more genetic loci and are affected by environmental factors that contribute to the diseases. Due to the complex characteristics of these diseases, identifying candidate genes requires a system-level analysis of the following: gene ontology, pathway, and interactions. A database and user interface, termed StrokeBase, was developed; StrokeBase provides queries that search for pathways, candidate genes, candidate SNPs, and gene networks. The database was developed by using in silico data mining of HGNC, ENSEMBL, STRING, RefSeq, UCSC, GO, HPRD, KEGG, GAD, and OMIM. Forty candidate genes that are associated with cerebrovascular disease were selected by human experts and public databases. The networked cerebrovascular disease gene maps also were developed; these maps describe genegene interactions and biological pathways. We identified 1127 genes, related indirectly to cerebrovascular disease but directly to the etiology of cerebrovascular disease. We found that a protein-protein interaction (PPI) network that was associated with cerebrovascular disease follows the power-law degree distribution that is evident in other biological networks. Not only was in silico data mining utilized, but also 250K Affymetrix SNP chips were utilized in the 320 control/disease association study to generate associated markers that were pertinent to the cerebrovascular disease as a genome-wide search. The associated genes and the genes that were retrieved from the in silico data mining system were compared and analyzed. We developed a well-curated cerebrovascular disease-associated gene network and provided bioinformatic resources to cerebrovascular disease researchers. This cerebrovascular disease network can be used as a frame of systematic genomic research, applicable to other complex diseases. Therefore, the ongoing database efficiently supports medical and genetic research in order to overcome cerebrovascular disease.

Functional characterization of ABA signaling components using transient gene expression in rice protoplasts

  • Song, In-Sik;Moon, Seok-Jun;Kim, Jin-Ae;Yoon, Insun;Kwon, Taek-Ryoun;Kim, Beom-Gi
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2017년도 9th Asian Crop Science Association conference
    • /
    • pp.109-109
    • /
    • 2017
  • The core components of ABA-dependent gene expression signaling have been identified in Arabidopsis and rice. This signaling pathway consists of four major components; group A OsbZIPs, SAPKs, subclass A OsPP2Cs and OsPYL/RCARs in rice. These might be able to make thousands of combinations through interaction networks resulting in diverse signaling responses. We tried to characterize those gene functions using transient gene expression for rice protoplasts (TGERP) because it is instantaneous and convenient system. Firstly, in order to monitor the ABA signaling output, we developed reporter system named pRab16A-fLUC which consists of Rab16A promoter of rice and luciferase gene. It responses more rapidly and sensitively to ABA than pABRC3-fLUC that consists of ABRC3 of HVA1 promoter in TGERP. We screened the reporter responses for over-expression of each signaling components from group A OsbZIPs to OsPYL/RCARs with or without ABA in TGERP. OsbZIP46 induced reporter most strongly among OsbZIPs tested in the presence of ABA. SAPKs could activate the OsbZIP46 even in the ABA independence. Subclass A OsPP2C6 and -8 almost completely inhibited the OsbZIP46 activity in the different degree through the SAPK9. Lastly, OsPYL/RCAR2 and -5 rescued the OsbZIP46 activity in the presence of SAPK9 and OsPP2C6 dependent on ABA concentration and expression level. By using TGERP, we could characterize successfully the effects of ABA dependent gene expression signaling components in rice. In conclusion, TGERP represents very useful technology to study systemic functional genomics in rice or other monocots.

  • PDF

Identification and Functional Analysis of Differentially Expressed Genes Related to Metastatic Osteosarcoma

  • Niu, Feng;Zhao, Song;Xu, Chang-Yan;Chen, Lin;Ye, Long;Bi, Gui-Bin;Tian, Gang;Gong, Ping;Nie, Tian-Hong
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권24호
    • /
    • pp.10797-10801
    • /
    • 2015
  • Background: To explore the molecular mechanisms of metastatic osteosarcoma (OS) by using the microarray expression profiles of metastatic and non-metastatic OS samples. Materials and Methods: The gene expression profile GSE37552 was downloaded from Gene Expression Omnibus database, including 2 human metastatic OS cell line models and 2 two non-metastatic OS cell line models. The differentially expressed genes (DEGs) were identified by Multtest package in R language. In addition, functional enrichment analysis of the DEGs was performed by WebGestalt, and the protein-protein interaction (PPI) networks were constructed by Hitpredict, then the signal pathways of the genes involved in the networks were performed by Kyoto Encyclopaedia of Genes and Genomes (KEGG) automatic annotation server (KAAS). Results: A total of 237 genes were classified as DEGs in metastatic OS. The most significant up- and down-regulated genes were A2M (alpha-2-macroglobulin) and BCAN (brevican). The DEGs were significantly related to the response to hormone stimulus, and the PPI network of A2M contained IL1B (interleukin), LRP1 (low-density lipoprotein receptor-related protein 1) and PDGF (platelet-derived growth factor). Furthermore, the MAPK signaling pathway and focal adhesion were significantly enriched. Conclusions: A2M and its interactive proteins, such as IL1B, LRP1 and PDGF may be candidate target molecules to monitor, diagnose and treat metastatic OS. The response to hormone stimulus, MAPK signaling pathway and focal adhesion may play important roles in metastatic OS.

Bile Ductal Transcriptome Identifies Key Pathways and Hub Genes in Clonorchis sinensis-Infected Sprague-Dawley Rats

  • Yoo, Won Gi;Kang, Jung-Mi;Le, Huong Giang;Pak, Jhang Ho;Hong, Sung-Jong;Sohn, Woon-Mok;Na, Byoung-Kuk
    • Parasites, Hosts and Diseases
    • /
    • 제58권5호
    • /
    • pp.513-525
    • /
    • 2020
  • Clonorchis sinensis is a food-borne trematode that infects more than 15 million people. The liver fluke causes clonorchiasis and chronical cholangitis, and promotes cholangiocarcinoma. The underlying molecular pathogenesis occurring in the bile duct by the infection is little known. In this study, transcriptome profile in the bile ducts infected with C. sinensis were analyzed using microarray methods. Differentially expressed genes (DEGs) were 1,563 and 1,457 at 2 and 4 weeks after infection. Majority of the DEGs were temporally dysregulated at 2 weeks, but 519 DEGs showed monotonically changing expression patterns that formed seven distinct expression profiles. Protein-protein interaction (PPI) analysis of the DEG products revealed 5 sub-networks and 10 key hub proteins while weighted co-expression network analysis (WGCNA)-derived gene-gene interaction exhibited 16 co-expression modules and 13 key hub genes. The DEGs were significantly enriched in 16 Kyoto Encyclopedia of Genes and Genomes pathways, which were related to original systems, cellular process, environmental information processing, and human diseases. This study uncovered a global picture of gene expression profiles in the bile ducts infected with C. sinensis, and provided a set of potent predictive biomarkers for early diagnosis of clonorchiasis.

MNNG 처리에 의해 조절되는 암발생 유발 유전자의 조사 (MNNG-Regulated Differentially Expressed Genes that Contribute to Cancer Development in Stomach Cells)

  • 김태진;김명관;정동주
    • 대한임상검사과학회지
    • /
    • 제53권4호
    • /
    • pp.353-362
    • /
    • 2021
  • 암은 전 세계적인 건강문제이다. 암의 종류는 다양하나 암의 발생과정에서의 유사성은 상당히 높다. 모든 암은 유전자의 발현 변화가 있다는 점에서 공통점을 갖는다. 암 발생과정에서 변화되는 유전자의 발현을 이해하는 것은 암치료제나 암백신 개발에 큰 도움을 줄 것이다. 이번 연구에서는 잘 알려진 암발생 물질인 MNNG를 이용하여 정상 위세포에서 암발생 시 변화되는 유전자 발현을 분석하였다. MNNG 처리에 의해 정상 세포와는 다르게 발현되는 유전자인 DEG들을 찾고 이들을 암세포에서 발현되는 DEG들과 비교하였다. 여기에 더해 DEG의 단백질 결과물들의 기능과 단백질들 간의 결합을 분석하여 단순히 유전자의 발현뿐만 아니라 이들 단백질의 신호전달 과정에서의 연관성도 함께 분석하였다. 이 결과 위암 발생과정에서 관여하는 유전자들과 이들 유전자의 단백질 결과물들의 상호 작용을 밝히게 되었다.

동적인 개념을 적용한 알츠하이머 질병 네트워크의 특성 분석 (Characterization of the Alzheimer's disease-related network based on the dynamic network approach)

  • 김만선;김정래
    • 한국지능시스템학회논문지
    • /
    • 제25권6호
    • /
    • pp.529-535
    • /
    • 2015
  • 지금까지 생체 네트워크 분석 연구는 정적(static)인 개념으로만 다루어졌다. 그러나 실제 생명현상이 발생하는 세포 내에서는 세포의 상태 및 외부 환경에 따라 일부 단백질과 그 상호작용만이 선택적으로 활성화된다. 따라서 생체 네트워크의 구조가 시간의 흐름에 따라 변화하는 동적(dynamic)인 개념이 적용되어야 하며, 이런 개념은 질병의 진행 추이를 분석하는데 효율적이다. 본 논문에서는 동적인 네트워크 방법을 알츠하이머 질병에 적용하여 질병이 진행되는 단계에 따라 변화하는 단백질 상호작용 네트워크의 구조적, 기능적 특징에 대하여 분석하고자 한다. 우선, 유전자 발현데이터를 기반으로 각 질병의 진행 상태에 따른 부분 네트워크(정상, 초기, 중기, 말기)를 구축하였다. 이를 기반으로, 네트워크의 구조적 특성 분석을 수행하였다. 또한 기능적 특성 분석을 위해 유전자 군집(module)을 탐색하고, 군집별 유전자 기능(Gene Ontology) 분석을 수행했다. 그 결과, 네트워크의 특성들은 각 질병의 단계와 잘 대응되며, 동적 네트워크 분석법이 중요한 생물학적 이벤트를 설명하는데 이용될 수 있음을 보였다. 결론적으로 제안된 연구 방법을 통하여 그동안 알려지지 않았던 질병유발에 관련된 주요 네트워크 변화를 관측할 수 있고, 질병에 관여하는 복잡한 분자 수준의 발생 기작과 진행 과정을 이해하는데 중요한 정보를 획득할 수 있다.