• 제목/요약/키워드: Gene ontology

검색결과 303건 처리시간 0.02초

FCAnalyzer: A Functional Clustering Analysis Tool for Predicted Transcription Regulatory Elements and Gene Ontology Terms

  • Kim, Sang-Bae;Ryu, Gil-Mi;Kim, Young-Jin;Heo, Jee-Yeon;Park, Chan;Oh, Berm-Seok;Kim, Hyung-Lae;Kimm, Ku-Chan;Kim, Kyu-Won;Kim, Young-Youl
    • Genomics & Informatics
    • /
    • 제5권1호
    • /
    • pp.10-18
    • /
    • 2007
  • Numerous studies have reported that genes with similar expression patterns are co-regulated. From gene expression data, we have assumed that genes having similar expression pattern would share similar transcription factor binding sites (TFBSs). These function as the binding regions for transcription factors (TFs) and thereby regulate gene expression. In this context, various analysis tools have been developed. However, they have shortcomings in the combined analysis of expression patterns and significant TFBSs and in the functional analysis of target genes of significantly overrepresented putative regulators. In this study, we present a web-based A Functional Clustering Analysis Tool for Predicted Transcription Regulatory Elements and Gene Ontology Terms (FCAnalyzer). This system integrates microarray clustering data with similar expression patterns, and TFBS data in each cluster. FCAnalyzer is designed to perform two independent clustering procedures. The first process clusters gene expression profiles using the K-means clustering method, and the second process clusters predicted TFBSs in the upstream region of previously clustered genes using the hierarchical biclustering method for simultaneous grouping of genes and samples. This system offers retrieved information for predicted TFBSs in each cluster using $Match^{TM}$ in the TRANSFAC database. We used gene ontology term analysis for functional annotation of genes in the same cluster. We also provide the user with a combinatorial TFBS analysis of TFBS pairs. The enrichment of TFBS analysis and GO term analysis is statistically by the calculation of P values based on Fisher’s exact test, hypergeometric distribution and Bonferroni correction. FCAnalyzer is a web-based, user-friendly functional clustering analysis system that facilitates the transcriptional regulatory analysis of co-expressed genes. This system presents the analyses of clustered genes, significant TFBSs, significantly enriched TFBS combinations, their target genes and TFBS-TF pairs.

Gene-set based genome-wide association analysis for the speed of sound in two skeletal sites of Korean women

  • Kwon, Ji-Sun;Kim, Sangsoo
    • BMB Reports
    • /
    • 제47권6호
    • /
    • pp.348-353
    • /
    • 2014
  • The speed of sound (SOS) value is an indicator of bone mineral density (BMD). Previous genome-wide association (GWA) studies have identified a number of genes, whose variations may affect BMD levels. However, their biological implications have been elusive. We re-analyzed the GWA study dataset for the SOS values in skeletal sites of 4,659 Korean women, using a gene-set analysis software, GSA-SNP. We identified 10 common representative GO terms, and 17 candidate genes between these two traits (PGS < 0.05). Implication of these GO terms and genes in the bone mechanism is well supported by the literature survey. Interestingly, the significance levels of some member genes were inversely related, in several gene-sets that were shared between two skeletal sites. This implies that biological process, rather than SNP or gene, is the substantial unit of genetic association for SOS in bone. In conclusion, our findings may provide new insights into the biological mechanisms for BMD.

조건(암, 정상)에 따라 특이적 관계를 나타내는 유전자 쌍으로 구성된 유전자 모듈을 이용한 독립샘플의 클래스예측 (Class prediction of an independent sample using a set of gene modules consisting of gene-pairs which were condition(Tumor, Normal) specific)

  • 정현이;윤영미
    • 한국컴퓨터정보학회논문지
    • /
    • 제15권12호
    • /
    • pp.197-207
    • /
    • 2010
  • 대용량(High-throughput) 형태로 얻어진 cDNA 마이크로어레이 데이터에 다양한 데이터 마이닝 기법을 적용하면 서로 다른 조직에서 추출한 유전자의 발현정도를 비교할 수 있고 정상세포와 암세포에서 발현량의 차이를 보이는 DEG(Differently Expression Gene) 유전자를 추출할 수 있다. 이들을 이용하여 병을 진단할 수 있을 뿐만 아니라, 암의 진행 단계(Cancer Stage)에 따른 치료 방법을 결정할 수 있다. 마이크로어레이를 기반으로 한 대부분의 암 분류자는 기계학습 기법을 이용하여 암 관련 유전자를 추출하여, 이들 유전자를 총체적으로 이용하여 독립 샘플의 클래스(암, 정상)를 판정한다. 하지만 유전자의 발현량의 차이뿐만 아니라 유전자와 유전자의 상관관계의 변화가 질병 진단에 활용될 수 있다. 대부분의 질병은 단독 유전자의 변이에 의한 것이 아니라 유전자의 모듈로 이루어진 유전자조절네트워크의 변이에 의한 것이기 때문이다. 본 논문에서는 조건에 따라 특이적 관계를 나타내는 유전자 쌍을 식별하여, 이들 유전자 쌍을 이용한 유전자 분류 모듈을 생성한다. 분류 모듈을 이용한 암 분류 방법이 기존의 암 분류 방법보다 높은 정확도로 암과정상 샘플을 분류함을 보여주고 있다. 분류 모듈을 구성하는 유전자의 수가 상대적으로 적으므로 임상키트로의 개발도 고려할 수 있다. 향후 분류 모듈에 속하는 유전자의 기능적 검증을, GO(Gene Ontology)를 활용함으로서, 밝혀지지 않은 새로운 암 관련 유전자를 식별하고, 분류 모듈을 확대하여 암 특이적 유전자조절네트워크 구성에 활용할 계획이다.

Functional Gene Analysis to Identify Potential Markers Induced by Benzene in Two Different Cell Lines, HepG2 and HL-60

  • Kim, Youn-Jung;Song, Mi-Kyung;Sarma, Sailendra Nath;Choi, Han-Saem;Ryu, Jae-Chun
    • Molecular & Cellular Toxicology
    • /
    • 제4권3호
    • /
    • pp.183-191
    • /
    • 2008
  • Volatile organic compounds (VOCs) are common constituents of cleaning and degreasing agents, paints, pesticides, personal care products, gasoline and solvents. And VOCs are evaporated at room temperature and most of them exhibit acute and chronic toxicity to human. Benzene is the most widely used prototypical VOC and the toxic mechanisms of them are still unclear. The multi-step process of toxic mechanism can be more fully understood by characterizing gene expression changes induced in cells by toxicants. In this study, DNA microarray was used to monitor the expression levels of genes in HepG2 cells and HL-60 cells exposed to the benzene on IC20 and IC50 dose respectively. In the clustering analysis of gene expression profiles, although clusters of HepG2 and HL-60 cells by benzene were divided differently, expression pattern of many genes observed similarly. We identified 916 up-regulated genes and 1,144 down-regulated genes in HepG2 cells and also 1,002 up-regulated genes and 919 down-regulated genes in HL-60 cells. The gene ontology analysis on genes expressed by benzene in HepG2 and HL-60 cells, respectively, was performed. Thus, we found some principal pathways, such as, focal adhesion, gap junction and signaling pathway in HepG2 cells and toll-like receptor signaling pathway, MAPK signaling pathway, p53 signaling pathway and neuroactive ligand-receptor interaction in HL-60 cells. And we also found 16 up-regulated and 14 down-regulated commonly expressed total 30 genes that belong in the same biological process like inflammatory response, cell cycle arrest, cell migration, transmission of nerve impulse and cell motility in two cell lines. In conclusion, we suggest that this study is meaningful because these genes regarded as strong potential biomarkers of benzene independent of cell type.

Differential Gene Expression Induced by Naphthalene in Two Human Cell Line, HepG2 and HL-60

  • Kim, Youn-Jung;Song, Mee;Song, Mi-Kyung;Youk, Da-Young;Choi, Han-Saem;Sarma, Sailendra Nath;Ryu, Jae-Chun
    • Molecular & Cellular Toxicology
    • /
    • 제5권2호
    • /
    • pp.99-107
    • /
    • 2009
  • Naphthalene is bicyclic aromatic compound that is widely used in various domestic and commercial applications including lavatory scent disks, soil fumigants and moth balls. Exposure to naphthalene results in the development of bronchiolar damage, cataracts and hemolytic anemia in humans and laboratory animals. However, little information is available regarding the mechanism of naphthalene toxicity. We investigated gene expression profiles and potential signature genes in human hepatocellular carcinoma HepG2 cells and human promyelocytic leukemia HL-60 cells after 3 h and 48 h incubation with the IC$_{20}$ and IC$_{50}$ of naphthalene by using 44 k agilent whole human genome oligomicroarray and operon human whole 35 k oligomicroarray, respectively. We identified 616 up-regulated genes and 2,088 down-regulated genes changed by more than 2-fold by naphthalene in HepG2 cells. And in HL-60, we identified 138 up-regulated genes and 182 down-regulated genes changed by more than 2-fold. This study identified several interesting targets and functions in relation to naphthalene-induced toxicity through a gene ontology analysis method. Apoptosis and cell cycle related genes are more commonly expressed than other functional genes in both cell lines. In summary, the use of in vitro models with global expression profiling emerges as a relevant approach toward the identification of biomarkers associated with toxicity after exposure to a variety of environmental toxicants.

Transcriptome profiling of the coffee (C. arabica L.) seedlings under salt stress condition

  • Haile, Mesfin;Kang, Won Hee
    • Journal of Plant Biotechnology
    • /
    • 제45권1호
    • /
    • pp.45-54
    • /
    • 2018
  • This research was conducted to study the gene expression of coffee (Coffea arabica L.) seedlings under salt stress condition. A solution of five percent ($2.3dS\;m^{-1}$) deep sea water was used for the salt treatment, and it was thereby compared to normal irrigation water ($0.2dS\;m^{-1}$) used for the control treatment. The mRNA was extracted from the leaves of the coffee seedlings for a comprehensive analysis. In this study, a total of 19,581 genes were identified and aligned to the reference sequences available in the coffee genome database. The gene ontology analysis was performed to estimate the number of genes associated with the identified biological processes, cellular components and molecular functions. Among the 19,581 genes, 7369 (37.64%) were associated with biological processes, 5909 (30.18%) with cellular components, and 5325 (27.19%) with molecular functions. The remaining 978 (4.99%) genes were therefore grouped as unclassified. A differential gene expression analysis was performed using the DESeq2 package to identify the genes that were differentially expressed between the treatments based on fold changes and p-values. Namely, a total of 611 differentially expressed genes were identified (treatment/control) in that case. Among these, 336 genes were up-regulated while 275 of the genes were down-regulated. Of the differentially expressed genes, 60 genes showed statistically significant (p < 0.05) expression, 44 of which were up-regulated and 16 which were down-regulated. We also identified 11 differentially expressed transcription factor genes, 6 of which were up-regulated and rest 5 genes were down-regulated. The data generated from this study will help in the continued interest and understanding of the responses of coffee seedlings genes associated with salinity stress, in particular. This study will also provide important resources for further functional genomics studies.

마이크로어레이 데이터와 PPI 데이터를 이용한 에스트로겐 수용체 음성 유방암 환자의 예후 특이 네트워크 식별 및 예후 예측 (Identification of prognosis-specific network and prediction for estrogen receptor-negative breast cancer using microarray data and PPI data)

  • 황유현;오민;윤영미
    • 한국컴퓨터정보학회논문지
    • /
    • 제20권2호
    • /
    • pp.137-147
    • /
    • 2015
  • 본 논문에서는 유전자 네트워크를 기반으로 유방암 환자의 예후를 예측하는 알고리듬을 제안한다. 유방암 환자의 마이크로어레이 데이터와 PPI(Protein-protein interaction)데이터를 이용하여 알고리듬의 분류자로 사용될 예후 특이 네트워크(Prognosis specific gene network)를 추출한다. PPI에 속한 모든 유전자 네트워크에 대하여 각각의 네트워크가 예후 좋음과 나쁨을 잘 구분하는지에 대한 점수를 피어슨 상관계수(Pearson's correlation coefficient)와 마이크로어레이 데이터를 이용하여 계산한다. 이들 중 가장 예후에 유의한 네트워크를 식별하고, 이 네트워크를 분류자로 사용하여 에스트로겐 수용체 음성 유방암 환자의 예후를 분류 분석 한다. 본 연구와 기존 연구의 알고리듬 정확도를 비교 분석 하기 위하여 독립 실험을 진행하고, 본 연구에서 제안된 알고리듬의 성능이 더 우수함을 보인다. 또한, Gene Ontology 데이터베이스를 활용하여 식별된 예후 특이 네트워크를 기능적으로 검증 한다.

인간 간암세포주 HepG2에서 김 분획물의 항산화 활성을 통한 증식 억제 및 유전자 발현 양상 (Anti-proliferating Effects and Gene Expression Profiles through Antioxidant Activity of Porphyra yezoensis Fractions on Human HepG2 Cell Lines)

  • 오윤정;김정민;방인석
    • 생명과학회지
    • /
    • 제28권2호
    • /
    • pp.176-186
    • /
    • 2018
  • 김(Porphyra yezoensis, Laver)의 MeOH 추출에 의한 유기용매 별 분획물에서 폴리페놀 함량과 항산화 활성 및 간암세포주 HepG2의 세포증식 억제효과를 확인하였다. $CHCl_3$ 분획물의 폴리페놀 함량은 $10.34{\mu}g/mg$으로 물 분획물의 $13.08{\mu}g/mg$ 보다는 다소 적게 나타났으나, DPPH 자유라디칼 소거에 의한 전자공여능(EDA)에서 나타난 $ED_{50}$$16.96{\mu}g/ml$로 가장 높게 나타났다. $CHCl_3$과 EtOAc 분획물은 농도의존적으로 HepG2 세포의 증식을 억제하였으며, 특히 $900{\mu}g/ml$$CHCl_3$ 분획물을 24시간 동안 처리하여 90%의 세포증식이 억제되었다. 한편 $CHCl_3$ 분획물이 처리된 HepG2 세포의 유전자 발현 양상을 microarray로 확인하였다. P. yezoensis의 효능과 연관지은 gene ontology 분석으로 비타민 D 합성 과정, 항균작용에 대한 반응 및 영양물질에 대한 반응에 관련된 유의 유전자들을 탐색하였다. 유의 유전자로 IL6R와 CYP1A1를 선정하였고, 이들 유전자의 상위 조절자는 ARNT 유전자가 선정되었다. 또한 50 및 $100{\mu}g/ml$$CHCl_3$ 분획물이 처리된 HepG2 세포에서 IL6R와 CYP1A1 단백질의 발현과 상위 조절자인 ARNT의 활성을 Western blotting으로 확인하였다.

영하의 저온에 노출된 'Campbell Early'와 'Muscat Bailey A' 포도나무 신초의 전사체 비교 (Transcriptomic analysis of 'Campbell Early' and 'Muscat Bailey A' grapevine shoots exposed to freezing cold stress)

  • 김선애;윤해근
    • Journal of Plant Biotechnology
    • /
    • 제43권2호
    • /
    • pp.204-212
    • /
    • 2016
  • 환경스트레스 중의 하나인 저온에 대한 생육기의 포도나무의 반응을 분석하고자 -$2^{\circ}C$에서 4일 동안 저온처리 한두 품종('Campbell Early'와 'Muscat Baily A')의 포도나무잎을 이용하여 전사체를 분석하였고 특이발현유전자(differentially expressed genes, DEGs)를 검색하였다. 영하의 저온에 반응한 'Campbell Early'의 DEG를 기능별로 분석한 결과 생물대사에서 17,424개, 세포구성에서 28,954개, 분자기능에서는 6,972개의 유전자와 관련이 있었다. 발현이 유도되는 유전자로는 dehydrin xero 1, K-box region and MADS-box transcription factor family protein과 MYB domain protein 36이 있으며, 억제되는 유전자로는 light-harvesting chlorophyll B-binding protein 3, FASCICLIN-like arabinoogalactan 9와 pectin methylesterase 61 등이 있었다. 'Muscat Baily A'의 DEG는 생물대사에서 1,157개, 세포구성에서 1,350개, 분자기능에서는 431개의 유전자와 관련이 있었다. 발현이 유도되는 유전자로는 NB-ARC domain-containing disease resistance protein, fatty acid hydrozylase syperfamily와 isopentenyltransferase 3이 있으며, 억제되는 유전자로는 binding, IAP-like protein 1과 pentatricopeptide repeat superfamily protein 등이 있었다. Real-time PCR을 이용하여 영하의 저온에서 특이적으로 발현하는 유전자들을 검정하였으며, InterPro Scan을 통해 단백질 도메인을 분석한 결과 두 품종 모두에서 ubiquitin-protein ligase가 가장 많았다. 영하의 저온에 노출된 신초의 전사체 정보를 바탕으로 포도나무에서 저온 내성을 발현하는 기작을 연하는 데에 분자수준의 정보를 제공하고, 내한성 포도를 육종하는데 이용될 수 있을 것이다.

Analysis of Expressed Sequence Tags from the Red Alga Griffithsia okiensis

  • Lee, Hyoung-Seok;Lee, Hong-Kum;An, Gyn-Heung;Lee, Yoo-Kyung
    • Journal of Microbiology
    • /
    • 제45권6호
    • /
    • pp.541-546
    • /
    • 2007
  • Red algae are distributed globally, and the group contains several commercially important species. Griffithsia okiensis is one of the most extensively studied red algal species. In this study, we conducted expressed sequence tag (ESTs) analysis and synonymous codon usage analysis using cultured G. okiensis samples. A total of 1,104 cDNA clones were sequenced using a cDNA library made from samples collected from Dolsan Island, on the southern coast of Korea. The clustering analysis of these sequences allowed for the identification of 1,048 unigene clusters consisting of 36 consensus and 1,012 singleton sequences. BLASTX searches generated 532 significant hits (E-value <$10^{-4}$) and via further Gene Ontology analysis, we constructed a functional classification of 434 unigenes. Our codon usage analysis showed that unigene clusters with more than three ESTs had higher GC contents (76.5%) at the third position of the codons than the singletons. Also, the majority of the optimal codons of G. okiensis and Chondrus crispus belonging to Bangiophycidae were G-ending, whereas those of Porphyra yezoensis belonging to Florideophycidae were G-ending. An orthologous gene search for the P. yezoensis EST database resulted in the identification of 39 unigenes commonly expressed in two rhodophytes, which have putative functions for structural proteins, protein degradation, signal transduction, stress response, and physiological processes. Although experiments have been conducted on a limited scale, this study provides a material basis for the development of microarrays useful for gene expression studies, as well as useful information for the comparative genomic analysis of red algae.