• 제목/요약/키워드: Gene Set Enrichment Analysis (GSEA)

검색결과 15건 처리시간 0.019초

NGSEA: Network-Based Gene Set Enrichment Analysis for Interpreting Gene Expression Phenotypes with Functional Gene Sets

  • Han, Heonjong;Lee, Sangyoung;Lee, Insuk
    • Molecules and Cells
    • /
    • 제42권8호
    • /
    • pp.579-588
    • /
    • 2019
  • Gene set enrichment analysis (GSEA) is a popular tool to identify underlying biological processes in clinical samples using their gene expression phenotypes. GSEA measures the enrichment of annotated gene sets that represent biological processes for differentially expressed genes (DEGs) in clinical samples. GSEA may be suboptimal for functional gene sets; however, because DEGs from the expression dataset may not be functional genes per se but dysregulated genes perturbed by bona fide functional genes. To overcome this shortcoming, we developed network-based GSEA (NGSEA), which measures the enrichment score of functional gene sets using the expression difference of not only individual genes but also their neighbors in the functional network. We found that NGSEA outperformed GSEA in identifying pathway gene sets for matched gene expression phenotypes. We also observed that NGSEA substantially improved the ability to retrieve known anti-cancer drugs from patient-derived gene expression data using drug-target gene sets compared with another method, Connectivity Map. We also repurposed FDA-approved drugs using NGSEA and experimentally validated budesonide as a chemical with anti-cancer effects for colorectal cancer. We, therefore, expect that NGSEA will facilitate both pathway interpretation of gene expression phenotypes and anti-cancer drug repositioning. NGSEA is freely available at www.inetbio.org/ngsea.

Fisher Criterion을 이용한 Gene Set Enrichment Analysis 기반 유의 유전자 집합의 검출 방법 연구 (Identifying Statistically Significant Gene-Sets by Gene Set Enrichment Analysis Using Fisher Criterion)

  • 김재영;신미영
    • 전자공학회논문지CI
    • /
    • 제45권4호
    • /
    • pp.19-26
    • /
    • 2008
  • Gene set enrichment analysis (GSEA)는 두 개의 클래스를 가지는 마이크로어레이 실험 데이터 분석을 위해 생물학적 특징을 기반으로 구성된 다양한 유전자-집합 중에서 두 클래스의 발현값들이 통계적으로 중요한 차이를 나타내는 유의한 유전자-집합을 추출하기 위한 분석 방법이다. 특히, 유전자에 대한 다양한 생물학적인 정보를 지닌 유전자 주석 데이터베이스(Cytogenetic Band, KEGG pathway, Gene Ontology 등)를 이용하여 마이크로어레이 실험에 사용된 전체 유전자 중 특정 기능을 가지는 유전자들을 그룹화하여 다양한 유전자-집합을 발굴하고, 각 유전자-집합 내에서 두 클래스간에 발현값의 차이를 참조하여 유의한 유전자들을 결정하여, 이를 기반으로 통계적으로 유의한 유전자-집합들을 최종 검출하는 방법이다. 본 논문에서는 GSEA 분석 과정에서 현재 주로 사용되고 있는 signal-to-noise ratio 기반 유전자 서열화(gene ranking) 방법 대신에, Fisher criterion을 이용한 유전자 서열화 방법을 적용함으로써 기존의 GSEA 방법에서 추출하지 못한 생물학적으로 의미 있는 새로운 유의 유전자-집합을 추출하는 방법을 제안하고자 한다. 또한, 제안한 방법의 성능을 고찰하기 위하여 공개된 Leukemia 관련 마이크로어레이 실험 데이터 분석에 적용하였으며, 기존의 알려진 결과와 비교 분석함으로써 제안한 방법의 유용성을 검증하고자 하였다.

Discovery of Cellular RhoA Functions by the Integrated Application of Gene Set Enrichment Analysis

  • Chun, Kwang-Hoon
    • Biomolecules & Therapeutics
    • /
    • 제30권1호
    • /
    • pp.98-116
    • /
    • 2022
  • The small GTPase RhoA has been studied extensively for its role in actin dynamics. In this study, multiple bioinformatics tools were applied cooperatively to the microarray dataset GSE64714 to explore previously unidentified functions of RhoA. Comparative gene expression analysis revealed 545 differentially expressed genes in RhoA-null cells versus controls. Gene set enrichment analysis (GSEA) was conducted with three gene set collections: (1) the hallmark, (2) the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, and (3) the Gene Ontology Biological Process. GSEA results showed that RhoA is related strongly to diverse pathways: cell cycle/growth, DNA repair, metabolism, keratinization, response to fungus, and vesicular transport. These functions were verified by heatmap analysis, KEGG pathway diagramming, and direct acyclic graphing. The use of multiple gene set collections restricted the leakage of information extracted. However, gene sets from individual collections are heterogenous in gene element composition, number, and the contextual meaning embraced in names. Indeed, there was a limit to deriving functions with high accuracy and reliability simply from gene set names. The comparison of multiple gene set collections showed that although the gene sets had similar names, the gene elements were extremely heterogeneous. Thus, the type of collection chosen and the analytical context influence the interpretation of GSEA results. Nonetheless, the analyses of multiple collections made it possible to derive robust and consistent function identifications. This study confirmed several well-described roles of RhoA and revealed less explored functions, suggesting future research directions.

Identification of key genes and functional enrichment analysis of liver fibrosis in nonalcoholic fatty liver disease through weighted gene co-expression network analysis

  • Yue Hu;Jun Zhou
    • Genomics & Informatics
    • /
    • 제21권4호
    • /
    • pp.45.1-45.11
    • /
    • 2023
  • Nonalcoholic fatty liver disease (NAFLD) is a common type of chronic liver disease, with severity levels ranging from nonalcoholic fatty liver to nonalcoholic steatohepatitis (NASH). The extent of liver fibrosis indicates the severity of NASH and the risk of liver cancer. However, the mechanism underlying NASH development, which is important for early screening and intervention, remains unclear. Weighted gene co-expression network analysis (WGCNA) is a useful method for identifying hub genes and screening specific targets for diseases. In this study, we utilized an mRNA dataset of the liver tissues of patients with NASH and conducted WGCNA for various stages of liver fibrosis. Subsequently, we employed two additional mRNA datasets for validation purposes. Gene set enrichment analysis (GSEA) was conducted to analyze gene function enrichment. Through WGCNA and subsequent analyses, complemented by validation using two additional datasets, we identified five genes (BICC1, C7, EFEMP1, LUM, and STMN2) as hub genes. GSEA analysis indicated that gene sets associated with liver metabolism and cholesterol homeostasis were uniformly downregulated. BICC1, C7, EFEMP1, LUM, and STMN2 were identified as hub genes of NASH, and were all related to liver metabolism, NAFLD, NASH, and related diseases. These hub genes might serve as potential targets for the early screening and treatment of NASH.

Comparison of Invariant NKT Cells with Conventional T Cells by Using Gene Set Enrichment Analysis (GSEA)

  • Oh, Sae-Jin;Ahn, Ji-Ye;Chung, Doo-Hyun
    • IMMUNE NETWORK
    • /
    • 제11권6호
    • /
    • pp.406-411
    • /
    • 2011
  • Background: Invariant Natural killer T (iNKT) cells, a distinct subset of CD1d-restricted T cells with invariant $V{\alpha}{\beta}$ TCR, functionally bridge innate and adaptive immunity. While iNKT cells share features with conventional T cells in some functional aspects, they simultaneously produce large amount of Th1 and Th2 cytokines upon T-cell receptor (TCR) ligation. However, gene expression pattern in two types of cells has not been well characterized. Methods: we performed comparative microarray analyses of gene expression in murine iNKT cells and conventional $CD4^+CD25^-$ ${\gamma}{\delta}TCR^-$ T cells by using Gene Set Enrichment Analysis (GSEA) method. Results: Here, we describe profound differences in gene expression pattern between iNKT cells and conventional $CD4^+CD25^-$ ${\gamma}{\delta}TCR^-$ T cells. Conclusion: Our results provide new insights into the functional competence of iNKT cells and a better understanding of their various roles during immune responses.

Analysis of gene expression during odontogenic differentiation of cultured human dental pulp cells

  • Seo, Min-Seock;Hwang, Kyung-Gyun;Kim, Hyong-Bum;Baek, Seung-Ho
    • Restorative Dentistry and Endodontics
    • /
    • 제37권3호
    • /
    • pp.142-148
    • /
    • 2012
  • Objectives: We analyzed gene-expression profiles after 14 day odontogenic induction of human dental pulp cells (DPCs) using a DNA microarray and sought candidate genes possibly associated with mineralization. Materials and Methods: Induced human dental pulp cells were obtained by culturing DPCs in odontogenic induction medium (OM) for 14 day. Cells exposed to normal culture medium were used as controls. Total RNA was extracted from cells and analyzed by microarray analysis and the key results were confirmed selectively by reverse-transcriptase polymerase chain reaction (RT-PCR). We also performed a gene set enrichment analysis (GSEA) of the microarray data. Results: Six hundred and five genes among the 47,320 probes on the BeadChip differed by a factor of more than two-fold in the induced cells. Of these, 217 genes were upregulated, and 388 were down-regulated. GSEA revealed that in the induced cells, genes implicated in Apoptosis and Signaling by wingless MMTV integration (Wnt) were significantly upregulated. Conclusions: Genes implicated in Apoptosis and Signaling by Wnt are highly connected to the differentiation of dental pulp cells into odontoblast.

마이크로어레이 자료분석에서 모수적 방법을 이용한 유전자군의 유의성 검정 (Developing a Parametric Method for Testing the Significance of Gene Sets in Microarray Data Analysis)

  • 이선호;이승규;이광현
    • Communications for Statistical Applications and Methods
    • /
    • 제16권3호
    • /
    • pp.397-408
    • /
    • 2009
  • 마이크로어레이 기술은 수만 개 유전자의 발현 패턴을 동시에 관찰하는 것을 가능하게 하였고, 이들을 하나씩 검정하여 찾아낸 특이발현 현상을 보이는 유전자를 중심으로 질병의 진단, 치료법 정립과 신약 개발을 위한 기본 정보를 확립하였다. 그러나 개별 유전자분석의 여러 문제점이 발견되면서 유전자들을 생물학적 대사경로나 염색체 위치가 같은 것끼리 묶은 집단을 분석하여 질병의 발생이나 생존에 영향을 미치는 집단을 찾는 방법이 제시되었다. 이러한 유전자 집단의 유의성에 대한 연구는 2002년에 MIT에서 비롯되어 GSEA, SAM-GS와 중심극한 정리의 개념을 이용한 모수적 방법인 PAGE 등이 사용되고 있다. 본 논문에서는 이들 통계량의 구조적 한계를 극복하고 계산이 간단한 새로운 모수적 방법을 제안하고 자료 분석을 통하여 효율성을 보였다.

Deep Learning Approach Based on Transcriptome Profile for Data Driven Drug Discovery

  • Eun-Ji Kwon;Hyuk-Jin Cha
    • Molecules and Cells
    • /
    • 제46권1호
    • /
    • pp.65-67
    • /
    • 2023
  • SMILES (simplified molecular-input line-entry system) information of small molecules parsed by one-hot array is passed to a convolutional neural network called black box. Outputs data representing a gene signature is then matched to the genetic signature of a disease to predict the appropriate small molecule. Efficacy of the predicted small molecules is examined by in vivo animal models. GSEA, gene set enrichment analysis.

HPAI-resistant Ri chickens exhibit elevated antiviral immune-related gene expression

  • Thi Hao Vu;Jubi Heo;Yeojin Hong;Suyeon Kang;Ha Thi Thanh Tran;Hoang Vu Dang;Anh Duc Truong;Yeong Ho Hong
    • Journal of Veterinary Science
    • /
    • 제24권1호
    • /
    • pp.13.1-13.11
    • /
    • 2023
  • Background: Highly pathogenic avian influenza viruses (HPAIVs) is an extremely contagious and high mortality rates in chickens resulting in substantial economic impact on the poultry sector. Therefore, it is necessary to elucidate the pathogenic mechanism of HPAIV for infection control. Objective: Gene set enrichment analysis (GSEA) can effectively avoid the limitations of subjective screening for differential gene expression. Therefore, we performed GSEA to compare HPAI-infected resistant and susceptible Ri chicken lines. Methods: The Ri chickens Mx(A)/BF2(B21) were chosen as resistant, and the chickens Mx(G)/BF2(B13) were selected as susceptible by genotyping the Mx and BF2 genes. The tracheal tissues of HPAIV H5N1 infected chickens were collected for RNA sequencing followed by GSEA analysis to define gene subsets to elucidate the sequencing results. Results: We identified four differentially expressed pathways, which were immune-related pathways with a total of 78 genes. The expression levels of cytokines (IL-1β, IL-6, IL-12), chemokines (CCL4 and CCL5), type interferons and their receptors (IFN-β, IFNAR1, IFNAR2, and IFNGR1), Jak-STAT signaling pathway genes (STAT1, STAT2, and JAK1), MHC class I and II and their co-stimulatory molecules (CD80, CD86, CD40, DMB2, BLB2, and B2M), and interferon stimulated genes (EIF2AK2 and EIF2AK1) in resistant chickens were higher than those in susceptible chickens. Conclusions: Resistant Ri chickens exhibit a stronger antiviral response to HPAIV H5N1 compared with susceptible chickens. Our findings provide insights into the immune responses of genetically disparate chickens against HPAIV.

마이크로어레이 자료에서 생존과 유의한 관련이 있는 유전자집단 검색 (Detecting survival related gene sets in microarray analysis)

  • 이선호;이광현
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권1호
    • /
    • pp.1-11
    • /
    • 2012
  • 환자의 생존시간과 함께 유전자 마이크로어레이 자료가 주어진 경우 생존에 유의한 영향을 미치는 대사경로를 찾는 방법을 연구하였다. 기존의 방법인 유전자 집합 농축도 분석, 글로벌 검정과 왈드 형태 검정을 비교 분석하였고, 치환을 통하여 p값을 구하는 단점을 개선한 수정된 왈드 형태 검정을 제안하였다. 모의실험과 실제자료 분석을 이용하여 새로운 방법의 적용 가능성을 보였다.