• Title/Summary/Keyword: Bioconductor

Search Result 6, Processing Time 0.017 seconds

Design of Web-Bioconductor System for DNA chip data analysis (DNA chip 데이터 분석을 위한 Web-Bioconductor System 설계)

  • 신동훈;박준형;강병철;신창진;김철민
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2004.04a
    • /
    • pp.251-254
    • /
    • 2004
  • Web-Bioconductor System은 유전자 분석에 대한 통계적 모듈과 그래픽 환경을 제공하는 R언어와 DNA chip 데이터의 분석을 수행하는 Bioconductor 패키지를 이용하여 웹으로 DNA chip 데이터를 분석할 수 있도록 설계한 시스템이다. 본 시스템은 DNA chip 데이터의 분석을 위해 사용자 계정 모듈, 데이터 입력 모듈, 전 처리 모듈, 유전자 차등 발현 분석 모듈, 결과 출력 모듈로 구성되어 있으며, 분석된 결과물은 HTML, 이미지, XLS 파일 형태로 제공된다. 웹을 이용하여 DNA chip 분석을 수행함으로써 인터넷이 가능한 곳이면 시간과 장소의 구분이 없이 DNA chip 데이터 분석이 가능하며, 인터넷으로 DNA chip 데이터 분석 자료를 공유할 수 있음으로 연구자들의 상호 의견 교환을 바탕으로 효율적인 분석이 가능할 것이다. 또한 기존의 R언어와 Bioconductor가 전산 지식이 부족한 사람들에게는 접근하기 어려운 점을 웹 인터페이스로 간단하게 구현함으로써 DNA chip 데이터 분석에 있어 용이성과 효율성을 중대하고 있다.

  • PDF

A ChIP-Seq Data Analysis Pipeline Based on Bioconductor Packages

  • Park, Seung-Jin;Kim, Jong-Hwan;Yoon, Byung-Ha;Kim, Seon-Young
    • Genomics & Informatics
    • /
    • v.15 no.1
    • /
    • pp.11-18
    • /
    • 2017
  • Nowadays, huge volumes of chromatin immunoprecipitation-sequencing (ChIP-Seq) data are generated to increase the knowledge on DNA-protein interactions in the cell, and accordingly, many tools have been developed for ChIP-Seq analysis. Here, we provide an example of a streamlined workflow for ChIP-Seq data analysis composed of only four packages in Bioconductor: dada2, QuasR, mosaics, and ChIPseeker. 'dada2' performs trimming of the high-throughput sequencing data. 'QuasR' and 'mosaics' perform quality control and mapping of the input reads to the reference genome and peak calling, respectively. Finally, 'ChIPseeker' performs annotation and visualization of the called peaks. This workflow runs well independently of operating systems (e.g., Windows, Mac, or Linux) and processes the input fastq files into various results in one run. R code is available at github: https://github.com/ddhb/Workflow_of_Chipseq.git.

Web-based microarray analysis using the virtual chip viewer and bioconductor. (MicroArray의 직관적 시각적 분석을 위한 웹 기반 분석 도구)

  • Lee, Seung-Won;Park, Jun-Hyung;Kim, Hyun-Jin;Kang, Byeong-Chul;Park, Hee-Kyung;Kim, In-Ju;Kim, Cheol-Min
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2005.05a
    • /
    • pp.198-201
    • /
    • 2005
  • DNA microarray 칩은 신약 개발, 유전적 질환 진단, Bio-molecular 상호작용 연구, 유전자의 기능연구 등 폭넓게 사용되고 있다. 이 논문은 cDNA mimcroarray 데이터를 분석하기 위한 웹형태의 시스템 개발에 대한 내용을 다룬다. 하나의 cDNA microarray에는 수 백에서 수 만개의 유전자가 심어져 있으며, 데이터를 분석할 때 대량의 데이터와 다양한 형태의 오류로 인해서 데이터간의 차이를 보정하는 분석 도구와 통계적 기법들이 사용되어야 한다. 본 논문에서는 가상 칩 뷰어를 이용하여 실제 microarray 데이터의 foreground intensity에서 백그라운드의 intensity를 제거하여 일반화된 칩 이미지를 생성한다. 이 가상 칩 뷰어는 여러 가지 필터효과와 서로 다른 두 형광의 차이를 조정하는 global normalization 기법을 사용하여 발현 유전자 분석을 시각적으로 할 수 있고, 중복된 마이크로어레이 칩 데이터를 통하여 시간이 많이 걸리는 분석전 칩의 유효성을 검토할 수 있다. 칩 데이터의 normalization을 위한 통계 방법으로 R 통계 도구와 linear 모델을 사용하여 microarray 칩의 유전자 발현 양상을 분석한다. 통계적 방법을 사용하지 않은 데이터를 추출, 이 데이터의 패턴 그래프 그리고 발현 레벨을 분류하여 마이크로어레이의 각 스팟의 유효성 검토의 정확성을 높였다. 이 시스템은 칩의 유효성 검토, 스팟의 유효성 검토, 유전자 선정에 대해 분석의 용이성과 정확성을 높일 수 있었다.

  • PDF

TRAPR: R Package for Statistical Analysis and Visualization of RNA-Seq Data

  • Lim, Jae Hyun;Lee, Soo Youn;Kim, Ju Han
    • Genomics & Informatics
    • /
    • v.15 no.1
    • /
    • pp.51-53
    • /
    • 2017
  • High-throughput transcriptome sequencing, also known as RNA sequencing (RNA-Seq), is a standard technology for measuring gene expression with unprecedented accuracy. Numerous bioconductor packages have been developed for the statistical analysis of RNA-Seq data. However, these tools focus on specific aspects of the data analysis pipeline, and are difficult to appropriately integrate with one another due to their disparate data structures and processing methods. They also lack visualization methods to confirm the integrity of the data and the process. In this paper, we propose an R-based RNA-Seq analysis pipeline called TRAPR, an integrated tool that facilitates the statistical analysis and visualization of RNA-Seq expression data. TRAPR provides various functions for data management, the filtering of low-quality data, normalization, transformation, statistical analysis, data visualization, and result visualization that allow researchers to build customized analysis pipelines.

Bayesian Survival Analysis of High-Dimensional Microarray Data for Mantle Cell Lymphoma Patients

  • Moslemi, Azam;Mahjub, Hossein;Saidijam, Massoud;Poorolajal, Jalal;Soltanian, Ali Reza
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.1
    • /
    • pp.95-100
    • /
    • 2016
  • Background: Survival time of lymphoma patients can be estimated with the help of microarray technology. In this study, with the use of iterative Bayesian Model Averaging (BMA) method, survival time of Mantle Cell Lymphoma patients (MCL) was estimated and in reference to the findings, patients were divided into two high-risk and low-risk groups. Materials and Methods: In this study, gene expression data of MCL patients were used in order to select a subset of genes for survival analysis with microarray data, using the iterative BMA method. To evaluate the performance of the method, patients were divided into high-risk and low-risk based on their scores. Performance prediction was investigated using the log-rank test. The bioconductor package "iterativeBMAsurv" was applied with R statistical software for classification and survival analysis. Results: In this study, 25 genes associated with survival for MCL patients were identified across 132 selected models. The maximum likelihood estimate coefficients of the selected genes and the posterior probabilities of the selected models were obtained from training data. Using this method, patients could be separated into high-risk and low-risk groups with high significance (p<0.001). Conclusions: The iterative BMA algorithm has high precision and ability for survival analysis. This method is capable of identifying a few predictive variables associated with survival, among many variables in a set of microarray data. Therefore, it can be used as a low-cost diagnostic tool in clinical research.

Reconstruction and Exploratory Analysis of mTORC1 Signaling Pathway and Its Applications to Various Diseases Using Network-Based Approach

  • Buddham, Richa;Chauhan, Sweety;Narad, Priyanka;Mathur, Puniti
    • Journal of Microbiology and Biotechnology
    • /
    • v.32 no.3
    • /
    • pp.365-377
    • /
    • 2022
  • Mammalian target of rapamycin (mTOR) is a serine-threonine kinase member of the cellular phosphatidylinositol 3-kinase (PI3K) pathway, which is involved in multiple biological functions by transcriptional and translational control. mTOR is a downstream mediator in the PI3K/Akt signaling pathway and plays a critical role in cell survival. In cancer, this pathway can be activated by membrane receptors, including the HER (or ErbB) family of growth factor receptors, the insulin-like growth factor receptor, and the estrogen receptor. In the present work, we congregated an electronic network of mTORC1 built on an assembly of data using natural language processing, consisting of 470 edges (activations/interactions and/or inhibitions) and 206 nodes representing genes/proteins, using the Cytoscape 3.6.0 editor and its plugins for analysis. The experimental design included the extraction of gene expression data related to five distinct types of cancers, namely, pancreatic ductal adenocarcinoma, hepatic cirrhosis, cervical cancer, glioblastoma, and anaplastic thyroid cancer from Gene Expression Omnibus (NCBI GEO) followed by pre-processing and normalization of the data using R & Bioconductor. ExprEssence plugin was used for network condensation to identify differentially expressed genes across the gene expression samples. Gene Ontology (GO) analysis was performed to find out the over-represented GO terms in the network. In addition, pathway enrichment and functional module analysis of the protein-protein interaction (PPI) network were also conducted. Our results indicated NOTCH1, NOTCH3, FLCN, SOD1, SOD2, NF1, and TLR4 as upregulated proteins in different cancer types highlighting their role in cancer progression. The MCODE analysis identified gene clusters for each cancer type with MYC, PCNA, PARP1, IDH1, FGF10, PTEN, and CCND1 as hub genes with high connectivity. MYC for cervical cancer, IDH1 for hepatic cirrhosis, MGMT for glioblastoma and CCND1 for anaplastic thyroid cancer were identified as genes with prognostic importance using survival analysis.