• Title/Summary/Keyword: Omics data analysis

Search Result 45, Processing Time 0.02 seconds

Bioinformatics Resources of the Korean Bioinformation Center (KOBIC)

  • Lee, Byung-Wook;Chu, In-Sun;Kim, Nam-Shin;Lee, Jin-Hyuk;Kim, Seon-Yong;Kim, Wan-Kyu;Lee, Sang-Hyuk
    • Genomics & Informatics
    • /
    • v.8 no.4
    • /
    • pp.165-169
    • /
    • 2010
  • The Korean Bioinformation Center (KOBIC) is a national bioinformatics research center in Korea. We developed many bioinformatics algorithms and applications to facilitate the biological interpretation of OMICS data. Here we present an introduction to major bioinformatics resources of databases and tools developed at KOBIC. These resources are classified into three main fields: genome, proteome, and literature. In the genomic resources, we constructed several pipelines for next generation sequencing (NGS) data processing and developed analysis algorithms and web-based database servers including miRGator, ESTpass, and CleanEST. We also built integrated databases and servers for microarray expression data such as MDCDP. As for the proteome data, VnD database, WDAC, Localizome, and CHARMM_HM web servers are available for various purposes. We constructed IntoPub server and Patome database in the literature field. We continue constructing and maintaining the bioinformatics infrastructure and developing algorithms.

IdBean: a Java GUI application for conversion of biological identifiers

  • Lee, Sang-Hyuk;Kim, Bum-Jin;Kim, Hyeon-Jin;Lee, Hook-Eun;Yu, Ung-Sik
    • BMB Reports
    • /
    • v.44 no.2
    • /
    • pp.107-112
    • /
    • 2011
  • We have developed a biologist-friendly, stand-alone Java GUI application, IdBean, for ID conversion. Our tool integrated most of the widely used ID conversion services that provide programmatic access. It is the first GUI ID conversion application that supports the direct merging as well as comparison of conversion results from multiple ID conversion services without manual effort. This tool will greatly help biologists who handle multiple ID types for the analyses of gene or gene product lists. By referring to multiple conversion services, the number of failed IDs can be reduced. By accessing ID conversion service online, it will potentially provide the most up-to-date conversion results. The application was developed in modular form; however, it can be re-packaged into plug-in form. For the development of a bioinformatics analysis tool, the module can be used as a built-in ID conversion component. It is available at http://neon.gachon.ac.kr/IdBean/.

Comprehensive Analysis of Proteomic Differences between Escherichia coli K-12 and B Strains Using Multiplexed Isobaric Tandem Mass Tag (TMT) Labeling

  • Han, Mee-Jung
    • Journal of Microbiology and Biotechnology
    • /
    • v.27 no.11
    • /
    • pp.2028-2036
    • /
    • 2017
  • The Escherichia coli K-12 and B strains are among the most frequently used bacterial hosts for scientific research and biotechnological applications. However, omics analyses have revealed that E. coli K-12 and B exhibit notably different genotypic and phenotypic attributes, even though they were derived from the same ancestor. In a previous study, we identified a limited number of proteins from the two strains using two-dimensional gel electrophoresis and tandem mass spectrometry (MS/MS). In this study, an in-depth analysis of the physiological behavior of the E. coli K-12 and B strains at the proteomic level was performed using six-plex isobaric tandem mass tag-based quantitative MS. Additionally, the best lysis buffer for increasing the efficiency of protein extraction was selected from three tested buffers prior to the quantitative proteomic analysis. This study identifies the largest number of proteins in the two E. coli strains reported to date and is the first to show the dynamics of these proteins. Notable differences in proteins associated with key cellular properties, including some metabolic pathways, the biosynthesis and degradation of amino acids, membrane integrity, cellular tolerance, and motility, were found between the two representative strains. Compared with previous studies, these proteomic results provide a more holistic view of the overall state of E. coli cells based on a single proteomic study and reveal significant insights into why the two strains show distinct phenotypes. Additionally, the resulting data provide in-depth information that will help fine-tune processes in the future.

Systems-Level Analysis of Genome-Scale In Silico Metabolic Models Using MetaFluxNet

  • Lee, Sang-Yup;Woo, Han-Min;Lee, Dong-Yup;Choi, Hyun-Seok;Kim, Tae-Yong;Yun, Hong-Seok
    • Biotechnology and Bioprocess Engineering:BBE
    • /
    • v.10 no.5
    • /
    • pp.425-431
    • /
    • 2005
  • The systems-level analysis of microbes with myriad of heterologous data generated by omics technologies has been applied to improve our understanding of cellular function and physiology and consequently to enhance production of various bioproducts. At the heart of this revolution resides in silico genome-scale metabolic model, In order to fully exploit the power of genome-scale model, a systematic approach employing user-friendly software is required. Metabolic flux analysis of genome-scale metabolic network is becoming widely employed to quantify the flux distribution and validate model-driven hypotheses. Here we describe the development of an upgraded MetaFluxNet which allows (1) construction of metabolic models connected to metabolic databases, (2) calculation of fluxes by metabolic flux analysis, (3) comparative flux analysis with flux-profile visualization, (4) the use of metabolic flux analysis markup language to enable models to be exchanged efficiently, and (5) the exporting of data from constraints-based flux analysis into various formats. MetaFluxNet also allows cellular physiology to be predicted and strategies for strain improvement to be developed from genome-based information on flux distributions. This integrated software environment promises to enhance our understanding on metabolic network at a whole organism level and to establish novel strategies for improving the properties of organisms for various biotechnological applications.

Metagenomic and Proteomic Analyses of a Mangrove Microbial Community Following Green Macroalgae Enteromorpha prolifera Degradation

  • Wu, Yijing;Zhao, Chao;Xiao, Zheng;Lin, Hetong;Ruan, Lingwei;Liu, Bin
    • Journal of Microbiology and Biotechnology
    • /
    • v.26 no.12
    • /
    • pp.2127-2137
    • /
    • 2016
  • A mangrove microbial community was analyzed at the gene and protein levels using metagenomic and proteomic methods with the green macroalgae Enteromorpha prolifera as the substrate. Total DNA was sequenced on the Illumina HiSeq 2000 PE-100 platform. Two-dimensional gel electrophoresis in combination with liquid chromatography tandem mass spectrometry was used for proteomic analysis. The metagenomic data revealed that the orders Pseudomonadales, Rhizobiales, and Sphingomonadales were the most prevalent in the mangrove microbial community. By monitoring changes at the functional level, proteomic analyses detected ATP synthase and transporter proteins, which were expressed mainly by members of the phyla Proteobacteria and Bacteroidetes. Members of the phylum Proteobacteria expressed a high number of sugar transporters and demonstrated specialized and efficient digestion of various glycans. A few glycoside hydrolases were detected in members of the phylum Firmicutes, which appeared to be the main cellulose-degrading bacteria. This is the first report of multiple "omics" analysis of E. prolifera degradation. These results support the fact that key enzymes of glycoside hydrolase family were expressed in large quantities, indicating the high metabolic activity of the community.

From genome sequencing to the discovery of potential biomarkers in liver disease

  • Oh, Sumin;Jo, Yeeun;Jung, Sungju;Yoon, Sumin;Yoo, Kyung Hyun
    • BMB Reports
    • /
    • v.53 no.6
    • /
    • pp.299-310
    • /
    • 2020
  • Chronic liver disease progresses through several stages, fatty liver, steatohepatitis, cirrhosis, and eventually, it leads to hepatocellular carcinoma (HCC) over a long period of time. Since a large proportion of patients with HCC are accompanied by cirrhosis, it is considered to be an important factor in the diagnosis of liver cancer. This is because cirrhosis leads to an irreversible harmful effect, but the early stages of chronic liver disease could be reversed to a healthy state. Therefore, the discovery of biomarkers that could identify the early stages of chronic liver disease is important to prevent serious liver damage. Biomarker discovery at liver cancer and cirrhosis has enhanced the development of sequencing technology. Next generation sequencing (NGS) is one of the representative technical innovations in the biological field in the recent decades and it is the most important thing to design for research on what type of sequencing methods are suitable and how to handle the analysis steps for data integration. In this review, we comprehensively summarized NGS techniques for identifying genome, transcriptome, DNA methylome and 3D/4D chromatin structure, and introduced framework of processing data set and integrating multi-omics data for uncovering biomarkers.

Implementation of Proteomics for Cancer Research: Past, Present, and Future

  • Karimi, Parisa;Shahrokni, Armin;Nezami Ranjbar, Mohammad R.
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.6
    • /
    • pp.2433-2438
    • /
    • 2014
  • Cancer is the leading cause of the death, accounts for about 13% of all annual deaths worldwide. Many different fields of science are collaborating together studying cancer to improve our knowledge of this lethal disease, and find better solutions for diagnosis and treatment. Proteomics is one of the most recent and rapidly growing areas in molecular biology that helps understanding cancer from an omics data analysis point of view. The human proteome project was officially initiated in 2008. Proteomics enables the scientists to interrogate a variety of biospecimens for their protein contents and measure the concentrations of these proteins. Current necessary equipment and technologies for cancer proteomics are mass spectrometry, protein microarrays, nanotechnology and bioinformatics. In this paper, we provide a brief review on proteomics and its application in cancer research. After a brief introduction including its definition, we summarize the history of major previous work conducted by researchers, followed by an overview on the role of proteomics in cancer studies. We also provide a list of different utilities in cancer proteomics and investigate their advantages and shortcomings from theoretical and practical angles. Finally, we explore some of the main challenges and conclude the paper with future directions in this field.

C4orf47 is a Novel Prognostic Biomarker and Correlates with Infiltrating Immune Cells in Hepatocellular Carcinoma

  • Hye-Ran Kim;Choong Won Seo;Sang Jun Han;Jongwan Kim
    • Biomedical Science Letters
    • /
    • v.29 no.1
    • /
    • pp.11-25
    • /
    • 2023
  • In hepatocellular carcinoma (HCC), chromosome 4 open-reading frame 47 (C4orf47) has not been so far investigated for its prognostic value or association with infiltrating immune cells. We performed bioinformatics analysis on HCC data and analyzed the data using online databases such as TIMER, UALCAN, Kaplan-Meier plotter, LinkedOmics, and GEPIA2. We found that C4orf47 expression in HCC was higher compared to normal tissues. High C4orf47 expression was associated with a worse prognosis in HCC. The correlation between C4orf47 and infiltrating immune cells is positively associated with CD4+T cells, B cells, neutrophils, macrophages, and dendritic cells in HCC. Moreover, high C4orf47 expression was correlated with a poor prognosis of infiltrating immune cells. Analysis of C4orf47 gene co-expression networks revealed that 12501 genes were positively correlated with C4orf47, whereas 7200 genes were negatively correlated. The positively related genes of C4orf47 are associated with a high hazard ratio in different types of cancer, including HCC. Regarding the biological functions of C4orf47 gene, it mainly regulates RNA metabolic process, DNA replication, and cell cycle. The C4orf47 gene may play a prognostic role by regulating the global transcriptome process in HCC. Our findings demonstrate that high C4orf47 expression correlates with poor prognosis and tumor-infiltrating immune cells in HCC. We suggest that C4orf47 is a novel prognostic biomarker and potential immune therapeutic target for HCC.

Eco-toxicogenomics Research with Fish

  • Park, Kyeong-Seo;Kim, Han-Na;Gu, Man-Bock
    • Molecular & Cellular Toxicology
    • /
    • v.1 no.1
    • /
    • pp.17-25
    • /
    • 2005
  • There are some critical drawbacks in the use of biomarkers for a global assessment of the toxicological impacts many chemicals and environmental pollutants have, primarily due to an individual biomarker's specificity for an explicit chemical or toxicant. In other words, the biomarker-based assessment methodology used to analyze toxicological effects lacks a high-throughput capability. Therefore, eco-toxicogenomics, or the study of toxicogenomics with organisms present within a given environmental locale, has recently been introduced with the advent of the so-called "-omics" era, which began with the creation of microarray technologies. Fish are comparable with humans in their toxicological responses and thus data from toxicogenomic studies performed with fish could be applied, with appropriate tools and implementation protocols, to the evaluation of environments where human or animal health is of concern. At present, there have been very active research streams for developing expression sequence tag (EST) databases (DBs) for zebra fish and rainbow trout. Even though few reports involve toxicogenomic studies with fish, a few groups have successfully fabricated and used cDNA microarrays or oligo DNA chips when studying the toxicological impacts of hypoxia or some toxicants with fish. Furthermore, it is strongly believed that this technology can also be implemented with non-model fish. With the standardization of DNA microarray technologies and ample progress in bioinformatics and proteomic technologies, data obtained from DNA microarray technologies offer not only multiple biomarker assays or an analysis of gene expression profiles, but also a means of elucidating gene networking, gene-gene relations, chemical-gene interactions, and chemical-chemical relationships. Accordingly, the ultimate target of eco-toxicogenomics should be to predict and map the pathways of stress propagation within an organism and to analyze stress networking.

Application of Toxicogenomic Technology for the Improvement of Risk Assessment

  • Hwang, Myung-Sil;Yoon, Eun-Kyung;Kim, Ja-Young;Son, Bo-Kyung;Jang, Dong-Deuk;Yoo, Tae-Moo
    • Molecular & Cellular Toxicology
    • /
    • v.4 no.3
    • /
    • pp.260-266
    • /
    • 2008
  • Recently, there has been scientific discussion on the utility of -omics techniques such as genomics, proteomics, and metabolomics within toxicological research and mechanism-based risk assessment. Toxicogenomics is a novel approach integrating the expression analysis of genes (genomic) or proteins (proteomic) with traditional toxicological methods. Since 1999, the toxicogenomic approach has been extensively applied for regulatory purposes in order to understand the potential toxic mechanisms that result from chemical compound exposures. Therefore, this article's purpose was to consider the utility of toxicogenomic profiles for improved risk assessment, explore the current limitations in applying toxicogenomics to regulation, and finally, to rationalize possible avenues to resolve some of the major challenges. Based on many recent works, the significant impact toxicogenomic techniques would have on human health risk assessment is better identification of toxicity pathways or mode-of-actions (MOAs). In addition, the application of toxicogenomics in risk assessment and regulation has proven to be cost effective in terms of screening unknown toxicants prior to more extensive and costly experimental evaluation. However, to maximize the utility of these techniques in regulation, researchers and regulators must resolve many parallel challenges with regard to data collection, integration, and interpretation. Furthermore, standard guidance has to be prepared for researchers and assessors on the scientifically appropriate use of toxicogenomic profiles in risk assessment. The National Institute of Toxicological Research (NITR) looks forward to an ongoing role as leader in addressing the challenges associated with the scientifically sound use of toxicogenomics data in risk assessment.