• 제목/요약/키워드: Next generation sequencing (NGS)

검색결과 171건 처리시간 0.029초

NGS 데이터를 이용한 대용량 게놈의 디노버 어셈블리 (De novo assembly of a large volume of genome using NGS data)

  • 원정임;홍상균;공진화;허선;윤지희
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2012년도 한국컴퓨터종합학술대회논문집 Vol.39 No.1(C)
    • /
    • pp.25-27
    • /
    • 2012
  • 디노버 어셈블리는 레퍼런스 시퀀스 없이 리드의 염기 서열 정보를 이용하여 원래의 전체 시퀀스(original sequence)로 추정되는 시퀀스로 리드들을 재구성하는 방식이다. 최근의 NGS(Next Generation Sequencing) 기술은 대용량 리드를 훨씬 쉽게 저비용으로 생성할 수 있다는 장점이 있어, 이를 이용한 많은 연구가 이루어지고 있다. 그러나 NGS 리드 데이터를 이용한 디노버 어셈블리에 관한 연구는 국내외적으로 매우 미흡한 실정이다. 그 이유는 NGS 리드 데이터를 이용하여 디노버 어셈블리를 수행하는 경우 대용량 데이터, 복잡한 데이터 구조 및 처리 과정 등으로 인하여 매우 많은 시간과 공간이 소요될 뿐만 아니라 아직까지 다양한 분석 툴과 노하우 등이 충분히 개발되어 있지 않기 때문이다. 본 연구에서는 NGS 리드 데이터를 이용한 어셈블리의 실효성과 정확성을 검증한다. 또한 디노버 어셈블리의 처리 시간 및 공간 오버헤드를 해결하기 위하여 유사 종과의 리드 정렬을 활용하는 방안을 제안한다.

Digestion efficiency differences of restriction enzymes frequently used for genotype-by-sequencing technology

  • Chung, Yong Suk;Jun, Taehwan;Kim, Changsoo
    • 농업과학연구
    • /
    • 제44권3호
    • /
    • pp.318-324
    • /
    • 2017
  • With the development of next-generation sequencing (NGS), a cutting-edge technology, genotype-by-sequencing (GBS) became available at a low cost per sample. GBS makes it possible to customize the process of library preparation to obtain high-quality single nucleotide polymorphisms (SNPs) in the most efficient way. However, a GBS library is hard to construct due to fine-tuning of concentration of each reagent and set-up. The major reason for this is the presence of undigested genomic DNA (gDNA) owing to the efficiency of different restriction enzymes for different species with unknown reasons. Therefore, this proof-concept study is to demonstrate the unpredictable patterns of enzyme digestion from various plants in order to make the reader aware of the caution needed when choosing restriction enzymes for their GBS library preparations. Indeed, no pattern was found for the digestibility of gDNA samples and restriction enzymes in the current study. We suggest that more data should be accumulated on this matter to help researchers who want to apply GBS technologies in a variety of genetic approaches.

Comparison of Distributed and Parallel NGS Data Analysis Methods based on Cloud Computing

  • Kang, Hyungil;Kim, Sangsoo
    • International Journal of Contents
    • /
    • 제14권1호
    • /
    • pp.34-38
    • /
    • 2018
  • With the rapid growth of genomic data, new requirements have emerged that are difficult to handle with big data storage and analysis techniques. Regardless of the size of an organization performing genomic data analysis, it is becoming increasingly difficult for an institution to build a computing environment for storing and analyzing genomic data. Recently, cloud computing has emerged as a computing environment that meets these new requirements. In this paper, we analyze and compare existing distributed and parallel NGS (Next Generation Sequencing) analysis based on cloud computing environment for future research.

차세대 염기서열분석을 통한 밀 기능유전체 연구의 현황과 전망 (Current Status and Prospect of Wheat Functional Genomics using Next Generation Sequencing)

  • 최창현;윤영미;손재한;조성우;강천식
    • 한국육종학회지
    • /
    • 제50권4호
    • /
    • pp.364-377
    • /
    • 2018
  • 차세대 염기 서열 분석 기술의 적용은 빠르게 식물 유전체학의 지식을 확장시킴으로 기능유전자 연구의 발전을 도모하고 있다. 특히, 밀의 기능유전체학의 발전은 기존의 염기서열 분석 기술로는 가능성이 없어 보였다. 하지만 NGS의 발전은 고품질 보통밀의 RefSeq를 완성뿐만 아니라 다양한 밀 계통들의 재염기서열분석을 가능하게 한다. 현재 이렇게 얻어진 고품질 유전정보와 유전적 다형성이 밝혀진 유전자원의 이용으로 밀 기능유전체 연구가 새로운 단계로 접어들고 있다. NGS 기술 및 reverse genetics의 발전은 앞으로 전세계에 펼쳐져 있는 야생형 밀과 재배종 밀 계통들의 유전적인 다양성 분석을 가능케 하고 밀의 유전과 진화 과정을 깊게 이해하는데 큰 도움이 될 것이다. NGS 기술의 사용과 생물정보학의 결합은 타 작물에 비해 뒤쳐진 밀의 기능유전체 연구 속도를 가속화할 것이다. 기능유전체 연구를 활용한 밀 육종의 시대가, 애기장대 및 벼 분야와 같이, 다가오고 있다.

Application of genotyping-by-sequencing (GBS) in plant genome using bioinformatics pipeline

  • Lee, Yun Gyeong;Kang, Chon-Sik;Kim, Changsoo
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2017년도 9th Asian Crop Science Association conference
    • /
    • pp.58-58
    • /
    • 2017
  • The advent of next generation sequencing technology has elicited plenty of sequencing data available in agriculturally relevant plant species. For most crop species, it is too expensive to obtain the whole genome sequence data with sufficient coverage. Thus, many approaches have been developed to bring down the cost of NGS. Genotyping-by-sequencing (GBS) is a cost-effective genotyping method for complex genetic populations. GBS can be used for the analysis of genomic selection (GS), genome-wide association study (GWAS) and constructing haplotype and genetic linkage maps in a variety of plant species. For efficiently dealing with plant GBS data, the TASSEL-GBS pipeline is one of the most popular choices for many researchers. TASSEL-GBS is JAVA based a software package to obtain genotyping data from raw GBS sequences. Here, we describe application of GBS and bioinformatics pipeline of TASSEL-GBS for analyzing plant genetics data.

  • PDF

미생물법의학: 차세대염기서열분석 방법에 따른 MLVA 결과 비교 및 이를 활용한 DNA 감식 (Microbial Forensics: Comparison of MLVA Results According to NGS Methods, and Forensic DNA Analysis Using MLVA)

  • 윤형석;이승호;임승현;이대상;구세훈;김정은;정주환;김성주;허경행;송동현
    • 한국군사과학기술학회지
    • /
    • 제27권4호
    • /
    • pp.507-515
    • /
    • 2024
  • Microbial forensics is a scientific discipline for analyzing evidence related to biological crimes by identifying the origin of microorganisms. Multiple locus variable number tandem repeat analysis(MLVA) is one of the microbiological analysis methods used to specify subtypes within a species based on the number of tandem repeat in the genome, and advances in next generation sequencing(NGS) technology have enabled in silico anlysis of full-length whole genome sequences. In this paper, we analyzed unknown samples provided by Robert Koch Institute(RKI) through The United Nations Secretary-General's Mechanism(UNSGM)'s external quality assessment exercise(EQAE) project, which we officially participated in 2023. We confirmed that the 3 unknown samples were B. anthracis through nucleic acid isolation and genetic sequence analysis studies. MLVA results on 32 loci of B. anthracis were analysed by using genome sequences obtained from NGS(NextSeq and MinION) and Sanger sequencing. The MLVA typing using short-reads based NGS platform(NextSeq) showed a high probability of causing assembly error when a size of the tandem repeats was grater than 200 bp, while long-reads based NGS platform(MinION) showed higher accuracy than NextSeq, although insertion and deletion was observed. We also showed hybrid assembly can correct most indel error caused by MinION. Based on the MLVA results, genetic identification was performed compared to the 2,975 published MLVA databases of B. anthracis, and MLVA results of 10 strains were identical with 3 unkonwn samples. As a result of whole genome alignment of the 10 strains and 3 unknown samples, all samples were identified as B. anthracis strain A4564 which is associated with injectional anthrax isolates in heroin users.

IVAG: An Integrative Visualization Application for Various Types of Genomic Data Based on R-Shiny and the Docker Platform

  • Lee, Tae-Rim;Ahn, Jin Mo;Kim, Gyuhee;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • 제15권4호
    • /
    • pp.178-182
    • /
    • 2017
  • Next-generation sequencing (NGS) technology has become a trend in the genomics research area. There are many software programs and automated pipelines to analyze NGS data, which can ease the pain for traditional scientists who are not familiar with computer programming. However, downstream analyses, such as finding differentially expressed genes or visualizing linkage disequilibrium maps and genome-wide association study (GWAS) data, still remain a challenge. Here, we introduce a dockerized web application written in R using the Shiny platform to visualize pre-analyzed RNA sequencing and GWAS data. In addition, we have integrated a genome browser based on the JBrowse platform and an automated intermediate parsing process required for custom track construction, so that users can easily build and navigate their personal genome tracks with in-house datasets. This application will help scientists perform series of downstream analyses and obtain a more integrative understanding about various types of genomic data by interactively visualizing them with customizable options.

Characterization of Microbial Community Changes in Process Affected by Physicochemical Parameters During Liquid Fertilization of Swine Waste

  • Shin, Mi-Na;Kim, Jin-Won;Shim, Jaehong;Koo, Heung-Hoe;Lee, Jai-Young;Cho, Min;Oh, Byung-Taek
    • 한국토양비료학회지
    • /
    • 제46권3호
    • /
    • pp.173-181
    • /
    • 2013
  • Livestock wastes are considered as major environmental pollutants because they contain high concentration of organic materials. In 2001, The Environmental Department reported that stock farmers were increasing as 5.1%/year, which resulted in a gradual increase in livestock wastes generation. The direct disposal of livestock wastes create several environmental problems. Thus, several countries banned the disposal of livestock wastes in environment including aquatic systems. Recently, aeration-based liquid fertilization was considered as potential way for the disposal of livestock wastes. In this study, next generation sequencing (NGS) analysis was used to understand the microbial community changes during liquid fertilization of livestock wastes. Microbial community was compared with liquid fertilizer physicochemical analysis such as $BOD_5$, $COD_{Mn}$ pH, N (Nitrogen), P (Phosphorus), K (Potassium) etc. The physicochemical parameters and bacterial community results pave the way for producing effective livestock-based fertilizer. By comparing the physical characteristics of the manure with microbial community changes, it is possible to optimize the conditions for producing effective fertilizer.

From genome sequencing to the discovery of potential biomarkers in liver disease

  • Oh, Sumin;Jo, Yeeun;Jung, Sungju;Yoon, Sumin;Yoo, Kyung Hyun
    • BMB Reports
    • /
    • 제53권6호
    • /
    • pp.299-310
    • /
    • 2020
  • Chronic liver disease progresses through several stages, fatty liver, steatohepatitis, cirrhosis, and eventually, it leads to hepatocellular carcinoma (HCC) over a long period of time. Since a large proportion of patients with HCC are accompanied by cirrhosis, it is considered to be an important factor in the diagnosis of liver cancer. This is because cirrhosis leads to an irreversible harmful effect, but the early stages of chronic liver disease could be reversed to a healthy state. Therefore, the discovery of biomarkers that could identify the early stages of chronic liver disease is important to prevent serious liver damage. Biomarker discovery at liver cancer and cirrhosis has enhanced the development of sequencing technology. Next generation sequencing (NGS) is one of the representative technical innovations in the biological field in the recent decades and it is the most important thing to design for research on what type of sequencing methods are suitable and how to handle the analysis steps for data integration. In this review, we comprehensively summarized NGS techniques for identifying genome, transcriptome, DNA methylome and 3D/4D chromatin structure, and introduced framework of processing data set and integrating multi-omics data for uncovering biomarkers.

Bacterial Diversity in Soil Surround Subterranean Termites-Damaged Wooden Buildings in Seonamsa Temple and Effect of the Termites on Bacterial Diversity in Humus Soil

  • Kim, Young Hee;Lim, Boa;Lee, Jeung Min;Hong, Jin Young;Kim, Soo Ji;Park, Ji Hee
    • 보존과학회지
    • /
    • 제37권4호
    • /
    • pp.357-361
    • /
    • 2021
  • In order to determine the changes in microbial community due to termites, soil microorganisms surrounding the termites were investigated. First, bacterial communities from soil with termites collected at Seonamsa temple, Suncheon city, Korea were compared by next-generation sequencing (NGS, Illumina Miseq). The bacterial composition of soil from Daeungjeon without termites and the soil from Josadang, Palsangjeon, and Samjeon with termites were compared. Next, the bacterial composition of these soils was also compared with that of humus soil cultured with termites. A total high-quality sequences of 71,942 and 72,429 reads were identified in Seonamsa temple's soil and humus soil, respectively. The dominant phyla in the collected Seonamsa temple's soil were Proteobacteria (27%), Firmicutes (24%) and Actinobacteria (21%), whereas those in the humus soil were Bacteriodetes (56%) and Proteobacteria (37%). Using a two-dimensional plot to explain the principal coordinate analysis of operational taxonomic unit compositions of the soil samples, it was confirmed that the samples were divided into soil with and without termites, and it was especially confirmed that the Proteobacteria phylum was increased in humus soil with termites than in humus soil without termites.