• Title/Summary/Keyword: NGS

Search Result 273, Processing Time 0.027 seconds

Digestion efficiency differences of restriction enzymes frequently used for genotype-by-sequencing technology

  • Chung, Yong Suk;Jun, Taehwan;Kim, Changsoo
    • Korean Journal of Agricultural Science
    • /
    • v.44 no.3
    • /
    • pp.318-324
    • /
    • 2017
  • With the development of next-generation sequencing (NGS), a cutting-edge technology, genotype-by-sequencing (GBS) became available at a low cost per sample. GBS makes it possible to customize the process of library preparation to obtain high-quality single nucleotide polymorphisms (SNPs) in the most efficient way. However, a GBS library is hard to construct due to fine-tuning of concentration of each reagent and set-up. The major reason for this is the presence of undigested genomic DNA (gDNA) owing to the efficiency of different restriction enzymes for different species with unknown reasons. Therefore, this proof-concept study is to demonstrate the unpredictable patterns of enzyme digestion from various plants in order to make the reader aware of the caution needed when choosing restriction enzymes for their GBS library preparations. Indeed, no pattern was found for the digestibility of gDNA samples and restriction enzymes in the current study. We suggest that more data should be accumulated on this matter to help researchers who want to apply GBS technologies in a variety of genetic approaches.

Genome-Wide SNP Calling Using Next Generation Sequencing Data in Tomato

  • Kim, Ji-Eun;Oh, Sang-Keun;Lee, Jeong-Hee;Lee, Bo-Mi;Jo, Sung-Hwan
    • Molecules and Cells
    • /
    • v.37 no.1
    • /
    • pp.36-42
    • /
    • 2014
  • The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

A Fragmentation and Search Method of Query Document for Partially Plagiarized Section Detection (부분표절구간 검출을 위한 질의문서의 분할 및 탐색 기법)

  • Ock, Chang-Seok;Seo, Jong-Kyu;Cho, Hwan-Gue
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.11a
    • /
    • pp.586-589
    • /
    • 2012
  • 표절과 관련된 이슈가 주목받고 있는 상황에서 표절을 검출하는 방법에 대한 연구가 활발히 진행되고 있다. 일반적으로 표절구간 검출을 위해 복잡한 자연어처리와 같은 의미론적 접근방법이 아닌 비교적 단순한 어휘기반의 문자열 처리 방법을 사용한다. 대표적인 방법으로는 지문법 (Fingerprinting)과 서열정렬 (Sequence alignment) 등이 있다. 하지만 이 방법들을 이용하여 대용량 문서에 대한 표절검사를 수행하기에는 시공간적 복잡도의 문제가 발생한다. 본 논문에서는 이러한 단점을 극복하기 위해 NGS (Next Generation Sequencing)에서 사용하는 BWT (Burrows-Wheeler Transform)[1]를 이용한 탐색방법을 응용한다. 또한 부분표절구간을 검출하고 정확도를 향상시키기 위해 질의문서를 분할하여 작은 조각으로 만든 뒤, 조각들에 대한 질의탐색을 수행한다. 본 논문에서는 질의문서를 분할하는 두 가지 방법을 소개한다. 두 가지 방법은 k-mer analysis를 이용한 방법과 random-split analysis를 이용한 방법으로, 각 방법의 장단점을 실험을 통해 분석하고 실제 부분표절구간의 검출 정확도를 측정하였다.

Identification of pathways and genes associated with cerebral palsy

  • Zhu, Qingwen;Ni, Yufei;Wang, Jing;Yin, Honggang;Zhang, Qin;Zhang, Lingli;Bian, Wenjun;Liang, Bo;Kong, Lingyin;Xuan, Liming;Lu, Naru
    • Genes and Genomics
    • /
    • v.40 no.12
    • /
    • pp.1339-1349
    • /
    • 2018
  • Cerebral palsy (CP) is a non-progressive neurological disease, of which susceptibility is linked to genetic and environmental risk factors. More and more studies have shown that CP might be caused by multiple genetic factors, similar to other neurodevelopmental disorders. Due to the high genetic heterogeneity of CP, we focused on investigating related molecular pathways. Ten children with CP were collected for whole-exome sequencing by next-generation sequencing (NGS) technology. Customized processes were used to identify potential pathogenic pathways and variants. Three pathways (axon guidance, transmission across chemical synapses, protein-protein interactions at synapses) with twenty-three genes were identified to be highly correlated with CP. This study showed that the three pathways associated with CP might be the molecular mechanism of pathogenesis. These findings could provide useful clues for developing pathway-based pharmacotherapies. Further studies are required to confirm potential roles for these pathways in the pathogenesis of CP.

PCR-based markers developed by comparison of complete chloroplast genome sequences discriminate Solanum chacoense from other Solanum species

  • Kim, Soojung;Park, Tae-Ho
    • Journal of Plant Biotechnology
    • /
    • v.46 no.2
    • /
    • pp.79-87
    • /
    • 2019
  • One of wild diploid Solanum species, Solanum chacoense, is one of the excellent resources for potato breeding because it is resistant to several important pathogens, but the species is not sexually compatible with potato (S. tuberosum) causing the limitation of sexual hybridization between S. tuberosum and S. chacoense. Therefore, diverse traits regarding resistance from the species can be introgressed into potato via somatic hybridization. After cell fusion, the identification of fusion products is crucial with molecular markers. In this study, S. chacoense specific markers were developed by comparing the chloroplast genome (cpDNA) sequence of S. chacoense obtained by NGS (next-generation sequencing) technology with those of five other Solanum species. A full length of the cpDNA sequence is 155,532 bp and its structure is similar to other Solanum species. Phylogenetic analysis resulted that S. chacoense is most closely located with S. commersonii. Sequence alignment with cpDNA sequences of six other Solanum species identified two InDels and 37 SNPs specific sequences in S. chacoense. Based on these InDels and SNPs regions, four markers for distingushing S. chacoense from other Solanum species were developed. These results obtained in this research could help breeders select breeding lines and facilitate breeding using S. chacoense in potato breeding.

Deep sequencing of B cell receptor repertoire

  • Kim, Daeun;Park, Daechan
    • BMB Reports
    • /
    • v.52 no.9
    • /
    • pp.540-547
    • /
    • 2019
  • Immune repertoire is a collection of enormously diverse adaptive immune cells within an individual. As the repertoire shapes and represents immunological conditions, identification of clones and characterization of diversity are critical for understanding how to protect ourselves against various illness such as infectious diseases and cancers. Over the past several years, fast growing technologies for high throughput sequencing have facilitated rapid advancement of repertoire research, enabling us to observe the diversity of repertoire at an unprecedented level. Here, we focus on B cell receptor (BCR) repertoire and review approaches to B cell isolation and sequencing library construction. These experiments should be carefully designed according to BCR regions to be interrogated, such as heavy chain full length, complementarity determining regions, and isotypes. We also highlight preprocessing steps to remove sequencing and PCR errors with unique molecular index and bioinformatics techniques. Due to the nature of massive sequence variation in BCR, caution is warranted when interpreting repertoire diversity from error-prone sequencing data. Furthermore, we provide a summary of statistical frameworks and bioinformatics tools for clonal evolution and diversity. Finally, we discuss limitations of current BCR-seq technologies and future perspectives on advances in repertoire sequencing.

Bacterial Diversity in Soil Surround Subterranean Termites-Damaged Wooden Buildings in Seonamsa Temple and Effect of the Termites on Bacterial Diversity in Humus Soil

  • Kim, Young Hee;Lim, Boa;Lee, Jeung Min;Hong, Jin Young;Kim, Soo Ji;Park, Ji Hee
    • Journal of Conservation Science
    • /
    • v.37 no.4
    • /
    • pp.357-361
    • /
    • 2021
  • In order to determine the changes in microbial community due to termites, soil microorganisms surrounding the termites were investigated. First, bacterial communities from soil with termites collected at Seonamsa temple, Suncheon city, Korea were compared by next-generation sequencing (NGS, Illumina Miseq). The bacterial composition of soil from Daeungjeon without termites and the soil from Josadang, Palsangjeon, and Samjeon with termites were compared. Next, the bacterial composition of these soils was also compared with that of humus soil cultured with termites. A total high-quality sequences of 71,942 and 72,429 reads were identified in Seonamsa temple's soil and humus soil, respectively. The dominant phyla in the collected Seonamsa temple's soil were Proteobacteria (27%), Firmicutes (24%) and Actinobacteria (21%), whereas those in the humus soil were Bacteriodetes (56%) and Proteobacteria (37%). Using a two-dimensional plot to explain the principal coordinate analysis of operational taxonomic unit compositions of the soil samples, it was confirmed that the samples were divided into soil with and without termites, and it was especially confirmed that the Proteobacteria phylum was increased in humus soil with termites than in humus soil without termites.

Community Structure of Ectomycorrhizal Fungal communities Colonizing Quercus spp. in Limestone Areas of Korea (석회암 지대 참나무속 식물에 공생하는 외생균근균의 군집구조)

  • Lee, Jong-Chul;Park, Hyeok;Eom, Ahn-Heum
    • The Korean Journal of Mycology
    • /
    • v.49 no.1
    • /
    • pp.109-118
    • /
    • 2021
  • In this study, we analyzed the diversity of ectomycorrhizal (ECM) fungal communities of Quercus spp. roots in the limestone area. We identified 45 generd of ECM using next generation sequencing (NGS) analysis. Soil chemical composition analysis confirmed soil pH, substitution calcium concentration, total nitrogen content, organic phosphate, and organic matter content. Shannon's Index was calculated according to the changes in soil chemical composition. The results of cluster analysis showed that Sebacina, Tomentella, Tuber, Densospora, Inocybe, Suillus, and Piloderma were the main genera of symbiotic ECM fungi that thrived in soil with high pH and calcium content.

High-accuracy quantitative principle of a new compact digital PCR equipment: Lab On An Array

  • Lee, Haeun;Lee, Cherl-Joon;Kim, Dong Hee;Cho, Chun-Sung;Shin, Wonseok;Han, Kyudong
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.34.1-34.6
    • /
    • 2021
  • Digital PCR (dPCR) is the third-generation PCR that enables real-time absolute quantification without reference materials. Recently, global diagnosis companies have developed new dPCR equipment. In line with the development, the Lab On An Array (LOAA) dPCR analyzer (Optolane) was launched last year. The LOAA dPCR is a semiconductor chip-based separation PCR type equipment. The LOAA dPCR includes Micro Electro Mechanical System that can be injected by partitioning the target gene into 56 to 20,000 wells. The amount of target gene per wells is digitized to 0 or 1 as the number of well gradually increases to 20,000 wells because its principle follows Poisson distribution, which allows the LOAA dPCR to perform precise absolute quantification. LOAA determined region of interest first prior to dPCR operation. To exclude invalid wells for the quantification, the LOAA dPCR has applied various filtering methods using brightness, slope, baseline, and noise filters. As the coronavirus disease 2019 has now spread around the world, needs for diagnostic equipment of point of care testing (POCT) are increasing. The LOAA dPCR is expected to be suitable for POCT diagnosis due to its compact size and high accuracy. Here, we describe the quantitative principle of the LOAA dPCR and suggest that it can be applied to various fields.

A parallel SNP detection algorithm for RNA-Seq data (RNA 시퀀싱 데이터를 이용한 병렬 SNP 추출 알고리즘)

  • Kim, Deok-Keun;Lee, Deok-Hae;Kong, Jin-Hwa;Lee, Un-Joo;Yoon, Jee-Hee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.1260-1263
    • /
    • 2011
  • 최근 차세대 시퀀싱 (Next Generation Sequencing, NGS) 기술이 발전하면서 DNA, RNA 등의 시퀀싱 데이터를 이용한 유전체 분석 방식에 관한 연구가 활발히 이루어지고 있다. 차세대 시퀀싱 데이터를 이용한 유전체 분석 방식은 마이크로어레이 혹은 EST/cDNA 데이터를 이용한 기존의 분석 방식에 비하여 비용이 적게 들고 정확한 결과를 얻을 수 있다는 장점이 있다. 그러나 이 들 DNA, RNA 시퀀싱 데이터는 각 시퀀스의 길이가 짧고 전체 용량은 매우 커서 이 들 데이터로부터 정확한 분석 결과를 추출하는 데에 많은 어려움이 있다. 본 연구에서는 클라우드 컴퓨팅 기술을 기반으로 하여 대용량의 RNA 시퀀싱 데이터를 고속으로 처리하는 병렬 SNP 추출 알고리즘을 제안한다. 전체 게놈 데이터 중 유전자 영역만을 high coverage로 시퀀싱하여 얻어지는 RNA 시퀀싱 데이터는 유전자 변이 추출을 목적으로 분석되며, SNP(Single Nucleotide Polymorphism)와 같은 유전자 변이는 질병의 원인 규명 및 치료법 개발에 직접 이용된다. 제안된 알고리즘은 동시에 실행되는 다수의 Map/Reduce 함수에 의해서 대규모 RNA 시퀀스를 병렬로 처리하며, 레퍼런스 시퀀스에 매핑된 각 염기의 출현 빈도와 품질점수를 이용하여 SNP를 추출한다. 또한 이 들 SNP 추출 결과에 대한 시각적 분석 도구를 제공하여 SNP 추출 과정 및 근거를 시각적으로 확인/검증할 수 있도록 지원한다.