• 제목/요약/키워드: draft genome sequencing

검색결과 29건 처리시간 0.026초

개 회충 게놈 응용 사례에서 공개용 분석 툴을 사용한 드래프트 게놈 어셈블리 생성 (Workflow for Building a Draft Genome Assembly using Public-domain Tools: Toxocara canis as a Case Study)

  • 원정임;공진화;허선;윤지희
    • 정보과학회 컴퓨팅의 실제 논문지
    • /
    • 제20권9호
    • /
    • pp.513-518
    • /
    • 2014
  • NGS 기술의 발달로 시퀀싱 비용이 급격히 하락됨에 따라 대규모 크기의 유전체 염기 서열해독을 소규모의 실험실에서 수행할 수 있게 되었다. 디노버 어셈블리는 표준 유전체가 없는 새로운 종을 시퀀싱하는 경우 리드들의 염기 서열 정보를 이용하여 재구성함으로써 원래의 전체 시퀀스를 복원하는 것이다. 최근 이와 관련된 많은 연구 결과가 보고되고 있으나, 충분한 분석 노하우와 명확한 가이드라인 등이 공개되어 있지 않기 때문에 이들 연구에서 제시하는 동일한 어셈블리 수행 과정 및 분석 툴들을 사용하더라도 만족할만한 수준의 어셈블리 결과를 얻지 못하는 경우가 발생한다. 본 연구에서는 이러한 문제점을 해결하기 위하여 NGS 기술과 디노버 어셈블리 기술을 이용하여 아직 밝혀지지 않은 생물체의 전체 DNA의 염기 서열을 밝히기 위한 일련의 과정들을 단계별로 소개하고, 각 단계에서 필요로 하는 공개용 분석 툴의 장단점을 분석하여 제시한다. 이러한 과정별 단계를 구체적으로 설명하기 위하여 본 연구에서는 350Mbp 크기의 개 회충 게놈을 응용 사례로 사용한다. 또한 디노버 어셈블리 과정을 통해 새롭게 어셈블리된 시퀀스와 다른 유사 종과의 상동성 분석을 수행하여 어셈블리된 시퀀스에서의 유전자 영역 추출과 추출된 유전자의 기능을 예측한다.

Draft Genome of Toxocara canis, a Pathogen Responsible for Visceral Larva Migrans

  • Kong, Jinhwa;Won, Jungim;Yoon, Jeehee;Lee, UnJoo;Kim, Jong-Il;Huh, Sun
    • Parasites, Hosts and Diseases
    • /
    • 제54권6호
    • /
    • pp.751-758
    • /
    • 2016
  • This study aimed at constructing a draft genome of the adult female worm Toxocara canis using next-generation sequencing (NGS) and de novo assembly, as well as to find new genes after annotation using functional genomics tools. Using an NGS machine, we produced DNA read data of T. canis. The de novo assembly of the read data was performed using SOAPdenovo. RNA read data were assembled using Trinity. Structural annotation, homology search, functional annotation, classification of protein domains, and KEGG pathway analysis were carried out. Besides them, recently developed tools such as MAKER, PASA, Evidence Modeler, and Blast2GO were used. The scaffold DNA was obtained, the N50 was 108,950 bp, and the overall length was 341,776,187 bp. The N50 of the transcriptome was 940 bp, and its length was 53,046,952 bp. The GC content of the entire genome was 39.3%. The total number of genes was 20,178, and the total number of protein sequences was 22,358. Of the 22,358 protein sequences, 4,992 were newly observed in T. canis. Following proteins previously unknown were found: E3 ubiquitin-protein ligase cbl-b and antigen T-cell receptor, zeta chain for T-cell and B-cell regulation; endoprotease bli-4 for cuticle metabolism; mucin 12Ea and polymorphic mucin variant C6/1/40r2.1 for mucin production; tropomodulin-family protein and ryanodine receptor calcium release channels for muscle movement. We were able to find new hypothetical polypeptides sequences unique to T. canis, and the findings of this study are capable of serving as a basis for extending our biological understanding of T. canis.

Draft Genome Analysis of Antimicrobial Streptomyces Isolated from Himalayan Lichen

  • Kim, Byeollee;Han, So-Ra;Lamichhane, Janardan;Park, Hyun;Oh, Tae-Jin
    • Journal of Microbiology and Biotechnology
    • /
    • 제29권7호
    • /
    • pp.1144-1154
    • /
    • 2019
  • There have been several studies regarding lichen-associated bacteria obtained from diverse environments. Our screening process identified 49 bacterial species in two lichens from the Himalayas: 17 species of Actinobacteria, 19 species of Firmicutes, and 13 species of Proteobacteria. We discovered five types of strong antimicrobial agent-producing bacteria. Although some strains exhibited weak antimicrobial activity, NP088, NP131, NP132, NP134, and NP160 exhibited strong antimicrobial activity against all multidrug-resistant strains. Polyketide synthase (PKS) fingerprinting revealed results for 69 of 148 strains; these had similar genes, such as fatty acid-related PKS, adenylation domain genes, PfaA, and PksD. Although the association between antimicrobial activity and the PKS fingerprinting results is poorly resolved, NP160 had six types of PKS fingerprinting genes, as well as strong antimicrobial activity. Therefore, we sequenced the draft genome of strain NP160, and predicted its secondary metabolism using antiSMASH version 4.2. NP160 had 46 clusters and was predicted to produce similar secondary metabolites with similarities of 5-100%. Although NP160 had 100% similarity with the alkylresorcinol biosynthetic gene cluster, our results showed low similarity with existing members of this biosynthetic gene cluster, and most have not yet been revealed. In conclusion, we expect that lichen-associated bacteria from the Himalayas can produce new secondary metabolites, and we found several secondary metabolite-related biosynthetic gene clusters to support this hypothesis.

Study of Modern Human Evolution via Comparative Analysis with the Neanderthal Genome

  • Ahmed, Musaddeque;Liang, Ping
    • Genomics & Informatics
    • /
    • 제11권4호
    • /
    • pp.230-238
    • /
    • 2013
  • Many other human species appeared in evolution in the last 6 million years that have not been able to survive to modern times and are broadly known as archaic humans, as opposed to the extant modern humans. It has always been considered fascinating to compare the modern human genome with that of archaic humans to identify modern human-specific sequence variants and figure out those that made modern humans different from their predecessors or cousin species. Neanderthals are the latest humans to become extinct, and many factors made them the best representatives of archaic humans. Even though a number of comparisons have been made sporadically between Neanderthals and modern humans, mostly following a candidate gene approach, the major breakthrough took place with the sequencing of the Neanderthal genome. The initial genome-wide comparison, based on the first draft of the Neanderthal genome, has generated some interesting inferences regarding variations in functional elements that are not shared by the two species and the debated admixture question. However, there are certain other genetic elements that were not included or included at a smaller scale in those studies, and they should be compared comprehensively to better understand the molecular make-up of modern humans and their phenotypic characteristics. Besides briefly discussing the important outcomes of the comparative analyses made so far between modern humans and Neanderthals, we propose that future comparative studies may include retrotransposons, pseudogenes, and conserved non-coding regions, all of which might have played significant roles during the evolution of modern humans.

고구마 유전체 연구현황 및 전망 (Current status of sweetpotato genomics research)

  • 윤웅한;정재철;곽상수;양정욱;김태호;이형운;남상식;한장호
    • Journal of Plant Biotechnology
    • /
    • 제42권3호
    • /
    • pp.161-167
    • /
    • 2015
  • 고구마는 척박한 환경에서도 생육이 가능한 세계 7대 농작물로 식량뿐만 아니라 사료용, 전분 등의 산업용으로도 중요하다. 최근 고구마는 항산화물질, 식이섬유질 등을 고함유하는 건강식품으로 각광을 받고 있다. 그러나 고구마 유전체 해독에 관한 연구는 고구마의 중요도에 비해 많이 이루어지지 않고 있다. 본 총설의 목적은 고구마 유전체 연구 동향분석을 통하여 유전체 해독 연구의 효율성 증대 및 유용형질 유전자의 실용화 연구를 위한 기반구축을 모색하는데 있다. 최근 NGS 분석을 통한 동식물 유전체해독이 급진적으로 많이 이루어지고 있다. 고구마 유전체 해독의 경우는 다배수성 문제와 이질유전체 문제로 유전체 완전해독 연구가 이루어지지 않고 있으며 반면 전사체 분석 연구는 활발히 이루어지고 있는 실정이다. 최근 2015년 일본 연구자들에 의해 2배체 고구마의 유전체 해독 초안이 보고되었다. 한중일 고구마 연구협의회(Trilateral Research Association of Sweetpotato, TRAS)에 의해 6배체 고구마 Xushu 18의 유전자지도 작성 및 유전체 해독 연구가 2014년부터 이루어지고 있다. 빌게이츠재단(Bill & Melinda Gates Foundation)은 사하라사막 남쪽 아프리카지역의 기근과 영양문제를 해결하기 위해 고구마 유전체 기반 분자육종을 위한 분자도구 개발에 관한 프로젝트를 미국을 중심으로 한 컨소시엄을 구성하여 출범하였다. 고구마 유전체 해독과정 중에 분석된 고구마 엽록체 유전체 분석을 통하여 진화학적 해석연구가 이루어지고 있다. 본 총설을 통하여 고구마 유전체 해독 연구동향을 살펴보았다. 이러한 연구 동향 분석은 고구마의 생산성 및 기능성 향상 등의 실용화 연구를 수행하는 연구자들에게 최근의 연구현황을 제공할 수 있을 것이며 세계적인 식량, 에너지, 환경문제의 해결에 크게 기여 할 것으로 생각된다.

카로티노이드 생산 Sphingobacteriaceae SH-48 균주의 유전체 염기서열 분석 (Genome sequence of carotenoid producing Sphingobacteriaceae bacterium SH-48 isolated from freshwater in Korea)

  • 최아영;정유진;남영호;최강국
    • 미생물학회지
    • /
    • 제53권4호
    • /
    • pp.347-350
    • /
    • 2017
  • 그람 음성이며 막대모양의 Sphingobacteriaceae bacterium SH-48은 삼척 소한천에서 분리하였다. SH-48에 대한 유전체 분석을 실시하였으며, G + C 비율이 38.4%인 5,650,162 bp 크기의 염기서열을 얻었다. 유전체 특징은 카로티노이드 생합성 유전자인 crt 유전자 클러스터를 보유하고 있어 균주의 잠재적 중요성을 보여준다. 이러한 유전체 정보는 카로티노이드 생합성 경로에 대한 새로운 정보를 제공한다.

박과 작물에 과일썩음병을 일으키는 Acidovorax citrulli 검출을 위한 nested-PCR 검사법 개발 (Development of Nested-PCR Assay to Detect Acidovorax citrulli, a Causal Agent of Bacterial Fruit Blotch at Cucurbitaceae)

  • 김영탁;박경수;김혜성;이혁인;차재순
    • 식물병연구
    • /
    • 제21권2호
    • /
    • pp.74-81
    • /
    • 2015
  • 박과 작물에서 과일썩음병(bacterial fruit blotch)을 일으키는 Acidovorax citrulli를 종자로부터 검출하기 위한 특이적이고 민감한 nested-PCR 방법을 개발하였다. 본 연구에서는 Next Generation Sequencing을 이용하여 draft genome sequencing을 얻은 후 이를 분석하여 PCR 프라이머를 디자인하였고, 이들 프라이머의 A. citrulli에 대한 특이성을 확인하여 Ac-ORF 21F/Ac-ORF 21R의 nested PCR 프라이머를 최종 선발하였다. Ac-ORF 21F/Ac-ORF 21R는 오직 A. citrulli에서만 특이적으로 140bp 크기의 DNA를 증폭하였으며, 그 검출민감도는 1차 PCR 검출한계(10 ng genomic DNA/PCR)보다 검출한계를 10,000배 증가시켰다. 개발된 nested-PCR 방법을 통해 병원균을 인공접종한 수박 종자의 외부검사에서 $10^1cfu/ml$까지 인공 접종 한 모든 종자 시료에서 병원균을 검출하였고, 병원균을 인공접종 한 수박 종자의 내부검사에서는 병원균이 검출되지 않았다. 자연 감염 수박 종자의 외부검사에서는 10개의 반복 시료 중 2개에서, 그리고 종자 내부검사에서는 10개의 반복 시료 중 5개에서 A. citrulli를 검출하였다. 본 연구에서 개발한 nested-PCR은 특이성과 민감도가 높고 인공접종과 자연감염 수박 종자에서도 병원균의 검출이 가능하여 박과 작물의 종자로부터 A. citrulli를 검출하는데 효과적으로 사용될 수 있을 것으로 생각된다.

Draft Genome Assembly and Annotation for Cutaneotrichosporon dermatis NICC30027, an Oleaginous Yeast Capable of Simultaneous Glucose and Xylose Assimilation

  • Wang, Laiyou;Guo, Shuxian;Zeng, Bo;Wang, Shanshan;Chen, Yan;Cheng, Shuang;Liu, Bingbing;Wang, Chunyan;Wang, Yu;Meng, Qingshan
    • Mycobiology
    • /
    • 제50권1호
    • /
    • pp.66-78
    • /
    • 2022
  • The identification of oleaginous yeast species capable of simultaneously utilizing xylose and glucose as substrates to generate value-added biological products is an area of key economic interest. We have previously demonstrated that the Cutaneotrichosporon dermatis NICC30027 yeast strain is capable of simultaneously assimilating both xylose and glucose, resulting in considerable lipid accumulation. However, as no high-quality genome sequencing data or associated annotations for this strain are available at present, it remains challenging to study the metabolic mechanisms underlying this phenotype. Herein, we report a 39,305,439 bp draft genome assembly for C. dermatis NICC30027 comprised of 37 scaffolds, with 60.15% GC content. Within this genome, we identified 524 tRNAs, 142 sRNAs, 53 miRNAs, 28 snRNAs, and eight rRNA clusters. Moreover, repeat sequences totaling 1,032,129 bp in length were identified (2.63% of the genome), as were 14,238 unigenes that were 1,789.35 bp in length on average (64.82% of the genome). The NCBI non-redundant protein sequences (NR) database was employed to successfully annotate 11,795 of these unigenes, while 3,621 and 11,902 were annotated with the Swiss-Prot and TrEMBL databases, respectively. Unigenes were additionally subjected to pathway enrichment analyses using the Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Cluster of Orthologous Groups of proteins (COG), Clusters of orthologous groups for eukaryotic complete genomes (KOG), and Non-supervised Orthologous Groups (eggNOG) databases. Together, these results provide a foundation for future studies aimed at clarifying the mechanistic basis for the ability of C. dermatis NICC30027 to simultaneously utilize glucose and xylose to synthesize lipids.

Genomic Analysis of the Carrot Bacterial Blight Pathogen Xanthomonas hortorum pv. carotae in Korea

  • Mi-Hyun Lee;Sung-Jun Hong;Dong Suk Park;Hyeonheui Ham;Hyun Gi Kong
    • The Plant Pathology Journal
    • /
    • 제39권4호
    • /
    • pp.409-416
    • /
    • 2023
  • Bacterial leaf blight of carrots caused by Xanthomonas hortorum pv. carotae (Xhc) is an important worldwide seed-borne disease. In 2012 and 2013, symptoms similar to bacterial leaf blight were found in carrot farms in Jeju Island, Korea. The phenotypic characteristics of the Korean isolation strains were similar to the type strain of Xhc. Pathogenicity showed symptoms on the 14th day after inoculation on carrot plants. Identification by genetic method was multi-position sequencing of the isolated strain JJ2001 was performed using four genes (danK, gyrB, fyuA, and rpoD). The isolated strain was confirmed to be most similar to Xhc M081. Furthermore, in order to analyze the genetic characteristics of the isolated strain, whole genome analysis was performed through the next-generation sequencing method. The draft genome size of JJ2001 is 5,443,372 bp, which contains 63.57% of G + C and has 4,547 open reading frames. Specifically, the classification of pathovar can be confirmed to be similar to that of the host lineage. Plant pathogenic factors and determinants of the majority of the secretion system are conserved in strain JJ2001. This genetic information enables detailed comparative analysis in the pathovar stage of pathogenic bacteria. Furthermore, these findings provide basic data for the distribution and diagnosis of Xanthomonas hortorum pv. carotae, a major plant pathogen that infects carrots in Korea.

Comparative Genome Analysis and Evaluation of Probiotic Characteristics of Lactobacillus plantarum Strain JDFM LP11

  • Heo, Jaeyoung;Shin, Donghyun;Chang, Sung Yong;Bogere, Paul;Park, Mi Ri;Ryu, Sangdon;Lee, Woong Ji;Yun, Bohyun;Lee, Hak Kyo;Kim, Younghoon;Oh, Sangnam
    • 한국축산식품학회지
    • /
    • 제38권5호
    • /
    • pp.878-888
    • /
    • 2018
  • In the current study, the probiotic potential of approximately 250 strains of lactic acid bacteria (LAB) isolated from piglet fecal samples were investigated; among them Lactobacillus plantarum strain JDFM LP11, which possesses significant probiotic potential, with enhanced acid/bile tolerance, attachment to porcine intestinal epithelial cells (IPEC-J2), and antimicrobial activity. The genetic characteristics of strain JDFM LP11 were explored by performing whole genome sequencing (WGS) using a PacBio system. The circular draft genome have a total length of 3,206,883 bp and a total of 3,021 coding sequences were identified. Phylogenetically, three genes, possibly related to survival and metabolic activity in the porcine host, were identified. These genes encode p60, lichenan permease IIC component, and protein TsgA, which are a putative endopeptidase, a component of the phosphotransferase system (PTS), and a major facilitator in the gut environment, respectively. Our findings suggest that understanding the functional and genetic characteristics of L. plantarum strain JDFM LP11, with its candidate genes for gut health, could provide new opportunities and insights into applications in the animal food and feed additive industries.