• Title/Summary/Keyword: De novo assembly

Search Result 55, Processing Time 0.024 seconds

High-throughput identification of chrysanthemum gene function and expression: An overview and an effective proposition

  • Nguyen, Toan Khac;Lim, Jin Hee
    • Journal of Plant Biotechnology
    • /
    • v.48 no.3
    • /
    • pp.139-147
    • /
    • 2021
  • Since whole-genome duplication (WGD) of diploid Chrysanthemum nankingense and de novo assembly whole-genome of C. seticuspe have been obtained, they have afforded to perceive the diversity evolution and gene discovery in the improved investigation of chrysanthemum breeding. The robust tools of high-throughput identification and analysis of gene function and expression produce their vast importance in chrysanthemum genomics. However, the gigantic genome size and heterozygosity are also mentioned as the major obstacles preventing the chrysanthemum breeding practices and functional genomics analysis. Nonetheless, some of technological contemporaries provide scientific efficient and promising solutions to diminish the drawbacks and investigate the high proficient methods for generous phenotyping data obtaining and system progress in future perspectives. This review provides valuable strategies for a broad overview about the high-throughput identification, and molecular analysis of gene function and expression in chrysanthemum. We also contribute the efficient proposition about specific protocols for considering chrysanthemum genes. In further perspective, the proper high-throughput identification will continue to advance rapidly and advertise the next generation in chrysanthemum breeding.

Genome-Wide Comparison of Carbohydrate-Active Enzymes (CAZymes) Repertoire of Flammulina ononidis

  • Park, Young-Jin;Kong, Won-Sik
    • Mycobiology
    • /
    • v.46 no.4
    • /
    • pp.349-360
    • /
    • 2018
  • Whole-genome sequencing of Flammulina ononidis, a wood-rotting basidiomycete, was performed to identify genes associated with carbohydrate-active enzymes (CAZymes). A total of 12,586 gene structures with an average length of 2009 bp were predicted by the AUGUSTUS tool from a total 35,524,258 bp length of de novo genome assembly (49.76% GC). Orthologous analysis with other fungal species revealed that 7051 groups contained at least one F. ononidis gene. In addition, 11,252 (89.5%) of 12,586 genes for F. ononidis proteins had orthologs among the Dikarya, and F. ononidis contained 8 species-specific genes, of which 5 genes were paralogous. CAZyme prediction revealed 524 CAZyme genes, including 228 for glycoside hydrolases, 21 for polysaccharide lyases, 87 for glycosyltransferases, 61 for carbohydrate esterases, 87 with auxiliary activities, and 40 for carbohydrate-binding modules in the F. ononidis genome. This genome information including CAZyme repertoire will be useful to understand lignocellulolytic machinery of this white rot fungus F. ononidis.

Transcriptome analysis of internal and external stress mechanisms in Aster spathulifolius Maxim.

  • Sivagami, Jean Claude;Park, SeonJoo
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2019.04a
    • /
    • pp.35-35
    • /
    • 2019
  • Aster spathulifolius Maxim. is belongs to the Asteraceae family which is distributed only in Korea and Japan. It is recognize as a traditionally medicinal plants and economically valuable in ornamental field. However, among the Asteraceae family, the Aster genus, which is lacks in genomic resources and information of molecular function. Therefore, we used high throughput RNA-sequencing transcriptome data of the A. spathulifolius to know molecular level function. DeNovo assembly produced 98,660 unigene with N50 value 1126 bp. Unigenes was performed to analyses the functional annotation against NCBI database like plant database of nucleotide (Nt) and non-redundant protein (Nr), Pfam, Uniprot, KEGG and Transcriptional factor (TF). In addition, Distribution of SSR markers also analyzed for future perfectives. Further, Comparing with other two Asteraceae family species like, Karelinia caspica and Chrysanthemum morifolium to the A. spathulifolius shows the number of gene that regulated in internal and external stress respectively salt-tolerant and heat and drought stress to understand the molecular basis related to the different environments stress.

  • PDF

The complete chloroplast genome of Polygonatum falcatum (Asparagaceae)

  • CHOI, Tae-Young;YUN, Se-Hyun;LEE, Soo-Rang
    • Korean Journal of Plant Taxonomy
    • /
    • v.52 no.1
    • /
    • pp.80-83
    • /
    • 2022
  • Polygonatum falcatum is a perennial herb distributed in East Asia. We determined the characteristics of the complete chloroplast genome in P. falcatum for the first time, with a de novo assembly strategy. The chloroplast genome was 154,579bp in length harboring 87 protein coding genes, 38 tRNA genes and eight rRNA genes. It exhibits typical quadripartite structure comprising a large single-copy (LSC) (83,528bp), a small single-copy (SSC) (18,457bp) and a pair of inverted repeats (IRs) (26,297bp). Phylogenetic analysis of 16 chloroplast genomes from Asparagaceae reveals that the genus Polygonatum is a monophyletic group and that P. falcatum is clustered together with the congener, P. odoratum.

De novo Assembly and Analysis of Amur Sturgeon (Acipenser schrenckii) Transcriptome in Response to Mycobacterium Marinum Infection to Identify Putative Genes Involved in Immunity

  • Zhang, Qianqian;Wang, Xiehao;Zhang, Defeng;Long, Meng;Wu, Zhenbing;Feng, Yuqing;Hao, Jingwen;Wang, Shuyi;Liao, Qian;Li, Aihua
    • Journal of Microbiology and Biotechnology
    • /
    • v.29 no.8
    • /
    • pp.1324-1334
    • /
    • 2019
  • Fish mycobacteriosis is a common bacterial disease in many species of freshwater and marine fish and has caused severe loss of fish production. Mycobacterium marinum has been the most prevalent pathogen observed in several outbreaks of mycobacteriosis of farmed sturgeons in China. However, the immune responses and pathology of sturgeons in mycobacterial infection are rarely studied. Therefore, we used the Illumina RNA-seq method to analyze the transcriptome profile of Acipenser schrenckii challenged with Mycobacterium marinum. To begin, 168,220 non-redundant contigs were acquired from the infection and control groups, and among these, 33,225 contigs have acquired annotations. A total of 4,043 differently expressed (DE) contigs between the two groups were identified, and among these, 2479 were up-regulated and 1564 were down-regulated in the infected fish. A total of 1,340 DE contigs with acquired annotations in KEGG were enriched for 124 pathways including the TNF signaling pathway, and the Toll-like receptor signaling pathway. The roles of DE genes involved in significant pathways and other processes were discussed. The 2,209 DE contigs that have yet to acquire proper annotation may represent candidate genes associated with infection in sturgeons and are expected to serve as immunogenetic resources for further study. To our best knowledge, this is the first transcriptome study on sturgeons under bacterial infection.

Genome-wide Copy Number Variation in a Korean Native Chicken Breed (한국 토종닭의 전장 유전체 복제수변이(CNV) 발굴)

  • Cho, Eun-Seok;Chung, Won-Hyong;Choi, Jung-Woo;Jang, Hyun-Jun;Park, Mi-Na;Kim, Namshin;Kim, Tae-Hun;Lee, Kyung-Tai
    • Korean Journal of Poultry Science
    • /
    • v.41 no.4
    • /
    • pp.305-311
    • /
    • 2014
  • Copy number variation (CNV) is a form of structural variation that shows various numbers of copies in segments of the DNA. It has been shown to account for phenotypic variations in human diseases and agricultural production traits. Currently, most of chicken breeds in the poultry industry are based on European-origin breeds that have been mostly provided from several international breeding companies. Therefore, National Institute of Animal Science, RDA has been trying to restore and improve Korean native chicken breeds (12 lines of 5 breeds) for about 20 years. Thanks to the recent advance of sequencing technologies, genome-wide CNV can be accessed in the higher resolution throughout the genome of species of interest. However, there is no systematic study available to dissect the CNV in the native chicken breed in Korea. Here, we report genome-wide copy number variations identified from a genome of Korean native chicken (Line L) by comparing between the chicken reference sequence assembly (Gallus gallus) and a de novo sequencing assembly of the Korean native chicken (Line L). Throughout all twenty eight chicken autosomes, we identified a total of 501 CNVs; defined as gain and loss of duplication and deletion respectively. Furthermore, we performed gene ontology (GO) analysis for the putative CNVs using DAVID, leading to 68 GO terms clustered independently. Of the clustered GO terms, genes related to transcription and gene regulation were mainly detected. This study provides useful genomic resource to investigate potential biological implications of CNVs with traits of interest in the Korean native chicken.

PAIVS: prediction of avian influenza virus subtype

  • Park, Hyeon-Chun;Shin, Juyoun;Cho, Sung-Min;Kang, Shinseok;Chung, Yeun-Jun;Jung, Seung-Hyun
    • Genomics & Informatics
    • /
    • v.18 no.1
    • /
    • pp.5.1-5.5
    • /
    • 2020
  • Highly pathogenic avian influenza (HPAI) viruses have caused severe respiratory disease and death in poultry and human beings. Although most of the avian influenza viruses (AIVs) are of low pathogenicity and cause mild infections in birds, some subtypes including hemagglutinin H5 and H7 subtype cause HPAI. Therefore, sensitive and accurate subtyping of AIV is important to prepare and prevent for the spread of HPAI. Next-generation sequencing (NGS) can analyze the full-length sequence information of entire AIV genome at once, so this technology is becoming a more common in detecting AIVs and predicting subtypes. However, an analysis pipeline of NGS-based AIV sequencing data, including AIV subtyping, has not yet been established. Here, in order to support the pre-processing of NGS data and its interpretation, we developed a user-friendly tool, named prediction of avian influenza virus subtype (PAIVS). PAIVS has multiple functions that support the pre-processing of NGS data, reference-guided AIV subtyping, de novo assembly, variant calling and identifying the closest full-length sequences by BLAST, and provide the graphical summary to the end users.

Genome Sequencing and Genome-Wide Identification of Carbohydrate-Active Enzymes (CAZymes) in the White Rot Fungus Flammulina fennae

  • Lee, Chang-Soo;Kong, Won-Sik;Park, Young-Jin
    • Microbiology and Biotechnology Letters
    • /
    • v.46 no.3
    • /
    • pp.300-312
    • /
    • 2018
  • Whole-genome sequencing of the wood-rotting fungus, Flammulina fennae, was carried out to identify carbohydrate-active enzymes (CAZymes). De novo genome assembly (31 kmer) of short reads by next-generation sequencing revealed a total genome length of 32,423,623 base pairs (39% GC). A total of 11,591 gene models in the assembled genome sequence of F. fennae were predicted by ab initio gene prediction using the AUGUSTUS tool. In a genome-wide comparison, 6,715 orthologous groups shared at least one gene with F. fennae and 10,667 (92%) of 11,591 genes for F. fennae proteins had orthologs among the Dikarya. Additionally, F. fennae contained 23 species-specific genes, of which 16 were paralogous. CAZyme identification and annotation revealed 513 CAZymes, including 82 auxiliary activities, 220 glycoside hydrolases, 85 glycosyltransferases, 20 polysaccharide lyases, 57 carbohydrate esterases, and 45 carbohydrate binding-modules in the F. fennae genome. The genome information of F. fennae increases the understanding of this basidiomycete fungus. CAZyme gene information will be useful for detailed studies of lignocellulosic biomass degradation for biotechnological and industrial applications.

De novo assembly of a large volume of genome using NGS data (NGS 데이터를 이용한 대용량 게놈의 디노버 어셈블리)

  • Won, Jung-Im;Hong, Sang-Kyoon;Kong, Jin-Hwa;Huh, Sun;Yoon, Jee-Hee
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2012.06c
    • /
    • pp.25-27
    • /
    • 2012
  • 디노버 어셈블리는 레퍼런스 시퀀스 없이 리드의 염기 서열 정보를 이용하여 원래의 전체 시퀀스(original sequence)로 추정되는 시퀀스로 리드들을 재구성하는 방식이다. 최근의 NGS(Next Generation Sequencing) 기술은 대용량 리드를 훨씬 쉽게 저비용으로 생성할 수 있다는 장점이 있어, 이를 이용한 많은 연구가 이루어지고 있다. 그러나 NGS 리드 데이터를 이용한 디노버 어셈블리에 관한 연구는 국내외적으로 매우 미흡한 실정이다. 그 이유는 NGS 리드 데이터를 이용하여 디노버 어셈블리를 수행하는 경우 대용량 데이터, 복잡한 데이터 구조 및 처리 과정 등으로 인하여 매우 많은 시간과 공간이 소요될 뿐만 아니라 아직까지 다양한 분석 툴과 노하우 등이 충분히 개발되어 있지 않기 때문이다. 본 연구에서는 NGS 리드 데이터를 이용한 어셈블리의 실효성과 정확성을 검증한다. 또한 디노버 어셈블리의 처리 시간 및 공간 오버헤드를 해결하기 위하여 유사 종과의 리드 정렬을 활용하는 방안을 제안한다.

The comparative gene expression concern to the seed pigmentation in maize (Zea mays L.)

  • Sa, Kyu Jin;Choi, Ik-Young;Lee, Ju Kyong
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.29.1-29.11
    • /
    • 2020
  • Maize seed pigmentation is one of the important issue to develop maize seed breeding. The differently gene expression was characterized and compared for three inbred lines, such as the pigment accumulated seed (CM22) and non-pigmented seed (CM5 and CM19) at 10 days after pollination. We obtained a total of 63,870, 82,496, and 54,555 contigs by de novo assembly to identify gene expression in the CM22, CM5, and CM19, respectably. In differentially expressed gene analysis, it was revealed that 7,044 genes were differentially expressed by at least two-fold, with 4,067 upregulated in colored maize inbred lines and 2,977 upregulated in colorless maize inbred lines. Of them,18 genes were included to the anthocyanin biosynthesis pathways, while 15 genes were upregulated in both CM22/5 and CM22/19. Additionally, 37 genes were detected in the metabolic pathway concern to the seed pigmentation by BINs analysis using MAPMAN software. Finally, these differently expressed genes may aid in the research on seed pigmentation in maize breeding programs.