• 제목/요약/키워드: De novo assembly

검색결과 55건 처리시간 0.034초

High-throughput identification of chrysanthemum gene function and expression: An overview and an effective proposition

  • Nguyen, Toan Khac;Lim, Jin Hee
    • Journal of Plant Biotechnology
    • /
    • 제48권3호
    • /
    • pp.139-147
    • /
    • 2021
  • Since whole-genome duplication (WGD) of diploid Chrysanthemum nankingense and de novo assembly whole-genome of C. seticuspe have been obtained, they have afforded to perceive the diversity evolution and gene discovery in the improved investigation of chrysanthemum breeding. The robust tools of high-throughput identification and analysis of gene function and expression produce their vast importance in chrysanthemum genomics. However, the gigantic genome size and heterozygosity are also mentioned as the major obstacles preventing the chrysanthemum breeding practices and functional genomics analysis. Nonetheless, some of technological contemporaries provide scientific efficient and promising solutions to diminish the drawbacks and investigate the high proficient methods for generous phenotyping data obtaining and system progress in future perspectives. This review provides valuable strategies for a broad overview about the high-throughput identification, and molecular analysis of gene function and expression in chrysanthemum. We also contribute the efficient proposition about specific protocols for considering chrysanthemum genes. In further perspective, the proper high-throughput identification will continue to advance rapidly and advertise the next generation in chrysanthemum breeding.

Genome-Wide Comparison of Carbohydrate-Active Enzymes (CAZymes) Repertoire of Flammulina ononidis

  • Park, Young-Jin;Kong, Won-Sik
    • Mycobiology
    • /
    • 제46권4호
    • /
    • pp.349-360
    • /
    • 2018
  • Whole-genome sequencing of Flammulina ononidis, a wood-rotting basidiomycete, was performed to identify genes associated with carbohydrate-active enzymes (CAZymes). A total of 12,586 gene structures with an average length of 2009 bp were predicted by the AUGUSTUS tool from a total 35,524,258 bp length of de novo genome assembly (49.76% GC). Orthologous analysis with other fungal species revealed that 7051 groups contained at least one F. ononidis gene. In addition, 11,252 (89.5%) of 12,586 genes for F. ononidis proteins had orthologs among the Dikarya, and F. ononidis contained 8 species-specific genes, of which 5 genes were paralogous. CAZyme prediction revealed 524 CAZyme genes, including 228 for glycoside hydrolases, 21 for polysaccharide lyases, 87 for glycosyltransferases, 61 for carbohydrate esterases, 87 with auxiliary activities, and 40 for carbohydrate-binding modules in the F. ononidis genome. This genome information including CAZyme repertoire will be useful to understand lignocellulolytic machinery of this white rot fungus F. ononidis.

Transcriptome analysis of internal and external stress mechanisms in Aster spathulifolius Maxim.

  • Sivagami, Jean Claude;Park, SeonJoo
    • 한국자원식물학회:학술대회논문집
    • /
    • 한국자원식물학회 2019년도 춘계학술대회
    • /
    • pp.35-35
    • /
    • 2019
  • Aster spathulifolius Maxim. is belongs to the Asteraceae family which is distributed only in Korea and Japan. It is recognize as a traditionally medicinal plants and economically valuable in ornamental field. However, among the Asteraceae family, the Aster genus, which is lacks in genomic resources and information of molecular function. Therefore, we used high throughput RNA-sequencing transcriptome data of the A. spathulifolius to know molecular level function. DeNovo assembly produced 98,660 unigene with N50 value 1126 bp. Unigenes was performed to analyses the functional annotation against NCBI database like plant database of nucleotide (Nt) and non-redundant protein (Nr), Pfam, Uniprot, KEGG and Transcriptional factor (TF). In addition, Distribution of SSR markers also analyzed for future perfectives. Further, Comparing with other two Asteraceae family species like, Karelinia caspica and Chrysanthemum morifolium to the A. spathulifolius shows the number of gene that regulated in internal and external stress respectively salt-tolerant and heat and drought stress to understand the molecular basis related to the different environments stress.

  • PDF

The complete chloroplast genome of Polygonatum falcatum (Asparagaceae)

  • CHOI, Tae-Young;YUN, Se-Hyun;LEE, Soo-Rang
    • 식물분류학회지
    • /
    • 제52권1호
    • /
    • pp.80-83
    • /
    • 2022
  • Polygonatum falcatum is a perennial herb distributed in East Asia. We determined the characteristics of the complete chloroplast genome in P. falcatum for the first time, with a de novo assembly strategy. The chloroplast genome was 154,579bp in length harboring 87 protein coding genes, 38 tRNA genes and eight rRNA genes. It exhibits typical quadripartite structure comprising a large single-copy (LSC) (83,528bp), a small single-copy (SSC) (18,457bp) and a pair of inverted repeats (IRs) (26,297bp). Phylogenetic analysis of 16 chloroplast genomes from Asparagaceae reveals that the genus Polygonatum is a monophyletic group and that P. falcatum is clustered together with the congener, P. odoratum.

De novo Assembly and Analysis of Amur Sturgeon (Acipenser schrenckii) Transcriptome in Response to Mycobacterium Marinum Infection to Identify Putative Genes Involved in Immunity

  • Zhang, Qianqian;Wang, Xiehao;Zhang, Defeng;Long, Meng;Wu, Zhenbing;Feng, Yuqing;Hao, Jingwen;Wang, Shuyi;Liao, Qian;Li, Aihua
    • Journal of Microbiology and Biotechnology
    • /
    • 제29권8호
    • /
    • pp.1324-1334
    • /
    • 2019
  • Fish mycobacteriosis is a common bacterial disease in many species of freshwater and marine fish and has caused severe loss of fish production. Mycobacterium marinum has been the most prevalent pathogen observed in several outbreaks of mycobacteriosis of farmed sturgeons in China. However, the immune responses and pathology of sturgeons in mycobacterial infection are rarely studied. Therefore, we used the Illumina RNA-seq method to analyze the transcriptome profile of Acipenser schrenckii challenged with Mycobacterium marinum. To begin, 168,220 non-redundant contigs were acquired from the infection and control groups, and among these, 33,225 contigs have acquired annotations. A total of 4,043 differently expressed (DE) contigs between the two groups were identified, and among these, 2479 were up-regulated and 1564 were down-regulated in the infected fish. A total of 1,340 DE contigs with acquired annotations in KEGG were enriched for 124 pathways including the TNF signaling pathway, and the Toll-like receptor signaling pathway. The roles of DE genes involved in significant pathways and other processes were discussed. The 2,209 DE contigs that have yet to acquire proper annotation may represent candidate genes associated with infection in sturgeons and are expected to serve as immunogenetic resources for further study. To our best knowledge, this is the first transcriptome study on sturgeons under bacterial infection.

한국 토종닭의 전장 유전체 복제수변이(CNV) 발굴 (Genome-wide Copy Number Variation in a Korean Native Chicken Breed)

  • 조은석;정원형;최정우;장현준;박미나;김남신;김태헌;이경태
    • 한국가금학회지
    • /
    • 제41권4호
    • /
    • pp.305-311
    • /
    • 2014
  • 복제수변이(Copy number variation, CNV)는 DNA 다양한 구조적 변화의 한 형태이다. 복제수변이는 인간의 질병 및 농업의 생산성에 영향을 미치는 것으로 알려져 있다. 이전 우리나라의 닭의 품종은 유럽에서 유입되어진 품종을 기반으로 구축되어져 있었다. 따라서 농촌진흥청 국립축산과학원에서는 20년 동안 재래품종을 복원하려고 노력하였고, 5품종 12계통으로 복원하였다. 최근 염기서열분석 기술의 발달로, 해상도가 좋은 게놈 전체의 복제수변이를 발굴할 수 있게 되었다. 그러나 한국 재래닭 품종에 대해서는 체계적인 연구가 이루어지지 않고 있다. 본 연구에서는 한국 재래 닭(계통 L)에 대해서 게놈 전체의 염기서열을 분석하고 닭의 참고서열과 비교하여 재래닭에서 확인된 복제수 변이를 보고하였다. 닭의 28개 염색체에서 총 501개의 복제수 변이를 확인하였고, 이를 Gain과 Loss로 나누어서 표시하였다. 또한 우리는 501개의 복제수 변이를 포함하고 있는 유전자의 기능을 분류하였다. 그 결과, 전사 및 유전자 조절에 관련된 유전자들이 많이 분류되었다. 본 연구의 결과는 복제수 변이와 한국 재래닭의 경제형질 간의 연관성을 설명할 수 있는 기초자료로 활용될 것으로 사료된다.

PAIVS: prediction of avian influenza virus subtype

  • Park, Hyeon-Chun;Shin, Juyoun;Cho, Sung-Min;Kang, Shinseok;Chung, Yeun-Jun;Jung, Seung-Hyun
    • Genomics & Informatics
    • /
    • 제18권1호
    • /
    • pp.5.1-5.5
    • /
    • 2020
  • Highly pathogenic avian influenza (HPAI) viruses have caused severe respiratory disease and death in poultry and human beings. Although most of the avian influenza viruses (AIVs) are of low pathogenicity and cause mild infections in birds, some subtypes including hemagglutinin H5 and H7 subtype cause HPAI. Therefore, sensitive and accurate subtyping of AIV is important to prepare and prevent for the spread of HPAI. Next-generation sequencing (NGS) can analyze the full-length sequence information of entire AIV genome at once, so this technology is becoming a more common in detecting AIVs and predicting subtypes. However, an analysis pipeline of NGS-based AIV sequencing data, including AIV subtyping, has not yet been established. Here, in order to support the pre-processing of NGS data and its interpretation, we developed a user-friendly tool, named prediction of avian influenza virus subtype (PAIVS). PAIVS has multiple functions that support the pre-processing of NGS data, reference-guided AIV subtyping, de novo assembly, variant calling and identifying the closest full-length sequences by BLAST, and provide the graphical summary to the end users.

Genome Sequencing and Genome-Wide Identification of Carbohydrate-Active Enzymes (CAZymes) in the White Rot Fungus Flammulina fennae

  • Lee, Chang-Soo;Kong, Won-Sik;Park, Young-Jin
    • 한국미생물·생명공학회지
    • /
    • 제46권3호
    • /
    • pp.300-312
    • /
    • 2018
  • Whole-genome sequencing of the wood-rotting fungus, Flammulina fennae, was carried out to identify carbohydrate-active enzymes (CAZymes). De novo genome assembly (31 kmer) of short reads by next-generation sequencing revealed a total genome length of 32,423,623 base pairs (39% GC). A total of 11,591 gene models in the assembled genome sequence of F. fennae were predicted by ab initio gene prediction using the AUGUSTUS tool. In a genome-wide comparison, 6,715 orthologous groups shared at least one gene with F. fennae and 10,667 (92%) of 11,591 genes for F. fennae proteins had orthologs among the Dikarya. Additionally, F. fennae contained 23 species-specific genes, of which 16 were paralogous. CAZyme identification and annotation revealed 513 CAZymes, including 82 auxiliary activities, 220 glycoside hydrolases, 85 glycosyltransferases, 20 polysaccharide lyases, 57 carbohydrate esterases, and 45 carbohydrate binding-modules in the F. fennae genome. The genome information of F. fennae increases the understanding of this basidiomycete fungus. CAZyme gene information will be useful for detailed studies of lignocellulosic biomass degradation for biotechnological and industrial applications.

NGS 데이터를 이용한 대용량 게놈의 디노버 어셈블리 (De novo assembly of a large volume of genome using NGS data)

  • 원정임;홍상균;공진화;허선;윤지희
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2012년도 한국컴퓨터종합학술대회논문집 Vol.39 No.1(C)
    • /
    • pp.25-27
    • /
    • 2012
  • 디노버 어셈블리는 레퍼런스 시퀀스 없이 리드의 염기 서열 정보를 이용하여 원래의 전체 시퀀스(original sequence)로 추정되는 시퀀스로 리드들을 재구성하는 방식이다. 최근의 NGS(Next Generation Sequencing) 기술은 대용량 리드를 훨씬 쉽게 저비용으로 생성할 수 있다는 장점이 있어, 이를 이용한 많은 연구가 이루어지고 있다. 그러나 NGS 리드 데이터를 이용한 디노버 어셈블리에 관한 연구는 국내외적으로 매우 미흡한 실정이다. 그 이유는 NGS 리드 데이터를 이용하여 디노버 어셈블리를 수행하는 경우 대용량 데이터, 복잡한 데이터 구조 및 처리 과정 등으로 인하여 매우 많은 시간과 공간이 소요될 뿐만 아니라 아직까지 다양한 분석 툴과 노하우 등이 충분히 개발되어 있지 않기 때문이다. 본 연구에서는 NGS 리드 데이터를 이용한 어셈블리의 실효성과 정확성을 검증한다. 또한 디노버 어셈블리의 처리 시간 및 공간 오버헤드를 해결하기 위하여 유사 종과의 리드 정렬을 활용하는 방안을 제안한다.

The comparative gene expression concern to the seed pigmentation in maize (Zea mays L.)

  • Sa, Kyu Jin;Choi, Ik-Young;Lee, Ju Kyong
    • Genomics & Informatics
    • /
    • 제18권3호
    • /
    • pp.29.1-29.11
    • /
    • 2020
  • Maize seed pigmentation is one of the important issue to develop maize seed breeding. The differently gene expression was characterized and compared for three inbred lines, such as the pigment accumulated seed (CM22) and non-pigmented seed (CM5 and CM19) at 10 days after pollination. We obtained a total of 63,870, 82,496, and 54,555 contigs by de novo assembly to identify gene expression in the CM22, CM5, and CM19, respectably. In differentially expressed gene analysis, it was revealed that 7,044 genes were differentially expressed by at least two-fold, with 4,067 upregulated in colored maize inbred lines and 2,977 upregulated in colorless maize inbred lines. Of them,18 genes were included to the anthocyanin biosynthesis pathways, while 15 genes were upregulated in both CM22/5 and CM22/19. Additionally, 37 genes were detected in the metabolic pathway concern to the seed pigmentation by BINs analysis using MAPMAN software. Finally, these differently expressed genes may aid in the research on seed pigmentation in maize breeding programs.