• Title/Summary/Keyword: Illumina De novo Sequencing

Search Result 12, Processing Time 0.024 seconds

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

  • Lim, Jong-Sung;Choi, Beom-Soon;Lee, Jeong-Soo;Shin, Chan-Seok;Yang, Tae-Jin;Rhee, Jae-Sung;Lee, Jae-Seong;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.1-8
    • /
    • 2012
  • Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the nextgeneration DNA sequencer (NGS) Roche/454 and Illumina/ Solexa systems, along with bioinformation analysis technologies of whole-genome $de$ $novo$ assembly, expression profiling, DNA variation discovery, and genotyping. Both massive whole-genome shotgun paired-end sequencing and mate paired-end sequencing data are important steps for constructing $de$ $novo$ assembly of novel genome sequencing data. It is necessary to have DNA sequence information from a multiplatform NGS with at least $2{\times}$ and $30{\times}$ depth sequence of genome coverage using Roche/454 and Illumina/Solexa, respectively, for effective an way of de novo assembly. Massive shortlength reading data from the Illumina/Solexa system is enough to discover DNA variation, resulting in reducing the cost of DNA sequencing. Whole-genome expression profile data are useful to approach genome system biology with quantification of expressed RNAs from a wholegenome transcriptome, depending on the tissue samples. The hybrid mRNA sequences from Rohce/454 and Illumina/Solexa are more powerful to find novel genes through $de$ $novo$ assembly in any whole-genome sequenced species. The $20{\times}$ and $50{\times}$ coverage of the estimated transcriptome sequences using Roche/454 and Illumina/Solexa, respectively, is effective to create novel expressed reference sequences. However, only an average $30{\times}$ coverage of a transcriptome with short read sequences of Illumina/Solexa is enough to check expression quantification, compared to the reference expressed sequence tag sequence.

A Study on Transcriptome Analysis Using de novo RNA-sequencing to Compare Ginseng Roots Cultivated in Different Environments

  • Yang, Byung Wook
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2018.04a
    • /
    • pp.5-5
    • /
    • 2018
  • Ginseng (Panax ginseng C.A. Meyer), one of the most widely used medicinal plants in traditional oriental medicine, is used for the treatment of various diseases. It has been classified according to its cultivation environment, such as field cultivated ginseng (FCG) and mountain cultivated ginseng (MCG). However, little is known about differences in gene expression in ginseng roots between field cultivated and mountain cultivated ginseng. In order to investigate the whole transcriptome landscape of ginseng, we employed High-Throughput sequencing technologies using the Illumina HiSeqTM2500 system, and generated a large amount of sequenced transcriptome from ginseng roots. Approximately 77 million and 87 million high-quality reads were produced in the FCG and MCG roots transcriptome analyses, respectively, and we obtained 256,032 assembled unigenes with an average length of 1,171 bp by de novo assembly methods. Functional annotations of the unigenes were performed using sequence similarity comparisons against the following databases: the non-redundant nucleotide database, the InterPro domains database, the Gene Ontology Consortium database, and the Kyoto Encyclopedia of Genes and Genomes pathway database. A total of 4,207 unigenes were assigned to specific metabolic pathways, and all of the known enzymes involved in starch and sucrose metabolism pathways were also identified in the KEGG library. This study indicated that alpha-glucan phosphorylase 1, putative pectinesterase/pectinesterase inhibitor 17, beta-amylase, and alpha-glucan phosphorylase isozyme H might be important factors involved in starch and sucrose metabolism between FCG and MCG in different environments.

  • PDF

De novo assembly, annotation and gene expression profiles of gonads of Cytorace-3, a hybrid lineage of Drosophila nasuta nasuta and D. n. albomicans

  • Ponnanna, Koushik;DSouza, Stafny M.;Ramachandra, Nallur B.
    • Genomics & Informatics
    • /
    • v.19 no.1
    • /
    • pp.8.1-8.12
    • /
    • 2021
  • Cytorace-3 is a laboratory evolved hybrid lineage of Drosophila nasuta nasuta males and Drosophila nasuta albomicans females currently passing ~850 generations. To assess interracial hybridization effects on gene expression in Cytorace-3 we profiled the transcriptomes of mature ovaries and testes by employing Illumina sequencing technology and de novo transcriptome assembling strategies. We found 26% of the ovarian, and 14% of testis genes to be differentially expressed in Cytorace-3 relative to the expressed genes in the parental gonadal transcriptomes. About 5% of genes exhibited additive gene expression pattern in the ovary and 3% in the testis, while the remaining genes were misexpressed in Cytorace-3. Nearly 772 of these misexpressed genes in the ovary and 413 in the testis were either over-or under-dominant. Genes following D. n. nasuta dominance was twice (270 genes) than D. n. albomicans dominance (133 genes) in the ovary. In contrast, only 105 genes showed D. n. nasuta dominance and 207 showed D. n. albomicans dominance in testis transcriptome. Of the six expression inheritance patterns, conserved inheritance pattern was predominant for both ovary (73%) and testis (85%) in Cytorace-3. This study is the first to provide an overview of the expression divergence and inheritance patterns of the transcriptomes in an independently evolving distinct hybrid lineage of Drosophila. This recorded expression divergence in Cytorace-3 surpasses that between parental lineages illustrating the strong impact of hybridization driving rapid gene expression changes.

Transcriptome Analysis of Antrodia cinnamomea Mycelia from Different Wood Substrates

  • Jiao-Jiao Chen;Zhang Zhang;Yi Wang;Xiao-Long Yuan;Juan Wang;Yu-Ming Yang;Yuan Zheng
    • Mycobiology
    • /
    • v.51 no.1
    • /
    • pp.49-59
    • /
    • 2023
  • Antrodia cinnamomea, an edible and medicinal fungus with significant economic value and application prospects, is rich in terpenoids, benzenoids, lignans, polysaccharides, and benzoquinone, succinic and maleic derivatives. In this study, the transcriptome of A. cinnamomea cultured on the wood substrates of Cinnamomum glanduliferum (YZM), C. camphora (XZM), and C. kanehirae (NZM) was sequenced using the high-throughput sequencing technology Illumina HiSeq 2000, and the data were assembled by de novo strategy to obtain 78,729 Unigenes with an N50 of 4,463 bp. Compared with public databases, about 11,435, 6,947, and 5,994 Unigenes were annotated to the Non-Redundant (NR), Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genome (KEGG), respectively. The comprehensive analysis of the mycelium terpene biosynthesis-related genes in A. cinnamomea revealed that the expression of acetyl-CoA acetyltransferase (AACT), acyl-CoA dehydrogenase (MCAD), 3-hydroxy-3-methylglutaryl-coenzyme A (HMG-CoA), mevalonate pyrophosphate decarboxylase (MVD), and isopentenyl diphosphate isomerase (IDI) was significantly higher on NZM compared to the other two wood substrates. Similarly, the expression of geranylgeranyltransferase (GGT) was significantly higher on YZM compared to NZM and XZM, and the expression of farnesyl transferase (FTase) was significantly higher on XZM. Furthermore, the expressions of 2,3-oxidized squalene cyclase (OCS), squalene synthase (SQS), and squalene epoxidase (SE) were significantly higher on NZM. Overall, this study provides a potential approach to explore the molecular regulation mechanism of terpenoid biosynthesis in A. cinnamomea.

Development and Genetic Diversity Analysis of Microsatellite Markers Using Next-generation Sequencing in Seriola quinqueradiata (차세대 염기서열 분석법을 이용한 방어(Seriola quinqueradiata)의 microsatellite 마커의 개발 및 유전적 특성 분석)

  • Dong, Chun Mae;Lee, Mi-Nan;Kim, Eun-Mi;Park, Jung Youn;Kim, Gun-Do;Noh, Jae Koo
    • Journal of Life Science
    • /
    • v.30 no.3
    • /
    • pp.291-297
    • /
    • 2020
  • This study was conducted to develop microsatellite markers in Seriola quinqueradiata using next-generation sequencing. A total of 28,873,374 reads were generated on an Illumina Hiseq2500 system, yielding 7,247,216,874 bp sequences. The de novo assembly resulted in 466,359 contigs. A total of 132 contigs (0.43%), including 60 microsatellite loci, were derived from 30,729 contigs longer than 518 bp. A total of 60 primer sets were designed from the 132 microsatellite loci. A total of 15 polymorphic nuclear microsatellite loci were chosen to evaluate population genetic parameters in the parents and offspring. The mean number of effective alleles was 18.5, ranging from 11 to 30. The observed heterozygosity (HO) and expected heterozygosity (HE) ranged between 0.431 and 0.972 with an average of 0.812 and from 0.782 to 0.949 with an average of 0.896, respectively. No significant linkage disequilibrium was observed after Bonferroni revision in any loci. The results show that the 15 polymorphic nuclear microsatellite markers can be used to study the population and conservation genetics of S. quinqueradiata in Korea. To ensure the success of artificial seedling production technology, genetic variations between the parent and offspring populations should be monitored, and inbreeding should be controlled.

SNP discovery and applications in Brassica napus

  • Hayward, Alice;Mason, Annaliese S.;Dalton-Morgan, Jessica;Zander, Manuel;Edwards, David;Batley, Jacqueline
    • Journal of Plant Biotechnology
    • /
    • v.39 no.1
    • /
    • pp.49-61
    • /
    • 2012
  • This review summarises the biology, discovery and applications of single nucleotide polymorphisms in complex polyploid crop genomes, with a focus on the important oilseed crop $Brassica$ $napus$. $Brassica$ $napus$ is an allotetraploid species, and along with soybean and oil palm is one of the top three most important oilseed crops globally. Current efforts are well underway to $de$ $novo$ assemble the $B.$ $napus$ genome, following the release of the related $B.$ $rapa$ 'A' genome last year. The next generation of genome sequencing, SNP discovery and analysis pipelines, and the associated challenges for this work in $B.$ $napus$, will be addressed. The biological applications of SNP technology for both evolutionary and molecular geneticists as well as plant breeders and industry are far-reaching, and will be invaluable to our understanding and advancement of the $Brassica$ crop species.

Single Nucleotide Polymorphism Marker Discovery from Transcriptome Sequencing for Marker-assisted Backcrossing in Capsicum

  • Kang, Jin-Ho;Yang, Hee-Bum;Jeong, Hyeon-Seok;Choe, Phillip;Kwon, Jin-Kyung;Kang, Byoung-Cheorl
    • Horticultural Science & Technology
    • /
    • v.32 no.4
    • /
    • pp.535-543
    • /
    • 2014
  • Backcross breeding is the method most commonly used to introgress new traits into elite lines. Conventional backcross breeding requires at least 4-5 generations to recover the genomic background of the recurrent parent. Marker-assisted backcrossing (MABC) represents a new breeding approach that can substantially reduce breeding time and cost. For successful MABC, highly polymorphic markers with known positions in each chromosome are essential. Single nucleotide polymorphism (SNP) markers have many advantages over other marker systems for MABC due to their high abundance and amenability to genotyping automation. To facilitate MABC in hot pepper (Capsicum annuum), we utilized expressed sequence tags (ESTs) to develop SNP markers in this study. For SNP identification, we used Bukang $F_1$-hybrid pepper ESTs to prepare a reference sequence through de novo assembly. We performed large-scale transcriptome sequencing of eight accessions using the Illumina Genome Analyzer (IGA) IIx platform by Solexa, which generated small sequence fragments of about 90-100 bp. By aligning each contig to the reference sequence, 58,151 SNPs were identified. After filtering for polymorphism, segregation ratio, and lack of proximity to other SNPS or exon/intron boundaries, a total of 1,910 putative SNPs were chosen and positioned to a pepper linkage map. We further selected 412 SNPs evenly distributed on each chromosome and primers were designed for high throughput SNP assays and tested using a genetic diversity panel of 27 Capsicum accessions. The SNP markers clearly distinguished each accession. These results suggest that the SNP marker set developed in this study will be valuable for MABC, genetic mapping, and comparative genome analysis.

Identification and Characterization of Polymorphic Microsatellite DNA Markers Using Next-generation Sequencing in Parapristipoma trilineatum (차세대 염기서열 분석법을 사용한 벤자리(Parapristipoma trilineatum)의 microsatellite 마커의 개발 및 유전학적 특성 분석)

  • Chun Mae Dong;Mi-Nan Lee;Jae Koo Noh;Jin Woo Park;Young-Ok Kim;Eun-Mi Kim
    • Journal of Life Science
    • /
    • v.33 no.8
    • /
    • pp.623-631
    • /
    • 2023
  • This study was conducted to develop microsatellite markers in Parapristipoma trilineatum using next-generation sequencing. A total of 402,244,934 reads were generated on the Illumina Hiseq X Ten System, yielding 60,738,985,034 bp of sequences. The de novo assembly resulted in 1,320,995 contigs. A total of 952,326 contigs (0.016%) including 151 microsatellite loci were derived from the 1,320,995 contigs longer than 640 bp. A total of 34 primer sets were designed from the 151 microsatellite loci. As a result, 15 microsatellite loci were chosen and used for assuming population genetic parameters in the wild and farmed populations. The mean number of effective alleles was 12, ranging from 6 to 25. The observed heterozygosity (HO) and the expected heterozygosity (HE) ranged between 0.530 and 0.873, with an average of 0.750, and from 0.647 to 0.895, with an average of 0.793, respectively. According to these results, the developed set of 15 microsatellite markers is expected to be useful for the analysis of genetic characteristics in the population of P. trilineatum in Korea. There are requirements now for further genetic information, fishery resource management, breeding guidelines, support with the selection of breeds and studies on the effects of release, all of which will improve species conservation, and through future research, we aim to offer genetic foundational data with that goal.

Metagenomic analysis of bacterial community structure and diversity of lignocellulolytic bacteria in Vietnamese native goat rumen

  • Do, Thi Huyen;Dao, Trong Khoa;Nguyen, Khanh Hoang Viet;Le, Ngoc Giang;Nguyen, Thi Mai Phuong;Le, Tung Lam;Phung, Thu Nguyet;Straalen, Nico M. van;Roelofs, Dick;Truong, Nam Hai
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.5
    • /
    • pp.738-747
    • /
    • 2018
  • Objective: In a previous study, analysis of Illumina sequenced metagenomic DNA data of bacteria in Vietnamese goats' rumen showed a high diversity of putative lignocellulolytic genes. In this study, taxonomy speculation of microbial community and lignocellulolytic bacteria population in the rumen was conducted to elucidate a role of bacterial structure for effective degradation of plant materials. Methods: The metagenomic data had been subjected into Basic Local Alignment Search Tool (BLASTX) algorithm and the National Center for Biotechnology Information non-redundant sequence database. Here the BLASTX hits were further processed by the Metagenome Analyzer program to statistically analyze the abundance of taxa. Results: Microbial community in the rumen is defined by dominance of Bacteroidetes compared to Firmicutes. The ratio of Firmicutes versus Bacteroidetes was 0.36:1. An abundance of Synergistetes was uniquely identified in the goat microbiome may be formed by host genotype. With regard to bacterial lignocellulose degraders, the ratio of lignocellulolytic genes affiliated with Firmicutes compared to the genes linked to Bacteroidetes was 0.11:1, in which the genes encoding putative hemicellulases, carbohydrate esterases, polysaccharide lyases originated from Bacteroidetes were 14 to 20 times higher than from Firmicutes. Firmicutes seem to possess more cellulose hydrolysis capacity showing a Firmicutes/Bacteroidetes ratio of 0.35:1. Analysis of lignocellulolytic potential degraders shows that four species belonged to Bacteroidetes phylum, while two species belonged to Firmicutes phylum harbouring at least 12 different catalytic domains for all lignocellulose pretreatment, cellulose, as well as hemicellulose saccharification. Conclusion: Based on these findings, we speculate that increasing the members of Bacteroidetes to keep a low ratio of Firmicutes versus Bacteroidetes in goat rumen has resulted most likely in an increased lignocellulose digestion.

Transcriptome Profiling and Characterization of Drought-Tolerant Potato Plant (Solanum tuberosum L.)

  • Moon, Ki-Beom;Ahn, Dong-Joo;Park, Ji-Sun;Jung, Won Yong;Cho, Hye Sun;Kim, Hye-Ran;Jeon, Jae-Heung;Park, Youn-il;Kim, Hyun-Soon
    • Molecules and Cells
    • /
    • v.41 no.11
    • /
    • pp.979-992
    • /
    • 2018
  • Potato (Solanum tuberosum L.) is the third most important food crop, and breeding drought-tolerant varieties is vital research goal. However, detailed molecular mechanisms in response to drought stress in potatoes are not well known. In this study, we developed EMS-mutagenized potatoes that showed significant tolerance to drought stress compared to the wild-type (WT) 'Desiree' cultivar. In addition, changes to transcripts as a result of drought stress in WT and drought-tolerant (DR) plants were investigated by de novo assembly using the Illumina platform. One-week-old WT and DR plants were treated with -1.8 Mpa polyethylene glycol-8000, and total RNA was prepared from plants harvested at 0, 6, 12, 24, and 48 h for subsequent RNA sequencing. In total, 61,100 transcripts and 5,118 differentially expressed genes (DEGs) displaying up- or down-regulation were identified in pairwise comparisons of WT and DR plants following drought conditions. Transcriptome profiling showed the number of DEGs with up-regulation and down-regulation at 909, 977, 1181, 1225 and 826 between WT and DR plants at 0, 6, 12, 24, and 48 h, respectively. Results of KEGG enrichment showed that the drought tolerance mechanism of the DR plant can mainly be explained by two aspects, the 'photosynthetic-antenna protein' and 'protein processing of the endoplasmic reticulum'. We also divided eight expression patterns in four pairwise comparisons of DR plants (DR0 vs DR6, DR12, DR24, DR48) under PEG treatment. Our comprehensive transcriptome data will further enhance our understanding of the mechanisms regulating drought tolerance in tetraploid potato cultivars.