• Title/Summary/Keyword: Whole-genome sequence

Search Result 210, Processing Time 0.025 seconds

Whole-genome sequence analysis through online web interfaces: a review

  • Gunasekara, A.W.A.C.W.R.;Rajapaksha, L.G.T.G.;Tung, T.L.
    • Genomics & Informatics
    • /
    • v.20 no.1
    • /
    • pp.3.1-3.10
    • /
    • 2022
  • The recent development of whole-genome sequencing technologies paved the way for understanding the genomes of microorganisms. Every whole-genome sequencing (WGS) project requires a considerable cost and a massive effort to address the questions at hand. The final step of WGS is data analysis. The analysis of whole-genome sequence is dependent on highly sophisticated bioinformatics tools that the research personal have to buy. However, many laboratories and research institutions do not have the bioinformatics capabilities to analyze the genomic data and therefore, are unable to take maximum advantage of whole-genome sequencing. In this aspect, this study provides a guide for research personals on a set of bioinformatics tools available online that can be used to analyze whole-genome sequence data of bacterial genomes. The web interfaces described here have many advantages and, in most cases exempting the need for costly analysis tools and intensive computing resources.

Whole Mitochondrial Genome Sequence of an Indian Plasmodium falciparum Field Isolate

  • Tyagi, Suchi;Pande, Veena;Das, Aparup
    • Parasites, Hosts and Diseases
    • /
    • v.52 no.1
    • /
    • pp.99-103
    • /
    • 2014
  • Mitochondrial genome sequence of malaria parasites has served as a potential marker for inferring evolutionary history of the Plasmodium genus. In Plasmodium falciparum, the mitochondrial genome sequences from around the globe have provided important evolutionary understanding, but no Indian sequence has yet been utilized. We have sequenced the whole mitochondrial genome of a single P. falciparum field isolate from India using novel primers and compared with the 3D7 reference sequence and 1 previously reported Indian sequence. While the 2 Indian sequences were highly divergent from each other, the presently sequenced isolate was highly similar to the reference 3D7 strain.

Development of Workbench for Analysis and Visualization of Whole Genome Sequence (전유전체(Whole gerlome) 서열 분석과 가시화를 위한 워크벤치 개발)

  • Choe, Jeong-Hyeon;Jin, Hui-Jeong;Kim, Cheol-Min;Jang, Cheol-Hun;Jo, Hwan-Gyu
    • The KIPS Transactions:PartA
    • /
    • v.9A no.3
    • /
    • pp.387-398
    • /
    • 2002
  • As whole genome sequences of many organisms have been revealed by small-scale genome projects, the intensive research on individual genes and their functions has been performed. However on-memory algorithms are inefficient to analysis of whole genome sequences, since the size of individual whole genome is from several million base pairs to hundreds billion base pairs. In order to effectively manipulate the huge sequence data, it is necessary to use the indexed data structure for external memory. In this paper, we introduce a workbench system for analysis and visualization of whole genome sequence using string B-tree that is suitable for analysis of huge data. This system consists of two parts : analysis query part and visualization part. Query system supports various transactions such as sequence search, k-occurrence, and k-mer analysis. Visualization system helps biological scientist to easily understand whole structure and specificity by many kinds of visualization such as whole genome sequence, annotation, CGR (Chaos Game Representation), k-mer, and RWP (Random Walk Plot). One can find the relations among organisms, predict the genes in a genome, and research on the function of junk DNA using our workbench.

Complete Genome Sequence of the Enterobacter asburiae IK3 Isolated from a Soybean (Glycine max) Rhizosphere

  • Sihyun Park;GyuDae Lee;Ikwhan Kim;Yeongyu Jeong;Jae-Ho Shin
    • Microbiology and Biotechnology Letters
    • /
    • v.51 no.3
    • /
    • pp.306-308
    • /
    • 2023
  • This research presents the whole-genome sequence of Enterobacter asburiae strain IK3, which was isolated from the rhizosphere soil of soybean (Glycine max). The genome of the strain is composed of a single chromosome with 4 plasmids, total size of 5,084,040 bp, and the GC content is 55.5%.

Complete Genome Sequence of Bifidobacterium bifidum DS0908, Isolated from Human Fecal Sample

  • Haneol Yang;Yong-Sik Kim;Doo-Sang Park
    • Microbiology and Biotechnology Letters
    • /
    • v.51 no.4
    • /
    • pp.566-568
    • /
    • 2023
  • In this report, we present the whole-genome sequence of Bifidobacterium bifidum DS0908 isolated from the human fecal sample. The genome composed of a single circular chromosome is 2,223,317 bp long and the DNA G+C content is 62.65%. No virulence genes were detected in the genomic sequences of B. bifidum DS0908.

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

  • Lim, Jong-Sung;Choi, Beom-Soon;Lee, Jeong-Soo;Shin, Chan-Seok;Yang, Tae-Jin;Rhee, Jae-Sung;Lee, Jae-Seong;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.1-8
    • /
    • 2012
  • Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the nextgeneration DNA sequencer (NGS) Roche/454 and Illumina/ Solexa systems, along with bioinformation analysis technologies of whole-genome $de$ $novo$ assembly, expression profiling, DNA variation discovery, and genotyping. Both massive whole-genome shotgun paired-end sequencing and mate paired-end sequencing data are important steps for constructing $de$ $novo$ assembly of novel genome sequencing data. It is necessary to have DNA sequence information from a multiplatform NGS with at least $2{\times}$ and $30{\times}$ depth sequence of genome coverage using Roche/454 and Illumina/Solexa, respectively, for effective an way of de novo assembly. Massive shortlength reading data from the Illumina/Solexa system is enough to discover DNA variation, resulting in reducing the cost of DNA sequencing. Whole-genome expression profile data are useful to approach genome system biology with quantification of expressed RNAs from a wholegenome transcriptome, depending on the tissue samples. The hybrid mRNA sequences from Rohce/454 and Illumina/Solexa are more powerful to find novel genes through $de$ $novo$ assembly in any whole-genome sequenced species. The $20{\times}$ and $50{\times}$ coverage of the estimated transcriptome sequences using Roche/454 and Illumina/Solexa, respectively, is effective to create novel expressed reference sequences. However, only an average $30{\times}$ coverage of a transcriptome with short read sequences of Illumina/Solexa is enough to check expression quantification, compared to the reference expressed sequence tag sequence.

Whole Genome Resequencing of Heugu (Korean Black Cattle) for the Genome-Wide SNP Discovery

  • Choi, Jung-Woo;Chung, Won-Hyong;Lee, Kyung-Tai;Choi, Jae-Won;Jung, Kyoung-Sub;Cho, Yongmin;Kim, Namshin;Kim, Tae-Hun
    • Food Science of Animal Resources
    • /
    • v.33 no.6
    • /
    • pp.715-722
    • /
    • 2013
  • Heugu (Korea Black Cattle) is one of the indigenous cattle breeds in Korea; however there has been severe lack of genomic studies on the breed. In this study, we report the first whole genome resequencing of Heugu at higher sequence coverage using Illumina HiSeq 2000 platform. More than 153.6 Giga base pairs sequence was obtained, of which 97% of the reads were mapped to the bovine reference sequence assembly (UMD 3.1). The number of non-redundantly mapped sequence reads corresponds to approximately 28.9-fold coverage across the genome. From these data, we identified a total of over six million single nucleotide polymorphisms (SNPs), of which 29.4% were found to be novel using the single nucleotide polymorphism database build 137. Extensive annotation was performed on all the detected SNPs, showing that most of SNPs were located in intergenic regions (70.7%), which is well corresponded with previous studies. Of the total SNPs, we identified substantial numbers of non-synonymous SNPs (13,979) in 5,999 genes, which could potentially affect meat quality traits in cattle. These results provide genome-wide SNPs that can serve as useful genetic tools and as candidates in searches for phenotype-altering DNA difference implicated with meat quality traits in cattle. The importance of this study can be further pronounced with the first whole genome sequencing of the valuable local genetic resource to be used in further genomic comparison studies with diverse cattle breeds.

Five Computer Simulation Studies of Whole-Genome Fragment Assembly: The Case of Assembling Zymomonas mobilis ZM4 Sequences

  • Jung, Cholhee;Choi, Jin-Young;Park, Hyun Seck;Seo, Jeong-Sun
    • Genomics & Informatics
    • /
    • v.2 no.4
    • /
    • pp.184-190
    • /
    • 2004
  • An approach for genome analysis based on assembly of fragments of DNA from the whole genome can be applied to obtain the complete nucleotide sequence of the genome of Zymomonas mobilis. However, the problem of fragment assembly raise thorny computational issues. Computer simulation studies of sequence assembly usually show some abnormal assemblage of artificial sequences containing repetitive or duplicated regions, and suggest methods to correct those abnormalities. In this paper, we describe five simulation studies which had been performed previous to the actual genome assembly process of Zymomonas mobilis ZM4.

Determination of Complete Genome Sequence of Korean Isolate of Potato virus X

  • Choi, Sun-Hee;Ryu, Ki-Hyun
    • The Plant Pathology Journal
    • /
    • v.24 no.3
    • /
    • pp.361-364
    • /
    • 2008
  • The complete nucleotide sequences of a Korean isolate of Potato virus X(PVX-Kr) has been determined. Full-length cDNA of PVX-Kr has been directly amplified by long template reverse transcription and polymerase chain reaction(RT-PCR) using virus specific 5'-end primer and 3'-end primer, and then constructed in a plasmid vector. Consecutive subclones of a full-length cDNA clone were constructed to identify whole genome sequence of the virus. Total nucleotide sequences of genome of PVX-Kr were 6,435 excluding one adenine at poly A tail, and genome organization was identical with that of typical PVX species. Comparison of whole genome sequence of PVX-Kr with those of European and South American isolates showed 95.4-96.8% and 77.4-77.9%, in nucleotide similarity, respectively. Sequenced PVX-Kr in this study and twelve isolates already reported could be divided into two subgroups in phylogeny based on their complete nucleotide sequences. Phylogenetic tree analysis demonstrated that PVX-Kr was clustered with European and Asian isolates(Taiwan, os, bs, Kr, S, X3, UK3, ROTH1, Tula) in the same subgroup and South American isolates(CP, CP2, CP4, HB) were clustered in the other subgroup.