• 제목/요약/키워드: genome annotation

검색결과 179건 처리시간 0.024초

Genome Wide Analysis of the Potato Soft Rot Pathogen Pectobacterium carotovorum Strain ICMP 5702 to Predict Novel Insights into Its Genetic Features

  • Mallick, Tista;Mishra, Rukmini;Mohanty, Sasmita;Joshi, Raj Kumar
    • The Plant Pathology Journal
    • /
    • 제38권2호
    • /
    • pp.102-114
    • /
    • 2022
  • Pectobacterium carotovorum subsp. carotovorum (Pcc) is a gram-negative, broad host range bacterial pathogen which causes soft rot disease in potatoes as well as other vegetables worldwide. While Pectobacterium infection relies on the production of major cell wall degrading enzymes, other virulence factors and the mechanism of genetic adaptation of this pathogen is not yet clear. In the present study, we have performed an in-depth genome-wide characterization of Pcc strain ICMP5702 isolated from potato and compared it with other pathogenic bacteria from the Pectobacterium genus to identify key virulent determinants. The draft genome of Pcc ICMP5702 contains 4,774,457 bp with a G + C content of 51.90% and 4,520 open reading frames. Genome annotation revealed prominent genes encoding key virulence factors such as plant cell wall degrading enzymes, flagella-based motility, phage proteins, cell membrane structures, and secretion systems. Whereas, a majority of determinants were conserved among the Pectobacterium strains, few notable genes encoding AvrE-family type III secretion system effectors, pectate lyase and metalloprotease in addition to the CRISPR-Cas based adaptive immune system were uniquely represented. Overall, the information generated through this study will contribute to decipher the mechanism of infection and adaptive immunity in Pcc.

Complete genome sequence of Pediococcus acidilactici CACC 537 isolated from canine

  • Jung-Ae Kim;Hyun-Jun Jang;Dae-Hyuk Kim;Youn Kyoung Son;Yangseon Kim
    • Journal of Animal Science and Technology
    • /
    • 제65권5호
    • /
    • pp.1105-1109
    • /
    • 2023
  • Pedi coccus acidilactici CACC 537 was isolated from canine feces and reported to have probiotic properties. We aimed to characterize the potential probiotic properties of this strain by functional genomic analysis. Complete genome sequencing of P. acidilactici CACC 537 was performed using a PacBio RSII and Illumina platform, and contained one circular chromosome (2.0 Mb) with a 42% G + C content. The sequences were annotation revealed 1,897 protein-coding sequences, 15 rRNAs, and 56 tRNAs. It was determined that P. acidilactici CACC 537 genome carries genes known to be involved in the immune system, defense mechanisms, restriction-modification (R-M), and the CRISPR system. CACC 537 was shown to be beneficial in preventing pathogen infection during the fermentation process, help host immunity, and maintain intestinal health. These results provide for a comprehensive understanding of P. acidilactici and the development of industrial probiotic feed additives that can help improve host immunity and intestinal health.

KUGI: A Database and Search System for Korean Unigene and Pathway Information

  • Yang, Jin-Ok;Hahn, Yoon-Soo;Kim, Nam-Soon;Yu, Ung-Sik;Woo, Hyun-Goo;Chu, In-Sun;Kim, Yong-Sung;Yoo, Hyang-Sook;Kim, Sang-Soo
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.407-411
    • /
    • 2005
  • KUGI (Korean UniGene Information) database contains the annotation information of the cDNA sequences obtained from the disease samples prevalent in Korean. A total of about 157,000 5'-EST high throughput sequences collected from cDNA libraries of stomach, liver, and some cancer tissues or established cell lines from Korean patients were clustered to about 35,000 contigs. From each cluster a representative clone having the longest high quality sequence or the start codon was selected. We stored the sequences of the representative clones and the clustered contigs in the KUGI database together with their information analyzed by running Blast against RefSeq, human mRNA, and UniGene databases from NCBI. We provide a web-based search engine fur the KUGI database using two types of user interfaces: attribute-based search and similarity search of the sequences. For attribute-based search, we use DBMS technology while we use BLAST that supports various similarity search options. The search system allows not only multiple queries, but also various query types. The results are as follows: 1) information of clones and libraries, 2) accession keys, location on genome, gene ontology, and pathways to public databases, 3) links to external programs, and 4) sequence information of contig and 5'-end of clones. We believe that the KUGI database and search system may provide very useful information that can be used in the study for elucidating the causes of the disease that are prevalent in Korean.

  • PDF

The complete mitochondrial genome sequence of the indigenous I pig (Sus scrofa) in Vietnam

  • Nguyen, Hieu Duc;Bui, Tuan Anh;Nguyen, Phuong Thanh;Kim, Oanh Thi Phuong;Vo, Thuy Thi Bich
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제30권7호
    • /
    • pp.930-937
    • /
    • 2017
  • Objective: The I pig is a long nurtured longstanding breed in Vietnam, and contains excellent indigenous genetic resources. However, after 1970s, I pig breeds have become a small population because of decreasing farming areas and increasing pressure from foreign breeds with a high growth rate. Thus, there is now the risk of the disappearance of the I pigs breed. The aim of this study was to focus on classifying and identifying the I pig genetic origin and supplying molecular makers for conservation activities. Methods: This study sequenced the complete mitochondrial genome and used the sequencing result to analyze the phylogenetic relationship of I pig with Asian and European domestic pigs and wild boars. The full sequence was annotated and predicted the secondary tRNA. Results: The total length of I pig mitochondrial genome (accession number KX094894) was 16,731 base pairs, comprised two rRNA (12S and 16S), 22 tRNA and 13 mRNA genes. The annotation structures were not different from other pig breeds. Some component indexes as AT content, GC, and AT skew were counted, in which AT content (60.09%) was smaller than other pigs. We built the phylogenetic trees from full sequence and D loop sequence using Bayesian method. The result showed that I pig, Banna mini, wild boar (WB) Vietnam and WB Hainan or WB Korea, WB Japan were a cluster. They were a group within the Asian clade distinct from Chinese pigs and other Asian breeds in both phylogenetic trees (0.0004 and 0.0057, respectively). Conclusion: These results were similar to previous phylogenic study in Vietnamese pig and showed the genetic distinctness of I pig with other Asian domestic pigs.

Whole genome sequence of Staphylococcus aureus strain RMI-014804 isolated from pulmonary patient sputum via next-generation sequencing technology

  • Ayesha, Wisal;Asad Ullah;Waheed Anwar;Carlos M. Morel;Syed Shah Hassan
    • Genomics & Informatics
    • /
    • 제21권3호
    • /
    • pp.34.1-34.10
    • /
    • 2023
  • Nosocomial infections, commonly referred to as healthcare-associated infections, are illnesses that patients get while hospitalized and are typically either not yet manifest or may develop. One of the most prevalent nosocomial diseases in hospitalized patients is pneumonia, among the leading causes of mortality and morbidity. Viral, bacterial, and fungal pathogens cause pneumonia. More severe introductions commonly included Staphylococcus aureus, which is at the top of bacterial infections, per World Health Organization reports. The staphylococci, S. aureus, strain RMI-014804, mesophile, on-sporulating, and non-motile bacterium, was isolated from the sputum of a pulmonary patient in Pakistan. Many characteristics of S. aureus strain RMI-014804 have been revealed in this paper, with complete genome sequence and annotation. Our findings indicate that the genome is a single circular 2.82 Mbp long genome with 1,962 protein-coding genes, 15 rRNA, 49 tRNA, 62 pseudogenes, and a GC content of 28.76%. As a result of this genome sequencing analysis, researchers will fully understand the genetic and molecular basis of the virulence of the S. aureus bacteria, which could help prevent the spread of nosocomial infections like pneumonia. Genome analysis of this strain was necessary to identify the specific genes and molecular mechanisms that contribute to its pathogenicity, antibiotic resistance, and genetic diversity, allowing for a more in-depth investigation of its pathogenesis to develop new treatments and preventive measures against infections caused by this bacterium.

Whole Genome Resequencing of Heugu (Korean Black Cattle) for the Genome-Wide SNP Discovery

  • Choi, Jung-Woo;Chung, Won-Hyong;Lee, Kyung-Tai;Choi, Jae-Won;Jung, Kyoung-Sub;Cho, Yongmin;Kim, Namshin;Kim, Tae-Hun
    • 한국축산식품학회지
    • /
    • 제33권6호
    • /
    • pp.715-722
    • /
    • 2013
  • Heugu (Korea Black Cattle) is one of the indigenous cattle breeds in Korea; however there has been severe lack of genomic studies on the breed. In this study, we report the first whole genome resequencing of Heugu at higher sequence coverage using Illumina HiSeq 2000 platform. More than 153.6 Giga base pairs sequence was obtained, of which 97% of the reads were mapped to the bovine reference sequence assembly (UMD 3.1). The number of non-redundantly mapped sequence reads corresponds to approximately 28.9-fold coverage across the genome. From these data, we identified a total of over six million single nucleotide polymorphisms (SNPs), of which 29.4% were found to be novel using the single nucleotide polymorphism database build 137. Extensive annotation was performed on all the detected SNPs, showing that most of SNPs were located in intergenic regions (70.7%), which is well corresponded with previous studies. Of the total SNPs, we identified substantial numbers of non-synonymous SNPs (13,979) in 5,999 genes, which could potentially affect meat quality traits in cattle. These results provide genome-wide SNPs that can serve as useful genetic tools and as candidates in searches for phenotype-altering DNA difference implicated with meat quality traits in cattle. The importance of this study can be further pronounced with the first whole genome sequencing of the valuable local genetic resource to be used in further genomic comparison studies with diverse cattle breeds.

Draft Genome Sequence of the Reference Strain of the Korean Medicinal Mushroom Wolfiporia cocos KMCC03342

  • Bogun Kim;Byoungnam Min;Jae-Gu Han;Hongjae Park;Seungwoo Baek;Subin Jeong;In-Geol Choi
    • Mycobiology
    • /
    • 제50권4호
    • /
    • pp.254-257
    • /
    • 2022
  • Wolfiporia cocos is a wood-decay brown rot fungus belonging to the family Polyporaceae. While the fungus grows, the sclerotium body of the strain, dubbed Bokryeong in Korean, is formed around the roots of conifer trees. The dried sclerotium has been widely used as a key component of many medicinal recipes in East Asia. Wolfiporia cocos strain KMCC03342 is the reference strain registered and maintained by the Korea Seed and Variety Service for commercial uses. Here, we present the first draft genome sequence of W. cocos KMCC03342 using a hybrid assembly technique combining both short- and long-read sequences. The genome has a total length of 55.5 Mb comprised of 343 contigs with N50 of 332 kb and 95.8% BUSCO completeness. The GC ratio was 52.2%. We predicted 14,296 protein-coding gene models based on ab initio gene prediction and evidence-based annotation procedure using RNAseq data. The annotated genome was predicted to have 19 terpene biosynthesis gene clusters, which was the same number as the previously sequenced W. cocos strain MD-104 genome but higher than Chinese W. cocos strains. The genome sequence and the predicted gene clusters allow us to study biosynthetic pathways for the active ingredients of W. cocos.

Improving spaCy dependency annotation and PoS tagging web service using independent NER services

  • Colic, Nico;Rinaldi, Fabio
    • Genomics & Informatics
    • /
    • 제17권2호
    • /
    • pp.21.1-21.6
    • /
    • 2019
  • Dependency parsing is often used as a component in many text analysis pipelines. However, performance, especially in specialized domains, suffers from the presence of complex terminology. Our hypothesis is that including named entity annotations can improve the speed and quality of dependency parses. As part of BLAH5, we built a web service delivering improved dependency parses by taking into account named entity annotations obtained by third party services. Our evaluation shows improved results and better speed.

CONVIRT: A web-based tool for transcriptional regulatory site identification using a conserved virtual chromosome

  • Ryu, Tae-Woo;Lee, Se-Joon;Hur, Cheol-Goo;Lee, Do-Heon
    • BMB Reports
    • /
    • 제42권12호
    • /
    • pp.823-828
    • /
    • 2009
  • Techniques for analyzing protein-DNA interactions on a genome-wide scale have recently established regulatory roles for distal enhancers. However, the large sizes of higher eukaryotic genomes have made identification of these elements difficult. Information regarding sequence conservation, exon annotation and repetitive regions can be used to reduce the size of the search region. However, previously developed resources are inadequate for consolidating such information. CONVIRT is a web resource for the identification of transcription factor binding sites and also features comparative genomics. Genomic information on ortholog-independent conserved regions, exons, repeats and sequences is integrated into the virtual chromosome, and statistically over-represented single or combinations of transcription factor binding sites are sought. CONVIRT provides regulatory network analysis for several organisms with long promoter regions and permits inter-species genome alignments. CONVIRT is freely available at http://biosoft.kaist.ac.kr/convirt.