• Title/Summary/Keyword: Whole-genome sequence

Search Result 213, Processing Time 0.025 seconds

Prediction of Genes Related to Positive Selection Using Whole-Genome Resequencing in Three Commercial Pig Breeds

  • Kim, HyoYoung;Caetano-Anolles, Kelsey;Seo, Minseok;Kwon, Young-jun;Cho, Seoae;Seo, Kangseok;Kim, Heebal
    • Genomics & Informatics
    • /
    • v.13 no.4
    • /
    • pp.137-145
    • /
    • 2015
  • Selective sweep can cause genetic differentiation across populations, which allows for the identification of possible causative regions/genes underlying important traits. The pig has experienced a long history of allele frequency changes through artificial selection in the domestication process. We obtained an average of 329,482,871 sequence reads for 24 pigs from three pig breeds: Yorkshire (n = 5), Landrace (n = 13), and Duroc (n = 6). An average read depth of 11.7 was obtained using whole-genome resequencing on an Illumina HiSeq2000 platform. In this study, cross-population extended haplotype homozygosity and cross-population composite likelihood ratio tests were implemented to detect genes experiencing positive selection for the genome-wide resequencing data generated from three commercial pig breeds. In our results, 26, 7, and 14 genes from Yorkshire, Landrace, and Duroc, respectively were detected by two kinds of statistical tests. Significant evidence for positive selection was identified on genes ST6GALNAC2 and EPHX1 in Yorkshire, PARK2 in Landrace, and BMP6, SLA-DQA1, and PRKG1 in Duroc. These genes are reportedly relevant to lactation, reproduction, meat quality, and growth traits. To understand how these single nucleotide polymorphisms (SNPs) related positive selection affect protein function, we analyzed the effect of non-synonymous SNPs. Three SNPs (rs324509622, rs80931851, and rs80937718) in the SLA-DQA1 gene were significant in the enrichment tests, indicating strong evidence for positive selection in Duroc. Our analyses identified genes under positive selection for lactation, reproduction, and meat-quality and growth traits in Yorkshire, Landrace, and Duroc, respectively.

From the Sequence to Cell Modeling: Comprehensive Functional Genomics in Escherichia coli

  • Mori, Hirotada
    • BMB Reports
    • /
    • v.37 no.1
    • /
    • pp.83-92
    • /
    • 2004
  • As a result of the enormous amount of information that has been collected with E. coli over the past half century (e.g. genome sequence, mutant phenotypes, metabolic and regulatory networks, etc.), we now have detailed knowledge about gene regulation, protein activity, several hundred enzyme reactions, metabolic pathways, macromolecular machines, and regulatory interactions for this model organism. However, understanding how all these processes interact to form a living cell will require further characterization, quantification, data integration, and mathematical modeling, systems biology. No organism can rival E. coli with respect to the amount of available basic information and experimental tractability for the technologies needed for this undertaking. A focused, systematic effort to understand the E. coli cell will accelerate the development of new post-genomic technologies, including both experimental and computational tools. It will also lead to new technologies that will be applicable to other organisms, from microbes to plants, animals, and humans. E. coli is not only the best studied free-living model organism, but is also an extensively used microbe for industrial applications, especially for the production of small molecules of interest. It is an excellent representative of Gram-negative commensal bacteria. E. coli may represent a perfect model organism for systems biology that is aimed at elucidating both its free-living and commensal life-styles, which should open the door to whole-cell modeling and simulation.

Korea Brassica Genome Project: Current Status and Prospective (배추 유전체열구의 현황과 전망)

  • Choi, Su-Ryun;Park, Jee-Yong;Park, Beom-Seok;Kim, Ho-Il;Lim, Yong-Pyo
    • Journal of Plant Biotechnology
    • /
    • v.33 no.3
    • /
    • pp.153-160
    • /
    • 2006
  • Brassica rape is an important species used as a vegetable, oil, and fodder worldwide. It is related phylogenically to Arabidopsis thaliana, which has already been fully sequenced as a model plant. The 'Multinational Brassica Genome Project (MBGP)'was launched by the international Brassica community with the aim of sequencing the whole genome of B. rapa in 2003 on account of its value and the fact that it has the smallest genome among the diploid Brassica. The genome study was carried out not only to know the structure of genome but also to understand the function and the evolution of the genes comprehensively. There are two mapping populations, over 1,000 molecular markers and a genetic map, 2 BAC libraries, physical map, a 22 cDHA libraries as suitable genomic materials for examining the genome of B. rapa ssp. pekinensis Chinese cabbage. As the first step for whole genome analysis, 220,000 BAC-end sequences of the KBrH and KBrB BAC library are achieved by cooperation of six countries. The results of BAC-end sequence analysis will provide a clue in understanding the structure of the genome of Brassica rapa by analyzing the gene sequence, annotation and abundant repetitive DHA. The second stage involves sequencing of the genetically mapped seed BACs and identifying the overlapping BACs for complete genome sequencing. Currently, the second stage is comprises of process genetic anchoring using communal populations and maps to identify more than 1,000 seed BACs based on a BAC-to-BAC strategy. For the initial sequencing, 629 seed BACs corresponding to the minimum tiling path onto Arabidopsis genome were selected and fully sequenced. These BACs are now anchoring to the genetic map using the development of SSR markers. This information will be useful for identifying near BAC clones with the seed BAC on a genome map. From the BAC sequences, it is revealed that the Brassica rapa genome has extensive triplication of the DNA segment coupled with variable gene losses and rearrangements within the segments. This article introduces the current status and prospective of Korea Brassica Genome Project and the bioinformatics tools possessed in each national team. In the near future, data of the genome will contribute to improving Brassicas for their economic use as well as in understanding the evolutional process.

Genomic Research of the Genus Bifidobacterium and Its Application (비피도박테리아의 분자생물학적인 연구 동향)

  • Kim, Geun-Bae
    • Journal of Dairy Science and Biotechnology
    • /
    • v.25 no.2
    • /
    • pp.21-28
    • /
    • 2007
  • Recently, the field of microbiology has been transformed by huge increasing number of publicly available whole-genome sequences. This sequence information has significantly enhanced our understanding of the physiology, genetics, and evolutionary development of bacteria. Among the gastrointestinal microorganisms, bifidobacteria represent the most important human commensals because of their contribution to maintaining a balanced gastrointestinal tract microbiota. In recent years bifidobacteria have drawn much scientific attention due to their use as live bacteria in numerous food products with various health-related claims. For this reason, these bacteria constitute a growing area of interest with respect to genomics, molecular biology, and genetics. Recent genome sequencing of a number of bifidobacterial species has allowed access to the complete genetic make-up of these bacteria. This review will focus how genomic data has allowed us to understand bifidobacterial evolution, while also revealing genetic functions that explains their presence in the particular ecological environment of the gastrointestinal tract.

  • PDF

Genomic Research as a Means to Understand Bacterial Phylogeny and Ecological Adaptation of the Genus Bifidobacterium (Bifidobacterium의 분자생물학적 연구 동향)

  • Kim, Geun-Bae
    • 한국유가공학회:학술대회논문집
    • /
    • 2007.09a
    • /
    • pp.21-29
    • /
    • 2007
  • The field of microbiology has in recent years been transformed by huge increasing number of publicly available whole-genome sequences. This sequence information has significantly enhanced our understanding of the physiology, genetics, and evolutionary development of bacteria. Among the gastrointestinal microorganisms, bifidobacteria represent important human commensals because of their perceived contribution to maintaining a balanced gastrointestinal tract microbiota. In recent years bifidobacteria have drawn much scientific attention due to their use as live bacteria in numerous food products with various health-related claims. For this reason, these bacteria constitute a growing area of interest with respect to genomics, molecular biology, and genetics. Recent genome sequencing of a number of bifidobacterial species has allowed access to the complete genetic make-up of these bacteria. This review will focus how genomic data has allowed us to understand bifidobacterial evolution, while also revealing genetic functions that explains their presence in the particular ecological environment of the gastrointestinal tract.

  • PDF

Genomic Approaches for Understanding the Characteristics of Salmonella enterica subsp. enterica Serovar Typhimurium ST1120, Isolated from Swine Feces in Korea

  • Kim, Seongok;Kim, Eunsuk;Park, Soyeon;Hahn, Tae-Wook;Yoon, Hyunjin
    • Journal of Microbiology and Biotechnology
    • /
    • v.27 no.11
    • /
    • pp.1983-1993
    • /
    • 2017
  • Salmonella enterica subsp. enterica serovar Typhimurium, one of the most common foodborne pathogens, is transmitted mainly through contaminated food derived from infected animals. In this study, S. Typhimurium ST1120, an isolate from pig feces in Korea, was subjected to whole-genome analysis to understand its genomic features associated with virulence. The genome of ST1120 was found to have a circular chromosome of 4,855,001 bp (GC content 52.2%) and a plasmid of 6,863 bp (GC content 46.0%). This chromosome was predicted to have 4,558 open reading frames (ORFs), 17 pseudogenes, 22 rRNA genes, and 86 tRNA genes. Its plasmid was predicted to have three ORFs. Comparative genome analysis revealed that ST1120 was phylogenetically close to S. Typhimurium U288, a critical isolate in piggery farms and food chains in Europe. In silico functional analysis predicted that the ST1120 genome harbored multiple genes associated with virulence and stress resistance, including Salmonella pathogenicity islands (SPIs containing SPI-1 to SPI-5, SPI-13, and SPI-14), C63PI locus, ST104 prophage locus, and various antibiotic resistance genes. In accordance with these analysis results, ST1120 showed competence in invasion and survival abilities when it was added to host cells. It also exhibited robust resistance against antibiotics in comparison with other S. Typhimurium strains. This is the first report of the complete genome sequence of S. Typhimurium isolated from swine in Korea. Comparative genome analysis between ST1120 and other Salmonella strains would provide fruitful information toward understanding Salmonella host specificity and developing control measures against S. Typhimurium infection.

Comparative Analysis of Salmonella enterica subsp. enterica Serovar Thompson Isolates associated with Outbreaks Using PFGE and wgMLST

  • Youngho Koh;Yunyoung Bae;Min-Jung Lee;Yu-Si Lee;Dong-Hyun Kang;Soon Han Kim
    • Journal of Microbiology and Biotechnology
    • /
    • v.32 no.12
    • /
    • pp.1605-1614
    • /
    • 2022
  • The strains associated with foodborne Salmonella enterica Thompson outbreaks in Korea have not been identified. Therefore, we characterized S. Thompson strains isolated from chocolate cakes linked to foodborne outbreaks in Korea. A total of 56 strains were isolated from preserved cake products, products in the supply chain distribution, the manufacturer's apparatus, and egg white liquid products used for cream preparation. Subsequently, serological typing, pathogenic gene-targeted polymerase chain reaction (PCR), pulsed-field gel electrophoresis (PFGE), and whole-genome multi-locus sequence typing (wgMLST) were performed to characterize these isolates. The antigen formula of all isolates was 7:k:1,5, namely Salmonella enterica subsp. enterica Serovar Thompson. All 56 isolates harbored invA, his, hin, and stn, and were negative for sefA and spvC based on gene-targeted PCR analyses. Based on PFGE results, these isolates were classified into one group based on the same SP6X01.011 pattern with 100% similarity. We selected 19 strains based on the region and sample type, which were subjected to wgMLST. Although the examined strains showed 100% similarity, they were classified into seven clusters based on allelic differences. According to our findings, the cause of these outbreaks was chocolate cake manufactured with egg white liquid contaminated with the same Salmonella Thompson. Additionally, comparative analysis of wgMLST on domestic isolates of S. Thompson from the three outbreaks showed genetic similarities of over 99.6%. Based on the results, the PFGE and wgMLST combination can provide highly resolved phylogeny and reliable evidence during Salmonella outbreak investigations.

Comparative Genomic Analysis of Staphylococcus aureus FORC_001 and S. aureus MRSA252 Reveals the Characteristics of Antibiotic Resistance and Virulence Factors for Human Infection

  • Lim, Sooyeon;Lee, Dong-Hoon;Kwak, Woori;Shin, Hakdong;Ku, Hye-Jin;Lee, Jong-eun;Lee, Gun Eui;Kim, Heebal;Choi, Sang-Ho;Ryu, Sangryeol;Lee, Ju-Hoon
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.1
    • /
    • pp.98-108
    • /
    • 2015
  • Staphylococcus aureus is an important foodborne pathogen that causes diverse diseases ranging from minor infections to life-threatening conditions in humans and animals. To further understand its pathogenesis, the genome of the strain S. aureus FORC_001 was isolated from a contaminated food. Its genome consists of 2,886,017 bp double-stranded DNA with a GC content of 32.8%. It is predicted to contain 2,728 open reading frames, 57 tRNAs, and 6 rRNA operons, including 1 additional 5S rRNA gene. Comparative phylogenetic tree analysis of 40 complete S. aureus genome sequences using average nucleotide identity (ANI) revealed that strain FORC_001 belonged to Group I. The closest phylogenetic match was S. aureus MRSA252, according to a whole-genome ANI (99.87%), suggesting that they might share a common ancestor. Comparative genome analysis of FORC_001 and MRSA252 revealed two non-homologous regions: Regions I and II. The presence of various antibiotic resistance genes, including the SCCmec cluster in Region I of MRSA252, suggests that this strain might have acquired the SCCmec cluster to adapt to specific environments containing methicillin. Region II of both genomes contains prophage regions but their DNA sequence identity is very low, suggesting that the prophages might differ. This is the first report of the complete genome sequence of S. aureus isolated from a real foodborne outbreak in South Korea. This report would be helpful to extend our understanding about the genome, general characteristics, and virulence factors of S. aureus for further studies of pathogenesis, rapid detection, and epidemiological investigation in foodborne outbreak.

Whole-exome sequencing analysis in a case of primary congenital glaucoma due to the partial uniparental isodisomy

  • Zavarzadeh, Parisima Ghaffarian;Bonyadi, Morteza;Abedi, Zahra
    • Genomics & Informatics
    • /
    • v.20 no.3
    • /
    • pp.28.1-28.7
    • /
    • 2022
  • We described a clinical, laboratory, and genetic presentation of a pathogenic variant of the CYP1B1 gene through a report of a case of primary congenital glaucoma and a trio analysis of this candidate variant in the family with the Sanger sequencing method and eventually completed our study with the secondary/incidental findings. This study reports a rare case of primary congenital glaucoma, an 8-year-old female child with a negative family history of glaucoma and uncontrolled intraocular pressure. This case's whole-exome sequencing data analysis presents a homozygous pathogenic single nucleotide variant in the CYP1B1 gene (NM_000104:exon3:c.G1103A:p.R368H). At the same time, this pathogenic variant was obtained as a heterozygous state in her unaffected father but not her mother. The diagnosis was made based on molecular findings of whole-exome sequencing data analysis. Therefore, the clinical reports and bioinformatics findings supported the relation between the candidate pathogenic variant and the disease. However, it should not be forgotten that primary congenital glaucoma is not peculiar to the CYP1B1 gene. Since the chance of developing autosomal recessive disorders with low allele frequency and unrelated parents is extraordinary in offspring. However, further data analysis of whole-exome sequencing and Sanger sequencing method were applied to obtain the type of mutation and how it was carried to the offspring.

Development of PCR-based markers specific to Solanum brevicaule by using the complete chloroplast genome sequences of Solanum species (엽록체 전장유전체 비교를 통한 PCR 기반의 Solanum brevicaule 특이적 분자마커 개발)

  • Park, Tae-Ho
    • Journal of Plant Biotechnology
    • /
    • v.49 no.1
    • /
    • pp.30-38
    • /
    • 2022
  • Solanum brevicaule is one of the tuber-bearing wild Solanum species. Because of its resistance to several important pathogens infecting potatoes during cultivation, it can be used for potato breeding. However, the fact that S. brevicaule used in this study has an EBN value of two causes the sexual reproduction barriers between the species and cultivated potatoes. In this study, specific markers for discriminating S. brevicaule from other Solanum species were developed on the basis of the results of sequence alignments with the whole chloroplast genomes of S. brevicaule and seven other Solanum species. The chloroplast genome of S. brevicaule was completed by next-generation sequencing technology described in other recent studies. The total sequence length of the chloroplast genome of S. brevicaule is 155,531 bp. Its structure and gene composition are similar to those of other Solanum species. Phylogenetic analysis revealed that S. brevicaule was closely grouped with other Solanum species. BLASTN search showed that its genome sequence had 99.99% and 99.89% identity with those of S. spegazzinii (MH021562) and S. kurtzianum (MH021495), respectively. Sequence alignment identified 27 SNPs that were specific to S. brevicaule. Thus, three PCR-based CAPS markers specific to S. brevicaule were developed on the basis of these SNPs. This study will facilitate in further studies on evolutionary and breeding aspects in Solanum species.