• 제목/요약/키워드: Genome size

검색결과 353건 처리시간 0.026초

Comparative Genomic Analysis of Food-Originated Coagulase-Negative Staphylococcus: Analysis of Conserved Core Genes and Diversity of the Pan-Genome

  • Heo, Sojeong;Lee, Jung-Sug;Lee, Jong-Hoon;Jeong, Do-Won
    • Journal of Microbiology and Biotechnology
    • /
    • 제30권3호
    • /
    • pp.341-351
    • /
    • 2020
  • To shed light on the genetic differences among food-originated coagulase-negative Staphylococcus (CNS), we performed pan-genome analysis of five species: Staphylococcus carnosus (two strains), Staphylococcus equorum (two strains), Staphylococcus succinus (three strains), Staphylococcus xylosus (two strains), and Staphylococcus saprophyticus (one strain). The pan-genome size increases with each new strain and currently holds about 4,500 genes from 10 genomes. Specific genes were shown to be strain dependent but not species dependent. Most specific genes were of unknown function or encoded restriction-modification enzymes, transposases, or prophages. Our results indicate that unique genes have been acquired or lost by convergent evolution within individual strains.

The complete chloroplast genome sequence of Hibiscus sabdariffa (Malvaceae)

  • KWON, Soon-Ho;PARK, Yunmi;JANG, You Lim;KWON, Hae-Yun
    • 식물분류학회지
    • /
    • 제52권2호
    • /
    • pp.123-126
    • /
    • 2022
  • Hibiscus sabdariffa L., (roselle) in the Malvaceae family is an erect subshrub known to be native to India and Malaysia. It is widely used as a food or tea material around the world, and its therapeutic effects have been widely studied. In this study, the sequencing of the complete chloroplast genome of H. sabdariffa was carried out. The result indicates a genome size of 162,428 bp, which is composed of a large single copy of 90,327 bp, two inverted repeats of 26,242 bp each, and a small single copy of 19,617 bp. Overall, a total of 131 genes were predicted, including 86 coding sequences, 37 tRNAs, and 8 rRNAs. According to a phylogenic analysis, it was clearly distinguished from outgroups such as other species of the genus Hibiscus used in the analysis.

Whole genome sequence analysis of Ligilactobacillus agilis C7 isolated from pig feces revealed three bacteriocin gene clusters

  • Jeong Min, Yoo;Remilyn M., Mendoza;In-Chan, Hwang;Dae-Kyung, Kang
    • Journal of Animal Science and Technology
    • /
    • 제64권5호
    • /
    • pp.1008-1011
    • /
    • 2022
  • We here report the whole genome sequence of Ligilactobacillus agilis C7 with anti-listerial activity, which was isolated from pig feces. The genome size of L. agilis C7 (~ 3.0 Mb) is relatively larger compared with other L. agilis strains. L. agilis C7 carries three bacteriocin gene clusters encoding garvicin Q, salivaricin A, and Blp family class II bacteriocin. Garvicin Q and salivaricin A are reported to be active against Listeria monocytogenes and Micrococcus luteus, respectively, as well as against other Gram-positive bacteria. Meanwhile, the bacteriocin encoded in the blp cassette was shown to be active against pneumococci, mediating intraspecies competition. This report highlights the potential of L. agilis C7 for the production of bacteriocins inhibiting pathogenic bacteria.

The complete chloroplast genome sequence of Rhododendron caucasicum (Ericaceae)

  • Myounghai KWAK;Rainer W. BUSSMANN
    • 식물분류학회지
    • /
    • 제53권3호
    • /
    • pp.230-236
    • /
    • 2023
  • Rhododendron caucasicum Pall. is a shrub distributed in the mountainous areas of the Caucasus from northeastern Türkiye towards the Caspian Sea. This study reports the first complete chloroplast genome sequence of R. caucasicum. The plastome is 199,487 base pairs (bp) long and exhibits a typical quadripartite structure comprising a large single-copy region of 107,645 bp, a small single-copy region of 2,598 bp, and a pair of identical inverted repeat regions of 44,622 bp each. It contains 143 genes, comprising 93 protein-coding genes, 42 tRNA genes, and eight rRNA genes. The large chloroplast genome size is likely due to the expansion of inverted repeats. A phylogenetic analysis of chloroplast genomes with other Rhododendron species supports previously recognized infrageneric relationship.

Beta-Meta: a meta-analysis application considering heterogeneity among genome-wide association studies

  • Gyungbu Kim;Yoonsuk Lee;Jeong Ho Park;Dongmin Kim;Wonseok Lee
    • Genomics & Informatics
    • /
    • 제20권4호
    • /
    • pp.49.1-49.7
    • /
    • 2022
  • Many packages for a meta-analysis of genome-wide association studies (GWAS) have been developed to discover genetic variants. Although variations across studies must be considered, there are not many currently-accessible packages that estimate between-study heterogeneity. Thus, we propose a python based application called Beta-Meta which can easily process a meta-analysis by automatically selecting between a fixed effects and a random effects model based on heterogeneity. Beta-Meta implements flexible input data manipulation to allow multiple meta-analyses of different genotype-phenotype associations in a single process. It provides a step-by-step meta-analysis of GWAS for each association in the following order: heterogeneity test, two different calculations of an effect size and a p-value based on heterogeneity, and the Benjamini-Hochberg p-value adjustment. These methods enable users to validate the results of individual studies with greater statistical power and better estimation precision. We elaborate on these and illustrate them with examples from several studies of infertility-related disorders.

Comparison of Normalization Methods for Defining Copy Number Variation Using Whole-genome SNP Genotyping Data

  • Kim, Ji-Hong;Yim, Seon-Hee;Jeong, Yong-Bok;Jung, Seong-Hyun;Xu, Hai-Dong;Shin, Seung-Hun;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • 제6권4호
    • /
    • pp.231-234
    • /
    • 2008
  • Precise and reliable identification of CNV is still important to fully understand the effect of CNV on genetic diversity and background of complex diseases. SNP marker has been used frequently to detect CNVs, but the analysis of SNP chip data for identifying CNV has not been well established. We compared various normalization methods for CNV analysis and suggest optimal normalization procedure for reliable CNV call. Four normal Koreans and NA10851 HapMap male samples were genotyped using Affymetrix Genome-Wide Human SNP array 5.0. We evaluated the effect of median and quantile normalization to find the optimal normalization for CNV detection based on SNP array data. We also explored the effect of Robust Multichip Average (RMA) background correction for each normalization process. In total, the following 4 combinations of normalization were tried: 1) Median normalization without RMA background correction, 2) Quantile normalization without RMA background correction, 3) Median normalization with RMA background correction, and 4) Quantile normalization with RMA background correction. CNV was called using SW-ARRAY algorithm. We applied 4 different combinations of normalization and compared the effect using intensity ratio profile, box plot, and MA plot. When we applied median and quantile normalizations without RMA background correction, both methods showed similar normalization effect and the final CNV calls were also similar in terms of number and size. In both median and quantile normalizations, RMA backgroundcorrection resulted in widening the range of intensity ratio distribution, which may suggest that RMA background correction may help to detect more CNVs compared to no correction.

Whole Genome Analysis of the Red-Crowned Crane Provides Insight into Avian Longevity

  • Lee, HyeJin;Kim, Jungeun;Weber, Jessica A.;Chung, Oksung;Cho, Yun Sung;Jho, Sungwoong;Jun, JeHoon;Kim, Hak-Min;Lim, Jeongheui;Choi, Jae-Pil;Jeon, Sungwon;Blazyte, Asta;Edwards, Jeremy S.;Paek, Woon Kee;Bhak, Jong
    • Molecules and Cells
    • /
    • 제43권1호
    • /
    • pp.86-95
    • /
    • 2020
  • The red-crowned crane (Grus japonensis) is an endangered, large-bodied crane native to East Asia. It is a traditional symbol of longevity and its long lifespan has been confirmed both in captivity and in the wild. Lifespan in birds is known to be positively correlated with body size and negatively correlated with metabolic rate, though the genetic mechanisms for the red-crowned crane's long lifespan have not previously been investigated. Using whole genome sequencing and comparative evolutionary analyses against the grey-crowned crane and other avian genomes, including the long-lived common ostrich, we identified redcrowned crane candidate genes with known associations with longevity. Among these are positively selected genes in metabolism and immunity pathways (NDUFA5, NDUFA8, NUDT12, SOD3, CTH, RPA1, PHAX, HNMT, HS2ST1, PPCDC, PSTK CD8B, GP9, IL-9R, and PTPRC). Our analyses provide genetic evidence for low metabolic rate and longevity, accompanied by possible convergent adaptation signatures among distantly related large and long-lived birds. Finally, we identified low genetic diversity in the red-crowned crane, consistent with its listing as an endangered species, and this genome should provide a useful genetic resource for future conservation studies of this rare and iconic species.

꼬막(Tegillarca granosa)의 유전적 다양성 분석을 위한 드래프트 게놈분석과 마이크로새틀라이트 마커 발굴 (Genome Survey and Microsatellite Marker Selection of Tegillarca granosa)

  • 김진무;이승재;조은아;최은경;김현진;이정식;박현
    • 한국해양생명과학회지
    • /
    • 제6권1호
    • /
    • pp.38-46
    • /
    • 2021
  • 꼬막 종류 중 하나인 Tegillarca granosa는 해양 이매패류로서 한국, 중국, 일본 등의 중요한 수산 자원 중 하나이다. 꼬막의 염색체 수는 2n=38로 알려져 있지만, 유전체의 크기와 유전 정보에 대해서는 아직 명확하게 알려져 있지 않다. 꼬막의 유전체 크기 예측을 위하여 NGS Illumina HiSeq 플랫폼을 이용하여 얻은 짧은 DNA 서열 정보를 통하여 in silico 분석으로 유전체 크기를 분석하였다. 그 결과 꼬막의 유전체 크기는 770.61 Mb로 예측되었다. 이후 MaSuRCA assembler를 통하여 드래프트 게놈 조립 작업을 수행하고, QDD pipeline을 이용하여 SSR (simple sequence repeats) 분석을 수행하였다. 꼬막의 유전체로부터 43,944개의 SSR을 발굴하였으며, 다이-뉴클레오타이드(di-nucleotide) 69.51%, 트라이-뉴클레오타이드(tri-nucleotide) 16.68%, 테트라-뉴클레오타이드(tetra-nucleotide) 12.96%, 펜타-뉴클레오타이드(penta-nucleotide) 0.82% 그리고 헥사-뉴클레오타이드(hexa-nucleotide) 0.03%로 구성되었다. 이후 꼬막의 유전적 다양성 연구에 활용할 수 있는 100개의 마이크로새틀라이트 마커의 프라이머 세트를 선별하였다. 앞으로 이번 연구를 통해서, 꼬막의 집단유전학적 연구와 유전적 다양성을 규명하는데 도움이 될 것이며, 나아가 동종들 간의 원산지 분류를 알아낼 수 있을 것이다.

계층적 정렬쌍 가시화를 이용한 유전자 클러스터 탐색 알고리즘 (A Gene Clustering Method with Hierarchical Visualization of Alignment Pairs)

  • 진희정;박수현;조환규
    • 정보처리학회논문지A
    • /
    • 제16A권3호
    • /
    • pp.143-152
    • /
    • 2009
  • 최근 생물정보학 분야의 연구는 하나하나의 유전자를 연구하던 예전의 방법에서 유전자들간의 관계를 알아보는 연구들로 변해가고 있다. 이러한 유전자들 간의 연구 중 하나가 유전자 팀(gene team)을 연구하는 것이다. 유전자 팀이란 몇몇 염색체들 사이의 유전자들이 보존되어 있는 것을 말하며, 닫힌 영역 안에 보존되어 있는 유전자들의 집합으로 볼 수 있다. 이들은 진화과정을 거치면서, 유전자 팀 내의 유전자들의 위치나 그 종류가 변한다. 이러한 유전자 팀을 찾기 위해 많은 연구들이 이루어져왔다. 본 논문은 생물정보학 분야에서 많이 사용되는 계층적 클러스터링(hierarchical clustering)방법을 변형하여 전체 유전체(whole genome) 쌍내에서의 의미 있는 영역을 찾고, 영역 내에서 gene team을 찾을 수 있는 방법을 소개한다. 본 연구 방법을 이용하면, 복잡한 구조의 두 유전체 사이의 연관 유전자들이나 유사 영역들의 맵(map)을 단계별로 간략화 하여 나타낼 수 있다.

해양퇴적물로부터 분리된 Pseudoalteromonas sp. meg-B1의 유전체 분석 (Draft genome sequence of Pseudoalteromonas sp. meg-B1 isolated from marine sediment)

  • 박수제;박세욱
    • 미생물학회지
    • /
    • 제54권3호
    • /
    • pp.280-282
    • /
    • 2018
  • Gammaproteobacteria에 속하는 Pseudoalteromonas sp. meg-B1을 제주도 해양 퇴적물로부터 분리하였다. 본 연구에서는 대략 4.15 Mb의 크기와 41.2%의 평균 G + C 함량을 가진 meg-B1 균주의 완전한 유전체를 보고한다. 유전체는 3,606개의 코딩 서열, 9개의 리보솜 RNA 및 94개의 전사 RNA 유전자가 존재하며, 한 개의 완전한 프로파지 영역이 발견되었다. 본 유전체는 해양환경에서 생존하기 위한 삼투화합성 용질합성과 관련된 유전자(예, choline dehydrogenase)들이 확인되었다.