• 제목/요약/키워드: Complete genome sequences

검색결과 173건 처리시간 0.024초

Complete Chloroplast Genome assembly and Annotation of Milk Thistle (Silybum marianum) and Phylogenetic Analysis

  • Hwajin Jung;Yedomon Ange Bovys Zoclanclounon;Jeongwoo Lee;Taeho Lee;Jeonggu Kim;Guhwang Park;Keunpyo Lee;Kwanghoon An;Jeehyoung Shim;Joonghyoun Chin;Suyoung Hong
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2022년도 추계학술대회
    • /
    • pp.210-210
    • /
    • 2022
  • Silybum marianum is an annual or biennial plant from the Asteraceae family. It can grow in low-nutrient soil and drought conditions, making it easy to cultivate. From the seed, a specialized plant metabolite called silymarin (flavonolignan complex) is produced and is known to alleviate the liver from hepatitis and toxins damages. To infer the phylogenetic placement of a Korean milk thistle, we conducted a chloroplast assembly and annotation following by a comparison with existing Chinese reference genome (NC_028027). The chloroplast genome structure was highly similar with an assembly size of 152,642 bp, an 153,202 bp for Korean and Chinese milk thistle respectively. Moreover, there were similarities at the gene level, coding sequence (n = 82), transfer RNA (n = 31) and ribosomal RNA (n = 4). From all coding sequences gene set, the phylogenetic tree inference placed the Korean cultivar into the milk thistle clade; corroborating the expected tree. Moreover, an investigation the tree based only on the ycf1 gene confirmed the same tree; suggesting that ycf1 gene is a potential marker for DNA barcoding and population diversity study in milk thistle genus. Overall, the provided data represents a valuable resource for population genomics and species-centered determination since several species have been reported in the Silybum genus.

  • PDF

De novo Genome Assembly and Single Nucleotide Variations for Soybean Mosaic Virus Using Soybean Seed Transcriptome Data

  • Jo, Yeonhwa;Choi, Hoseong;Bae, Miah;Kim, Sang-Min;Kim, Sun-Lim;Lee, Bong Choon;Cho, Won Kyong;Kim, Kook-Hyung
    • The Plant Pathology Journal
    • /
    • 제33권5호
    • /
    • pp.478-487
    • /
    • 2017
  • Soybean is the most important legume crop in the world. Several diseases in soybean lead to serious yield losses in major soybean-producing countries. Moreover, soybean can be infected by diverse viruses. Recently, we carried out a large-scale screening to identify viruses infecting soybean using available soybean transcriptome data. Of the screened transcriptomes, a soybean transcriptome for soybean seed development analysis contains several virus-associated sequences. In this study, we identified five viruses, including soybean mosaic virus (SMV), infecting soybean by de novo transcriptome assembly followed by blast search. We assembled a nearly complete consensus genome sequence of SMV China using transcriptome data. Based on phylogenetic analysis, the consensus genome sequence of SMV China was closely related to SMV isolates from South Korea. We examined single nucleotide variations (SNVs) for SMVs in the soybean seed transcriptome revealing 780 SNVs, which were evenly distributed on the SMV genome. Four SNVs, C-U, U-C, A-G, and G-A, were frequently identified. This result demonstrated the quasispecies variation of the SMV genome. Taken together, this study carried out bioinformatics analyses to identify viruses using soybean transcriptome data. In addition, we demonstrated the application of soybean transcriptome data for virus genome assembly and SNV analysis.

A phylogenetic analysis of the Korean endemic species Paraphlomis koreana (Lamiaceae) inferred from nuclear and plastid DNA sequences

  • Eun-Kyeong HAN;Jung-Hyun KIM;Jin-Seok KIM;Chang Woo HYUN;Dong Chan SON;Gyu Young CHUNG;Amarsanaa GANTSETSEG;Jung-Hyun LEE;In-Su CHOI
    • 식물분류학회지
    • /
    • 제53권2호
    • /
    • pp.157-165
    • /
    • 2023
  • Paraphlomis koreana (Lamiaceae) was newly named and added to Korean flora in 2014. Paraphlomis belongs to the tribe Paraphlomideae, along with Ajugoides and Matsumurella. However, a recent study has suggested that P. koreana is morphologically similar to Matsumurella chinensis, making them difficult to distinguish from each other. Therefore, we aimed to examine the phylogenetic placement of P. koreana within the tribe and compare its genetic relationship with M. chinensis. We sequenced an additional complete plastid genome for an individual of P. koreana and generated sequences of nuclear ribosomal (nr) DNA regions of internal and external transcribed spacers (ITS and ETS) for two individuals of P. koreana. Maximum likelihood analyses based on two nrDNA regions (ITS and ETS) and four plastid DNA markers (rpl16 intron, rpl32-trnL, rps16 intron, and trnL-F) covering 13 Paraphlomis species and M. chinensis were conducted. Phylogenetic analyses concordantly supported that P. koreana forms a monophyletic group with M. chinensis. Moreover, our study revealed that P. koreana includes nrDNA sequences of M. chinensis as minor intra-individual variants, suggesting that the genetic divergence between the two taxa is incomplete and may represent intraspecific variation rather than distinct species. In conclusion, our findings suggest that the independent species status of P. koreana within Paraphlomis should be reconsidered.

한국에서 분리된 파밤나방 핵다각체병 바이러스의 전체 유전체 분석 (Complete Genome Analysis of Spodoptera exigua Nucleopolyhedrovirus Isolated in Korea)

  • 최재방;김현수;우수동
    • 한국응용곤충학회지
    • /
    • 제61권3호
    • /
    • pp.449-460
    • /
    • 2022
  • 광식성 난방제 해충인 파밤나방(Spodoptera exigua)의 친환경적 방제원으로써 이용을 위해 국내에서 분리된 파밤나방 핵다각체병바이러스(S. exigua nucleopolyhedrovirus K1: SeNPV-K1)의 형태 및 전체 유전체 서열을 분석하였다. SeNPV-K1의 다각체(polyhedra)는 0.6-1.8 um 크기의 부정형으로, 기 보고된 SeNPV와 외형적 차이는 보이지 않았다. 전체 유전체의 염기서열을 분석한 결과, 기 보고된 SeNPV와 비교할 때 145 bp 더 많은 135,756 bp로 확인되었으며, G+C 함량은 44% 였고 상동반복영역은 6개로 두 바이러스간에 차이는 없었다. ORF 분석결과, SeNPV-K1은 기 보고된 것과 비교할 때 2개 더 적은 137개를 가지며, SeNPV-K1에만 존재하는 ORF는 4개가 확인되었다. 이들 4개의 ORF는 비필수 유전자로 바이러스의 특성에는 큰 영향을 주지 않을 것으로 여겨졌다. 유전체의 vista 분석 결과, SeNPV-K1과 기 보고된 SeNPV의 전체 염기서열 유사도가 매우 높은 것으로 확인되었다. 국내에서 처음으로 분석한 SeNPV-K1의 전체 유전체는 기 보고된 SeNPV와 유사한 것으로 나타났으나 서로 다른 분리주로 국내 고유자원임을 확인하였다.

Molecular Characterization of the HERV-W Env Gene in Humans and Primates: Expression, FISH, Phylogeny, and Evolution

  • Kim, Heui-Soo;Kim, Dae-Soo;Huh, Jae-Won;Ahn, Kung;Yi, Joo-Mi;Lee, Ja-Rang;Hirai, Hirohisa
    • Molecules and Cells
    • /
    • 제26권1호
    • /
    • pp.53-60
    • /
    • 2008
  • We characterized the human endogenous retrovirus (HERV-W) family in humans and primates. In silico expression data indicated that 22 complete HERV-W families from human chromosomes 1-3, 5-8, 10-12, 15, 19, and X are randomly expressed in various tissues. Quantitative real-time RT-PCR analysis of the HERV-W env gene derived from human chromosome 7q21.2 indicated predominant expression in the human placenta. Several copies of repeat sequences (SINE, LINE, LTR, simple repeat) were detected within the complete or processed pseudo HERV-W of the human, chimpanzee, and rhesus monkey. Compared to other regions (5'LTR, Gag, Gag-Pol, Env, 3'LTR), the repeat family has been mainly integrated into the region spanning the 5'LTRs of Gag (1398 bp) and Pol (3242 bp). FISH detected the HERV-W probe (fosWE1) derived from a gorilla fosmid library in the metaphase chromosomes of all primates (five hominoids, three Old World monkeys, two New World monkeys, and one prosimian), but not in Tupaia. This finding was supported by molecular clock and phylogeny data using the divergence values of the complete HERV-W LTR elements. The data suggested that the HERV-W family was integrated into the primate genome approximately 63 million years (Myr) ago, and evolved independently during the course of primate radiation.

Characteristics of Cucumber mosaic virus Infecting Zucchini in Korea

  • Kim, Mi-Kyeong;Kwak, Hae-Ryun;Jeong, Seon-Gi;Ko, Sug-Ju;Lee, Su-Heon;Kim, Jeong-Soo;Kim, Kook-Hyung;Choi, Jang-Kyung;Choi, Hong-Soo;Cha, Byeong-Jin
    • The Plant Pathology Journal
    • /
    • 제26권2호
    • /
    • pp.139-148
    • /
    • 2010
  • A virus causing stunt, yellowing, severe mosaic, malformation symptoms on leaves and uneven development and malformation on fruits of zucchini was prevalent around Goseong, Gyeongsangnam-do, Korea. A survey conducted (2004) in the Goseong area revealed about 20% virus infection rate. The disease causative identified as Cucumber mosaic virus (CMV-Z1) was further characterized. The isolate induces mosaic symptoms on Cucumis sativus, while severe mosaic, stunt and malformation on C. pepo. Thin section analyses have shown that virus inclusions are formed in the cuticle layers as well as epidermal, parenchyma and collenchymas cells in virus-infected Nicotiana tabacum. CMV-Z1 isolate induced specific cytoplasmic inclusion bodies such as irregular clumps (IC), crystal (Cr) and irregular chloroplasts (ICh). IC was made up of virus particles interspersed with a darkly stained amorphous material and found both in the cytoplasm and vacuoles, whereas ICh and Cr were rarely found in the vacuoles. The genome of CMV-Z1 RNA-1 consists of 3359 nucleotide (nt) encoding 1a protein of 993 amino acids (aa). The CMV-Z1 RNA-2 was 3050 nt in length containing 2a (857 aa) and 2b (110 aa), while RNA-3 encoding 3a movement protein (279 aa) and coat protein (218 aa) was 2215 nt in length. Phylogenetic analyses of nucleotide sequences of CMV-Z1 isolate appeared it is more closely related to subgroup IA than to subgroup IB or II.

Comparative genetic analyses of Korean bat coronaviruses with SARS-CoV and the newly emerged SARS-CoV-2

  • Na, Eun-Jee;Lee, Sook-Young;Kim, Hak Jun;Oem, Jae-Ku
    • Journal of Veterinary Science
    • /
    • 제22권1호
    • /
    • pp.12.1-12.11
    • /
    • 2021
  • Background: Bats have been considered natural reservoirs for several pathogenic human coronaviruses (CoVs) in the last two decades. Recently, a bat CoV was detected in the Republic of Korea; its entire genome was sequenced and reported to be genetically similar to that of the severe acute respiratory syndrome CoV (SARS-CoV). Objectives: The objective of this study was to compare the genetic sequences of SARS-CoV, SARS-CoV-2, and the two Korean bat CoV strains 16BO133 and B15-21, to estimate the likelihood of an interaction between the Korean bat CoVs and the human angiotensin-converting enzyme 2 (ACE2) receptor. Methods: The phylogenetic analysis was conducted with the maximum-likelihood (ML) method using MEGA 7 software. The Korean bat CoVs receptor binding domain (RBD) of the spike protein was analyzed by comparative homology modeling using the SWISS-MODEL server. The binding energies of the complexes were calculated using PRODIGY and MM/GBGA. Results: Phylogenetic analyses of the entire RNA-dependent RNA polymerase, spike regions, and the complete genome revealed that the Korean CoVs, along with SARS-CoV and SARS-CoV-2, belong to the subgenus Sarbecovirus, within BetaCoVs. However, the two Korean CoVs were distinct from SARS-CoV-2. Specifically, the spike gene of the Korean CoVs, which is involved in host infection, differed from that of SARS-CoV-2, showing only 66.8%-67.0% nucleotide homology and presented deletions within the RBD, particularly within regions critical for cross-species transmission and that mediate interaction with ACE2. Binding free energy calculation revealed that the binding affinity of Korean bat CoV RBD to hACE2 was drastically lower than that of SARS-CoV and SARS-CoV-2. Conclusions: These results suggest that Korean bat CoVs are unlikely to bind to the human ACE2 receptor.

Genomic Analysis of Halotolerant Bacterial Strains Martelella soudanensis NC18T and NC20

  • Jung-Yun Lee;Dong-Hun Kim
    • Journal of Microbiology and Biotechnology
    • /
    • 제32권11호
    • /
    • pp.1427-1434
    • /
    • 2022
  • Two novel, halotolerant strains of Martelella soudanensis, NC18T and NC20, were isolated from deep subsurface sediment, deeply sequenced, and comparatively analyzed with related strains. Based on a phylogenetic analysis using 16S rRNA gene sequences, the two strains grouped with members of the genus Martelella. Here, we sequenced the complete genomes of NC18T and NC20 to understand the mechanisms of their halotolerance. The genome sizes and G+C content of the strains were 6.1 Mb and 61.8 mol%, respectively. Moreover, NC18T and NC20 were predicted to contain 5,849 and 5,830 genes, and 5,502 and 5,585 protein-coding genes, respectively. Both strains contain the identically predicted 6 rRNAs and 48 tRNAs. The harboring of halotolerant-associated genes revealed that strains NC18T and NC20 might tolerate high salinity through the accumulation of potassium ions in a "salt-in" strategy induced by K+ uptake protein (kup) and the K+ transport system (trkAH and kdpFABC). These two strains also use the ectoine transport system (dctPQM), the glycine betaine transport system (proVWX), and glycine betaine uptake protein (opu) to accumulate "compatible solutes," such as ectoine and glycine betaine, to protect cells from salt stress. This study reveals the halotolerance mechanism of strains NC18T and NC20 in high salt environments and suggests potential applications for these halotolerant and halophilic strains in environmental biotechnology.

COVID-19 progression towards ARDS: a genome wide study reveals host factors underlying critical COVID-19

  • Shama Mujawar;Gayatri Patil;Srushti Suthar;Tanuja Shendkar;Vaishnavi Gangadhar
    • Genomics & Informatics
    • /
    • 제21권2호
    • /
    • pp.16.1-16.14
    • /
    • 2023
  • Coronavirus disease 2019 (COVID-19) is a viral infection produced by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus epidemic, which was declared a global pandemic in March 2020. The World Health Organization has recorded around 43.3 billion cases and 59.4 million casualties to date, posing a severe threat to global health. Severe COVID-19 indicates viral pneumonia caused by the SARS-CoV-2 infections, which can induce fatal consequences, including acute respiratory distress syndrome (ARDS). The purpose of this research is to better understand the COVID-19 and ARDS pathways, as well as to find targeted single nucleotide polymorphism. To accomplish this, we retrieved over 100 patients' samples from the Sequence Read Archive, National Center for Biotechnology Information. These sequences were processed through the Galaxy server next generation sequencing pipeline for variant analysis and then visualized in the Integrative Genomics Viewer, and performed statistical analysis using t-tests and Bonferroni correction, where six major genes were identified as DNAH7, CLUAP1, PPA2, PAPSS1, TLR4, and IFITM3. Furthermore, a complete understanding of the genomes of COVID-19-related ARDS will aid in the early identification and treatment of target proteins. Finally, the discovery of novel therapeutics based on discovered proteins can assist to slow the progression of ARDS and lower fatality rates.