• Title/Summary/Keyword: Genome sequences

Search Result 851, Processing Time 0.029 seconds

The extension of the largest generalized-eigenvalue based distance metric Dij1) in arbitrary feature spaces to classify composite data points

  • Daoud, Mosaab
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.39.1-39.20
    • /
    • 2019
  • Analyzing patterns in data points embedded in linear and non-linear feature spaces is considered as one of the common research problems among different research areas, for example: data mining, machine learning, pattern recognition, and multivariate analysis. In this paper, data points are heterogeneous sets of biosequences (composite data points). A composite data point is a set of ordinary data points (e.g., set of feature vectors). We theoretically extend the derivation of the largest generalized eigenvalue-based distance metric Dij1) in any linear and non-linear feature spaces. We prove that Dij1) is a metric under any linear and non-linear feature transformation function. We show the sufficiency and efficiency of using the decision rule $\bar{{\delta}}_{{\Xi}i}$(i.e., mean of Dij1)) in classification of heterogeneous sets of biosequences compared with the decision rules min𝚵iand median𝚵i. We analyze the impact of linear and non-linear transformation functions on classifying/clustering collections of heterogeneous sets of biosequences. The impact of the length of a sequence in a heterogeneous sequence-set generated by simulation on the classification and clustering results in linear and non-linear feature spaces is empirically shown in this paper. We propose a new concept: the limiting dispersion map of the existing clusters in heterogeneous sets of biosequences embedded in linear and nonlinear feature spaces, which is based on the limiting distribution of nucleotide compositions estimated from real data sets. Finally, the empirical conclusions and the scientific evidences are deduced from the experiments to support the theoretical side stated in this paper.

Application of Disease Resistance Markers for Developing Elite Tomato Varieties and Lines

  • Kim, Hyoun-Joung;Lee, Heung-Ryul;Hyun, Ji-Young;Won, Dong-Chan;Hong, Dong-Oh;Cho, Hwa-Jin;Lee, Kyung-Ah;Her, Nam-Han;Lee, Jang-Ha;Harn, Chee-Hark
    • Horticultural Science & Technology
    • /
    • v.29 no.4
    • /
    • pp.336-344
    • /
    • 2011
  • Using the abundant available information about the tomato genome, we developed DNA markers that are linked to disease resistant loci and performed marker-assisted selection (MAS) to construct multi-disease resistant lines and varieties. Resistance markers of Ty-1, T2, and I2, which are linked to disease resistance to Tomato yellow leaf curl virus (TYLCV), Tomato mosaic virus (ToMV), and Fusarium wilt, respectively, were developed in a co-dominant fashion. DNA sequences near the resistance loci of TYLCV, ToMV, and Fusarium wilt were used for primer design. Reported candidate markers for powdery mildew-resistance were screened and the 32.5Cla marker was selected. All four markers (Ty-1, T2, I2, and 32.5Cla) were converted to cleavage amplification polymorphisms (CAPS) markers. Then, the CAPS markers were applied to 96 tomato lines to determine the phenetic relationships among the lines. This information yielded clusters of breeding lines illustrating the distribution of resistant and susceptible characters among lines. These data were utilized further in a MAS program for several generations, and a total of ten varieties and ten inbred lines were constructed. Among four traits, three were introduced to develop varieties and breeding lines through the MAS program; several cultivars possessed up to seven disease resistant traits. These resistant trait-related markers that were developed for the tomato MAS program could be used to select early stage seedlings, saving time and cost, and to construct multi-disease resistant lines and varieties.

Transcriptional Activator Elements for Curtovirus C1 Expression Reside in the 3' Coding Region of ORF C1

  • Hur, Jingyung;Buckley, Kenneth J.;Lee, Sukchan;Davis, Keith R.
    • Molecules and Cells
    • /
    • v.23 no.1
    • /
    • pp.80-87
    • /
    • 2007
  • Beet curly top virus (BCTV) and Beet severe curly top virus (BSCTV), members of curtoviruses, encode seven open reading frames (ORFs) within a ~3 kb genome. One of these viral ORFs, C1, is known to play an important role in the early stage of viral infection in plants during initiation of viral DNA replication. We used promoter:: reporter (${\beta}$-glucuronidase) gene fusions in transgenic Arabidopsis to identify the putative promoter region of BCTV ORF C1. Unlike other geminiviruses, the intergenic region of BCTV was not sufficient to promote C1 expression in transgenic plants. When sequences extending into the coding region of C1 were tested, strong expression of the reporter protein was observed in vascular tissues of transgenic plants. This expression was not dependent on the presence of the intergenic regions or proximal 5' portions of the C1 coding region. Transgenic plants expressing a reporter gene under control of the putative complete C1 promoter were inoculated with virus to determine if any viral transcript affected C1 expression. Virus inoculated plants did not show any altered pattern or change in of reporter gene expression level. These results suggest that (1) important transcriptional activator elements for C1 expression reside in the 3' portion of C1 coding area itself, (2) C1 protein does not auto-regulate its own expression and (3) C1 expression of two curtoviruses is controlled differently compared to other geminiviruses.

Identification of a Third Haplotype of the Sequence Linked to the Restorer-of-fertility (Rf) Gene and Its Implications for Male-Sterility Phenotypes in Peppers (Capsicum annuum L.)

  • Min, Woong-ki;Lim, Heerae;Lee, Young-Pyo;Sung, Soon-Kee;Kim, Byung-Dong;Kim, Sunggil
    • Molecules and Cells
    • /
    • v.25 no.1
    • /
    • pp.20-29
    • /
    • 2008
  • Cytoplasmic male sterility (CMS), one of the most important traits in crop breeding, has been used for commercial seed production by $F_1$ hybrid cultivars of pepper (Capsicum annuum L.). To develop reliable molecular markers for allelic selection of the Restorer-of-fertility (Rf) gene, which is known to be a major determinant of pollen fertility restoration in peppers, a sequence of approximately 10 kb flanking an RAPD fragment closely linked to the Rf locus was obtained by genome walking. A homology search revealed that this sequence contained an LTR retrotransposon and a non-LTR LINE-like retrotransposon. Sequencing of this Rf-linked region to search for polymorphisms between a dominant and recessive allele revealed 98% nucleotide sequence identity between them. A third polymorphic haplotype of the Rf-linked sequence, which has 94-96% nucleotide sequence identity with the two previously isolated haplotypes, was identified among a large number of breeding lines. Utilizing polymorphic sequences in the haplotypes, PCR markers were developed for selection of particular haplotypes and used to examine the distribution of the haplotypes in diverse breeding lines, cultivars, and C. annuum germplasms. Surprisingly, the third haplotype was the predominant type in C. annuum germplasms, while its frequency in $F_1$ hybrid cultivars was relatively low. Meanwhile, analysis of breeding lines whose Rf allele genotypes and male-sterility phenotypes were already known revealed that the third haplotype was mainly present in exotic breeding lines that cause unstable male-sterility when combined with sterile cytoplasms.

Fermentation and Sporulation Characteristics of Saccharomyces cerevisiae SHY111 Isolated from Korean Traditional Rice Wine

  • Kim, Seung-Hwan;Chung, Oon-Chan;Woo, Im-Sun;Shin, Jae-Ho;Rho, Dong-Hyun;Rhee, In-Koo;Park, Heui-Dong
    • Journal of Microbiology and Biotechnology
    • /
    • v.10 no.6
    • /
    • pp.776-783
    • /
    • 2000
  • Various alcohol yeast strains have been isolated from main mashes of Korean traditional liquors, and their genetic diversities were previously reported [23]. In this study, the strain SHY111, showing the highest alcohol production, was tested for its fermentation and sporulation characteristics. Additionally, its haploid cells were isolated and tested for their growth and fermentation patterns. The strain was identified as Saccharomyces cerevisiae based on its morphological and physiological characteristics. The sequences of the ITS(internal transcribed spacer) and 5.8S rDNA regions of S. cerevisiae SHY111 were found to be identical to those of S. cerevisiae that was obtained from through the yeast genome project. The maximum fermentation ratio obtained by the strain SHY111 (96.7%) was almost the same as that by S. cerevisiae Balyun No. 1 (96.5%) that was a little higher than that by S. cerevisiae KCCM11215(95.8%). The strain was induced for sporulation in a sporulation liquid medium using log phase cells grown in different types of pre-sporulation media, and its haploid cells were obtained by spore dissection using a micromanipulator. The majority of the spores formed a small colony on a YPD agar plate, and the haploid yeast cells derived from the strain SHY111 showed a variety of growth and alcohol fermentation patterns. It was proposed that the fermentation patterns were related to their growth phenotypes in the most haploid strains, but possible not in some strains.

  • PDF

Simultaneous Detection of Cytomegalovirus, Epstein-Barr Virus, Hepatitis B Virus, and Parvovirus by a Multiplex PCR (다중 중합효소 연쇄반응을 이용한 DNA 바이러스의 동시검출)

  • Sung, Hye-Ran;Joo, Jin-Young;Lee, Chong-Kil;Chung, Yeon-Bok;Song, Suk-Gil
    • Korean Journal of Microbiology
    • /
    • v.43 no.1
    • /
    • pp.1-6
    • /
    • 2007
  • We describe a multiplex PCR method that can detect and differentiate simultaneously four different kinds of DNA viruses, Epstein-Barr virus (EBV), cytomegalovirus (CMV), hepatitis B virus (HBV) and parvovirus B19 (B19). Primers for the multiplex PCR reaction were designed to amplify specific regions of the EBV (pol), CMV (pol), HBV (pol) and B19 (ns) viral genomes and used to simultaneously detect individual viruses. In order to achieve optimal sensitivity and specificity for multiplex PCR, the thermo-cycling parameters, primer sequences, and concentration of each reaction components were optimized systematically. The sensitivity of the detection method ranged between 5 and 10 copies of viral genome with a mixture of multiple primer pairs. Furthermore, this highly sensitive test showed no cross-reactivity among the four viruses. Thus, the results obtained in this study provide evidence that the assay system is a good tool for supporting the diagnosis of viral infection and contamination.

Development and Molecular Characterization of Novel Polymorphic Genomic DNA SSR Markers in Lentinula edodes

  • Moon, Suyun;Lee, Hwa-Yong;Shim, Donghwan;Kim, Myungkil;Ka, Kang-Hyeon;Ryoo, Rhim;Ko, Han-Gyu;Koo, Chang-Duck;Chung, Jong-Wook;Ryu, Hojin
    • Mycobiology
    • /
    • v.45 no.2
    • /
    • pp.105-109
    • /
    • 2017
  • Sixteen genomic DNA simple sequence repeat (SSR) markers of Lentinula edodes were developed from 205 SSR motifs present in 46.1-Mb long L. edodes genome sequences. The number of alleles ranged from 3-14 and the major allele frequency was distributed from 0.17-0.96. The values of observed and expected heterozygosity ranged from 0.00-0.76 and 0.07-0.90, respectively. The polymorphic information content value ranged from 0.07-0.89. A dendrogram, based on 16 SSR markers clustered by the paired hierarchical clustering' method, showed that 33 shiitake cultivars could be divided into three major groups and successfully identified. These SSR markers will contribute to the efficient breeding of this species by providing diversity in shiitake varieties. Furthermore, the genomic information covered by the markers can provide a valuable resource for genetic linkage map construction, molecular mapping, and marker-assisted selection in the shiitake mushroom.

Molecular Characterization of Fusarium Graminearum Virus 2 Isolated from Fusarium graminearum Strain 98-8-60

  • Yu, Ji-Suk;Lee, Kyung-Mi;Son, Moon-Il;Kim, Kook-Hyung
    • The Plant Pathology Journal
    • /
    • v.27 no.3
    • /
    • pp.285-290
    • /
    • 2011
  • Fusarium graminearum virus 2 (FgV2) infects Fusarium graminearum strain 98-8-60 and has at least five segments of double-stranded RNAs (dsRNAs), denoted as dsRNA-1 to dsRNA-5. In this study, the genome of FgV2 was sequenced and its phylogenetic relationship with other mycoviruses was analyzed. The lengths of FgV2 dsRNAs 1-5 ranged from 2414 to 3580 base pairs (bp). The 5' and 3' untranslated regions (UTRs) are highly conserved, and each dsRNA segment had 78-105 and 84-306 bp of 5' and 3' UTRs, respectively. Each dsRNA segment contained a single open reading frame (ORF). Computer analysis of dsRNA-1 revealed a putative open reading frame (ORF) that shows high sequence identity with an RNA-dependent RNA polymerase (RdRp) containing eight conserved motifs. dsRNAs 2-5 also each contain one putative ORF coding for products of unknown function. The sequences of FgV2 dsRNA-2 and dsRNA-3 have significant sequence identity with Magnaporthe oryzae chrysovirus 1 (MoCV1) dsRNA-3 and -4, respectively. When compared to other dsRNA mycoviruses in a phylogenetic analysis of the putative RdRp protein, FgV2 was found to form a distinct virus clade with Aspergillus mycovirus 1816 and MoCV1 in the family Chrysoviridae.

Phylogenetic Analysis of Apple scar skin viroid Isolates in Korea (국내에서 발생하는 Apple scar skin viroid 분리주에 대한 계통분석)

  • Cho, Kang Hee;Kim, In-Soo;Kil, Eui-Joon;Park, Seo Jun;Kim, Se Hee;Choi, In-Myung;Kim, Dae-Hyun
    • Research in Plant Disease
    • /
    • v.21 no.4
    • /
    • pp.346-350
    • /
    • 2015
  • To identify genome sequences of Apple scar skin viroid (ASSVd) isolates in Korea, the field survey was performed from 'Hongro' apple orchards located in eight sites in South Korea (Bongwha, Cheongsong, Dangjin, Gimchoen, Muju, Mungyeong, Suwon, and Yeongwol). ASSVd was detected by RT-PCR and PCR fragments were cloned into cloning vector. Full-length viral genomes of eight ASSVd isolates were sequenced and compared with 21 isolates reported previously from Korea, India, China, Japan and Greece. Eight isolates in this study showed 92.2-99.7% nucleotide sequence identities with those reported previously. Phylogenetic analysis showed that seven isolates reported in this study belong to the same group distinct from other groups.

Novel Peptide Nucleic Acid Melting Array for the Detection and Genotyping of Toxoplasma gondii

  • Suh, Soo Hwan;Yun, Han Seong;Lee, Sang-Eun;Kwak, Hyo-Sun
    • Microbiology and Biotechnology Letters
    • /
    • v.47 no.4
    • /
    • pp.645-650
    • /
    • 2019
  • Despite differences in virulence between strains of Toxoplasma gondii, rapid and accurate genotyping methods are lacking. In this study, a method was developed to detect and genotype T. gondii in food and environmental samples using PCR and a novel peptide nucleic acid (PNA) melting array. An alignment of genome sequences for T. gondii type I, II, and III obtained from NCBI was generated, and a single nucleotide polymorphism analysis was performed to identify targets for PCR amplification and a PNA melting array. Prior to the PNA melting array, conventional PCR was used to amplify GRA6 of T. gondii. After amplification, the PNA melting array was performed using two different PNA hybridization probes with fluorescent labels (FAM and HEX) and quenchers. Melting curves for each probe were used to determine genotypes and identify mutations. A 214-bp region of the GRA6 gene of T. gondii was successfully amplified by PCR. For all T. gondii strains (type I, II, and III) used to evaluate specificity, the correct genotypes were determined by the PNA melting array. Non-T. gondii strains, including 14 foodborne pathogens and 3 protozoan parasites, such as Giardia lamblia, Cryptosporidium parvum, and Entamoeba histolytica, showed no signal, suggesting that the assay has a high specificity. Although this is only a proof-of-concept study, the assay is promising for the fast and reliable genotyping of T. gondii from food and environmental samples.