• Title/Summary/Keyword: Sequence Analysis, DNA

Search Result 2,161, Processing Time 0.028 seconds

Experimental Analysis of Recent Works on the Overlap Phase of De Novo Sequence Assembly (De novo 시퀀스 어셈블리의 overlap 단계의 최근 연구 실험 분석)

  • Lim, Jihyuk;Kim, Sun;Park, Kunsoo
    • Journal of KIISE
    • /
    • v.45 no.3
    • /
    • pp.200-210
    • /
    • 2018
  • Given a set of DNA read sequences, de novo sequence assembly reconstructs a target sequence without a reference sequence. For reconstruction, the assembly needs the overlap phase, which computes all overlaps between every pair of reads. Since the overlap phase is the most time-consuming part of the whole assembly, the performance of the assembly depends on that of the overlap phase. There have been extensive studies on the overlap phase in various fields. Among them, three state-of-the-art results for the overlap phase are Readjoiner, SOF, and Lim-Park algorithm. Recently, a rapid development of sequencing technology has made it possible to produce a large read dataset at a low cost, and many platforms for generating a DNA read dataset have been developed. Since the platforms produce datasets with different statistical characteristics, a performance evaluation for the overlap phase should consider datasets with these characteristics. In this paper, we compare and analyze the performances of the three algorithms with various large datasets.

Molecular Cloning of ATPase $\alpha$-Subunit Gene from Mitochondria of Korean Ginseng (Panu ginseng C.A. Meyer) (고려인삼(Panax ginseng C.A. Meyer) ATPase $\alpha$-subunit 유전자의 Cloning)

  • Park, Ui-Sun;Choi, Kwan-Sam;Kim, Kab-Sig;Kim, Nam-Won;Choi, Kwang-Tae
    • Journal of Ginseng Research
    • /
    • v.19 no.1
    • /
    • pp.56-61
    • /
    • 1995
  • Molecular cloning and restriction mapping on ATPase $\alpha$-subunit gene (atpA) were carried out to obtain genomic information concerned with the gene structure and organization in Korean ginseng mitochondria. Two different clones containing the homologous sequence of atpA gene were selected from SalI and PstI libraries of mitochondrial DNA (mtDNA) of Korean ginseng. The sizes of mtDNA fragments inserted in SalI and PstI clones were 3.4 kb and 13 kb, respectively. Southern blot analysis with [$^{32}P$] labelled Oenothera atPA gene probe showed that atpA gene sequence was located in 2.0 kb XkaI fragment in PstI clone and in 1.7 kb XbaI fragment in SalI clone. A partial sequening ascertained that the SalI clone included about 1.2 kb fragment from SalI restriction site to C-terminal sequence of this gene but about 0.3 kb N-terminal sequence of open reading frame was abscent. The PstI fragment was enough large to cover the full sequence of atpA gene. The same restriction pattern of the overlapped region suggests that both clones include the same fragment of atiA locus. Data of Southern blot analysis and partial nucleotide sequencing suggested that mtDNA of Korean ginseng has a single copy of atpA gene. Key words ATPase a-subunit, mitochondrial DNA, Panax ginseng.

  • PDF

Generation of Expressed Sequence Tags for Immune Gene Discovery and Marker Development in the Sea Squirt, Halocynthia roretzi

  • Kim, Young-Ok;Cho, Hyun-Kook;Park, Eun-Mi;Nam, Bo-Hye;Hur, Young-Baek;Lee, Sang-Jun;Cheong, Jae-Hun
    • Journal of Microbiology and Biotechnology
    • /
    • v.18 no.9
    • /
    • pp.1510-1517
    • /
    • 2008
  • Expresssed sequence tag (EST) analysis was developed from three cDNA libraries constructed from cells of the digestive tract, gonad, and liver of sea squirt. Randomly selected cDNA clones were partially sequenced to generate a total of 922 ESTs, in which 687 unique ESTs were identified respectively. Results of BLASTX search showed that 612 ESTs (89%) have homology to genes of known function whereas 75 ESTs (11%) were unidentified or novel. Based on the major function of their encoded proteins, the identified clones were classified into ten broad categories. We also identified several kinds of immune-related genes as identifying novel genes. Sequence analysis of ESTs revealed the presence of microsatellite-containing genes that may be valuable for further gene mapping studies. The accumulation of a large number of identified cDNA clones is invaluable for the study of sea squirt genetics and developmental biology. Further studies using cDNA microarrays are needed to identify the differentially expressed transcripts after disease infection.

Cloning and Sequence Analysis of Ribosomal Protein S4 cDNA from Root of Panax ginseng

  • In Jun-Gyo;Lee Bum-Soo;Song Won-Seob;Bae Chang-Hyu;Choi Seong-Kyu;Yang Deok-Chun
    • Plant Resources
    • /
    • v.8 no.2
    • /
    • pp.110-115
    • /
    • 2005
  • Ribosomal protein complex with ribosomal RNA to form the subunits of the ribosome serve essential functions in protein synthesis. A full-length cDNA (PRPS4) encoding ribosomal protein S4 has been isolated and its nucleotide sequence determined in ginseng plant (Panax ginseng). A PRPS4 cDNA is 1105 nucleotides long and has an open reading frame of 792 bp with a deduced amino acid sequence of 264 residues (pI 10.67). The deduced amino acid sequence of PRPS4 matched to the previously reported ribosomal protein S4 genes. Their degree of amino acid identity ranged from 68 to $92\%$. Phylogenetic analysis based on the amino acid residues showed that the PRPS4 grouped with ribosomal protein S4 of S. tuberosum (CAA54095).

  • PDF

Keratitis by Acanthamoeba triangularis: Report of Cases and Characterization of Isolates

  • Xuan, Ying-Hua;Chung, Byung-Suk;Hong, Yeon-Chul;Kong, Hyun-Hee;Hahn, Tae-Won;Chung, Dong-Il
    • Parasites, Hosts and Diseases
    • /
    • v.46 no.3
    • /
    • pp.157-164
    • /
    • 2008
  • Three Acanthamoeba isolates (KA/E9, KA/E17, and KA/E23) from patients with keratitis were identified as Acanthamoeba triangularis by analysis of their molecular characteristics, a species not previously recognized to be a corneal pathogen. Epidemiologic significance of A. triangularis as a keratopathogen in Korea has been discussed. Morphologic features of Acanthamoeba cysts were examined under a microscope with differential interference contrast (DIC) optics. Mitochondrial DNA (mtDNA) of the ocular isolates KA/E9, KA/E17, and KA/E23 were digested with restriction enzymes, and the restriction patterns were compared with those of reference strains. Complete nuclear 188 and mitochondrial (mt) 16S rDNA sequences were subjected to phylogenetic analysis and species identification. mtDNA RFLP of 3 isolates showed very similar patterns to those of SH621, the type strain of A. triangularis. 16S and 18S rDNA sequence analysis confirmed 3 isolates to be A. triangularis. 18S rDNA sequence differences of the isolates were 1.3% to 1.6% and those of 16S rDNA, 0.4% to 0.9% from A. triangularis SH621. To the best of our knowledge, this is the first report, confirmed by 18S and 16S rDNA sequence analysis, of keratitis caused by A. triangularis of which the type strain was isolated from human feces. Six isolates of A. triangularis had been reported from contaminated contact lens cases in southeastern Korea.

Genetic Variation of Cytochrome P450 Genes in Garlic Cultivars (마늘유래 Cytochrome P450 유전자의 변이 분석)

  • Kwon, Soon-Tae;Kamiya, Juli
    • Korean Journal of Plant Resources
    • /
    • v.24 no.5
    • /
    • pp.584-590
    • /
    • 2011
  • Wound inducible P450-Esg cDNA, one of cytochrome P450 gene family, was isolated from shoot of Euiseong garlic cultivar. P450-Esg cDNA possesses highly conserved heme-binding domain in the nucleotide sequence, and 1,419 bp of open reading frame (ORF) coding of 473 amino acids. Based on the nucleotide sequence analysis of P450-Esg homologous from twelve garlic cultivars, two domains, one domain between 472 to 510 bp, and the other between 1,210 to 1,249 bp from start codon (ATG), showed various nucleotide polymorphism among cultivars. Sequence of heme-binding domain in P450-Esg homologous, which is located at the domain between 1,210 to 1,240 bp from start codon, showed various nucleotide polymorphism as well as amino acid sequence polymorphism among twelve garlic cultivars. Anther domain, between 472 to 510 bp from start codon, showed exactly same amino acid sequence in the twelve garlic cultivars, but there were various single nucleotide polymorphism to the cultivars.

(CA/GT)n Simple Sequence Repeat DNA Polymorphism in Chlamydomonas reinhardtii (녹조류 Chlamydomonas reinhardtii의 (CA/GT)n Simple Sequence Repeat DNA 다형현상)

  • ;;Marvin W. FAWLEY
    • Korean Journal of Plant Tissue Culture
    • /
    • v.24 no.2
    • /
    • pp.113-117
    • /
    • 1997
  • Simple sequence repeats (SSR) are widely dispersed throughout eukaryotic genomes, highly polymorphic, and easily typed using polymerase chain reaction (PCR). The objective of this study was to determine the polymorphism of different Chlamydomonas reinhartdtii strains and to determine the mode of inheritance of the SSR locus in Chlamydomonas. A genomic DNA library of C. reinhardtii was constructed and screened with a radiolabeled $(AC)_{11}$ probe for the selection of (CA/GT)n repeat clone. Selected clone was seqeuenced, and PCR primer set flanking (CA/GT)n sequence was constructed. PCR was used to specifically amplify the SSR locus from multiple isolates of C. reinhardtii. The locus was polymorphic in some of the C. reinhardtii isolates. However, the locus was amplified only 4 of 6 isolates of C. reinhardtii, not in other 2 isolates of C. reinhardtii, suggesting that this locus is not extensively conserved. A simple Mendelian inheritance pattern was found, which showed 2:2 segregation in the tetrads resulting from a cross between C. reinhardtii and C. smithii. Our results suggest that this simple sequence repeat DNA polymorphism will be useful for identity testing, population studies, linkage analysis, and genome mapping in Chlamydomonas.

  • PDF

Detection of Mycobacterium kansasii Using DNA-DNA Hybridization with rpoB Probe

  • Kweon, Tae-Dong;Bai, Sun-Joon;Choi, Chang-Shik;Hong, Seong-Karp
    • Journal of information and communication convergence engineering
    • /
    • v.10 no.2
    • /
    • pp.210-214
    • /
    • 2012
  • A microtiter well plate DNA hybridization method using Mycobacterium kansasii-specific rpoB DNA probe (kanp) were evaluated for the detection of M. kansasii from culture isolates. Among the 201 isolates tested by this method, 27 strains show positive results for M. kansasii, but the other 174 isolates were negative results for M. kansasii. This result was consistent with partial rpoB sequence analysis of M. kansasii and the result of biochemical tests. The negative strains by this DNA-DNA hybridization method were identified as Mycobacterium tuberculosis (159 strains), Mycobacterium avim (5 strains), Mycobacterium intracellulare (8 strains), and Mycobacterium flavescens (2 strain) by rpoB DNA sequence analysis. Due to high sensitivity and specificity of this test result, we suggest that DNA-DNA hybridization method using rpoB DNA probes of M. kansasii could be used for the rapid and convenient detection of M. kansasii.

Genotypic Characterization of Cherry Witches' Broom Pathogen Taphrina wiesneri Strains (벚나무 빗자루병균 Taphrina wiesneri의 유전적 특성)

  • Seo, Sang-Tae;Jeong, Su-Jee;Lee, Seung-Kyu;Kim, Kyung-Hee
    • Research in Plant Disease
    • /
    • v.17 no.1
    • /
    • pp.99-101
    • /
    • 2011
  • The ascomycetous fungus Taphrina wiesneri, the pathogen of cherry witches' broom, is highly pathogenic to Prunus yedoensis, the most widely planted cherry trees in Korea as park and roadside trees. A collection of 13 strains of the pathogen in Korea and Japan was characterized by 18S rDNA gene sequence and restriction fragment length polymorphism (RFLP) analysis. In cluster analysis based on 18S rDNA gene sequence the strains were divided into 2 clusters. In RFLP analysis of the rDNA-IGS region using HhaI, the strains were separated into four patterns, B, C, D and G, of which pattern G was new.

A Pattern Matching Extended Compression Algorithm for DNA Sequences

  • Murugan., A;Punitha., K
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.8
    • /
    • pp.196-202
    • /
    • 2021
  • DNA sequencing provides fundamental data in genomics, bioinformatics, biology and many other research areas. With the emergent evolution in DNA sequencing technology, a massive amount of genomic data is produced every day, mainly DNA sequences, craving for more storage and bandwidth. Unfortunately, managing, analyzing and specifically storing these large amounts of data become a major scientific challenge for bioinformatics. Those large volumes of data also require a fast transmission, effective storage, superior functionality and provision of quick access to any record. Data storage costs have a considerable proportion of total cost in the formation and analysis of DNA sequences. In particular, there is a need of highly control of disk storage capacity of DNA sequences but the standard compression techniques unsuccessful to compress these sequences. Several specialized techniques were introduced for this purpose. Therefore, to overcome all these above challenges, lossless compression techniques have become necessary. In this paper, it is described a new DNA compression mechanism of pattern matching extended Compression algorithm that read the input sequence as segments and find the matching pattern and store it in a permanent or temporary table based on number of bases. The remaining unmatched sequence is been converted into the binary form and then it is been grouped into binary bits i.e. of seven bits and gain these bits are been converted into an ASCII form. Finally, the proposed algorithm dynamically calculates the compression ratio. Thus the results show that pattern matching extended Compression algorithm outperforms cutting-edge compressors and proves its efficiency in terms of compression ratio regardless of the file size of the data.