• Title/Summary/Keyword: Sequence pattern analysis

Search Result 318, Processing Time 0.025 seconds

Isolation of a cDNA Encoding a Chloroplast Triosephosphate Isomerase from Strawberry

  • Kim, In-Jung;Lee, Byung-Hyun;Jinki Jo;Chung, Won-Il
    • Journal of Plant Biotechnology
    • /
    • v.2 no.3
    • /
    • pp.115-121
    • /
    • 2000
  • A cDNA clone encoding chloroplast triosephosphate isomerase (TPI-cp) was isolated from strawberry fruit cDNA library. Sequence analyses indicated that the cDNA contains an open reading frame of 314 amino acids (33.5 kDa) composed of a transit peptide (59 amino acids) in amino terminal region and mature protein (255 amino acids). The existence of transit peptide in the deduced amino acid sequence implies that it encodes a chloroplast isoform. The protein sequence is more similar to other plant chloroplast isoforms than cytosolic isoforms. RNA blot analysis indicated that its expression is ubiquitous in examined five tissues, flowers, leaves, petioles, roots and fruits, and shows differential pattern according to fruit ripening. Genomic DNA blot analysis showed that TPI-cp is encoded by multiple genes in strawberry. Through sequence comparison and phylogenetic tree construction, TPI-cp is distinctively grouped into dicot and chloroplast isoforms.

  • PDF

Sequence Selectivity of DNA Alkylation by Adozelesin and Carzelesin

  • Yoon, Jung-Hoon;Lee, Chong-Soon
    • Archives of Pharmacal Research
    • /
    • v.21 no.4
    • /
    • pp.385-390
    • /
    • 1998
  • Adozelesin and carzelesin are synthetic analogues of the extremely potent antitumor antibiotic CC-1065, which alkylates N3 of adenine in a consensus sequence $5^1$-(A/T)(A/T)$A^*$ ($A^*$ is the site of alkylation). We have investigated the DNA sequence selectivity of adozelesin and carzelesin by thermally ind ced DNA strand cleavage assay using radiolabeled restriction DNA fragments. An analysis of alkylation patterns shows that the consensus sequences for carzelesin and adozelesin have been found to be $5^1$-(A/T)(A/T)$A^*$ and $5^1$-(A/F)(G/C)(A/T)$A^*$. A new consensus sequence, $5^1$-(A/T)(A/T)$CA^*$, has been observed to display an additional alkylation site for adozelesin but not for carzelesin. These results indicate that the pattern of sequence selectivity induced by carzelesin is similar but not identical to those induced by adozelosin.

  • PDF

Mining Maximal Frequent Contiguous Sequences in Biological Data Sequences

  • Kang, Tae-Ho;Yoo, Jae-Soo;Kim, Hak-Yong;Lee, Byoung-Yup
    • International Journal of Contents
    • /
    • v.3 no.2
    • /
    • pp.18-24
    • /
    • 2007
  • Biological sequences such as DNA and amino acid sequences typically contain a large number of items. They have contiguous sequences that ordinarily consist of more than hundreds of frequent items. In biological sequences analysis(BSA), a frequent contiguous sequence search is one of the most important operations. Many studies have been done for mining sequential patterns efficiently. Most of the existing methods for mining sequential patterns are based on the Apriori algorithm. In particular, the prefixSpan algorithm is one of the most efficient sequential pattern mining schemes based on the Apriori algorithm. However, since the algorithm expands the sequential patterns from frequent patterns with length-1, it is not suitable for biological datasets with long frequent contiguous sequences. In recent years, the MacosVSpan algorithm was proposed based on the idea of the prefixSpan algorithm to significantly reduce its recursive process. However, the algorithm is still inefficient for mining frequent contiguous sequences from long biological data sequences. In this paper, we propose an efficient method to mine maximal frequent contiguous sequences in large biological data sequences by constructing the spanning tree with a fixed length. To verify the superiority of the proposed method, we perform experiments in various environments. The experiments show that the proposed method is much more efficient than MacosVSpan in terms of retrieval performance.

Anlaysis of Eukaryotic Sequence Pattern using GenScan (GenScan을 이용한 진핵생물의 서열 패턴 분석)

  • Jung, Yong-Gyu;Lim, I-Suel;Cha, Byung-Heun
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.11 no.4
    • /
    • pp.113-118
    • /
    • 2011
  • Sequence homology analysis in the substances in the phenomenon of life is to create database by sorting and indexing and to demonstrate the usefulness of informatics. In this paper, Markov models are used in GenScan program to convert the pattern of complex eukaryotic protein sequences. It becomes impossible to navigate the minimum distance, complexity increases exponentially as the exact calculation. It is used scorecard in amino acid substitutions between similar amino acid substitutions to have a differential effect score, and is applied the Markov models sophisticated concealment of the transition probability model. As providing superior method to translate sequences homologous sequences in analysis using blast p, Markov models. is secreted protein structure of sequence translations.

Detecting smartphone user habits using sequential pattern analysis

  • Lu, Dang Nhac;Nguyen, Thu Trang;Nguyen, Thi Hau;Nguyen, Ha Nam;Choi, Gyoo Seok
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.7 no.1
    • /
    • pp.20-22
    • /
    • 2015
  • Recently, the study of smart phone user habits has become a highly focused topic due to the rapid growth of the smart phone market. Indeed, sequential pattern analysis methods were efficiently used for web-based user habit mining long time ago. However, by means of simulations, it has been observed that these methods might fail for smart phone-based user habit mining. In this paper, we propose a novel approach that leads to a considerably increased performance of the traditional sequential pattern analysis methods by reasonably cutting off each chronological sequence of user logs on a device into shorter ones, which represent the sequential user activities in various periods of a day.

Trend and Technology of Gene and Genome Research (유전자 및 유전체 연구 기술과 동향)

  • 이진성;김기환;서동상;강석우;황재삼
    • Journal of Sericultural and Entomological Science
    • /
    • v.42 no.2
    • /
    • pp.126-141
    • /
    • 2000
  • A major step towards understanding of the genetic basis of an organism is the complete sequence determination of all genes in target genome. The nucleotide sequence encoded in the genome contains the information that specifies the amino acid sequence of every protein and functional RNA molecule. In principle, it will be possible to identify every protein resposible for the structure and function of the body of the target organism. The pattern of expression in different cell types will specify where and when each protein is used. The amino acid sequence of the proteins encoded by each gene will be derived from the conceptional translation of the nucleotide sequence. Comparison of these sequences with those of known proteins, whose sequences are sorted in database, will suggest an approximate function for many proteins. This mini review describes the development of new sequencing methods and the optimization of sequencing strategies for whole genome, various cDNA and genomic analysis.

  • PDF

(CA/GT)n Simple Sequence Repeat DNA Polymorphism in Chlamydomonas reinhardtii (녹조류 Chlamydomonas reinhardtii의 (CA/GT)n Simple Sequence Repeat DNA 다형현상)

  • ;;Marvin W. FAWLEY
    • Korean Journal of Plant Tissue Culture
    • /
    • v.24 no.2
    • /
    • pp.113-117
    • /
    • 1997
  • Simple sequence repeats (SSR) are widely dispersed throughout eukaryotic genomes, highly polymorphic, and easily typed using polymerase chain reaction (PCR). The objective of this study was to determine the polymorphism of different Chlamydomonas reinhartdtii strains and to determine the mode of inheritance of the SSR locus in Chlamydomonas. A genomic DNA library of C. reinhardtii was constructed and screened with a radiolabeled $(AC)_{11}$ probe for the selection of (CA/GT)n repeat clone. Selected clone was seqeuenced, and PCR primer set flanking (CA/GT)n sequence was constructed. PCR was used to specifically amplify the SSR locus from multiple isolates of C. reinhardtii. The locus was polymorphic in some of the C. reinhardtii isolates. However, the locus was amplified only 4 of 6 isolates of C. reinhardtii, not in other 2 isolates of C. reinhardtii, suggesting that this locus is not extensively conserved. A simple Mendelian inheritance pattern was found, which showed 2:2 segregation in the tetrads resulting from a cross between C. reinhardtii and C. smithii. Our results suggest that this simple sequence repeat DNA polymorphism will be useful for identity testing, population studies, linkage analysis, and genome mapping in Chlamydomonas.

  • PDF

Driver's Behavioral Pattern in Driver Assistance System (운전자 사용자경험기반의 인지향상 시스템 연구)

  • Jo, Doori;Shin, Donghee
    • Journal of Digital Contents Society
    • /
    • v.15 no.5
    • /
    • pp.579-586
    • /
    • 2014
  • This paper analyzes the recognition of driver's behavior in lane change using context-free grammar. In contrast to conventional pattern recognition techniques, context-free grammars are capable of describing features effectively that are not easily represented by finite symbols. Instead of coordinate data processing that should handle features in multiple concurrent events respectively, effective syntactic analysis was applied for patterning of symbolic sequence. The findings proposed the effective and intuitive method for drivers and researchers in driving safety field. Probabilistic parsing for the improving this research will be the future work to achieve a robust recognition.

Sequential pattern load modeling and warning-system plan in modular falsework

  • Peng, Jui-Lin;Wu, Cheng-Lung;Chan, Siu-Lai
    • Structural Engineering and Mechanics
    • /
    • v.16 no.4
    • /
    • pp.441-468
    • /
    • 2003
  • This paper investigates the structural behavior of modular falsework system under sequential pattern loads. Based on the studies of 25 construction sites, the pattern load sequence modeling is defined as models R (rectangle), L and U. The study focuses on the system critical loads, regions of largest reaction forces, discrepancy between the pattern load and the uniform load, and the warning-system plan. The analysis results show that the critical loads of modular falsework systems with sequential pattern loads are very close to those with the uniform load used in design. The regions of largest reaction forces are smaller than those calculated by the uniform load. However, the regions of largest reaction forces of three models under sequential pattern loads can be considered as the crucial positions of warning-system based on the measured index of loading. The positions of the sensors for the warning-system for these three different models are not identical.