• Title/Summary/Keyword: sequence alignment

Search Result 351, Processing Time 0.027 seconds

Bioinformatics based Identification and Characterization of Epoxide Hydrolase of Gordonia westfalica for the Production of Chiral Epoxides (Bioinformatics를 활용한 토양미생물인 Gordonia westfalica Epoxide Hydrolase 생촉매 개발 및 Chiral Epoxides 제조 특성 분석)

  • Lee Soo Jung;Lee Eun Jung;Kim Hee Sook;Lee Eun Yeol
    • KSBB Journal
    • /
    • v.20 no.4
    • /
    • pp.311-316
    • /
    • 2005
  • Epoxide hydrolases (EHs) are versatile biocatalysts for the preparation of chiral epoxides by enantioselective hydrolysis from racemic epoxides. Various microorganisms were identified to possess a EH activity by multiple sequence alignment and analysis of conserved domain sequence from genomic and megaplasmid sequence data. We successfully isolated Gordonia westfalica possessing EH activity from various microbial strains from culture type collections. G. westfalica exhibited (R)-styrene oxide preferred enantioselective hydrolysis activity. Chiral (S)-styrene oxide with high optical purity $(>\;99\%)\;ee)$ and yield of $36.5\%$ was obtained from its racemate using whole-cell of G. westfalica.

Cloning and Sequence Analysis of a Levansucrase Gene from Rahnella aquatilis ATCC15552

  • Kim, Hyun-Jin;Yang, Ji-Young;Lee, Hyeon-Gye;Cha, Jae-Ho
    • Journal of Microbiology and Biotechnology
    • /
    • v.11 no.4
    • /
    • pp.693-699
    • /
    • 2001
  • An intracellular levansucrase gene, lscR from Rahnella aquatilis ATCC 15552, was cloned and its nucleotide sequence was determined. Nucleotide sequence analysis of this gene revealed a 1,238 bp open reading frame coding for a protein of 415 amino acids. The levansucrase was expressed by using a T7 promoter in Escherichia coli BL21 (DE3) and the enzyme activity was detected in the cytoplasmic fraction. The optimum pH and temperature of this enzyme for levan formation was pH 6 and $30^{\circ}C$, respectively. The deduced amino acid sequence of the lscR gene showed a high sequence similarity (59-89%) with Gram-negative levansucrses, while the level of similarity with Gram-positive enzymes was less than 42%. Multiple alignments of levansucrase sequences reported from Gram-negative and Gram-positive bacteria revealed seven conserved regions. A comparison of the catalytic properties and deduced amino acid sequence of lscR with those of other bacterial levansucrases strongly suggest that Gram-negative and Gram-positive levansucrases have an overall different structure, but they have a similar structure at the active site.

  • PDF

Optimal Sequence Alignment Algorithm Using Space Division Technique (공간 분할 방법을 이용한 최적 서열정렬 알고리즘)

  • Ahn, Heui-Kook;Roh, Hi-Young
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.5
    • /
    • pp.397-406
    • /
    • 2007
  • The problem of finding an optimal alignment between sequence A and B can be solved by dynamic programming algorithm(DPA) efficiently. But, if the length of string was longer, the problem might not be solvable because it requires O(m*n) time and space complexity.(where, $m={\mid}A{\mid},\;n={\mid}B{\mid}$) For space, Hirschberg developed a linear space and quadratic time algorithm, so computer memory was no longer a limiting factor for long sequences. As computers's processor and memory become faster and larger, a method is needed to speed processing up, although which uses more space. For this purpose, we present an algorithm which will solve the problem in quadratic time and linear space. By using division method, It computes optimal alignment faster than LSA, although requires more memory. We generalized the algorithm about division problem for not being divided into integer and pruned additional space by entry/exit node concept. Through the proofness and experiment, we identified that our algorithm uses d*(m+n) space and a little more (m*n) time faster than LSA.

Taint Inference for Cross-Site Scripting in Context of URL Rewriting and HTML Sanitization

  • Pan, Jinkun;Mao, Xiaoguang;Li, Weishi
    • ETRI Journal
    • /
    • v.38 no.2
    • /
    • pp.376-386
    • /
    • 2016
  • Currently, web applications are gaining in prevalence. In a web application, an input may not be appropriately validated, making the web application susceptible to cross-site scripting (XSS), which poses serious security problems for Internet users and websites to whom such trusted web pages belong. A taint inference is a type of information flow analysis technique that is useful in detecting XSS on the client side. However, in existing techniques, two current practical issues have yet to be handled properly. One is URL rewriting, which transforms a standard URL into a clearer and more manageable form. Another is HTML sanitization, which filters an input against blacklists or whitelists of HTML tags or attributes. In this paper, we make an analogy between the taint inference problem and the molecule sequence alignment problem in bioinformatics, and transfer two techniques related to the latter over to the former to solve the aforementioned yet-to-be-handled-properly practical issues. In particular, in our method, URL rewriting is addressed using local sequence alignment and HTML sanitization is modeled by introducing a removal gap penalty. Empirical results demonstrate the effectiveness and efficiency of our method.

A Web-Based High Performance Multiple Sequence Alignment System Design and Implementation (웹 기반 고성능 다중서열정렬시스템 설계 및 구현)

  • Kim, Tae-Kyung;Kim, Hun-Gi;Choi, Chi-Hwan;Jung, Seung-Hyun;Hou, Bo-Kyeng;Cho, Wan-Sup
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2010.07a
    • /
    • pp.79-82
    • /
    • 2010
  • 다중서열정렬 알고리즘은 생명정보학 분야에서 서열기반의 계통분류 분석에 가장 많이 사용되며, 가장 대표적인 공개 프로그램은 ClustalW로 사용자가 로컬시스템에 설치하여 이용할 수 있다. 그러나 실제로 사용자들이 ClustalW을 설치한 후, 서열데이터의 준비, 가공, 처리 및 타 시스템과 연동 등과 같은 작업을 하는데 여러 가지 어려움이 있다. 따라서 본 논문에서는 다중서열정렬 작업을 편리하고 빠르게 수행할 수 있는 웹기반의 고성능 다중서열정렬시스템을 제안한다. 제안된 시스템의 특징은, (1) Inter-Query 라우팅 알고리즘을 통해 다수의 PC 자원을 효율적으로 활용하여 계산 성능을 극대화하였으며, (2) 사용자 편의성을 고려한 웹인터페이스의 제공을 통해 개인화된 데이터관리, 실시간 모니터링, 데이터 편집 등을 지원하여 사용자가 손쉽게 서열데이터의 수집, 관리 및 처리할 수 있도록 지원한다.

  • PDF

NOGSEC: A NOnparametric method for Genome SEquence Clustering (녹섹(NOGSEC): A NOnparametric method for Genome SEquence Clustering)

  • 이영복;김판규;조환규
    • Korean Journal of Microbiology
    • /
    • v.39 no.2
    • /
    • pp.67-75
    • /
    • 2003
  • One large topic in comparative genomics is to predict functional annotation by classifying protein sequences. Computational approaches for function prediction include protein structure prediction, sequence alignment and domain prediction or binding site prediction. This paper is on another computational approach searching for sets of homologous sequences from sequence similarity graph. Methods based on similarity graph do not need previous knowledges about sequences, but largely depend on the researcher's subjective threshold settings. In this paper, we propose a genome sequence clustering method of iterative testing and graph decomposition, and a simple method to calculate a strict threshold having biochemical meaning. Proposed method was applied to known bacterial genome sequences and the result was shown with the BAG algorithm's. Result clusters are lacking some completeness, but the confidence level is very high and the method does not need user-defined thresholds.

Functional Role of a Conserved Sequence Motif in the Oxygen-dependent Degradation Domain of Hypoxia-inducible Factor 1α in the Recognition of p53

  • Chi, Seung-Wook
    • Genomics & Informatics
    • /
    • v.6 no.2
    • /
    • pp.72-76
    • /
    • 2008
  • Hypoxia-inducible factor $1{\alpha}\;(HIF1{\alpha})$ is a transcription factor that plays a key role in the adaptation of cells to low oxygen stress and oxygen homeostasis. The oxygen-dependent degradation (ODD) domain of $HIF1{\alpha}$ is responsible for the negative regulation of $HIF1{\alpha}$ in normoxia. The interactions of the $HIF1{\alpha}$ ODD domain with partner proteins such as von Hippel-Lindau tumor suppressor (pVHL) and p53 are mediated by two sequence motifs, the N- and C-terminal ODD(NODD and CODD). Multiple sequence alignment with $HIF1{\alpha}$ homologs from human, monkey, pig, rat, mouse, chicken, frog, and zebrafish has demonstrated that the NODD and CODD motifs have noticeably high conservation of the primary sequence across different species and isoforms. In this study, we carried out molecular dynamics simulation of the structure of the $HIF1{\alpha}$ CODD motif in complex with the p53 DNA-binding domain (DBD). The structure reveals specific functional roles of highly conserved residues in the CODD sequence motif of $HIF1{\alpha}$ for the recognition of p53.

The Complete Genome Sequence of Southern rice black-streaked dwarf virus Isolated from Vietnam

  • Dinh, Thi-Sau;Zhou, Cuiji;Cao, Xiuling;Han, Chenggui;Yu, Jialin;Li, Dawei;Zhang, Yongliang
    • The Plant Pathology Journal
    • /
    • v.28 no.4
    • /
    • pp.428-432
    • /
    • 2012
  • We determined the complete genome sequence of a Vietnamese isolate of Southern rice black-streaked dwarf virus (SRBSDV). Whole genome comparisons and phylogenetic analysis showed that the genome of the Vietnamese isolate shared high nucleotide sequence identities of over 97.5% with those of the reported Chinese isolates, confirming a common origin of them. Moreover, the greatest divergence between different SRBSDV isolates was found in the segments S1, S3, S4 and S6, which differs from the sequence alignment results between SRBSDV and Rice black streaked dwarf virus (RBSDV), implying that SRBSDV evolved in a unique way independent of RBSDV. This is the first report of a complete nucleotide sequence of SRBSDV from Vietnam and our data provides new clues for further understanding of molecular variation and epidemiology of SRBSDV in Southeast Asia.