• Title/Summary/Keyword: Sequence Analysis

Search Result 6,410, Processing Time 0.029 seconds

In Silico Sequence Analysis Reveals New Characteristics of Fungal NADPH Oxidase Genes

  • Detry, Nicolas;Choi, Jaeyoung;Kuo, Hsiao-Che;Asiegbu, Fred O.;Lee, Yong-Hwan
    • Mycobiology
    • /
    • v.42 no.3
    • /
    • pp.241-248
    • /
    • 2014
  • NADPH oxidases (Noxes), transmembrane proteins found in most eukaryotic species, generate reactive oxygen species and are thereby involved in essential biological processes. However, the fact that genes encoding ferric reductases and ferric-chelate reductases share high sequence similarities and domains with Nox genes represents a challenge for bioinformatic approaches used to identify Nox-encoding genes. Further, most studies on fungal Nox genes have focused mainly on functionality, rather than sequence properties, and consequently clear differentiation among the various Nox isoforms has not been achieved. We conducted an extensive sequence analysis to identify putative Nox genes among 34 eukaryotes, including 28 fungal genomes and one Oomycota genome. Analyses were performed with respect to phylogeny, transmembrane helices, di-histidine distance and glycosylation. Our analyses indicate that the sequence properties of fungal Nox genes are different from those of human and plant Nox genes, thus providing novel insight that will enable more accurate identification and characterization of fungal Nox genes.

Computing Method of Cross-Correlation of Non-Linear Sequences Using Subfield (부분체를 이용한 비선형 수열의 상호상관관계의 효율적인 계산방법)

  • Choi, Un-Sook;Cho, Sung-Jin;Kim, Seok-Tae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.8
    • /
    • pp.1686-1692
    • /
    • 2012
  • Spreading sequence play an important role in wireless communications, such as in a CDMA(code division multiple access) communication system and multi-carrier spectrum communication system. Spreading sequences with low cross-correlation, in a direct-sequence spread spectrum communication system, help to minimize multiple access interference and to increase security degree of system. Analysis of cross-correlations between the sequences is a necessary process to design sequences. However it require lots of computing time for analysis of cross-correlations between sequences. In this paper we propose a method which is possible to compute effectively cross-correlation using subfield in the process of practical computation of cross-correlation between nonlinear binary sequences.

Sequence Analysis of the Coat Protein Gene of a Korean Isolate of Iris Severe Mosaic Potyvirus from Iris Plant

  • Park, Won-Mok;Lee, Sang-Seon;Park, Sun-Hee;Ju;Ryu, Ki-Hyun
    • The Plant Pathology Journal
    • /
    • v.16 no.1
    • /
    • pp.36-42
    • /
    • 2000
  • The coat protein gene of iris severe mosaic potyvirus, which was isolated in Korea, ISMV-K, from iris plant was cloned and its nucleotide sequence was determined. The coat protein of the virus contained 252 amino acid residues, including five potential N-glyxosylation site motifs. The coat protein of ISMV-K has 99.1% and 98.4% sequence identities with those of the Netherlands isolate of ISMV (ISMV-Ne) form crocus for the nucleotide and amino acids, respectively. The coat protein of ISMV-K has 50.4% to 60.3% nucleotide sequence identities and 47.3% to 55.7% amino acid identities with those of other 21 potyviruses, indicating ISMV to be a distinct species of the genus. The coat protein of ISMV-K was closely related with bean yellow mosaic virus and clover yellow vein virus in the phylogenetic tree analysis among the potyviruses analyzed. ISMV was easily and reliably detected from virus-infected iris leaves by RT-PCR with a set of the virus-specific primers.

  • PDF

Molecular Characterization of a Chinese cabbage cDNA, C-DH, Predominantly Induced by Water-Deficit Stress and Plant Hormone, ABA (수분부족 및 식물호르몬, ABA에 의하여 발현이 유도되는 배추의 C-DH cDNA에 대한 분자적 특성)

  • 정나은;이균오;홍창휘;정배교;박정동;이상열
    • Korean Journal Plant Pathology
    • /
    • v.14 no.3
    • /
    • pp.240-246
    • /
    • 1998
  • A cDNA encoding desiccation-related protein was isolated from a flower bud cDNA library of Chinese cabbage (C-DH) and its nucleotide sequence was characterized. It contains 679 bp nucleotides with 501 bp open reading frame. The amino acid sequence of the putative protein showed the highest amino acid sequence homology (79 % identity) to dehydrin protein in Gossypium hirsutum. Also, the C-DH shares 48-52% amino acid sequence identity with the other typical dehydrin proteins in plant cells. When the amino acid sequence of their proteins were aligned, several peptide motifs were well conserved, of which function has to be solved. Particularly the C-DH contains 15 additional amino acids at its N-terminus. Genomic Southern blot analysis using the coding region of C-DH showed that the C-DH consists of a single copy gene in Chinese cabbage genome. The C-DH mRNA, whose transcript size is 0.7 kb, was expressed with a tissue-specific manner. It was highly expressed in seed, flower buds and low expression as detected in root, stem or leaf tissues of Chinese cabbage. And the transcript level of C-DH was significantly induced by the treatment of plant hormone, abscisic acid and water-deficit conditions.

  • PDF

Molecular Cloning, Sequencing, and Expression of a Fibrinolytic Serine-protease Gene from the Earthworm Lumbricus rubellus

  • Cho, Il-Hwan;Choi, Eui-Sung;Lee, Hyung-Hoan
    • BMB Reports
    • /
    • v.37 no.5
    • /
    • pp.574-581
    • /
    • 2004
  • The full-length cDNA of the lumbrokinase fraction 6 (F6) protease gene of Lumbricus rubellus was amplified using an mRNA template, sequenced and expressed in E. coli cells. The F6 protease gene consisted of pro- and mature sequences by gene sequence analysis, and the protease was translated and modified into active mature polypeptide by N-terminal amino acid sequence analysis of the F6 protease. The pro-region of F6 protease consisted of the 44 residues from methionine-1 to lysine-44, and the mature polypeptide sequence (239 amino acid residues and one stop codon; 720 bp) started from isoleucine-45 and continued to the terminal residue. F6 protease gene clones having pro-mature sequence and mature sequence produced inclusion bodies in E. coli cells. When inclusion bodies were orally administrated rats, generated thrombus weight in the rat' venous was reduced by approximately 60% versus controls. When the inclusion bodies were solubilized in pepsin and/or trypsin solutions, the solubilized enzymes showed hemolytic activity in vitro. It was concluded the F6 protease has hemolytic activity, and that it is composed of pro- and mature regions.

Improvement of Performance of Malware Similarity Analysis by the Sequence Alignment Technique (서열 정렬 기법을 이용한 악성코드 유사도 분석의 성능 개선)

  • Cho, In Kyeom;Im, Eul Gyu
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.3
    • /
    • pp.263-268
    • /
    • 2015
  • Malware variations could be defined as malicious executable files that have similar functions but different structures. In order to classify the variations, this paper analyzed sequence alignment, the method used in Bioinformatics. This method found common parts of the Malwares' API call information. This method's performance is dependent on the API call information's length; if the length is too long, the performance should be very poor. Therefore we removed the repeated patterns in API call information in order to improve the performance of sequence alignment analysis, before the method was applied. Finally the similarity between malware was analyzed using sequence alignment. The experimental results with the real malware samples were presented.

Random Sequence Analysis of the Genomic DNA of Methanopyrus kandleri and Molecular Cloning of the Gene Encoding a Homologue of the Catalytic Subunit of Carbon Monoxide Dehydrogenase

  • Shin, Hyun-Seock;Ryu, Jae-Ryeon;Han, Ye-Sun;Choi, Yong-Jin;Yu, Yeon-Gyu
    • Journal of Microbiology and Biotechnology
    • /
    • v.9 no.4
    • /
    • pp.404-413
    • /
    • 1999
  • Methanopyrus kandleri is a hyperthermophilic methanogen that represents one of the most heat-resistant organisms: the maximum growth temperature of M. kandleri is $110^{\circ}C$. A random sequence analysis of the genomic DNA of M. kandleri has been performed to obtain genomic information. More than 200 unique sequence tags were obtained and compared with the sequences in the GenBank and PIR databases. About 30% of the analyzed tags showed strong sequence similarity to previously identified genes involved in various cellular processes such as biosynthesis, transport, methanogenesis, or metabolism. When statistics relating to the frequency of codons were examined, the sequenced open reading frames showed highly biased codon usage and a high content of charged amino acids. Among the identified genes, a homologue of the catalytic subunit of carbon monoxide dehydrogenase (CODH) that reduces $CO_2$ to CO was cloned and sequenced in order to examine its detailed gene structure. The cloned gene includes consensus promoters. The amino acid sequence of the cloned gene shows a strong homology with the CODH genes from methanogenic Archaea, especially in the presumed binding sites for Fe-S centers.

  • PDF

TRAPR: R Package for Statistical Analysis and Visualization of RNA-Seq Data

  • Lim, Jae Hyun;Lee, Soo Youn;Kim, Ju Han
    • Genomics & Informatics
    • /
    • v.15 no.1
    • /
    • pp.51-53
    • /
    • 2017
  • High-throughput transcriptome sequencing, also known as RNA sequencing (RNA-Seq), is a standard technology for measuring gene expression with unprecedented accuracy. Numerous bioconductor packages have been developed for the statistical analysis of RNA-Seq data. However, these tools focus on specific aspects of the data analysis pipeline, and are difficult to appropriately integrate with one another due to their disparate data structures and processing methods. They also lack visualization methods to confirm the integrity of the data and the process. In this paper, we propose an R-based RNA-Seq analysis pipeline called TRAPR, an integrated tool that facilitates the statistical analysis and visualization of RNA-Seq expression data. TRAPR provides various functions for data management, the filtering of low-quality data, normalization, transformation, statistical analysis, data visualization, and result visualization that allow researchers to build customized analysis pipelines.

Development of Workbench for Analysis and Visualization of Whole Genome Sequence (전유전체(Whole gerlome) 서열 분석과 가시화를 위한 워크벤치 개발)

  • Choe, Jeong-Hyeon;Jin, Hui-Jeong;Kim, Cheol-Min;Jang, Cheol-Hun;Jo, Hwan-Gyu
    • The KIPS Transactions:PartA
    • /
    • v.9A no.3
    • /
    • pp.387-398
    • /
    • 2002
  • As whole genome sequences of many organisms have been revealed by small-scale genome projects, the intensive research on individual genes and their functions has been performed. However on-memory algorithms are inefficient to analysis of whole genome sequences, since the size of individual whole genome is from several million base pairs to hundreds billion base pairs. In order to effectively manipulate the huge sequence data, it is necessary to use the indexed data structure for external memory. In this paper, we introduce a workbench system for analysis and visualization of whole genome sequence using string B-tree that is suitable for analysis of huge data. This system consists of two parts : analysis query part and visualization part. Query system supports various transactions such as sequence search, k-occurrence, and k-mer analysis. Visualization system helps biological scientist to easily understand whole structure and specificity by many kinds of visualization such as whole genome sequence, annotation, CGR (Chaos Game Representation), k-mer, and RWP (Random Walk Plot). One can find the relations among organisms, predict the genes in a genome, and research on the function of junk DNA using our workbench.

Nucleotide Sequence Analysis and Secondary Structure Modeling of the 3'-Noncoding Regions of Two Korean Strains of Turnip Mosaic Virus (순무 모자이크 바이러스 두 한국계통의 3' 말단 비번역부위에 대한 염기서열분석 및 2차구조 모델링)

  • 최장경;류기현;최국선;박원목
    • Korean Journal Plant Pathology
    • /
    • v.11 no.3
    • /
    • pp.271-277
    • /
    • 1995
  • The RNA nucleotide sequences of the 3/-noncoding regions (3'-NCRs) of two Korean strains of turnip mosaic virus (TuMV), Ca and cqs, have been determined from their cDNA clones that encompassed the 3'-terminal regions of the viral genomic RNAs. The 3'-NCRs of both strains were 209 nucleotides long, terminated with GAC residues and poly (A) tails. The potential polyadenylational signal motif, UAUGU, was located 140 nucleotides upstream from the poly (A) tail in each of the virus. A highly conserved hexanucleotide sequence [A G U G A/U G/C], which was common in the 3'-NCRs of the potyvirus RNAs, was also found at the regions of 119 bases upstream from the 3'-end. Comparison of the 3'-NCRs of the two Korean isolates with those of four strains from Canada, China and Japan showed significantly identical genotypes (94.3∼99.5%). The secondary structure of three loops with long stems was found within the 3'-NCRs by sequence analysis. The substituted bases in the region among the six TuMV strains did not alter their secondary structures. Length of the 3'-NCRs of the know 11 potyviral RNAs and TuMV RNAs was different from one another and their nucleotide sequences showed 55.7% to 24.0% of homology. The 3'-NCR, therefore, is considered to be useful for phylogenetic studies in potyviruses.

  • PDF