• Title/Summary/Keyword: sequence databases

Search Result 224, Processing Time 0.026 seconds

Discrimination of Flaviviruses with High Frequency of Infection in Asian Countries: Epitope Prediction by Bioinformatic Approaches (아시아 국가 내에서 감염빈도가 높은 플라비바이러스의 구별: 생물정보학적 접근을 통한 항원결정기 예측)

  • Choi, Jae-Won;Jo, Byung-Gwan;Kim, Min Jung;Park, Suji;Kim, Hak Yong
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.4
    • /
    • pp.99-113
    • /
    • 2018
  • Recently, global warming has widened the habitat of mosquitoes and infection chances for mosquito-borne diseases are increasing. Flavivirus is a typical mosquito-borne virus. Flaviviruses with a relatively high frequency of infection in Asian countries include Zika, Dengue, and Japanese encephalitis viruses. Although distinctive diagnosis of flaviviruses is required because the symptoms and therapeutic method differ, there is no diagnostic method that can distinguish them accurately yet. In this study, we propose distinctive diagnosis method of flaviviruses using informations and analysis tools constructed in bioinformatic databases. The envelope protein and non-structural protein 1 which are useful protein for the immuno-diagnostics of three flaviviruses were selected. Their homology was analyzed by multiple sequence alignments and epitope candidates consisting of 10-15 amino acids were selected. Finally two epitopes were suggested to be most useful by immunogenicity analysis and 3D structure prediction. These approaches and results are expected to be great value in the distinctive diagnosis of three flaviviruses with a high frequency of infection in Asian countries.

Gramene database: A resource for comparative plant genomics, pathways and phylogenomics analyses

  • Tello-Ruiz, Marcela K.;Stein, Joshua;Wei, Sharon;Preece, Justin;Naithani, Sushma;Olson, Andrew;Jiao, Yinping;Gupta, Parul;Kumari, Sunita;Chougule, Kapeel;Elser, Justin;Wang, Bo;Thomason, James;Zhang, Lifang;D'Eustachio, Peter;Petryszak, Robert;Kersey, Paul;Lee, PanYoung Koung;Jaiswal, kaj;Ware, Doreen
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2017.06a
    • /
    • pp.135-135
    • /
    • 2017
  • The Gramene database (http://www.gramene.org) is a powerful online resource for agricultural researchers, plant breeders and educators that provides easy access to reference data, visualizations and analytical tools for conducting cross-species comparisons. Learn the benefits of using Gramene to enrich your lectures, accelerate your research goals, and respond to your organismal community needs. Gramene's genomes portal hosts browsers for 44 complete reference genomes, including crops and model organisms, each displaying functional annotations, gene-trees with orthologous and paralogous gene classification, and whole-genome alignments. SNP and structural diversity data, available for 11 species, are displayed in the context of gene annotation, protein domains and functional consequences on transcript structure (e.g., missense variant). Browsers from multiple species can be viewed simultaneously with links to community-driven organismal databases. Thus, while hosting the underlying data for comparative studies, the portal also provides unified access to diverse plant community resources, and the ability for communities to upload and display private data sets in multiple standard formats. Our BioMart data mining interface enable complex queries and bulk download of sequence, annotation, homology and variation data. Gramene's pathway portal, the Plant Reactome, hosts over 240 pathways curated in rice and inferred in 66 additional plant species by orthology projection. Users may compare pathways across species, query and visualize curated expression data from EMBL-EBI's Expression Atlas in the context of pathways, analyze genome-scale expression data, and conduct pathway enrichment analysis. Our integrated search database and modern user interface leverage these diverse annotations to facilitate finding genes through selecting auto-suggested filters with interactive views of the results.

  • PDF

Ac/Ds-mediated gene tagging system in rice

  • Eun, Moo-Young;Yun, Doh-Won;Nam, Min-Hee;Yi, Gi-Hwan;Han, Chang-Deok;Kim, Doh-Hoon;Park, Woong-June;Kim, Cheol-Soo;Park, Soon-Ki
    • Proceedings of the Korean Society of Plant Biotechnology Conference
    • /
    • 2005.11a
    • /
    • pp.95-105
    • /
    • 2005
  • Transposon-mediated insertional mutagenesis provides one of the most powerful tools for functional studies of genes in higher plants. This project has been performed to develop a large population of insertional mutations, and to construct databases of molecular information on Ds insertion sites in rice. Ultimate goals are to supply genetic materials and information to analyze gene function and to identify and utilize agronomically important genes for breeding purpose. Two strategies have been employed to generate the large scale of transposon population in a Japonica type rice, Dongjin Byeo; 1) genetic crosses between Ac and Ds lines and 2) plant regeneration from seeds carrying Ac and Ds. Our study showed that over 70% of regenerated plants generally carried independent Ds elements and high activity of transposition was detected only during regeneration period. Ds-flanking DNA amplified from leaf tissues of F2 and T1 (or T2) plants have been amplified via TAIL-PCR and directly sequenced. So far, over 65,000 Ds lines have been generated and over 9,500 Ds loci have been mapped on chromosomes by sequence analysis. Database of molecular information on Ds insertion sites has been constructed, and has been opened to the public and will be updated soon at http://www.niab.go.kr. Detailed functional analysis of more than 30 rice mutants has been performed. Several Ds-tagged rice genes that have been selected for functional analysis will be briefly introduced. We expect that a great deal of information and genetic resources of Ds lines would be obtained during the course of this project, which will be shared with domestic and international rice researchers. In addition to the Japonica rice, we have established the tagging system in an rice line of indica genetic background, MGRI079. MGRI079 (Indica/Japonica) was transformed with Agrobacteria carrying Ac and Ds T-DNA vectors. Among transgenic lines, we successfully identified single-copy Ds and Ac lines in MGR1079. These lines were served as ‘starter lines’ to mutagenize Indica genetic background. To achieve rapid, large scale generation of Ds transposant lines, MGR1079 transformants carrying homozygous Ac were crossed with ones with homozygous Ds, and $F_2$seeds were used for plant regeneration. In this year, over 2,000 regeneration plants were grown in the field. We are able to evaluate the tagging efficiency in the Indica genetic background in the fall.

  • PDF

Comparative Analysis of the Community of Culturable Bacteria Associated with Sponges, Spirastrella abata and Spirastrella panis by 16S rDNA-RFLP (16S rDNA-RFLP에 의한 Spirastrella abata와 Spirastrella panis 해면에 서식하는 배양가능한 공생세균 군집의 비교)

  • Cho, Hyun-Hee;Park, Jin-Sook
    • Korean Journal of Microbiology
    • /
    • v.45 no.2
    • /
    • pp.155-162
    • /
    • 2009
  • A cultivation-based approach was employed to compare the culturable bacterial diversity associated with two phylogenetically closely related marine sponges, Spirastrella abata and Spirastrella panis, which have geologically overlapping distribution patterns. The bacteria associated with sponge were cultivated using MA medium supplemented with 3% sponge extracts. Community structures of the culturable bacteria of the two sponge species were analyzed with PCR-RFLP (restriction fragment length polymorphism) based on 16S rDNA sequences. The RFLP fingerprinting of 16S rDNA digested with HaeIII and MspI, revealed 24 independent RFLP types, in which 1-5 representative strains from each type were partially sequenced. The sequence analysis showed >98.4% similarity to known bacterial species in public databases. Overall, the microbial populations of two sponges investigated were found to be the members of the classes; Alphaproteobacteria, Gammaproteobacteria, Firmicutes, and Actinobacteria. The Alphaproteobacteria were predominant in the bacterial communities of the two sponges. Gammaproteobacteria represented 38.5% of bacterial community in S. abata. Whereas only 1.6% of this class was present in S. panis. Bacillus species were dominat in S. panis. Bacillus species were found to be 44.3% of bacterial species in S. panis, while they were only 9.7% in S. abata. It is interesting to note that Planococcus maritimus (8.1%, phylum Firmicutes) and Psychrobacter nivimaris (28.9%, phylum Gammaproteobacteria) were found only in S. abata. This result revealed that profiles of bacterial communities from the sponges with a close phylogenetic relationship were highly species-specific.

Optimizing Multi-way Join Query Over Data Streams (데이타 스트림에서의 다중 조인 질의 최적화 방법)

  • Park, Hong-Kyu;Lee, Won-Suk
    • Journal of KIISE:Databases
    • /
    • v.35 no.6
    • /
    • pp.459-468
    • /
    • 2008
  • A data stream which is a massive unbounded sequence of data elements continuously generated at a rapid rate. Many recent research activities for emerging applications often need to deal with the data stream. Such applications can be web click monitoring, sensor data processing, network traffic analysis. telephone records and multi-media data. For this. data processing over a data stream are not performed on the stored data but performed the newly updated data with pre-registered queries, and then return a result immediately or periodically. Recently, many studies are focused on dealing with a data stream more than a stored data set. Especially. there are many researches to optimize continuous queries in order to perform them efficiently. This paper proposes a query optimization algorithm to manage continuous query which has multiple join operators(Multi-way join) over data streams. It is called by an Extended Greedy query optimization based on a greedy algorithm. It defines a join cost by a required operation to compute a join and an operation to process a result and then stores all information for computing join cost and join cost in the statistics catalog. To overcome a weak point of greedy algorithm which has poor performance, the algorithm selects the set of operators with a small lay, instead of operator with the smallest cost. The set is influenced the accuracy and execution time of the algorithm and can be controlled adaptively by two user-defined values. Experiment results illustrate the performance of the EGA algorithm in various stream environments.

Protein Structure Alignment Based on Maximum of Residue Pair Distance and Similarity Graph (정렬된 잔기 사이의 최대거리와 유사도 그래프에 기반한 단백질 구조 정렬)

  • Kim, Woo-Cheol;Park, Sang-Hyun;Won, Jung-Im
    • Journal of KIISE:Databases
    • /
    • v.34 no.5
    • /
    • pp.396-408
    • /
    • 2007
  • After the Human Genome Project finished the sequencing of a human DNA sequence, the concerns on protein functions are increasing. Since the structures of proteins are conserved in divergent evolution, their functions are determined by their structures rather than by their amino acid sequences. Therefore, if similarities between two protein structures are observed, we could expect them to have common biological functions. So far, a lot of researches on protein structure alignment have been performed. However, most of them use RMSD(Root Mean Square Deviation) as a similarity measure with which it is hard to judge the similarity level of two protein structures intuitively. In addition, they retrieve only one result having the highest alignment score with which it is hard to satisfy various users of different purpose. To overcome these limitations, we propose a novel protein structure alignment algorithm based on MRPD(Maximum of Residue Pair Distance) and SG (Similarity Graph). MRPD is more intuitive similarity measure by which fast tittering of unpromising pairs of protein pairs is possible, and SG is a compact representation method for multiple alignment results with which users can choose the most plausible one among various users' needs by providing multiple alignment results without compromising the time to align protein structures.

Development of a Recombinant Streptomyces griseus with sprA and sprB Genes for Proteolytic Enzyme Production (Streptomyces griseus IFO13350 유래 sprA 및 sprB 유전자를 이용한 Pretense 생산균주 개발)

  • Hwang Ji-Hwan;Lee Chang-Kwon;Lee Kang-Mu;Jo Byoung-Kee;Park Hae-Ryong;Hwang Yong-Il
    • Korean Journal of Microbiology
    • /
    • v.41 no.1
    • /
    • pp.87-92
    • /
    • 2005
  • Pronase, a protease produced for commercial purpose by Streptomyces griseus, was composed of serine protease, alkaline protease, aminopeptidase and carboxypeptidase complex, and it has been widely used as anti-inflammatory drugs for human therapy. In this study, we developed a new integration vector, pHJ101 derived from pSET152, containing strong promoter, ermE, to overexpress a certain protease gene. Specific PCR primers for cloning of sprA (a gene for S. griseus protease A) and sprB (a gene for S. griseus protease B) genes were designed from the basis of nucleotide sequence in databases and amplified by PCR. Plasmid pHJ201 and pHJ202 were constructed by inserting of amplified each gene in a vector pHJ101. S. griseus HA and S. griseus HB were respectively obtained by conjugal process of a parent strain, S. griseus IFO 13350 with the recombinant Escherichia coli harboring plasmid pHJ201 or pHJ202. When protease activity was measured in flask cultivation, produced protease levels of S. griseus HA and S. griseus HB increased about 5.3 times and 5 times, respectively, more than that of parent strain. And, the constructed integrating plasmid pHJ101 was applicable for overexpression of a certain gene in Streptomyces sp.

Sequencing of cDNA Clones Expressed in Adipose Tissues of Korean Cattle

  • Bong, J.J.;Tong, K.;Cho, K.K.;Baik, M.G.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.18 no.4
    • /
    • pp.483-489
    • /
    • 2005
  • To understand the molecular mechanisms that regulate intramuscular fat deposition and its release, cDNA clones expressed in adipose tissues of Korean cattle were identified by differential screening from adipose tissue cDNA library. By partial nucleotide sequencing of 486 clones and a search for sequence similarity in NCBI nucleotide databases, 245 clones revealed unique clones. By a functional grouping of the clones, 14% of the clones were categorized to metabolism and enzyme-related group (stearoyl CoA desaturase, lactate dehydrogenase, fatty acid synthase, ATP citrate lyase, lipoprotein lipase, acetyl CoA synthetase, etc), and 6% to signal transduction/cell cycle-related group (C/EBP, cAMP-regulated phosphoprotein, calmodulin, cyclin G1, cyclin H, etc), and 4% to cytoskeleton and extracellular matrix components (vimentin, ankyrin 2, gelosin, syntenin, talin, prefoldin 5). The obtained 245 clones will be useful to study lipid metabolism and signal transduction pathway in adipose tissues and to study obesity in human. Some clones were subjected to full-sequencing containing open reading frame. The cDNA clone of bovine homolog of human prefoldin 5 gene had a total length of 959 nucleotides coding for 139 amino acids. Comparison of the deduced amino acid sequences of bovine prefoldin 5 with those of human and mouse showed over 95% identity. The cDNA clone of bovine homolog of human ubiquitin-like/S30 ribosomal fusion protein gene had a total length of 484 nucleotides coding for 133 amino acids. Comparison of the deduced amino acid sequences of bovine ubiquitin-like/S30 ribosomal fusion protein gene with those of human, rat and mouse showed over 97% identity. The cDNA clone of bovine homolog of human proteolipid protein 2 mRNA had a total length of 928 nucleotides coding for 152 amino acids. Comparison of the deduced amino acid sequences of bovine proteolipid protein 2 with those of human and mouse showed 87.5% similarity. The cDNA clone of bovine homolog of rat thymosin beta 4 had a total length of 602 nucleotides coding for 44 amino acids. Comparison of the deduced amino acid sequences of bovine thymosin beta 4 gene with those of human, mouse and rat showed 93.1% similarity. The cDNA clone of bovine homolog of human myotrophin mRNA had a total length of 790 nucleotides coding for 118 amino acids. Comparison of the deduced amino acid sequences of bovine myotrophin gene with those of human, mouse and rat showed 83.9% similarity. The functional role of these clones in adipose tissues needs to be established.

Reordering Scheme of Location Identifiers for Indexing RFID Tags (RFID 태그의 색인을 위한 위치 식별자 재순서 기법)

  • Ahn, Sung-Woo;Hong, Bong-Hee
    • Journal of KIISE:Databases
    • /
    • v.36 no.3
    • /
    • pp.198-214
    • /
    • 2009
  • Trajectories of RFID tags can be modeled as a line, denoted by tag interval, captured by an RFID reader and indexed in a three-dimensional domain, with the axes being the tag identifier (TID), the location identifier (LID), and the time (TIME). Distribution of tag intervals in the domain space is an important factor for efficient processing of a query for tracing tags and is changed according to arranging coordinates of each domain. Particularly, the arrangement of LIDs in the domain has an effect on the performance of queries retrieving the traces of tags as times goes by because it provides the location information of tags. Therefore, it is necessary to determine the optimal ordering of LIDs in order to perform queries efficiently for retrieving tag intervals from the index. To do this, we propose LID proximity for reordering previously assigned LIDs to new LIDs and define the LID proximity function for storing tag intervals accessed together closely in index nodes when a query is processed. To determine the sequence of LIDs in the domain, we also propose a reordering scheme of LIDs based on LID proximity. Our experiments show that the proposed reordering scheme considerably improves the performance of Queries for tracing tag locations comparing with the previous method of assigning LIDs.

Linear Resource Sharing Method for Query Optimization of Sliding Window Aggregates in Multiple Continuous Queries (다중 연속질의에서 슬라이딩 윈도우 집계질의 최적화를 위한 선형 자원공유 기법)

  • Baek, Seong-Ha;You, Byeong-Seob;Cho, Sook-Kyoung;Bae, Hae-Young
    • Journal of KIISE:Databases
    • /
    • v.33 no.6
    • /
    • pp.563-577
    • /
    • 2006
  • A stream processor uses resource sharing method for efficient of limited resource in multiple continuous queries. The previous methods process aggregate queries to consist the level structure. So insert operation needs to reconstruct cost of the level structure. Also a search operation needs to search cost of aggregation information in each size of sliding windows. Therefore this paper uses linear structure for optimization of sliding window aggregations. The method comprises of making decision, generation and deletion of panes in sequence. The decision phase determines optimum pane size for holding accurate aggregate information. The generation phase stores aggregate information of data per pane from stream buffer. At the deletion phase, panes are deleted that are no longer used. The proposed method uses resources less than the method where level structures were used as data structures as it uses linear data format. The input cost of aggregate information is saved by calculating only pane size of data though numerous stream data is arrived, and the search cost of aggregate information is also saved by linear searching though those sliding window size is different each other. In experiment, the proposed method has low usage of memory and the speed of query processing is increased.