• 제목/요약/키워드: Sequence database

Search Result 567, Processing Time 0.027 seconds

Identification of highly pathogenic Beauveria bassiana strain against Pieris rapae larvae

  • DING, Jun-nan;LAI, Yong-cai
    • Entomological Research
    • /
    • v.48 no.5
    • /
    • pp.339-347
    • /
    • 2018
  • Seven different strains of Beauveria bassiana were used in a bioassay on Pieris rapae larvae. The results showed that an B. bassiana strain showed relatively high pathogenicity towards P. rapae larvae. The adjusted mortality rate was 92.86 %, and the infection rate was 85.71 % in 10 days post inoculation. Molecular identification was performed to identify the unknown strain. Internal Transcribed Spacer sequence analysis showed that the polymerase chain reaction amplicon length of the unknown strain of Beauveria sp. was 573 bp, and sequence similarity to the known B. bassiana sequences in the NCBI database was 99 %. The B. bassiana strain was named Bb01. The changes of proteins and PPO of P. rapae larvae infected by B. bassiana Bb01 strain at different times was determined. The activity of PPO increased in 1-6 d and decreased in 7 d again after inoculation. The B. bassiana invaded into the insect body affected the balance of the proteins and PPO.

The Brassica rapa Tissue-specific EST Database (배추의 조직 특이적 발현유전자 데이터베이스)

  • Yu, Hee-Ju;Park, Sin-Gi;Oh, Mi-Jin;Hwang, Hyun-Ju;Kim, Nam-Shin;Chung, Hee;Sohn, Seong-Han;Park, Beom-Seok;Mun, Jeong-Hwan
    • Horticultural Science & Technology
    • /
    • v.29 no.6
    • /
    • pp.633-640
    • /
    • 2011
  • Brassica rapa is an A genome model species for Brassica crop genetics, genomics, and breeding. With the completion of sequencing the B. rapa genome, functional analysis of the genome is forthcoming issue. The expressed sequence tags are fundamental resources supporting annotation and functional analysis of the genome including identification of tissue-specific genes and promoters. As of July 2011, 147,217 ESTs from 39 cDNA libraries of B. rapa are reported in the public database. However, little information can be retrieved from the sequences due to lack of organized databases. To leverage the sequence information and to maximize the use of publicly-available EST collections, the Brassica rapa tissue-specific EST database (BrTED) is developed. BrTED includes sequence information of 23,962 unigenes assembled by StackPack program. The unigene set is used as a query unit for various analyses such as BLAST against TAIR gene model, functional annotation using MIPS and UniProt, gene ontology analysis, and prediction of tissue-specific unigene sets based on statistics test. The database is composed of two main units, EST sequence processing and information retrieving unit and tissue-specific expression profile analysis unit. Information and data in both units are tightly inter-connected to each other using a web based browsing system. RT-PCR evaluation of 29 selected unigene sets successfully amplified amplicons from the target tissues of B. rapa. BrTED provided here allows the user to identify and analyze the expression of genes of interest and aid efforts to interpret the B. rapa genome through functional genomics. In addition, it can be used as a public resource in providing reference information to study the genus Brassica and other closely related crop crucifer plants.

Improving Bidirectional LSTM-CRF model Of Sequence Tagging by using Ontology knowledge based feature (온톨로지 지식 기반 특성치를 활용한 Bidirectional LSTM-CRF 모델의 시퀀스 태깅 성능 향상에 관한 연구)

  • Jin, Seunghee;Jang, Heewon;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.253-266
    • /
    • 2018
  • This paper proposes a methodology applying sequence tagging methodology to improve the performance of NER(Named Entity Recognition) used in QA system. In order to retrieve the correct answers stored in the database, it is necessary to switch the user's query into a language of the database such as SQL(Structured Query Language). Then, the computer can recognize the language of the user. This is the process of identifying the class or data name contained in the database. The method of retrieving the words contained in the query in the existing database and recognizing the object does not identify the homophone and the word phrases because it does not consider the context of the user's query. If there are multiple search results, all of them are returned as a result, so there can be many interpretations on the query and the time complexity for the calculation becomes large. To overcome these, this study aims to solve this problem by reflecting the contextual meaning of the query using Bidirectional LSTM-CRF. Also we tried to solve the disadvantages of the neural network model which can't identify the untrained words by using ontology knowledge based feature. Experiments were conducted on the ontology knowledge base of music domain and the performance was evaluated. In order to accurately evaluate the performance of the L-Bidirectional LSTM-CRF proposed in this study, we experimented with converting the words included in the learned query into untrained words in order to test whether the words were included in the database but correctly identified the untrained words. As a result, it was possible to recognize objects considering the context and can recognize the untrained words without re-training the L-Bidirectional LSTM-CRF mode, and it is confirmed that the performance of the object recognition as a whole is improved.

A Design of Matching Engine for a Practical Query-by-Singing/Humming System with Polyphonic Recordings

  • Lee, Seok-Pil;Yoo, Hoon;Jang, Dalwon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.2
    • /
    • pp.723-736
    • /
    • 2014
  • This paper proposes a matching engine for a query-by-singing/humming (QbSH) system with polyphonic music files like MP3 files. The pitch sequences extracted from polyphonic recordings may be distorted. So we use chroma-scale representation, pre-processing, compensation, and asymmetric dynamic time warping to reduce the influence of the distortions. From the experiment with 28 hour music DB, the performance of our QbSH system based on polyphonic database is very promising in comparison with the published QbSH system based on monophonic database. It shows 0.725 in MRR(Mean Reciprocal Rank). Our matching engine can be used for the QbSH system based on MIDI DB also and that performance was verified by MIREX 2011.

Protein-ligand interactions from the perspective of binding specificity

  • Ahmad, Shandar
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2003.10a
    • /
    • pp.4-4
    • /
    • 2003
  • A large number of in-vitro experiments on the inhibition of kinases and pretenses are reported in literature, and compiled by ProLINT database. Using this powerful wealth of knowledge, we have carried our an analysis of ligand specificity of these two classes of proteins. Each of the pretenses and kinases included in the database has been assigned a consensus ligand fragment signature, based on the available information about its interaction with different ligands. A set of 43 fragments efficiently represent every ligand. We have then organized the consensus fragment signatures for every protein in form of a cluster-tree diagram. This tree is also constructed from other sequence, structure and physical considerations. Cluster-cluster comparison between these analyzes provide a valuable information about ligand specific interactions and similarities between proteins.

  • PDF

Design and Implementation of gene sequence database with streptomyces data (유전자 데이터베이스의 설계 및 구현: streptomyces data를 예로)

  • Kim, Jin;Kim, Bun-Joon;Kim, Jeong-Mi;Kim, Dong-Hoi
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.04b
    • /
    • pp.160-162
    • /
    • 2001
  • 유전자의 서열 및 관련 정보가 폭발적으로 증가함에 따라, 사용자들에 대한 유전자정보 서비스, 온라인 상에서의 효율적이 서열정보 분석, 서열정보에 대한 효율적인 관리, 관련된 연구자들과의 정보공유 등이 필요하게 되었다. 본 논문에서는 인터넷 상에서 streptomyces 유전자 data를 효율적으로 관리하는 한편, 사용자들에게 유용한 서비스를 제공하는 시스템의 설계 및 구현에 관하여 논의하였다. 사용자는 본 시스템으로부터 원하는 유전자 정보를 다운로드 받을 수 있다. 또한 분석을 원하는 유전자를 streptomyces database내의 유전자들과 비교하여 유용한 정보를 추론할 수 있다.

  • PDF

A Study on CAD interfaced CAPP System for Turning Operation ( I ) : Automatic Feature Recognition and Process Selection (선삭공정에서 CAD 인터페이스된 자동공정계획시스템개발에 관한 연구( I ) : 형상특징의 자동인식과 공정선정)

  • Cho, Kyu-Kap;Kim, In-Ho
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.17 no.2
    • /
    • pp.1-16
    • /
    • 1991
  • This paper deals with some critical activities of CAPP system such as generation of part description database, part feature recognition, process and operation selection, and sequencing method for turning operation of symmetric rotational parts. The part description database is generated by data conversion module from CAD data, and the part feature is recognized by using both pattern primitives and feature recognition rules. Machining processes and operations are selected based on machining surface features and its sequence is determined by rules acquired from process planning expert. AutoCAD is employed as CAD system and computer program is developed by using Turbo-C on IBM PC/AT compatible system.

  • PDF

ChimerDB - Database of Chimeric Sequences in the GenBank

  • Kim, Namshin;Shin, Seokmin;Cho, Kwang-Hwi;Lee, Sanghyuk
    • Genomics & Informatics
    • /
    • v.2 no.2
    • /
    • pp.61-66
    • /
    • 2004
  • Fusion proteins resulting from chimeric sequences are excellent targets for therapeutic drug development. We developed a database of chimeric sequences by examining the genomic alignment of mRNA and EST sequences in the GenBank. We identified 688 chimeric mRNA and 20,998 chimeric EST sequences. Including EST sequences greatly expands the scope of chimeric sequences even though it inevitably accompanies many artifacts. Chimeric sequences are clustered according to the ECgene ID so that the user can easily find chimeric sequences related to a specific gene. Alignments of chimeric sequences are displayed as custom tracks in the UCSC genome browser. ChimerDB, available at http://genome.ewha.ac.kr/ECgene/ChimerDB/, should be a valuable resource for finding drug targets to treat cancers.

Similarity-Based Subsequence Search in Image Sequence Databases (이미지 시퀀스 데이터베이스에서의 유사성 기반 서브시퀀스 검색)

  • Kim, In-Bum;Park, Sang-Hyun
    • The KIPS Transactions:PartD
    • /
    • v.10D no.3
    • /
    • pp.501-512
    • /
    • 2003
  • This paper proposes an indexing technique for fast retrieval of similar image subsequences using the multi-dimensional time warping distance. The time warping distance is a more suitable similarity measure than Lp distance in many applications where sequences may be of different lengths and/or different sampling rates. Our indexing scheme employs a disk-based suffix tree as an index structure and uses a lower-bound distance function to filter out dissimilar subsequences without false dismissals. It applies the normaliration for an easier control of relative weighting of feature dimensions and the discretization to compress the index tree. Experiments on medical and synthetic image sequences verify that the proposed method significantly outperforms the naive method and scales well in a large volume of image sequence databases.

Object Retrieval Using the Corners Area Variability Based on Correlogram (코너영역 분산치 기반 코렐로그램을 이용한 형태검출)

  • An, Young-Eun;Lee, Ji-Min;Yang, Won-Ii;Choi, Young-Il;Chang, Min-Hyuk
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.11 no.6
    • /
    • pp.283-288
    • /
    • 2011
  • This paper have proposed an object retrieval using the corners area variability based on correlogram. The proposed algorithm is processed as follows. First, the corner points of the object in an image are extracted and then the feature vectors are obtained. It are rearranged according to the number dimension and consist of sequence vectors. And the similarity based on the maximum of sequence vectors is measured. The proposed technique is invariant to the rotation or the transfer of the objects and more efficient in case that the objects present simple structure. In simulation that use Wang's database, the method presents that the recall property is improved by 0.03% and more than the standard corner patch histogram.