• Title/Summary/Keyword: Analysis of Query

Search Result 457, Processing Time 0.021 seconds

Accelerating Group Fusion for Ligand-Based Virtual Screening on Multi-core and Many-core Platforms

  • Mohd-Hilmi, Mohd-Norhadri;Al-Laila, Marwah Haitham;Hassain Malim, Nurul Hashimah Ahamed
    • Journal of Information Processing Systems
    • /
    • v.12 no.4
    • /
    • pp.724-740
    • /
    • 2016
  • The performance issues of screening large database compounds and multiple query compounds in virtual screening highlight a common concern in Chemoinformatics applications. This study investigates these problems by choosing group fusion as a pilot model and presents efficient parallel solutions in parallel platforms, specifically, the multi-core architecture of CPU and many-core architecture of graphical processing unit (GPU). A study of sequential group fusion and a proposed design of parallel CUDA group fusion are presented in this paper. The design involves solving two important stages of group fusion, namely, similarity search and fusion (MAX rule), while addressing embarrassingly parallel and parallel reduction models. The sequential, optimized sequential and parallel OpenMP of group fusion were implemented and evaluated. The outcome of the analysis from these three different design approaches influenced the design of parallel CUDA version in order to optimize and achieve high computation intensity. The proposed parallel CUDA performed better than sequential and parallel OpenMP in terms of both execution time and speedup. The parallel CUDA was 5-10x faster than sequential and parallel OpenMP as both similarity search and fusion MAX stages had been CUDA-optimized.

Experimental Analysis of Correct Answer Characteristics in Question Answering Systems (질의응답시스템에서 정답 특징에 관한 실험적 분석)

  • Han, Kyoung-Soo
    • Journal of Digital Contents Society
    • /
    • v.19 no.5
    • /
    • pp.927-933
    • /
    • 2018
  • One of the factors that have the greatest influence on the error of the question answering system that finds and provides answers to natural language questions is the step of searching for documents or passages that contain correct answers. In order to improve the retrieval performance, it is necessary to understand the characteristics of documents and passages containing correct answers. This paper experimentally analyzes how many question words appear in the correct answer documents, how the location of the question word is distributed, and how the topic of the question and the correct answer document are similar using the corpus composed of the question, the documents with correct answer, and the documents without correct answer. This study explains the causes of previous search research results for question answer system and discusses the necessary elements of effective search step.

The performance analysis of the selective element encryption method (선택적 요소 암호화 방식에 대한 성능 분석)

  • Yang, Xue;Kim, Ji-Hong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.4
    • /
    • pp.848-854
    • /
    • 2015
  • There are a lot of encryption methods to secure database proposed recently. Those encryption methods can protect the sensitive data of users effectively, but it deteriorates the search performance of database query. In this paper, we proposed the selective element encryption method in order to complement those drawbacks. In addition, we compared the performance of the proposed method with that of tuple level encryption method using the various queries. As a result, we found that the proposed method, which use the selective element encryption with bloom filter as a index, has better performance than the other encryption method.

A Study of CBIR(Content-based Image Retrieval) Computer-aided Diagnosis System of Breast Ultrasound Images using Similarity Measures of Distance (거리 기반 유사도 측정을 통한 유방 초음파 영상의 내용 기반 검색 컴퓨터 보조 진단 시스템에 관한 연구)

  • Kim, Min-jeong;Cho, Hyun-chong
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.8
    • /
    • pp.1272-1277
    • /
    • 2017
  • To assist radiologists for the characterization of breast masses, Computer-aided Diagnosis(CADx) system has been studied. The CADx system can improve the diagnostic accuracy of radiologists by providing objective information about breast masses. Morphological and texture features were extracted from the breast ultrasound images. Based on extracted features, the CADx system retrieves masses that are similar to a query mass from a reference library using a k-nearest neighbor (k-NN) approach. Eight similarity measures of distance, Euclidean, Chebyshev(Minkowski family), Canberra, Lorentzian($F_2$ family), Wave Hedges, Motyka(Intersection family), and Cosine, Dice(Inner Product family) are evaluated by ROC(Receiver Operating Characteristic) analysis. The Inner Product family measure used with the k-NN classifier provided slightly higher performance for classification of malignant and benign masses than those with the Minkowski, $F_2$, and Intersection family measures.

A Combined Pharmacophore-Based Virtual Screening, Docking Study and Molecular Dynamics (MD) Simulation Approach to Identify Inhibitors with Novel Scaffolds for Myeloid cell leukemia (Mcl-1)

  • Bao, Guang-Kai;Zhou, Lu;Wang, Tai-Jin;He, Lu-Fen;Liu, Tao
    • Bulletin of the Korean Chemical Society
    • /
    • v.35 no.7
    • /
    • pp.2097-2108
    • /
    • 2014
  • Chemical feature based quantitative pharmacophore models were generated using the HypoGen module implemented in DS2.5. The best hypothesis, Hypo1, which was characterized by the highest correlation coefficient (0.96), the highest cost difference (61.60) and the lowest RMSD (0.74), consisted of one hydrogen bond acceptor, one hydrogen bond donor, one hydrophobic and one ring aromatic. The reliability of Hypo1 was validated on the basis of cost analysis, test set, Fischer's randomization method and GH test method. The validated Hypo1 was used as a 3D search query to identify novel inhibitors. The screened molecules were further refined by employing ADMET, docking studies and visual inspection. Three compounds with novel scaffolds were selected as the most promising candidates for the designing of Mcl-1 antagonists. Finally, a 10 ns molecular dynamics simulation was carried out on the complex of receptor and the retrieved ligand to demonstrate that the binding mode was stable during the MD simulation.

Distributed Information Extraction in Wireless Sensor Networks using Multiple Software Agents with Dynamic Itineraries

  • Gupta, Govind P.;Misra, Manoj;Garg, Kumkum
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.1
    • /
    • pp.123-144
    • /
    • 2014
  • Wireless sensor networks are generally deployed for specific applications to accomplish certain objectives over a period of time. To fulfill these objectives, it is crucial that the sensor network continues to function for a long time, even if some of its nodes become faulty. Energy efficiency and fault tolerance are undoubtedly the most crucial requirements for the design of an information extraction protocol for any sensor network application. However, most existing software agent based information extraction protocols are incapable of satisfying these requirements because of static agent itineraries and large agent sizes. This paper proposes an Information Extraction protocol based on Multiple software Agents with Dynamic Itineraries (IEMADI), where multiple software agents are dispatched in parallel to perform tasks based on the query assigned to them. IEMADI decides the itinerary for an agent dynamically at each hop using local information. Through mathematical analysis and simulation, we compare the performance of IEMADI with a well known static itinerary based protocol with respect to energy consumption and response time. The results show that IEMADI provides better performance than the static itinerary based protocols.

Speech Query Recognition for Tamil Language Using Wavelet and Wavelet Packets

  • Iswarya, P.;Radha, V.
    • Journal of Information Processing Systems
    • /
    • v.13 no.5
    • /
    • pp.1135-1148
    • /
    • 2017
  • Speech recognition is one of the fascinating fields in the area of Computer science. Accuracy of speech recognition system may reduce due to the presence of noise present in speech signal. Therefore noise removal is an essential step in Automatic Speech Recognition (ASR) system and this paper proposes a new technique called combined thresholding for noise removal. Feature extraction is process of converting acoustic signal into most valuable set of parameters. This paper also concentrates on improving Mel Frequency Cepstral Coefficients (MFCC) features by introducing Discrete Wavelet Packet Transform (DWPT) in the place of Discrete Fourier Transformation (DFT) block to provide an efficient signal analysis. The feature vector is varied in size, for choosing the correct length of feature vector Self Organizing Map (SOM) is used. As a single classifier does not provide enough accuracy, so this research proposes an Ensemble Support Vector Machine (ESVM) classifier where the fixed length feature vector from SOM is given as input, termed as ESVM_SOM. The experimental results showed that the proposed methods provide better results than the existing methods.

MRI Image Retrieval Using Wavelet with Mahalanobis Distance Measurement

  • Rajakumar, K.;Muttan, S.
    • Journal of Electrical Engineering and Technology
    • /
    • v.8 no.5
    • /
    • pp.1188-1193
    • /
    • 2013
  • In content based image retrieval (CBIR) system, the images are represented based upon its feature such as color, texture, shape, and spatial relationship etc. In this paper, we propose a MRI Image Retrieval using wavelet transform with mahalanobis distance measurement. Wavelet transformation can also be easily extended to 2-D (image) or 3-D (volume) data by successively applying 1-D transformation on different dimensions. The proposed algorithm has tested using wavelet transform and performance analysis have done with HH and $H^*$ elimination methods. The retrieval image is the relevance between a query image and any database image, the relevance similarity is ranked according to the closest similar measures computed by the mahalanobis distance measurement. An adaptive similarity synthesis approach based on a linear combination of individual feature level similarities are analyzed and presented in this paper. The feature weights are calculated by considering both the precision and recall rate of the top retrieved relevant images as predicted by our enhanced technique. Hence, to produce effective results the weights are dynamically updated for robust searching process. The experimental results show that the proposed algorithm is easily identifies target object and reduces the influence of background in the image and thus improves the performance of MRI image retrieval.

A Compound Term Retrieval Model Using Statistical lnformation (통계적 정보를 이용한 복합명사 검색 모델)

  • 박영찬;최기선
    • Korean Journal of Cognitive Science
    • /
    • v.6 no.3
    • /
    • pp.65-81
    • /
    • 1995
  • Compound nouns as a composition of multiple nouns exhibit diverse occurence patterns in the texts and have varying degree of meaning coherence.The problem of compound nouns in information retrieval is to find a method to represent and identify the compositive patterns of each words.This paper explains how the cooccurrence patterns are related with the meaning of each compound noun and the information of such relations that can be mechanically acquired from texts is used in ranking the candidated documents for a given query.The main theme of the paper is that compound nouns can be categorized according to their occurrence patterns of simple nouns and these occurrence patterns can be formalized by statistical analysis without large dictionary or complex compositive rules.Our suggested model achieved about 7.75% improvement over the best precision of the other methods at each recall measurements on Korean test collection.

  • PDF

Analysis of user's query and design of system for implementation of highway traffic datawarehouse (교통정보 이력자료 통합데이터베이스 구축을 위한 사용자 요구사항 분석 및 시스템 설계)

  • Cheong, Su-Jeong;Yun, Hye-Jung;Song, Soo-Kyung;Lee, Yoon-Kyung;Lee, Min-Soo;Oh, Cheol;NamGung, Sung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.88-90
    • /
    • 2007
  • 수집 및 가공된 교통자료와 시스템 운영 자료의 체계적인 분석 수단으로 사용될 교통 이력자료 관리 시스템의 요구 기능을 정의하는 것은 매우 중요하다. 시스템 구축을 위한 사용자 요구사항을 구체적으로 정의, 설명함으로써 시스템 완성도와 활용도를 높인다. 본 논문에서는 교통 이력자료 관리시스템의 주요 기능으로 '자료 저장 기능', '자료 분석 기능', '자료 보고 기능' 을 제안한다. 이러한 시스템의 사용자 요구 기능은 Rational Rose tool 을 이용하여 Use Case 다이어그램으로 시각화 되어지며 이후 교통정보 이력자료 통합데이터베이스 구축을 위한 개발자들에게 더욱 쉬운 이해를 제공할 수 있다.

  • PDF