• Title/Summary/Keyword: Complex Query

Search Result 128, Processing Time 0.04 seconds

Acceleration of Building Thesaurus in Fuzzy Information Retrieval Using Relational products

  • Kim, Chang-Min;Kim, Young-Gi
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1998.06a
    • /
    • pp.240-245
    • /
    • 1998
  • Fuzzy information retrieval which uses the concept of fuzzy relation is able to retrieve documents in the way based on not morphology but semantics, dissimilar to traditional information retrieval theories. Fuzzy information retrieval logically consists of three sets : the set of documents, the set of terms and the set of queries. It maintains a fuzzy relational matrix which describes the relationship between documents and terms and creates a thesaurus with fuzzy relational product. It also provides the user with documents which are relevant to his query. However, there are some problems on building a thesaurus with fuzzy relational product such that it has big time complexity and it uses fuzzy values to be processed with flating-point. Actually, fuzzy values have to be expressed and processed with floating-point. However, floating-point operations have complex logics and make the system be slow. If it is possible to exchange fuzzy values with binary values, we could expect sp eding up building the thesaurus. In addition, binary value expressions require just a bit of memory space, but floating -point expression needs couple of bytes. In this study, we suggest a new method of building a thesaurus, which accelerates the operation of the system by pre-applying an ${\alpha}$-cut. The experiments show the improvement of performance and reliability of the system.

  • PDF

A study on Implementation of English Sentence Generator using Lexical Functions (언어함수를 이용한 영문 생성기의 구현에 관한 연구)

  • 정희연;김희연;이웅재
    • Journal of Internet Computing and Services
    • /
    • v.1 no.2
    • /
    • pp.49-59
    • /
    • 2000
  • The majority of work done to date on natural language processing has focused on analysis and understanding of language, thus natural language generation had been relatively less attention than understanding, And people even tends to regard natural language generation CIS a simple reverse process of language understanding, However, need for natural language generation is growing rapidly as application systems, especially multi-language machine translation systems on the web, natural language interface systems, natural language query systems need more complex messages to generate, In this paper, we propose an algorithm to generate more flexible and natural sentence using lexical functions of Igor Mel'uk (Mel'uk & Zholkovsky, 1988) and systemic grammar.

  • PDF

Implementation of Protein Motif Prediction System Using integrated Motif Resources (모티프 자원 통합을 이용한 단백질 모티프 예측 시스템 구현)

  • Lee, Bum-Ju;Choi, Eun-Sun;Ryu, Keun-Ho
    • The KIPS Transactions:PartD
    • /
    • v.10D no.4
    • /
    • pp.679-688
    • /
    • 2003
  • Motif databases are used in the function and structure prediction of proteins which appear on new and rapid release of raw data from genome sequencing projects. Recently, the frequency of use about these databases increases continuously. However, existing motif databases were developed and extended independently and were integrated mainly by using a web-based cross-reference, thus these databases have a heterogeneous search result problem, a complex query process problem and a duplicate database entry handling problem. Therefore, in this paper, we suppose physical motif resource integration and describe the integrated search method about a family-based protein prediction for solving above these problems. Finally, we estimate our implementation of the motif integration database and prediction system for predicting protein motifs.

Bagged Auto-Associative Kernel Regression-Based Fault Detection and Identification Approach for Steam Boilers in Thermal Power Plants

  • Yu, Jungwon;Jang, Jaeyel;Yoo, Jaeyeong;Park, June Ho;Kim, Sungshin
    • Journal of Electrical Engineering and Technology
    • /
    • v.12 no.4
    • /
    • pp.1406-1416
    • /
    • 2017
  • In complex and large-scale industries, properly designed fault detection and identification (FDI) systems considerably improve safety, reliability and availability of target processes. In thermal power plants (TPPs), generating units operate under very dangerous conditions; system failures can cause severe loss of life and property. In this paper, we propose a bagged auto-associative kernel regression (AAKR)-based FDI approach for steam boilers in TPPs. AAKR estimates new query vectors by online local modeling, and is suitable for TPPs operating under various load levels. By combining the bagging method, more stable and reliable estimations can be achieved, since the effects of random fluctuations decrease because of ensemble averaging. To validate performance, the proposed method and comparison methods (i.e., a clustering-based method and principal component analysis) are applied to failure data due to water wall tube leakage gathered from a 250 MW coal-fired TPP. Experimental results show that the proposed method fulfills reasonable false alarm rates and, at the same time, achieves better fault detection performance than the comparison methods. After performing fault detection, contribution analysis is carried out to identify fault variables; this helps operators to confirm the types of faults and efficiently take preventive actions.

Implementation of System Retrieving Multi-Object Image Using Property of Moments (모멘트 특성을 이용한 다중 객체 이미지 검색 시스템 구현)

  • 안광일;안재형
    • Journal of Korea Multimedia Society
    • /
    • v.3 no.5
    • /
    • pp.454-460
    • /
    • 2000
  • To retrieve complex data such as images, the content-based retrieval method rather than keyword based method is required. In this paper, we implemented a content-based image retrieval system which retrieves object of user query effectively using invariant moments which have invariant properties about linear transformation like position transition, rotation and scaling. To extract the shape feature of objects in an image, we propose a labeling algorithm that extracts objects from an image and apply invariant moments to each object. Hashing method is also applied to reduce a retrieval time and index images effectively. The experimental results demonstrate the high retrieval efficiency i.e precision 85%, recall 23%. Consequently, our retrieval system shows better performance than the conventional system that cannot express the shale of objects exactly.

  • PDF

A Compound Term Retrieval Model Using Statistical lnformation (통계적 정보를 이용한 복합명사 검색 모델)

  • 박영찬;최기선
    • Korean Journal of Cognitive Science
    • /
    • v.6 no.3
    • /
    • pp.65-81
    • /
    • 1995
  • Compound nouns as a composition of multiple nouns exhibit diverse occurence patterns in the texts and have varying degree of meaning coherence.The problem of compound nouns in information retrieval is to find a method to represent and identify the compositive patterns of each words.This paper explains how the cooccurrence patterns are related with the meaning of each compound noun and the information of such relations that can be mechanically acquired from texts is used in ranking the candidated documents for a given query.The main theme of the paper is that compound nouns can be categorized according to their occurrence patterns of simple nouns and these occurrence patterns can be formalized by statistical analysis without large dictionary or complex compositive rules.Our suggested model achieved about 7.75% improvement over the best precision of the other methods at each recall measurements on Korean test collection.

  • PDF

Design and Implementation of Video Documents Management System (비디오 문서 관리시스템의 설계 및 구현)

  • Kweon, Jae-Gil;Bae, Jong-Min
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.8
    • /
    • pp.2287-2297
    • /
    • 2000
  • Video documents which have audio-visual and other semantics information have complex relationship among media. While user requests for topic retrieval or specific region retrieval increase, it is difficult to meet these requests with the existing design methodology, In order to support the systematic management and the various retrieval capabilities of video document, we must formulate structural and systematic model on metadata using semantics and structural informations which are abstracted automaticallv or manuallv. This paper suggests generic metadata model with which we analyze the characteristics of video document, supports various query types and serves as a generic framework for video applications, we propose the generic integrated management model(GIMM)for generic metadata,, design video documents management system(VDMS) and implement it using GIMM.

  • PDF

A Combined Pharmacophore-Based Virtual Screening, Docking Study and Molecular Dynamics (MD) Simulation Approach to Identify Inhibitors with Novel Scaffolds for Myeloid cell leukemia (Mcl-1)

  • Bao, Guang-Kai;Zhou, Lu;Wang, Tai-Jin;He, Lu-Fen;Liu, Tao
    • Bulletin of the Korean Chemical Society
    • /
    • v.35 no.7
    • /
    • pp.2097-2108
    • /
    • 2014
  • Chemical feature based quantitative pharmacophore models were generated using the HypoGen module implemented in DS2.5. The best hypothesis, Hypo1, which was characterized by the highest correlation coefficient (0.96), the highest cost difference (61.60) and the lowest RMSD (0.74), consisted of one hydrogen bond acceptor, one hydrogen bond donor, one hydrophobic and one ring aromatic. The reliability of Hypo1 was validated on the basis of cost analysis, test set, Fischer's randomization method and GH test method. The validated Hypo1 was used as a 3D search query to identify novel inhibitors. The screened molecules were further refined by employing ADMET, docking studies and visual inspection. Three compounds with novel scaffolds were selected as the most promising candidates for the designing of Mcl-1 antagonists. Finally, a 10 ns molecular dynamics simulation was carried out on the complex of receptor and the retrieved ligand to demonstrate that the binding mode was stable during the MD simulation.

Locally-Weighted Polynomial Neural Network for Daily Short-Term Peak Load Forecasting

  • Yu, Jungwon;Kim, Sungshin
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.16 no.3
    • /
    • pp.163-172
    • /
    • 2016
  • Electric load forecasting is essential for effective power system planning and operation. Complex and nonlinear relationships exist between the electric loads and their exogenous factors. In addition, time-series load data has non-stationary characteristics, such as trend, seasonality and anomalous day effects, making it difficult to predict the future loads. This paper proposes a locally-weighted polynomial neural network (LWPNN), which is a combination of a polynomial neural network (PNN) and locally-weighted regression (LWR) for daily shortterm peak load forecasting. Model over-fitting problems can be prevented effectively because PNN has an automatic structure identification mechanism for nonlinear system modeling. LWR applied to optimize the regression coefficients of LWPNN only uses the locally-weighted learning data points located in the neighborhood of the current query point instead of using all data points. LWPNN is very effective and suitable for predicting an electric load series with nonlinear and non-stationary characteristics. To confirm the effectiveness, the proposed LWPNN, standard PNN, support vector regression and artificial neural network are applied to a real world daily peak load dataset in Korea. The proposed LWPNN shows significantly good prediction accuracy compared to the other methods.

A Data Structuring Technique for Performance Enhancement of Query Processing in the Data Warehouses (DW에서의 질의어처리 성능향상을 위한 데이터 구조화 방법)

  • Lee Deok Heun;Oh Mi Hwa;Cho Jae Hun;Choi In Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.1 s.33
    • /
    • pp.7-14
    • /
    • 2005
  • An OLAP(On-Line Analytical Processing) system is the decision support tool with which a user can analyze the information interactively in the various aspects. However, the traditional existing construction of an OLAP system has the inefficiency Problem of increasing the processing time and cost caused by the use of complex MDX(Multidimensional Expressions) queries. In an attempt to solve this problem, a new concept of data structuring technique, where a unit column whose elements are all 1 is added to the fact table, was suggested. With the data structuring technique, we can reduce the processing time and cost in OLAP systems.

  • PDF