• Title/Summary/Keyword: Structure-based Retrieval

Search Result 265, Processing Time 0.022 seconds

Performance Evaluation on Structure-based Retrievals of XML Documents (XML 문서의 구조기반 검색성능 평가)

  • Kim, Su-Hee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.2
    • /
    • pp.396-406
    • /
    • 2009
  • In extension to our previous study, we develop metadata that specify elements' structural orders, to increase the efficiency level of XML document's retrieval process. Then, we proposed a structure-based indexing model. We expect the model to generate a more efficient retrieval process of horizontally and vertically related elements. To evaluate the model's performance level, we developed an experimental prototype and conducted an experiment on an XML corpus. On average, descendant, ancestor and sibling retrievals were approximately twelve percent faster than the ETID model. And retrievals specifying structural orders of particular element types were approximately twenty-five percent faster than the ETID model. In conclusion, metadata, such as Etype, Asso and Lsso, may make a meaningful contribution to retrieval processes that specify elements' order.

Semantic-based Query Generation For Information Retrieval

  • Shin Seung-Eun;Seo Young-Hoon
    • International Journal of Contents
    • /
    • v.1 no.2
    • /
    • pp.39-43
    • /
    • 2005
  • In this paper, we describe a generation mechanism of semantic-based queries for high accuracy information retrieval and question answering. It is difficult to offer the correct retrieval result because general information retrieval systems do not analyze the semantic of user's natural language question. We analyze user's question semantically and extract semantic features, and we .generate semantic-based queries using them. These queries are generated using the se-mantic-based question analysis grammar and the query generation rule. They are represented as semantic features and grammatical morphemes that consider semantic and syntactic structure of user's questions. We evaluated our mechanism using 100 questions whose answer type is a person in the TREC-9 corpus and Web. There was a 0.28 improvement in the precision at 10 documents when semantic-based queries were used for information retrieval.

  • PDF

A New process Structure for Constructing Efficient Information Retrieval Systems (효율적인 정보 검색 시스템 구축을 위한 새로운 프로세스 구조)

  • Go, Hyeong-Dae;Yu, Jae-Su;Kim, Byeong-Gi
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.1
    • /
    • pp.76-86
    • /
    • 1997
  • Many information retrieval systems have a simple process structure that a client process for a user is mapped to a server process for information retrieval. That is, when using information retrieval systems, each user is allocated a big process that consists of user interfaces, retrieval automatic indexing systems and storage systems. Therefor when many users use the information retrieval systems, it might be difficult to use the information retrieval systems. This is because the system overhead is increased as enormously much as users cannot use them. In this paper, we propose a new process structure for constructing efficient information retrieval systems that solves the problem resulting from he process structure. The proposed process structure contributes to the whole operational performance improvement of information retrieval systems and the efficientnt use of computer system resources. It is constructed based on a multi-threading scheme and a transaction processing monitor.

  • PDF

A new approach for content-based video retrieval

  • Kim, Nac-Woo;Lee, Byung-Tak;Koh, Jai-Sang;Song, Ho-Young
    • International Journal of Contents
    • /
    • v.4 no.2
    • /
    • pp.24-28
    • /
    • 2008
  • In this paper, we propose a new approach for content-based video retrieval using non-parametric based motion classification in the shot-based video indexing structure. Our system proposed in this paper has supported the real-time video retrieval using spatio-temporal feature comparison by measuring the similarity between visual features and between motion features, respectively, after extracting representative frame and non-parametric motion information from shot-based video clips segmented by scene change detection method. The extraction of non-parametric based motion features, after the normalized motion vectors are created from an MPEG-compressed stream, is effectively fulfilled by discretizing each normalized motion vector into various angle bins, and by considering the mean, variance, and direction of motion vectors in these bins. To obtain visual feature in representative frame, we use the edge-based spatial descriptor. Experimental results show that our approach is superior to conventional methods with regard to the performance for video indexing and retrieval.

Retrieval of Non-rigid 3D Models Based on Approximated Topological Structure and Local Volume

  • Hong, Yiyu;Kim, Jongweon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.8
    • /
    • pp.3950-3964
    • /
    • 2017
  • With the increasing popularity of 3D technology such as 3D printing, 3D modeling, etc., there is a growing need to search for similar models on the internet. Matching non-rigid shapes has become an active research field in computer graphics. In this paper, we present an efficient and effective non-rigid model retrieval method based on topological structure and local volume. The integral geodesic distances are first calculated for each vertex on a mesh to construct the topological structure. Next, each node on the topological structure is assigned a local volume that is calculated using the shape diameter function (SDF). Finally, we utilize the Hungarian algorithm to measure similarity between two non-rigid models. Experimental results on the latest benchmark (SHREC' 15 Non-rigid 3D Shape Retrieval) demonstrate that our method works well compared to the state-of-the-art.

A Study on Effective Internet Data Extraction through Layout Detection

  • Sun Bok-Keun;Han Kwang-Rok
    • International Journal of Contents
    • /
    • v.1 no.2
    • /
    • pp.5-9
    • /
    • 2005
  • Currently most Internet documents including data are made based on predefined templates, but templates are usually formed only for main data and are not helpful for information retrieval against indexes, advertisements, header data etc. Templates in such forms are not appropriate when Internet documents are used as data for information retrieval. In order to process Internet documents in various areas of information retrieval, it is necessary to detect additional information such as advertisements and page indexes. Thus this study proposes a method of detecting the layout of Web pages by identifying the characteristics and structure of block tags that affect the layout of Web pages and calculating distances between Web pages. This method is purposed to reduce the cost of Web document automatic processing and improve processing efficiency by providing information about the structure of Web pages using templates through applying the method to information retrieval such as data extraction.

  • PDF

Query Processing Model Using Two-level Fuzzy Knowledge Base (2단계 퍼지 지식베이스를 이용한 질의 처리 모델)

  • Lee, Ki-Young;Kim, Young-Un
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.4 s.36
    • /
    • pp.1-16
    • /
    • 2005
  • When Web-based special retrieval systems for scientific field extremely restrict the expression of user's information request, the process of the information content analysis and that of the information acquisition become inconsistent. Accordingly, this study suggests the re-ranking retrieval model which reflects the content based similarity between user's inquiry terms and index words by grasping the document knowledge structure. In order to accomplish this, the former constructs a thesaurus and similarity relation matrix to provide the subject analysis mechanism and the latter propose the algorithm which establishes a search model such as query expansion in order to analyze the user's demands. Therefore, the algorithm that this study suggests as retrieval utilizing the information structure of a retrieval system can be content-based retrieval mechanism to establish a 2-step search model for the preservation of recall and improvement of accuracy which was a weak point of the previous fuzzy retrieval model.

  • PDF

An Experimental Study on the Performance of Element-based XML Document Retrieval (엘리먼트 기반 XML 문서검색의 성능에 관한 실험적 연구)

  • Yoon, So-Young;Moon, Sung-Been
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.1 s.59
    • /
    • pp.201-219
    • /
    • 2006
  • This experimental study suggests an element-based XML document retrieval method that reveals highly relevant elements. The models investigated here for comparison are divergence and smoothing method, and hierarchical language model. In conclusion, the hierarchical language model proved to be most effective in element-based XML document retrieval with regard to the improved exhaustivity and harmed specificity.

An Implementation of XML Database System for Semantic-Based E-Catalog Image Retrieval (의미기반 전자 카탈로그 이미지 검색을 위한 XML 데이타베이스 시스템 구현)

  • Hong Sungyong;Nah Yunmook
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.9
    • /
    • pp.1219-1232
    • /
    • 2004
  • Recently, the web sites, such as e-business sites and shopping mall sites, deal with lots of catalog image information and contents. As a result, it is required to support semantic-based image retrieval efficiently on such image data. This paper presents a semantic-based image retrieval system, which adopts XML and Fuzzy technology. To support semantic-based retrieval on product catalog images containing multiple objects, we use a multi-level metadata structure which represents the product information and semantics of image data. To enable semantic-based retrieval on such image data, we design a XML database for storing the proposed metadata and study how to apply fuzzy data. This paper proposes a system, generate the fuzzy data automatically to use the image metadata, that can support semantic-based image retrieval by utilizing the generating fuzzy data. Therefore, it will contribute in improving the retrieval correctness and the user's satisfaction on semantic-based e-catalog image retrieval.

  • PDF

Consideration of a Robust Search Methodology that could be used in Full-Text Information Retrieval Systems (퍼지 논리를 이용한 사용자 중심적인 Full-Text 검색방법에 관한 연구)

  • Lee, Won-Bu
    • Asia pacific journal of information systems
    • /
    • v.1 no.1
    • /
    • pp.87-101
    • /
    • 1991
  • The primary purpose of this study was to investigate a robust search methodology that could be used in full-text information retrieval systems. A robust search methodology is one that can be easily used by a variety of users (particularly naive users) and it will give them comparable search performance regardless of their different expertise or interests In order to develop a possibly robust search methodology, a fully functional prototype of a fuzzy knowledge based information retrieval system was developed. Also, an experiment that used this prototype information retreival system was designed to investigate the performance of that search methodology over a small exploratory sample of user queries To probe the relatonships between the possibly robust search performance and the query organization using fuzzy inference logic, the search performance of a shallow query structure was analyzes. Consequently the following several noteworthy findings were obtained: 1) the hierachical(tree type) query structure might be a better query organization than the linear type query structure 2) comparing with the complex tree query structure, the simple tree query structure that has at most three levels of query might provide better search performance 3) the fuzzy search methodology that employs a proper levels of cut-off value might provide more efficient search performance than the boolean search methodology. Even though findings could not be statistically verified because the experiments were done using a single replication, it is worth noting however, that the research findings provided valuable information for developing a possibly robust search methodology in full-text information retrieval.

  • PDF