• Title/Summary/Keyword: example retrieval

Search Result 108, Processing Time 0.027 seconds

Structure-based Clustering for XML Document Retrieval (XML 문서 검색을 위한 구조 기반 클러스터링)

  • Hwang Jeong Hee;Ryu Keun Ho
    • The KIPS Transactions:PartD
    • /
    • v.11D no.7 s.96
    • /
    • pp.1357-1366
    • /
    • 2004
  • As the importance or XML is increasing to manage information and exchange data efficiently in the web, there are on going works about structural integration and retrieval. The XML. document with the defined structure can retrieve the structure through the DTD or XML schema, but the existing method can't apply to XML. documents which haven't the structure information. Therefore. in this paper we propose a new clus-tering technique at a basic research which make it possible to retrieve structure fast about the XML documents that haven't the structure information. We first estract the feature of frequent structure from each XML document. And we cluster based on the similar structure by con-sidering the frequent structure as representative structure of the XML document, which makes it possible to retrieve the XML document raster than dealing with the whole documents that have different structure. And also we perform the structure retrieval about XML documents based on the clusters which is the group of similar structure. Moreover, we show efficiency of proposed method to describe how to apply the structure retrieval as well as to display the example of application result.

Design and Implementation of OCR Correction Model for Numeric Digits based on a Context Sensitive and Multiple Streams (제한적 문맥 인식과 다중 스트림을 기반으로 한 숫자 정정 OCR 모델의 설계 및 구현)

  • Shin, Hyun-Kyung
    • The KIPS Transactions:PartD
    • /
    • v.18D no.1
    • /
    • pp.67-80
    • /
    • 2011
  • On an automated business document processing system maintaining financial data, errors on query based retrieval of numbers are critical to overall performance and usability of the system. Automatic spelling correction methods have been emerged and have played important role in development of information retrieval system. However scope of the methods was limited to the symbols, for example alphabetic letter strings, which can be reserved in the form of trainable templates or custom dictionary. On the other hand, numbers, a sequence of digits, are not the objects that can be reserved into a dictionary but a pure markov sequence. In this paper we proposed a new OCR model for spelling correction for numbers using the multiple streams and the context based correction on top of probabilistic information retrieval framework. We implemented the proposed error correction model as a sub-module and integrated into an existing automated invoice document processing system. We also presented the comparative test results that indicated significant enhancement of overall precision of the system by our model.

Typology of Retrieval Systems based on the Degree of Connections between Systems and Information Resources: Specific Domain Focus Model (SDFM) for Information Retrieval Interaction (시스템-정보자료 군(群) 연계정도 기반 검색시스템 유형화 - 특정영역 초점 정보검색 상호작용 모형 -)

  • Kim, Yang-woo
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.30 no.2
    • /
    • pp.145-166
    • /
    • 2019
  • While a significant number of user-related models have been presented in Human Information Behavior (HIB) research community, the basic assumption of the present study is most of those models including information interaction models are multi-domain models associated with comprehensive research components. Based on such an assumption, this study discusses the shortcomings of multi-domain models and proposes the need to present a new type of model. Accordingly, the study elaborates four essential models of HIB reach community and presents a new type of model based on Specific Domain Focus Modeling (SDFM). As an example of such modeling, this study presents the present author's information retrieval interaction model based on the degree of connections between systems and information resources.

Intelligent Retrieval System for finding important travel information (중요 여행 정보를 찾기 위한 지능 검색 시스템)

  • Yun, Un-Il;Shin, Hyeon-Il;Ryu, Keun-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.11
    • /
    • pp.113-121
    • /
    • 2009
  • The increasing interest in leisure activities of a five-day work per week has been recently prevailed. Additionally, as internet and mobile infrastructures have been becoming widespread, the user can get specific information using a search engine. However, it is difficult for the user to get accurate information they really want as shared information has been rapidly increased and the information has been searched. For example, users can retrieve required travel information, but they also must see a huge number of travel advertisements. In this paper, we design and implement a retrieval system using travel information collecting agent. The information gathering agent regularly visits travel-related category pages of the portal sites and major media travel-article pages to collect information related to travel, and the agent stores the gathered information to a database. Then, users can search the travel information conveniently without the need to view advertisements.

A Study On the Optimization Model for the Design of Automated Warehouses (자동창고 설계를 위한 최적화 모형에 관한 연구)

  • 김성태;김재연
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.16 no.27
    • /
    • pp.73-82
    • /
    • 1993
  • In this paper, We determine the expected travel time for several forks Storage/Retrieval machine which is allowed multiple stops in aisle. When throughput is increased, We propose adding to fork number of each S/R machine rather than adding to number of S/R machine, We also describe such a model which determines the optimal number of each several forks S/R machine subject to constraints on the hourly throughput and warehouse dimensions. Numerical example is presented to compare warehouse shapes against each single fork, twin forks, triple forks S/R machine for various throughput values.

  • PDF

Application of UNIX System to Automatic Control (UNIX SYSTEM을 이용한 종합제어 장치)

  • Park, Sung-Sik;Shin, Young-Kil;Rho, Tae-Suk;Cho, Kyu-Bok
    • Proceedings of the KIEE Conference
    • /
    • 1991.07a
    • /
    • pp.874-876
    • /
    • 1991
  • This paper deals with the topics when Unix based computer is applied to automatic manufacturing system. The total control of material flow in the automatic storage and retrieval system is taken as an example. And some technical issues are proposed for the wider application of UNIX to automatic control.

  • PDF

An Exploratory Study of Image Retrieval Using Aesthetic Impressions (심미적 인상을 이용한 이미지 검색에 관한 실험적 연구)

  • Yu, So-Young;Moon, Sung-Been
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.4 s.54
    • /
    • pp.187-208
    • /
    • 2004
  • In this study, aesthetic impressions were used for a high-level feature of image retrieval. The term, 'aesthetic' has been studied in psychology, art, and literature. It means unconscious, instantaneous parts of visual perception and emotion. The literatures related to aesthetic impressions were reviewed and four kinds of aesthetic impressions were defined operationally : strong impression, soft impression, courteous impression, and refined impression. 66 image files of paintings were sampled randomly from 1100 paintings and low-level color features were extracted from them by a using perceptual color model(Lai, & Tait, 1998). The high-level features of an image, that is, four kinds of aesthetic impressions of each painting were measured by 4 subjects and averaged. In CBIR, 2 subjects performed image retrievals using example queries. They were asked to retrieve images by using the aesthetic impressions or the keywords. In evaluations, subjects showed that they were satisfied with the aesthetic impression-based image retrieval system on the average. And R-precision of the image retrieval with both color features and aesthetic impressions was higher than that of the image retrieval with color features only. But further studies with larger test collections and query sets should be followed for generalization of the result of this study.

Some Legal Arguments on the Portal Service Providers' Information Retrieval (포털사업자의 검색서비스에 관한 법률문제)

  • Kim, Yun-Myung
    • Journal of Information Management
    • /
    • v.38 no.3
    • /
    • pp.183-209
    • /
    • 2007
  • The representative example of the business model on internet environment, the business of the Naver, Empas and Google which provides information retrieval service is the internet portal. The portal sites provide information retrieval service which provides users information what they want to find, that is a huge social contribution. The portal site which provides a search service leads much problems. Consequently, the regulation against information retrieval is asserted powerfully in spite of the public interest. Namely, the regulation regarding the search business owner is tried. Finally, portal business owner puts the social responsibility as OSP. But, there is a doubt that portal business owner who has much problem which occurred on the portal site indirectly has responsibility directly. That is duty on portal site owner the censorship on the contents transferred. So, this thesis researches on the social critical opinion relating with a information retrieval from the legal side against the problem of the Internet.

A Study on a Multilingual name Retrieval (다중 언어 인명 검색에 관한 연구)

  • Cho, Young-Hwa;Song, Jae-Yong;Ryu, Keun-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.9
    • /
    • pp.2271-2280
    • /
    • 1998
  • In this paper, we propose a method to retneve english written korcan names efficientl, and design a multilingual name retrieval system, It is very difficult to retrieve english-written korean names in typical IR sytems. For example, "홍길동" is written in english as vanous forms such like "Hong, gildong", "Gildong Hong", "Hong kil dong", "Hong kil dong" and so on, We not only propose a rule-based querv expansion method to retrieve english-written korean names efficiently but also design a multiligual name retneval system which is consisted of query classifier, exception handler, query expander, query executor, exception list and rulebase, Finally we will try to show that english-written korean names could be efficiently retrieved with rule based name generator.

  • PDF

MPEG-7 Homogeneous Texture Descriptor

  • Ro, Yong-Man;Kim, Mun-Churl;Kang, Ho-Kyung;Manjunath, B.S.;Kim, Jin-Woong
    • ETRI Journal
    • /
    • v.23 no.2
    • /
    • pp.41-51
    • /
    • 2001
  • MPEG-7 standardization work has started with the aims of providing fundamental tools for describing multimedia contents. MPEG-7 defines the syntax and semantics of descriptors and description schemes so that they may be used as fundamental tools for multimedia content description. In this paper, we introduce a texture based image description and retrieval method, which is adopted as the homogeneous texture descriptor in the visual part of the MPEG-7 final committee draft. The current MPEG-7 homogeneous texture descriptor consists of the mean, the standard deviation value of an image, energy, and energy deviation values of Fourier transform of the image. These are extracted from partitioned frequency channels based on the human visual system (HVS). For reliable extraction of the texture descriptor, Radon transformation is employed. This is suitable for HVS behavior. We also introduce various matching methods; for example, intensity-invariant, rotation-invariant and/or scale-invariant matching. This technique retrieves relevant texture images when the user gives a querying texture image. In order to show the promising performance of the texture descriptor, we take the experimental results with the MPEG-7 test sets. Experimental results show that the MPEG-7 texture descriptor gives an efficient and effective retrieval rate. Furthermore, it gives fast feature extraction time for constructing the texture descriptor.

  • PDF