• Title/Summary/Keyword: Document searching

Search Result 170, Processing Time 0.028 seconds

A Study on Implementation for in based Electronic Catalog Management System (XML기반 전자카탈로그 관리시스템의 구현에 관한 연구)

  • 김진영;김연수
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.25 no.1
    • /
    • pp.35-41
    • /
    • 2002
  • XML(eXtensible Markup Language) based electronic catalog is very useful for searching target information because of its structural and contents based searching support capability. And XML document editing is easier than HTML because of XML document is divided by structure, contents and presentation. This paper is to present a prototype of XML based Electronic Catalog Management System(ECMS) whose system consists of data input, output and manipulation system for inserting, updating, editing and deletion. A proposed system could resolved the problems at virtual intermediary shopping mall invloved in the difficulty of interoperability when customer try to compare similar products at mixed shopping mall and reduced web service costs at independent shopping mall by using XML format. The proposed ECMS offers rapid response capability for product data change of electronic catalog and easy and friendly interoperability among similar products.

Ranking Decision Method of Retrieved Documents Using User Profile from Searching Engine (검색 엔진에서 사용자 프로파일을 이용한 문서 순위결정 방법)

  • Kim Yong-Ho;Kim Hyeong-Gyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.9
    • /
    • pp.1590-1595
    • /
    • 2006
  • This paper proposes a technique of user oriented document ranking using user refile to provide more satisfied results which reflect preference of specific users. User profile is constructed to represent his or her preference. User pfofile consists of 'term array' and 'preference vector' according to the interest field of one. And the User profile for a particular person is updated by 'user access', 'latent relaeon', 'User Profile' proposed in this paper. The latent structures of documents in same domain are analysed by singular value decomposition(SVD). Then, the rank of documents is determined by comparison of user profile with analyzed document on the basis of relevance.

Object Modeling for Mapping from XML Document and Query to UML Class Diagram based on XML-GDM (XML-GDM을 기반으로 한 UML 클래스 다이어그램으로 사상을 위한 XML문서와 질의의 객체 모델링)

  • Park, Dae-Hyun;Kim, Yong-Sung
    • The KIPS Transactions:PartD
    • /
    • v.17D no.2
    • /
    • pp.129-146
    • /
    • 2010
  • Nowadays, XML has been favored by many companies internally and externally as a means of sharing and distributing data. there are many researches and systems for modeling and storing XML documents by an object-oriented method as for the method of saving and managing web-based multimedia document more easily. The representative tool for the object-oriented modeling of XML documents is UML (Unified Modeling Language). UML at the beginning was used as the integrated methodology for software development, but now it is used more frequently as the modeling language of various objects. Currently, UML supports various diagrams for object-oriented analysis and design like class diagram and is widely used as a tool of creating various database schema and object-oriented codes from them. This paper proposes an Efficinet Query Modelling of XML-GL using the UML class diagram and OCL for searching XML document which its application scope is widely extended due to the increased use of WWW and its flexible and open nature. In order to accomplish this, we propose the modeling rules and algorithm that map XML-GL. which has the modeling function for XML document and DTD and the graphical query function about that. In order to describe precisely about the constraint of model component, it is defined by OCL (Object Constraint Language). By using proposed technique creates a query for the XML document of holding various properties of object-oriented model by modeling the XML-GL query from XML document, XML DTD, and XML query while using the class diagram of UML. By converting, saving and managing XML document visually into the object-oriented graphic data model, user can prepare the base that can express the search and query on XML document intuitively and visually. As compared to existing XML-based query languages, it has various object-oriented characteristics and uses the UML notation that is widely used as object modeling tool. Hence, user can construct graphical and intuitive queries on XML-based web document without learning a new query language. By using the same modeling tool, UML class diagram on XML document content, query syntax and semantics, it allows consistently performing all the processes such as searching and saving XML document from/to object-oriented database.

A Study on the online of PDF Electronic Documents System (인터넷 원거리출판의 응용과 PDF의 인쇄활용에 관한 연구)

  • 유영수;강영립;김병현;이광수
    • Proceedings of the Korean Printing Society Conference
    • /
    • 2001.06a
    • /
    • pp.63-77
    • /
    • 2001
  • PDF(Portable Document Format) is a file format that Adobe advances postscritp technique and use in managing document information or electric publishing(internet, CD-ROM, DVD). PDF is a devised document type for being able to read and print anywhere, independent of OS, printer type, resolution, and the kind of computer etc. Because this includes a compressing function, it transfers document through a small size of file in internet or intranet. In addition, that is a file format has various advantages-sharing of information and transfering documents in on line or off line environment. In this paper, we developed electronic document system using PDF format. Electronic document system consists of filter, automatic indexing, special searching system and web server. The information used in this paper is database made using Zwon\`s DocuCom. The filter recognizes various kinds of document structure. And according to property of document, it produces ASCII output. In addition to processing various formats of document, the filter can extract keywords in documents of MS WORD, Excel, Powerpoint, PDF, CAD etc. This filter uses the structure of window printer drive and can extract the information for text, page, font type and size from relevant document. The automatic indexing recognizes the formatted tag of document form ASCII text produced by filter and extracts adequate keyword to structure and property of document. PDF electronic document systems proposed in this paper can be used in Internet, PC communication. Users can choose and read electronic documents by two ways. First, users can choose and read relevant books using PDF electronic document homepage. Second, users can use PDF integrated-search system. User can search after inputing keyword and choose reference field and type of data. But, now, PDF products of Adobe can\`t support the Korean character. If this problem is resolved, we thick that PDF applications system looks active. Although there is limited function in case of using Zwon DocuCom used in this study, we think that there isn\`t a great deal of difficulty in electronic document and building digital database.

  • PDF

Design and Implementation of Document Management System using Near Field Communication (비접촉 근거리 무선통신을 이용한 문서관리 시스템의 설계 및 구현)

  • Kim, Cheolho;Lee, Wooyong;Hwang, Mintae
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.5
    • /
    • pp.613-622
    • /
    • 2014
  • In spite of the convenience and cost-effectiveness of an electronic document management system the paper-type documents still should be stored for a comparison against the original documents. In this paper, for the efficient management of paper documents, we designed and implemented a document management system using smart devices equipped with NFC(Near Field Communication) technology. To implement the proposed system we designed a database for document management and developed an Android application for smart device using Eclipse 3.0 and Java programming. Whenever we touch the smart phone on the NFC tags which are attached to the paper-type documents and document boxes, it is possible to registering, searching and carrying in and out services for documents and boxes. This study provides smart phone users with systematic, economical and convenient paper-type documents management functions, and thus enhances the business efficiency.

A Keyword Matching for the Retrieval of Low-Quality Hangul Document Images

  • Na, In-Seop;Park, Sang-Cheol;Kim, Soo-Hyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.47 no.1
    • /
    • pp.39-55
    • /
    • 2013
  • It is a difficult problem to use keyword retrieval for low-quality Korean document images because these include adjacent characters that are connected. In addition, images that are created from various fonts are likely to be distorted during acquisition. In this paper, we propose and test a keyword retrieval system, using a support vector machine (SVM) for the retrieval of low-quality Korean document images. We propose a keyword retrieval method using an SVM to discriminate the similarity between two word images. We demonstrated that the proposed keyword retrieval method is more effective than the accumulated Optical Character Recognition (OCR)-based searching method. Moreover, using the SVM is better than Bayesian decision or artificial neural network for determining the similarity of two images.

대학도서관 문헌제공봉사의 현황분석과 강화방안

  • 윤희윤
    • Journal of Korean Library and Information Science Society
    • /
    • v.29
    • /
    • pp.27-63
    • /
    • 1998
  • The purpose of this study is to analyze the document delivery service(DDS) of the academic libraries and suggest its improvement model in Korea. DDS means providing copies of information requests in any format and from any source. And DDS is gaining in importance as libraries turn to 'just-in-time' access rather than 'just-in-case' collection to meet user information needs. By good fortune, rising journal subscription prices, declining financial resources, canceling some of journal subscriptions, electronic transmission technologies, and the rise of commercial document delivery services have allowed libraries to begin to deliver articles to users in a much more rapid and acceptable time frame. Therefore, the library paradigm for the 2000s must be the creation of new document delivery structures which capitalize on the access tolls and structures created by librarians during the past generations. First of all, library-based document service requires a close review of existing library-to-library delivery mechanisms, application of technology to transfer of facsimiles of materials and facilitated use of existing fee-based document sources. The ideal document delivery system would feature a transparent, seamless electronic service incorporating searching and browsing identification and marking of desired items, and transmission and fulfillment of requests. And requested items would be supplied from library collection, commercial suppliers, or other sources. But the future of DDS will succeed when physical resources, policies, personnel, and practices are organized to provide timely information delivery to users.

  • PDF

Keyword Spotting on Hangul Document Images Using Image-to-Image Matching (영상 대 영상 매칭을 이용한 한글 문서 영상에서의 단어 검색)

  • Park Sang Cheol;Son Hwa Jeong;Kim Soo Hyung
    • The KIPS Transactions:PartB
    • /
    • v.12B no.3 s.99
    • /
    • pp.357-364
    • /
    • 2005
  • In this paper, we propose an accurate and fast keyword spotting system for searching user-specified keyword in Hangul document images by using two-level image-to-image matching. The system is composed of character segmentation, creating a query image, feature extraction, and matching procedure. Two different feature vectors are used in the matching procedure. An experiment using 1600 Hangul word images from 8 document images, downloaded from the website of Korea Information Science Society, demonstrates that the proposed system is superior to conventional image-based document retrieval systems.

Document Image Segmentation and Classification using Texture Features and Structural Information (텍스쳐 특징과 구조적인 정보를 이용한 문서 영상의 분할 및 분류)

  • Park, Kun-Hye;Kim, Bo-Ram;Kim, Wook-Hyun
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.3
    • /
    • pp.215-220
    • /
    • 2010
  • In this paper, we propose a new texture-based page segmentation and classification method in which table region, background region, image region and text region in a given document image are automatically identified. The proposed method for document images consists of two stages, document segmentation and contents classification. In the first stage, we segment the document image, and then, we classify contents of document in the second stage. The proposed classification method is based on a texture analysis. Each contents in the document are considered as regions with different textures. Thus the problem of classification contents of document can be posed as a texture segmentation and analysis problem. Two-dimensional Gabor filters are used to extract texture features for each of these regions. Our method does not assume any a priori knowledge about content or language of the document. As we can see experiment results, our method gives good performance in document segmentation and contents classification. The proposed system is expected to apply such as multimedia data searching, real-time image processing.

A Study on Development of Integrated OPAC Based on Hypermedia Techniques (하이퍼미디어 기술에 기반한 통합 OPAC구현에 관한 연구)

  • Ahn, Tae Kyoung;Kim, Hyun Hee
    • Journal of Information Management
    • /
    • v.27 no.1
    • /
    • pp.1-39
    • /
    • 1996
  • The purpose of this paper is to design a model of integrated OPAC called as EconRef. This model not only provides users of libraries with systematic, rapid information service, but also supports librarians to do their tasks effectively. The designed model is constructed based on two operating systems such as REGIS system and The Book House and is developed by using KPWin++ is an expert system shell which combines hypertext and expert system functions. The designed system consists of six modules ; three reference expert systems for document sources, experts and statistical sources; OPAC ; external database ; user's guide. For the evaluation of the designed system, performance of EconRef system is compared with that of the naive and expert reference librarians. And also the features of the system are compared with those of REGIS systems. The tests comparing BconRef system searching with librarians searching have shown that EconRef system is at least as good as searching with expert librarians and much superior to searching with naive librarians.

  • PDF