• Title/Summary/Keyword: Concept-based Information Retrieval

Search Result 116, Processing Time 0.036 seconds

Semantic Clustering Model for Analytical Classification of Documents in Cloud Environment (클라우드 환경에서 문서의 유형 분류를 위한 시맨틱 클러스터링 모델)

  • Kim, Young Soo;Lee, Byoung Yup
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.11
    • /
    • pp.389-397
    • /
    • 2017
  • Recently semantic web document is produced and added in repository in a cloud computing environment and requires an intelligent semantic agent for analytical classification of documents and information retrieval. The traditional methods of information retrieval uses keyword for query and delivers a document list returned by the search. Users carry a heavy workload for examination of contents because a former method of the information retrieval don't provide a lot of semantic similarity information. To solve these problems, we suggest a key word frequency and concept matching based semantic clustering model using hadoop and NoSQL to improve classification accuracy of the similarity. Implementation of our suggested technique in a cloud computing environment offers the ability to classify and discover similar document with improved accuracy of the classification. This suggested model is expected to be use in the semantic web retrieval system construction that can make it more flexible in retrieving proper document.

Personal Information Management Based on the Concept Lattice of Formal Concept Analysis (FCA 개념 망 기반 개인정보관리)

  • Kim, Mi-Hye
    • Journal of Internet Computing and Services
    • /
    • v.6 no.6
    • /
    • pp.163-178
    • /
    • 2005
  • The ultimate objective of Personal Information Management (PIM) is to collect, handle and manage wanted information in a systematic way that enables individuals to search the information more easily and effectively, However, existing personal information management systems are usually based on a traditional hierarchical directory model for storing information, limiting effective organization and retrieval of information as well as providing less support in search by associative interrelationship between objects (documents) and their attributes, To improve these problems, in this paper we propose a personal information management model based on the concept lattice of Formal Concept Analysis (FCA) to easily build and maintain individuals' own information on the Web, The proposed system can overcome the limitations of the traditional hierarchy approach as well as supporting search of other useful information by the inter-relationships between objects and their attributes in the concept lattice of FCA beyond a narrow search.

  • PDF

A Mobile P2P Semantic Information Retrieval System with Effective Updates

  • Liu, Chuan-Ming;Chen, Cheng-Hsien;Chen, Yen-Lin;Wang, Jeng-Haur
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.5
    • /
    • pp.1807-1824
    • /
    • 2015
  • As the technologies advance, mobile peer-to-peer (MP2P) networks or systems become one of the major ways to share resources and information. On such a system, the information retrieval (IR), including the development of scalable infrastructures for indexing, becomes more complicated due to a huge increase on the amount of information and rapid information change. To keep the systems on MP2P networks more reliable and consistent, the index structures need to be updated frequently. For a semantic IR system, the index structure is even more complicated than a classic IR system and generally has higher update cost. The most well-known indexing technique used in semantic IR systems is Latent Semantic Indexing (LSI), of which the index structure is generated by singular value decomposition (SVD). Although LSI performs well, updating the index structure is not easy and time consuming. In an MP2P environment, which is fully distributed and dynamic, the update becomes more challenging. In this work, we consider how to update the sematic index generated by LSI and keep the index consistent in the whole MP2P network. The proposed Concept Space Update (CSU) protocol, based on distributed 2-Phase locking strategy, can effectively achieve the objectives in terms of two measurements: coverage speed and update cost. Using the proposed effective synchronization mechanism with the efficient updates on the SVD, re-computing the whole index on the P2P overlay can be avoided and the consistency can be achieved. Simulated experiments are also performed to validate our analysis on the proposed CSU protocol. The experimental results indicate that CSU is effective on updating the concept space with LSI/SVD index structure in MP2P semantic IR systems.

Software Component Retrieval System for Version Control (버전제어를 위한 소프트웨어 구성요소의 검색 시스템)

  • O, Sang-Yeop;Kim, Heung-Jin;Jang, Deok-Cheol
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.5
    • /
    • pp.1093-1102
    • /
    • 1996
  • For the reuse, configuration management, and version control of softwares, the composition of retrieval systems and library are most important matters, which makes it possible to retrieve the concerned software components. Retrieval systems, which is able to store many components, must make it possible to retrieve the concerned components with deadwoods in the fastest way. Based either on keyboards or the concept of inverted file on the part of content is usually used in the current retrieval systems. However, in this paper, new retrieval systems are suggested with using set and bag class with Smalltalk language, one of object- oriented programming language, based either on the keywords or on the part of content to find out the concerned components. This method is improved the function of user interface and its management, In this paper, library is also suggested along with the new retrieval systems, and user interface is designed and implemented for its management and control. The new retrial systems of this paper can be employed by interface in another language, and this system is to provide the concerned user with the appropriate retrieval systems and library for the version control.

  • PDF

Relevance Feedback using Region-of-interest in Retrieval of Satellite Images (위성영상 검색에서 사용자 관심영역을 이용한 적합성 피드백)

  • Kim, Sung-Jin;Chung, Chin-Wan;Lee, Seok-Lyong;Kim, Deok-Hwan
    • Journal of KIISE:Databases
    • /
    • v.36 no.6
    • /
    • pp.434-445
    • /
    • 2009
  • Content-based image retrieval(CBIR) is the retrieval technique which uses the contents of images. However, in contrast to text data, multimedia data are ambiguous and there is a big difference between system's low-level representation and human's high-level concept. So it doesn't always mean that near points in the vector space are similar to user. We call this the semantic-gap problem. Due to this problem, performance of image retrieval is not good. To solve this problem, the relevance feedback(RF) which uses user's feedback information is used. But existing RF doesn't consider user's region-of-interest(ROI), and therefore, irrelevant regions are used in computing new query points. Because the system doesn't know user's ROI, RF is proceeded in the image-level. We propose a new ROI RF method which guides a user to select ROI from relevant images for the retrieval of complex satellite image, and this improves the accuracy of the image retrieval by computing more accurate query points in this paper. Also we propose a pruning technique which improves the accuracy of the image retrieval by using images not selected by the user in this paper. Experiments show the efficiency of the proposed ROI RF and the pruning technique.

Building Domain Ontology Based on Linguistic Patterns

  • Kim, Kweon-Yang;Lim, Soo-Yeon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.16 no.6
    • /
    • pp.766-771
    • /
    • 2006
  • In this paper, we focus on the building domain ontology from corpus by extracting concepts and properties relationships based on linguistic patterns. The pharmacy field is selected as an experiment domain and we present an algorithm to extract hierarchical structure for terminology based on the noun/suffix patterns of terminology in domain texts. In order to show usefulness of our domain ontology, we compare a typical keyword based retrieval method with an ontology based retrieval mettled which uses related information in an ontology for a related feedback. As a result, our method shows the improvement of precision by 4.97% without losing recall.

Towards Intelligent Web Interaction

  • Takama, Yasufumi
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.134-137
    • /
    • 2003
  • Both browsing and retrieval with search engines are major operations that establish the interaction between users and the Web. Although both operations are usually combined to locate information from the Web, recent growth of the Web has overtaken the potential of this conventional interaction. This paper proposes the concept of Retrieve, Browse, and Analyze (RBA)-based interactions, as the improvement of the conventional Retrieve and Browse (RB)-based interaction. The prototype interface based on RBA-based interaction is also presented.

  • PDF

Region-Based Image Retrieval System using Spatial Location Information as Weights for Relevance Feedback (공간 위치 정보를 적합성 피드백을 위한 가중치로 사용하는 영역 기반 이미지 검색 시스템)

  • Song Jae-Won;Kim Deok-Hwan;Lee Ju-Hong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.4 s.42
    • /
    • pp.1-7
    • /
    • 2006
  • Recently, studies of relevance feedback to increase the performance of image retrieval has been activated. In this Paper a new region weighting method in region based image retrieval with relevance feedback is proposed to reduce the semantic gap between the low level feature representation and the high level concept in a given query image. The new weighting method determines the importance of regions according to the spatial locations of regions in an image. Experimental results demonstrate that the retrieval quality of our method is about 18% in recall better than that of area percentage approach. and about 11% in recall better than that of region frequency weighted by inverse image frequency approach and the retrieval time of our method is a tenth of that of region frequency approach.

  • PDF

Building Intelligent User Interface Agent for Semantically Reformulating User Query in Medicine

  • Lim, Chae-Myung;Chu, Sung-Joon;Lee, Dong-Hoon;Park, Duck-Whan;Park, Tae-Young;Yang, Jung-Jin
    • Proceedings of the KAIS Fall Conference
    • /
    • 2003.11a
    • /
    • pp.57-64
    • /
    • 2003
  • Achieving the beneficiary goal of recent discovery in human genome project still needs a way to retrieve and analyze the exponentially expanding bio-related information. Research on bio-related fields naturally applies knowledge discovered to the current problem and make inferences to extract new information where shared concepts and data containing information need to be defined and used in a coherent way. In such a professional domain, while the need to help users reduce their work and to improve search results has been emerged. methods for systematic retrieval and adequate exchange of relevant information are still in their infancy. The design of our system aims at improving the quality of information retrieval in a professional domain by utilizing both corpus-based and concept-based ontology. Meta-rules of helping users to make an adequate query are formed into an ontology in the domain. The integration of those knowledge permits the system to retrieve relevant information in a more semantic and systematic fashion. This work mainly describes the query models with details of GUI and a secondary query generation of the system.

  • PDF

Developing CBR System for Bolt's CAPP (볼트의 자동공정계획수립을 위한 CBR시스템의 개발)

  • Kim, Jin-Baek
    • Asia pacific journal of information systems
    • /
    • v.9 no.2
    • /
    • pp.19-37
    • /
    • 1999
  • Computer aided process planning(CAPP) is a key for implementing CIM. It is bridge between CAD and CAM and translates the design information into manufacturing instructions. Generally, manufacturing is an area where intelligent systems will not be able to rely on methods requiring formalized knowledge. Manufacturing lacks a body of knowledge that is specific, formalized, and rigorous, and which can be coded as rules or procedures. Thus expertise in manufacturing is developed over a period of many years. Case-based reasoning(CBR) offers a new approach for developing intelligent system. In the case-based approach the problem solving experience of the experts is encoded in the form of cases. CBR's retrieval process can be divided to two step. The first step is matching step, and the second step is selection step. For selecting base case, new preference heuristics were introduced using similarity concept. Similarity concept has three has three dimensions, i.e. entity similarity, structural similarity, and goal similarity. In this paper, bolt's process planning was selected an application domain. Following the test result, the new preference heuristics were approved as a useful procedure in CAPP.

  • PDF