• Title/Summary/Keyword: Distributed Information Retrieval

Search Result 168, Processing Time 0.03 seconds

Protecting the iTrust Information Retrieval Network against Malicious Attacks

  • Chuang, Yung-Ting;Melliar-Smith, P. Michael;Moser, Louise E.;Lombera, Isai Michel
    • Journal of Computing Science and Engineering
    • /
    • v.6 no.3
    • /
    • pp.179-192
    • /
    • 2012
  • This paper presents novel statistical algorithms for protecting the iTrust information retrieval network against malicious attacks. In iTrust, metadata describing documents, and requests containing keywords, are randomly distributed to multiple participating nodes. The nodes that receive the requests try to match the keywords in the requests with the metadata they hold. If a node finds a match, the matching node returns the URL of the associated information to the requesting node. The requesting node then uses the URL to retrieve the information from the source node. The novel detection algorithm determines empirically the probabilities of the specific number of matches based on the number of responses that the requesting node receives. It also calculates the analytical probabilities of the specific numbers of matches. It compares the observed and the analytical probabilities to estimate the proportion of subverted or non-operational nodes in the iTrust network using a window-based method and the chi-squared statistic. If the detection algorithm determines that some of the nodes in the iTrust network are subverted or non-operational, then the novel defensive adaptation algorithm increases the number of nodes to which the requests are distributed to maintain the same probability of a match when some of the nodes are subverted or non-operational as compared to when all of the nodes are operational. Experimental results substantiate the effectiveness of the detection and defensive adaptation algorithms for protecting the iTrust information retrieval network against malicious attacks.

Development of Web-based Bio-Image Retrieval System (웨이블릿 변환을 이용한 실시간 화재 감지 알고리즘)

  • Cheong, Kwang-Ho;Ko, Byoung-Chul;Nam, Jae-Yeal
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2006.11a
    • /
    • pp.227-230
    • /
    • 2006
  • A content-based image retrieval system using MPEG-7 is designed and implemented in this thesis. The implemented system uses existing MPEG-7 Visual Descriptors. In addition, a new descriptor for efficient retrieval of bio images is proposed and utilized in the developed content-based image retrieval system. Comparing proposed CBSD(Compact Binary Shape Descriptor) with Edge Histogram Descriptor(EHD) and Region Shape Descriptor(RSD), it shows good retrieval performance in NMRR. The proposed descriptor is robust to large modification of brightness and contrast and especially improved retrieval performance to search images with similar shapes. Also proposed system adopts distributed architecture to solve increased server overload and network delay. Updating module of client efficiently reduces downloading time for metadata. The developed system can efficiently retrieve images without causing server's overload.

  • PDF

A study on constructing GIS component repository on web using registration/retrieval agents

  • Jo, Myung-Hee;Jo, Yun-Won;Bu, Ki-Dong
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.949-951
    • /
    • 2003
  • To improve the reusability and interoperability of GIS components, we propose the registration/retrieval agents, which can search the locating of users' frequently used components in not only the GIS domain but also other spatial information technologies such as GPS, ITS, RS and FM. The registration/retrieval agents increase the reusability through the GIS component based development under distributed GIS components environment and enables the rapid setting of application on the web. Moreover, users can understand easily the information of GIS component and have the effective investment, timeliness and reliability while they have less maintenance effort by agent. In order to design and implement this system on web, HTML and ASP (Active Serve Page), and JAVA were used. In addition, the performance of this system was verified through comparing others, which are similar to.

  • PDF

Development of the Management Tool for S&T information in distributed retrieval database (분산 저장된 과학기술정보 서비스를 위한 검색 데이터베이스 관리 도구의 설계 및 개발)

  • Lee, Seok-Hyoung;Yoon, Hee-Jun;Yeo, Il-Yeon;Choi, Sung-Pil;Yoon, Hwa-Mook
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.677-681
    • /
    • 2006
  • In this paper, we suggest the GUI Management Tool, named K-Manager, for management and service of science and technology information that stored in distributed retrieval databases. Generally, it must be adapted retrieval database system for web based S&T contents service. But, It is inconvenient contents manager or the system administrator controls information easily, because it does not support the S&T information management process like TOAD, which can use for the relation database, in information retrieval database system. Using K-Manager, content manager can process the S&T content and system manager can manage the databases easily. The proposed tool active controls information effectively which is stored in the distributed retrieval database which guarantee the safety management of the contents stored in database and operate retrieval with efficient performances. Our tool consists of two sub systems, one is content manager, the other is database manager for YESKISTI based on KRISTAL-IRMS.

  • PDF

Application of the 2-Poisson Model to Full-Text Information Retrieval System (2-포아송 모형의 전문검색시스템 응용에 관한 연구)

  • 문성빈
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.3
    • /
    • pp.49-63
    • /
    • 1999
  • The purpose of this study is to investigate whether the terms in queries are distributed according to the 2-Poisson model in the documents represented by abstract/title or full-text. In this study, retrieval experiments using Binary independence and 2-Poisson independence model, which are based on the probabilistic theory, were conducted to see if the 2-Poisson distribution of the query terms has an influence on the retrieval effectiveness, particularly of full-text information retrieval system.

  • PDF

A Mobile P2P Semantic Information Retrieval System with Effective Updates

  • Liu, Chuan-Ming;Chen, Cheng-Hsien;Chen, Yen-Lin;Wang, Jeng-Haur
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.5
    • /
    • pp.1807-1824
    • /
    • 2015
  • As the technologies advance, mobile peer-to-peer (MP2P) networks or systems become one of the major ways to share resources and information. On such a system, the information retrieval (IR), including the development of scalable infrastructures for indexing, becomes more complicated due to a huge increase on the amount of information and rapid information change. To keep the systems on MP2P networks more reliable and consistent, the index structures need to be updated frequently. For a semantic IR system, the index structure is even more complicated than a classic IR system and generally has higher update cost. The most well-known indexing technique used in semantic IR systems is Latent Semantic Indexing (LSI), of which the index structure is generated by singular value decomposition (SVD). Although LSI performs well, updating the index structure is not easy and time consuming. In an MP2P environment, which is fully distributed and dynamic, the update becomes more challenging. In this work, we consider how to update the sematic index generated by LSI and keep the index consistent in the whole MP2P network. The proposed Concept Space Update (CSU) protocol, based on distributed 2-Phase locking strategy, can effectively achieve the objectives in terms of two measurements: coverage speed and update cost. Using the proposed effective synchronization mechanism with the efficient updates on the SVD, re-computing the whole index on the P2P overlay can be avoided and the consistency can be achieved. Simulated experiments are also performed to validate our analysis on the proposed CSU protocol. The experimental results indicate that CSU is effective on updating the concept space with LSI/SVD index structure in MP2P semantic IR systems.

Development of an Integrated Retrieval System on Distributed KRISTAL-2002 Systems with Metadata Information (메타데이터 정보를 이용한 분산 KRISTAL-2002 시스템의 통합 검색 시스템 개발)

  • Choe Gui-ja;Kim Jae-Gon;Seo Jung-Hyun;Cho Han-Hyung;Lee Min-Ho;Jung Chang-Hu;Park Dong-In;Nam Young-Kwang
    • The KIPS Transactions:PartD
    • /
    • v.12D no.1 s.97
    • /
    • pp.135-150
    • /
    • 2005
  • In this paper, we propose an integrated information retrieval system for distributed multiple KRISTAL-2002 systems by using the metadata information. This system integrates current systems for different areas or systems for the same area with the different schemas so that the users can get the answers by once from the whole systems. The proposed system composes of the Source Server Manager(SSM) supporting the mapping between the integrated metadata database and source server, the Integrated Metadata Manager(ISM) for registering and managing the metadata and schema mapping, the Distributed Query Processor (DQP) for processing the user query into the source server query, the Distributed Data Set Integrated Manager(DDSIM) for transforming the total retrieval results by merging to the HTML format, and the integrated retrieval engine for managing the query results. It is assumed that the integrated metadata follows ISO/IEC 11179 metadata registration procedure with the metadata registry system which is a subsystem of the proposed system. There are two kinds of queries for users; the basic query and the detailed query. The users may select the databases or organizations for results by their own choices before giving the queries. The proposed system has been developed over KRISTAL-2002 systems with $Visual C^{++}\;and\;C^{++}-CGI$ and tested and verified with the six database systems.

Integrated Information Retrieval with Metadata Interface for Heterogeneous Distributed XML Documents (메타정보 인터페이스를 이용한 이질 구조 분석 XML문서 통합 검색)

  • 류성준;황재문;김태훈;남영광
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.11
    • /
    • pp.1505-1518
    • /
    • 2004
  • We propose an extremely light DDXMI approach for semi-automated integration of both structurally and semantically heterogeneous distributed XML documents. In the proposed prototype, a DDXMI(Distributed Documents XML Metadata Interface) is defined and a user interface generator is developed. The prototype takes sources' DTDs as inputs and generates a friendly graphical user interface for the application users. The user can easily describe the semantic mapping between the integrated virtual database DTD and sources' DTDs through assigning index numbers and specifying associated function names so that the DDXMI based on the mappings is automatically generated. Quilt is selected as the XML query language which processes user queries according to the DDXMI. It is assumed that the application users know what they want from the different sources, that is, they have their own integrated database schema in their mind, and know the semantics of the involved XML databases. A small-size global DTD and a mid-size global DTB are generated to verify the rluery generation and retrieval results with 3 XML document databases, that is, Master/ph.D thesis, research reports, and journal databases. The system has been developed with JavaCC and Java Servelet.

Formal Modeling and Verification of an Information Retrieval System using SMV

  • Kim, Jong-Hwan;Park, Hea-Sook;Baik, Doo-Kwon
    • Proceedings of the Korea Society for Simulation Conference
    • /
    • 2001.10a
    • /
    • pp.141-146
    • /
    • 2001
  • An Information Retrieval System offers the integrated view of SCM(Supply Chain Management) information to the enterprise by making it possible to exchange data between regionally distributed heterogeneous computers and also to enable these computers to access various types of databases. The Information Retrieval System is modeled using Data Registry Model based on X3.285. We only verify the MetaData Registry Manager(MDR Manager) among the core parts using SMV(Symbolic Model Verifier) in order to verify whether our model satisfies the requirements under the given assumptions.

  • PDF

A Study on Distribution Query Conversion Method for Real-time Integrating Retrieval based on TMDR (TMDR 기반의 실시간 통합 검색을 위한 분산질의 변환 기법에 대한 연구)

  • Hwang, Chi-Gon;Shin, Hyo-Young;Jung, Kye-Dong;Choi, Young-Keun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.7
    • /
    • pp.1701-1707
    • /
    • 2010
  • This study is intended for implementing the system environment that can help integrate and retrieve various types of data in real-time by providing semantic interoperability among distributed heterogeneous information systems. The semantic interoperability is made possible by providing a TMDR(Topicmaps Metadata Registry), a set of ontologies. TMDR, which has been made by combining MDR(MetaData Registry) and TopicMaps and storing them in the database, is able to generate distributed query and provide efficient knowledge. MDR is a metadata management technique for distributed data management. TopicMaps is an ontology representation technique that takes into consideration the hierarchy and association for accessing knowledge data. We have created TMDR, a kind of ontology, that is fit for any system and able to detect and resolve semantic conflicts on the level of data and schema. With this system we propose a query-processing technique to integrate and access heterogeneous information sources. Unlike existing retrieval methods this makes possible efficient retrieval and reasoning by providing association focusing on subjects.