• 제목/요약/키워드: internet indexing

검색결과 103건 처리시간 0.021초

An Efficient Indexing Structure for Multidimensional Categorical Range Aggregation Query

  • Yang, Jian;Zhao, Chongchong;Li, Chao;Xing, Chunxiao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권2호
    • /
    • pp.597-618
    • /
    • 2019
  • Categorical range aggregation, which is conceptually equivalent to running a range aggregation query separately on multiple datasets, returns the query result on each dataset. The challenge is when the number of dataset is as large as hundreds or thousands, it takes a lot of computation time and I/O. In previous work, only a single dimension of the range restriction has been solved, and in practice, more applications are being used to calculate multiple range restriction statistics. We proposed MCRI-Tree, an index structure designed to solve multi-dimensional categorical range aggregation queries, which can utilize main memory to maximize the efficiency of CRA queries. Specifically, the MCRI-Tree answers any query in $O(nk^{n-1})$ I/Os (where n is the number of dimensions, and k denotes the maximum number of pages covered in one dimension among all the n dimensions during a query). The practical efficiency of our technique is demonstrated with extensive experiments.

A New Three-dimensional Integrated Multi-index Method for CBIR System

  • Zhang, Mingzhu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권3호
    • /
    • pp.993-1014
    • /
    • 2021
  • This paper proposes a new image retrieval method called the 3D integrated multi-index to fuse SIFT (Scale Invariant Feature Transform) visual words with other features at the indexing level. The advantage of the 3D integrated multi-index is that it can produce finer subdivisions in the search space. Compared with the inverted indices of medium-sized codebook, the proposed method increases time slightly in preprocessing and querying. Particularly, the SIFT, contour and colour features are fused into the integrated multi-index, and the joint cooperation of complementary features significantly reduces the impact of false positive matches, so that effective image retrieval can be achieved. Extensive experiments on five benchmark datasets show that the 3D integrated multi-index significantly improves the retrieval accuracy. While compared with other methods, it requires an acceptable memory usage and query time. Importantly, we show that the 3D integrated multi-index is well complementary to many prior techniques, which make our method compared favorably with the state-of-the-arts.

A Mobile P2P Semantic Information Retrieval System with Effective Updates

  • Liu, Chuan-Ming;Chen, Cheng-Hsien;Chen, Yen-Lin;Wang, Jeng-Haur
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권5호
    • /
    • pp.1807-1824
    • /
    • 2015
  • As the technologies advance, mobile peer-to-peer (MP2P) networks or systems become one of the major ways to share resources and information. On such a system, the information retrieval (IR), including the development of scalable infrastructures for indexing, becomes more complicated due to a huge increase on the amount of information and rapid information change. To keep the systems on MP2P networks more reliable and consistent, the index structures need to be updated frequently. For a semantic IR system, the index structure is even more complicated than a classic IR system and generally has higher update cost. The most well-known indexing technique used in semantic IR systems is Latent Semantic Indexing (LSI), of which the index structure is generated by singular value decomposition (SVD). Although LSI performs well, updating the index structure is not easy and time consuming. In an MP2P environment, which is fully distributed and dynamic, the update becomes more challenging. In this work, we consider how to update the sematic index generated by LSI and keep the index consistent in the whole MP2P network. The proposed Concept Space Update (CSU) protocol, based on distributed 2-Phase locking strategy, can effectively achieve the objectives in terms of two measurements: coverage speed and update cost. Using the proposed effective synchronization mechanism with the efficient updates on the SVD, re-computing the whole index on the P2P overlay can be avoided and the consistency can be achieved. Simulated experiments are also performed to validate our analysis on the proposed CSU protocol. The experimental results indicate that CSU is effective on updating the concept space with LSI/SVD index structure in MP2P semantic IR systems.

웹에서 축출된 정보를 이용한 축구 경기의 시맨틱 인덱싱 (Semantic Indexing for Soccer Videos Using Web-Extracted Information)

  • ;김명훈;설상훈
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2007년도 가을 학술발표논문집 Vol.34 No.2 (C)
    • /
    • pp.41-45
    • /
    • 2007
  • The rapid growing of video content production leads to the necessity of developing more complex indexing systems in order to efficiently allow searching, retrieval and presentation of the desired segments of videos. This paper presents a method for indexing soccer video through automatic extraction of information from internet. The proposed paper defines a metadata structure to formally represent the knowledge of soccer matches and provides an automatic method to extract semantic information from web-sites. This approach improves the capability to extract more reliable and richer semantic Information for soccer videos. Experimental results demonstrate that the proposed method provides an efficient performance.

  • PDF

An Intelligent Search Modeling using Avatar Agent

  • Kim, Dae Su
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제4권3호
    • /
    • pp.288-291
    • /
    • 2004
  • This paper proposes an intelligent search modeling using avatar agent. This system consists of some modules such as agent interface, agent management, preprocessor, interface machine. Core-Symbol Database and Spell Checker are related to the preprocessor module and Interface Machine is connected with Best Aggregate Designer. Our avatar agent system does the indexing work that converts user's natural language type sentence to the proper words that is suitable for the specific branch information retrieval. Indexing is one of the preprocessing steps that make it possible to guarantee the specialty of user's input and increases the reliability of the result. It references a database that consists of synonym and specific branch dictionary. The resulting symbol after indexing is used for draft search by the internet search engine. The retrieval page position and link information are stored in the database. We experimented our system with the stock market keyword SAMSUNG_SDI, IBM, and SONY and compared the result with that of Altavista and Google search engine. It showed quite excellent results.

Xp-tree:Xpath 로케이션 스텝의 효율화를 위한 새로운 공간기반의 인덱싱 기법 (Xp-tree:A new spatial-based indexing method to accelerate Xpath location steps)

  • Trang, Nguyen-Van;Hwang, Jeong-Hee;Ryu, Keun-Ho
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2004년도 봄 학술발표논문집 Vol.31 No.1 (B)
    • /
    • pp.10-12
    • /
    • 2004
  • Nowadays, with the rapid emergence of XML as a standard for data exchange over the Internet had led to considerable interest In the problem of data management requirements such as the need to store and query XML documents in which the location path languages Xpath is of particular important for XML application since it is a core component of many XML processing standards such as XSLT or XQuery, This parer gives a brief overview about method and design by applying a new spatial-based indexing method namely Xp-free that used for supporting Xpath. Spatial indexing technique has been proved its capacity on searching in large databases. Based on accelerating a node using planar as combined with the numbering schema, we devise efficiently derivative algorithms, which are simple, but useful. Besides that, it also allows to trace all Its relative nodes of context node In a manner supporting queries natural to the types especially Xpath queries with predicates.

  • PDF

Design and Implementation of a Simulation Framework for Wireless Data Broadcasting based on Data ID Space Partition

  • Im, Seokjin
    • International journal of advanced smart convergence
    • /
    • 제7권4호
    • /
    • pp.10-18
    • /
    • 2018
  • For the information services supporting requests of data items from a great number of mobile clients, wireless data broadcasting is an effective way because it can accommodate any number of clients. In the wireless data broadcasting, various air indexing schemes and data scheduling schemes have been developed in order to enable the clients to access their desired data items efficiently. The broadcasting system needs a method to simulate newly designed air indexing and scheduling schemes of the system, and to evaluate the performance parameters of the schemes. In this paper, we design an expandable and efficient simulation framework for the wireless data broadcasting based on the partition of data ID space. The framework can adopt regular and irregular space partition and evaluate various performance parameters of the broadcasting system. We implement a testbed of the broadcasting system using the framework, that adopts IIP, GDI and EXP as its air indexing schemes. We simulate the system using the testbed and evaluate the performance parameters of the system. Thus, we show the efficiency and expandability of the designed and implemented framework.

단축-경로와 확장성 해싱 기법을 이용한 경로-지향 질의의 평가속도 개선 방법 (A Way to Speed up Evaluation of Path-oriented Queries using An Abbreviation-paths and An Extendible Hashing Technique)

  • 박희숙;조우현
    • 정보처리학회논문지D
    • /
    • 제11D권7호
    • /
    • pp.1409-1416
    • /
    • 2004
  • 최근 인터넷의 폭발적인 성장과 인기로 인하여 인터넷을 통한 정보 교환이 극적으로 증가하고 있다. 또한 XML은 인터넷상에서 데이터를 교환하기 위한 표준인 동시에 중요한 수단이 되고 있다. 따라서 XML 문서를 검색하는데 있어서, 경로-지향 질의에 대한 평가 속도를 개선하는 문제는 중요한 이슈이다. 본 논문에서는 문서 데이터베이스에서 경로-지향 질의의 검색 성능을 개선하기 위한 새로운 인덱싱 방법을 제안하다. 새로운 인덱싱 방법에서는 경로-지향 질의를 효과적으로 수행하기 위해 단축-경로 파일을 생성하고 그것의 해시 코드 값을 인덱스 키로 사용한다. 또한 검색 평가 속도를 촉진시키기 위하여 단축-경로 파일을 확장성 해싱 기법과 결합하는 것으로 검색질의의 평가속도를 가속화하였다.

경로 서명 : XML문서 검색을 위한 경로-지향 질의처리 시스템 (Path Signatures : Path-oriented Query Processing System for XML document Retrieval)

  • 박희숙;박주현;조우현
    • 한국정보통신학회논문지
    • /
    • 제11권7호
    • /
    • pp.1311-1317
    • /
    • 2007
  • 최근 인터넷의 폭발적인 성장과 인기로 인하여, 인터넷상에서 정보의 교환이 매우 빠르게 증가하고 있다. 또한 XML은 인터넷상에서 정보교환의 수단인 동시에 표준으로서 자리를 잡아 가고 있다. 따라서 본 논문에서는 경로-지향 질의어를 평가하기 위한 새로운 인덱싱 기법과 사용자들에게 편리한 XML문서 검색 결과를 제공하기 위한 경로-지향 질의 처리 시스템의 설계 및 구현을 제공한다. 제안된 인덱싱 기법은 XML문서의 검색 성능을 향상시키기 위하여 이진 트라이 구조와 경로서명 파일을 결합하였다.

관계 DBMS 기반 XML 저장시스템 상에서의 XML 뷰 인덱싱 (XML View Indexing Using an RDBMS based XML Storage System)

  • 박대성;김영성;강현철
    • 인터넷정보학회논문지
    • /
    • 제6권4호
    • /
    • pp.59-73
    • /
    • 2005
  • 질의 결과를 캐쉬한 후 후속 질의 처리에 사용하는 것은 중요한 질의 최적화 기법으로서 이에는 실체뷰 기법과 뷰 인덱싱이 있다. 관계 데이터베이스에 대하여 많이 연구된 이들 기법은 XML이 웹 상에서 데이터 교환의 표준으로 부각된 이래 최근 XML 데이터에 대해서도 연구되고 있다. XML 뷰 인덱싱은 XML 질의의 결과인 XML 뷰 xv를 XML 뷰 인덱스(XVI)라는 구조로 표현한다. XVI는 xv의 소스 문서 엘리먼트들의 식별자 및 xv에 대한 정보를 저장한다. xv에 대한 XVI는 XML 엘리먼트의 식별자만을 저장하지 엘리먼트 자체를 저장하는 것이 아니다. 따라서 xv가 요청되면 그것의 XVI를 통해 xv의 하부 소스 XML 문서에 대해 실체화(materialization) 과정이 수행되어야 한다. 본 논문은 XVI 관리 시스템과 관계 DBMS 기반의 XML 저장 시스템의 통합 문제를 다룬다. 제안된 시스템은 두개의 서로 다른 상용 관계 DBMS에 대하여 Windows 2000 Server 환경에서 각각 Java로 구현되어 XML 뷰 인덱싱을 통한 XML 질의 처리의 성능 향상 및 XML 뷰 인덱싱의 오버헤드 평가에 이용되었다. 성능 실험 결과 관계 DBMS 기반의 XML 저장 시스템 상에서 XML 뷰 인덱싱은 매우 효율적이며 그 오버헤드는 미미하다는 것을 확인하였다.

  • PDF