• Title/Summary/Keyword: Information storage and retrieval

Search Result 228, Processing Time 0.024 seconds

CS-Tree : Cell-based Signature Index Structure for Similarity Search in High-Dimensional Data (CS-트리 : 고차원 데이터의 유사성 검색을 위한 셀-기반 시그니쳐 색인 구조)

  • Song, Gwang-Taek;Jang, Jae-U
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.305-312
    • /
    • 2001
  • Recently, high-dimensional index structures have been required for similarity search in such database applications s multimedia database and data warehousing. In this paper, we propose a new cell-based signature tree, called CS-tree, which supports efficient storage and retrieval on high-dimensional feature vectors. The proposed CS-tree partitions a high-dimensional feature space into a group of cells and represents a feature vector as its corresponding cell signature. By using cell signatures rather than real feature vectors, it is possible to reduce the height of our CS-tree, leading to efficient retrieval performance. In addition, we present a similarity search algorithm for efficiently pruning the search space based on cells. Finally, we compare the performance of our CS-tree with that of the X-tree being considered as an efficient high-dimensional index structure, in terms of insertion time, retrieval time for a k-nearest neighbor query, and storage overhead. It is shown from experimental results that our CS-tree is better on retrieval performance than the X-tree.

  • PDF

Efficient Incorporation of Tertiary Storage in a Multimedia DBMS (멀티미디어 DBMS에서 3차 저장장치의 효율적 활용 기법)

  • Mun, Chan-Ho;Gang, Hyeon-Cheol
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.7
    • /
    • pp.1724-1737
    • /
    • 1999
  • Multimedia data service applications have to store and manipulate LOBs(unstructured large objects) composing multimedia data. As such, the tertiary storage devices such as an optical disk jukebox and a tape library that consist of a number of platters (the disks in case of an optical disk jukebox and the cartridge tapes in case of a tape library) have been considered essential for the storage system of a DBMS in order to efficiently support storage and management of vary large volume of data. Since the latency with tertiary storage is too long, the schemes for efficient retrieval of LOBs out of tertiary storage need to be investigated. In this paper, we investigated the tertiary I/O Considering the performance characteristics of the LOBs, we proposed various I/O scheduling heuristic algorithms that reduce latency in query processing with LOB retrieval from tertiary storage, and evaluated their performance through a detailed simulation.

  • PDF

Efficient Multimedia Data File Management and Retrieval Strategy on Big Data Processing System

  • Lee, Jae-Kyung;Shin, Su-Mi;Kim, Kyung-Chang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.8
    • /
    • pp.77-83
    • /
    • 2015
  • The storage and retrieval of multimedia data is becoming increasingly important in many application areas including record management, video(CCTV) management and Internet of Things (IoT). In these applications, the files containing multimedia that need to be stored and managed is tremendous and constantly scaling. In this paper, we propose a technique to retrieve a very large number of files, in multimedia format, using the Hadoop Framework. Our strategy is based on the management of metadata that describes the characteristic of files that are stored in Hadoop Distributed File System (HDFS). The metadata schema is represented in Hbase and looked up using SQL On Hadoop (Hive, Tajo). Both the Hbase, Hive and Tajo are part of the Hadoop Ecosystem. Preliminary experiment on multimedia data files stored in HDFS shows the viability of the proposed strategy.

Analysis Performance Characterics of Dynamic Signature File Methods

  • Yoo, Jae-Soo;Choi, Kil-Seong;Kim, Myoung-Ho
    • Journal of Electrical Engineering and information Science
    • /
    • v.2 no.4
    • /
    • pp.37-45
    • /
    • 1997
  • With rapid increase of information requirements from various application areas, there has been much research on dynamic information storage structures that effectively support insertions, deletions and updates. In this appear we evaluate the performance of he existing dynamic signature file methods such as the S-tree, Quick Filter and HS file and provide guidelines for the most effective usage to a given operational environment. We derive analytic performance evaluation models of the storage structures based on retrieval time, storage overhead and insertion time. We also perform extensive experiments with various data distributions such as uniform, normal and exponential distributions. The relationships among various performance parameters are throughly investigated. We show through performance comparison based on analytic models and experiments that regardless of data distribution, the HS file significantly improves performance in both the retreival time and the storage overhead over S-tree and Quick Filter.

  • PDF

A Study of base-contents retrieval for using Multimedia code (멀티미디어 부호화를 이용한 내용기반 검색에 관한 연구)

  • 박재필;강진석;고석만;최연성;김장형
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2000.05a
    • /
    • pp.325-330
    • /
    • 2000
  • Recent progress on computer and related technology, especially including communication network, multimedia information processing and communication service technology. accelerates the entrance of information society. Especially, WWW brought the information crisis in its demand and size on the communication network, by making it easy to develop information service, like electronic (internet) news, electronic conference, multimedia information retrieval, and so on, on the internet. Due to this change, DBMS should provide efficient ways to store andmanage various types of Multimedia data and to model complex information structures. In order to satisfy these requirements, there have been man researches on architecture of multimedia DBMS, content-based retrieval for multimedia information, tertiary storage system for huge multimedia data, multimedia information modeling,

  • PDF

A Study on Retrieval of Storage Heat Flux in Urban Area (우리나라 도심지에서의 저장열 산출에 관한 연구)

  • Lee, Darae;Kim, Honghee;Lee, Sang-Hyun;Lee, Doo-Il;Hong, Jinkyu;Hong, Je-Woo;Lee, Keunmin;Lee, Kyeong-sang;Seo, Minji;Han, Kyung-Soo
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.2_1
    • /
    • pp.301-306
    • /
    • 2018
  • Urbanization causes urban floods and urban heat island in the summer, so it is necessary to understanding the changes of the thermal environment through urban climate and energy balance. This can be explained by the energy balance, but in urban areas, unlike the typical energy balance, the storage heat flux saved in the building or artificial land cover should be considered. Since the environment of each city is different, there is a difficulty in applying the method of retrieving the storage heat flux of the previous research. Especially, most of the previous studies are focused on the overseas cities, so it is necessary to study the storage heat retrieval suitable for various land cover and building characteristics of the urban areas in Korea. Therefore, the object of this study, it is to derive the regression formula which can quantitatively retrieve the storage heat using the data of the area where various surface types exist. To this end, nonlinear regression analysis was performed using net radiation and surface temperature data as independent variables and flux tower based storage heat estimates as dependent variables. The retrieved regression coefficients were applied to each independent variable to derive the storage heat retrieval regression formula. As a result of time series analysis with flux tower based storage heat estimates, it was well simulated high peak at day time and the value at night. Moreover storage heat retrieved in this study was possible continuous retrieval than flux tower based storage heat estimates. As a result of scatter plot analysis, accuracy of retrieved storage heat was found to be significant at $50.14Wm^{-2}$ and bias $-0.94Wm^{-2}$.

Storing Digital Information in Long-Read DNA

  • Ahn, TaeJin;Ban, Hamin;Park, Hyunsoo
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.30.1-30.6
    • /
    • 2018
  • There is urgent need for effective and cost-efficient data storage, as the worldwide requirement for data storage is rapidly growing. DNA has introduced a new tool for storing digital information. Recent studies have successfully stored digital information, such as text and gif animation. Previous studies tackled technical hurdles due to errors from DNA synthesis and sequencing. Studies also have focused on a strategy that makes use of 100-150-bp read sizes in both synthesis and sequencing. In this paper, we a suggest novel data encoding/decoding scheme that makes use of long-read DNA (~1,000 bp). This enables accurate recovery of stored digital information with a smaller number of reads than the previous approach. Also, this approach reduces sequencing time.

Signature-based Indexing Scheme for Similar Sub-Trajectory Retrieval of Moving Objects (이동 객체의 유사 부분궤적 검색을 위한 시그니쳐-기반 색인 기법)

  • Shim, Choon-Bo;Chang, Jae-Woo
    • The KIPS Transactions:PartD
    • /
    • v.11D no.2
    • /
    • pp.247-258
    • /
    • 2004
  • Recently, there have been researches on storage and retrieval technique of moving objects, which are highly concerned by user in database application area such as video databases, spatio-temporal databases, and mobile databases. In this paper, we propose a new signature-based indexing scheme which supports similar sub-trajectory retrieval at well as good retrieval performance on moving objects trajectories. Our signature-based indexing scheme is classified into concatenated signature-based indexing scheme for similar sub-trajectory retrieval, entitled CISR scheme and superimposed signature-based indexing scheme for similar sub-trajectory retrieval, entitled SISR scheme according to generation method of trajectory signature based on trajectory data of moving object. Our indexing scheme can improve retrieval performance by reducing a large number of disk access on data file because it first scans all signatures and does filtering before accessing the data file. In addition, we can encourage retrieval efficiency by appling k-warping algorithm to measure the similarity between query trajectory and data trajectory. Final]y, we evaluate the performance on sequential scan method(SeqScan), CISR scheme, and SISR scheme in terms of data insertion time, retrieval time, and storage overhead. We show from our experimental results that both CISR scheme and SISR scheme are better than sequential scan in terms of retrieval performance and SISR scheme is especially superior to the CISR scheme.

Representing and retrieving the Structured Information of XML Documents (XML 문서에 포함된 구조 정보의 표현과 검색)

  • Jo, Yun-Gi;Jo, Jeong-Gil;Lee, Byeong-Ryeol;Gu, Yeon-Seol
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.361-366
    • /
    • 2001
  • As growing the number of Webs, the total amount of accessible information has been greater than ever. To storage and retrieve the vast information on the Webs effectively, many researchers have been made utilizing XML (extensible Markup Language). In this paper, we propose an effective method of representation and retrieval mechanism for the structured retrieval of the XML documents : (1) the fixed sized LETID (Leveled Element Type ID) that contains the information of elements such as parent node, sibling nodes, and identical sibling nodes, and the hierachical information of current node, and (2) content index, structure index, attribute index model, and the information retrieval algorithm for the structured information retrieval. With our methods, we can effectively represent the structured information of XML documents, and can directly access the specific elements by simple operations to process various queries.

  • PDF

A Study on the Retention Efficiency of Library Materials of Automated Storage and Retrieval System (자동서고의 자료 수장 효율성에 관한 연구)

  • Kim, Young-Seok
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.3
    • /
    • pp.437-456
    • /
    • 2015
  • This study aims to compare the retention efficiency of library materials of Automated Storage and Retrieval System(AS/RS) revealed by the literature review to other types of book stacks. Domestic and foreign academic libraries adopted AS/RS as part of a plan to improve the retention efficiency. Literature review reveals that in general, AS/RS is better in the retention efficiency in compare to other types of book stacks. This study compared the amount of library materials stored by the AS/RS of S University Library to the general book stack and the fixed-type and the mobile rack-type of virtual book stacks. The study reveals that AS/RS is up to 6 times more efficient than general book stacks, up to 3.61 times more efficient than fixed-type book stacks and up to 1.45 times more efficient than mobile rack-type book stacks. However, the study claims that the retention efficiency of library materials of AS/RS is lower than what is known.