Browse > Article

Design of an Efficient Parallel High-Dimensional Index Structure  

Park, Chun-Seo (Electronics and Telecommunications Research Institute)
Song, Seok-Il (Dept. of Information Communication Engineering, Chungbuk National University)
Sin, Jae-Ryong (Dept. of Information Communication Engineering, Chungbuk National University)
Yu, Jae-Su (Dept. of Information Communication Engineering, Chungbuk National University)
Abstract
Generally, multi-dimensional data such as image and spatial data require large amount of storage space. There is a limit to store and manage those large amount of data in single workstation. If we manage the data on parallel computing environment which is being actively researched these days, we can get highly improved performance. In this paper, we propose a parallel high-dimensional index structure that exploits the parallelism of the parallel computing environment. The proposed index structure is nP(processor)-n$\times$mD(disk) architecture which is the hybrid type of nP-nD and lP-nD. Its node structure increases fan-out and reduces the height of a index tree. Also, A range search algorithm that maximizes I/O parallelism is devised, and it is applied to K-nearest neighbor queries. Through various experiments, it is shown that the proposed method outperforms other parallel index structures.
Keywords
parallel environment; high-dimensional index structure; range quer; K-nearest neighbor query;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 K. Lin, H. V. Jagadish, and C. Faloutsos, 'The TV-Tree an index structure for high dimensional data,' VLDB Journal, pp. 517-542, 1994   DOI
2 A. Henrich, 'Improving the performance of multidimensional access structures based on k-d-trees,' In Proc. Information and Knowledge Management Conf., pp. 68-75, 1996   DOI
3 Stefan Berchtold, 'Improving the Query performance of High-Dimensional Index Structure by Bulk Load Operation,' In Proc. EDBT Conf., pp. 216-230, 1998
4 A. Guttman, 'R-Trees: A dynamic index structure for spatial searching,' In Proc. ACM SIGMOD Conf., pp. 47-57, 1984   DOI
5 J.T. Robinson. 'The K-D-B-tree: A search structure for large multidimensional dynamic indexed.' In Proc. ACM SIGMOD Conf., pp. 10-18, 1981   DOI
6 T.Sellis, N. Roussopoulos and C. Faloutsos, 'The R--Tree: a dynamic index for multi-dimensional objects,' In Proc. VLDB Conf., pp. 507-518, 1987
7 Ibrahim Kamel and Christos Faloutsos, 'Parallel R-trees,' In Proc. ACM SIGMOD Conf., pp. 195-204, 1992   DOI
8 K.I. Lin, H. Jagadish, and C, Faloutsos, 'The TV-tree : An Index Structure for High Dimensional Data,' VLDB Journal, Vol 3, pp. 517-542, 1994   DOI
9 S. Berchtold, D. A. Keim and H-P. Kriegel, 'The X-tree : An Index Structure for High-Dimensional Data,' In Proc. 22nd VLDB Coni pp. 28-39, 1996
10 D. A. White and R. Jain, 'Similarity Indexing with the SS-tree,' In Proc. ICDE, New Orleans, pp. 516-523, 1996
11 Lomet D. and Salzberg B, 'The hB-Tree: A Robust Multiattribute Search Structure,' In Proc. ICDE Conf, pp. 296-304, 1989   DOI
12 N. Katayama and S. Satoh, 'The SR-Tree : An index structure for high dimensional nearest neighbor queries,' In Proc. SIGMOD conf., pp. 369-380 1997   DOI   ScienceOn
13 K. Chakrabarti and S. Mehrotra. 'The Hybrid Tree : An Index Structure for High-Dimensional Feature Spaces,' In Proc. ICDE conf., pp. 440-447, 1999   DOI
14 이석회, 유재수, 조기형, 허대영, 'CIR-Tree : 효율적인 고차원 색인기법', 한국정보과학회 논문지(B), 한국정보과학회 제26권 제6호, pp. 724-734, Jun 1999
15 A. Henrich, H.-W. Six and P. Widmayer, 'The LSD-tree: spatial access to multidimensional point and non-point objects,' In Proc. VLDB Conf., pp. 45-53, 1989
16 M. Freeston, 'The BANG file: a new kind of grid file,' In Proc. VLDB conf., pp. 260-269, 1987
17 조성훈, 김성주, 이준호, 이주영, 박석천, 'SANS의 구조와 기술 요소', 정보처리학회지 제8권 제4호, pp.19-28, 2001   과학기술학회마을
18 J. Nievergelt, H. Hinterberger, and K. Sevcik, 'The grid file: An adaptable, symmetric multikey file structure,' ACM Transactions on Database Systems(TODS). 1984   DOI   ScienceOn
19 Weber R., Scheck H.-J and Blott S., 'A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces,' In Proc. VLDB Conf., pp. 194-205, 1998
20 Aristides Gionis, Piotr Indyk and Rajeev Motwani, 'Similarity Search in High Dimensions via Hashing,' In Proc. VLDB Conf., pp. 518-529, 1999
21 Berchtold S., 'Independent Quantization: An Index Compression Technique for High-Dimensional Data Spaces,' In Proc. ICDE Conf., pp. 577-588, 2000   DOI
22 Ning An, Liujian Quian, Anand Sivasubramaniam and Tom Kecfe, 'Evaluating Parallel R-Tree Implementations on a Network of Workstations,' In Proc. ACM GIS Conf., pp. 159-160, 1998   DOI
23 I. Kamel and C. Faloutsos, 'Parallel R-trees,' CS-TR-2820, UMIACS-TR-92-1, Computer Science Technical Report Series, University of Maryland, Collage Park, MD, 1992
24 Roger Weber, 'Parallel VA-File,' VLDB, 1999
25 Stefan Berchtold, Christian Bohm, Bemhard Braunmuller, Daniel A.Keim and Hans-Peter Kriegel, 'Fast Parallel Similarity Search in Multimedia Databases,' In Proc. SIGMOD Conf., pp. 1-12, 1997   DOI   ScienceOn
26 Kap S. Bang and Huizu Lu, 'The PML-tree: An Efficient Parallel Spatial Index Structure for Spatial Databases,' In Proc. ACM Conf., pp. 79-88. 1996   DOI
27 N. Koudas, C. Faloutsos and I. Kamel, 'Declus-tering R-trees on Multi-Computer Architectures,' Technical Research Report ISR 1994
28 Botao Wang, Hiroyuki Horinokuchi, Kunihiko Kaneko and Akifumi Makinouchi, 'Parallel R-tree Search Algorithm on DSVM,' In Proc. DASFAA Conf., pp. 237-245, 1999   DOI
29 Xiaodong Fu, Dingxing Wang, Weimin Zheng and Mciming Sheng, 'GPR-Tree: A Global Parallel Index Structure for Multiattribute Declustering on Cluster of Workstations,' IEEE, 1997   DOI
30 Bemad Scnnitzer and Scott T.Leutenegger, 'Master-Client R-trees: A New Parallel R-trec Architecture,' In Proc. SSDBM Conf. pp. 68-77, 1999   DOI
31 http://www.metastor.com/products/sans/E4400_ datasheet.pdf
32 N. Beckmann, H.P. Komacker, R. Schneider and B. Seeger, 'The R*-Tree: An Efficient and Robust Access Method for Points and Rectangles,' In Proc. ACM SIGMOD Conf., pp. 322-331, 1990