Browse > Article
http://dx.doi.org/10.3837/tiis.2015.07.015

Fast Search with Data-Oriented Multi-Index Hashing for Multimedia Data  

Ma, Yanping (Scholol of Information and Electrical Engineering, Ludong University)
Zou, Hailin (Scholol of Information and Electrical Engineering, Ludong University)
Xie, Hongtao (Institute of Information Engineering, CAS, National Engineering Laboratory for Information Security Technologies)
Su, Qingtang (Scholol of Information and Electrical Engineering, Ludong University)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.9, no.7, 2015 , pp. 2599-2613 More about this Journal
Abstract
Multi-index hashing (MIH) is the state-of-the-art method for indexing binary codes, as it di-vides long codes into substrings and builds multiple hash tables. However, MIH is based on the dataset codes uniform distribution assumption, and will lose efficiency in dealing with non-uniformly distributed codes. Besides, there are lots of results sharing the same Hamming distance to a query, which makes the distance measure ambiguous. In this paper, we propose a data-oriented multi-index hashing method (DOMIH). We first compute the covariance ma-trix of bits and learn adaptive projection vector for each binary substring. Instead of using substrings as direct indices into hash tables, we project them with corresponding projection vectors to generate new indices. With adaptive projection, the indices in each hash table are near uniformly distributed. Then with covariance matrix, we propose a ranking method for the binary codes. By assigning different bit-level weights to different bits, the returned bina-ry codes are ranked at a finer-grained binary code level. Experiments conducted on reference large scale datasets show that compared to MIH the time performance of DOMIH can be improved by 36.9%-87.4%, and the search accuracy can be improved by 22.2%. To pinpoint the potential of DOMIH, we further use near-duplicate image retrieval as examples to show the applications and the good performance of our method.
Keywords
Nearest Neighbor Search; Binary Codes; Indexing;
Citations & Related Records
연도 인용수 순위
  • Reference