Browse > Article
http://dx.doi.org/10.3837/tiis.2018.09.017

RLDB: Robust Local Difference Binary Descriptor with Integrated Learning-based Optimization  

Sun, Huitao (Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology)
Li, Muguo (State Key Laboratory of Coastal and Offshore Engineering, Dalian University of Technology)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.12, no.9, 2018 , pp. 4429-4447 More about this Journal
Abstract
Local binary descriptors are well-suited for many real-time and/or large-scale computer vision applications, while their low computational complexity is usually accompanied by the limitation of performance. In this paper, we propose a new optimization framework, RLDB (Robust-LDB), to improve a typical region-based binary descriptor LDB (local difference binary) and maintain its computational simplicity. RLDB extends the multi-feature strategy of LDB and applies a more complete region-comparing configuration. A cascade bit selection method is utilized to select the more representative patterns from massive comparison pairs and an online learning strategy further optimizes descriptor for each specific patch separately. They both incorporate LDP (linear discriminant projections) principle to jointly guarantee the robustness and distinctiveness of the features from various scales. Experimental results demonstrate that this integrated learning framework significantly enhances LDB. The improved descriptor achieves a performance comparable to floating-point descriptors on many benchmarks and retains a high computing speed similar to most binary descriptors, which better satisfies the demands of applications.
Keywords
Computer vision; local feature; binary descriptor; linear discriminant projections; image matching;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Y. Gao, W. Huang and Y. Qiao, "Local multi-grouped binary descriptor with ring-based pooling configuration and optimization," IEEE Transactions on Image Processing, vol. 24, no. 12, pp. 4820-4833, December, 2015.   DOI
2 V. Balntas, L. Tang and K. Mikolajczyk, "BOLD-binary online learned descriptor for efficient image matching," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2367-2375, June 7-12, 2015.
3 T. Trzcinski and V. Lepetit, "Efficient discriminative projections for compact binary descriptors," in Proc. of European Conference on Computer Vision, pp. 228-242, October 7-13, 2012.
4 T. Trzcinski, M. Christoudias and V. Lepetit, "Learning image descriptors with boosting," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 37, no. 3, pp. 597-610, March, 2015.   DOI
5 S. Winder, G. Hua and M. Brown, "Picking the best daisy," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 178-185, June 20-26, 2009.
6 V. Balntas, E. Johns, L. Tang and K. Mikolajczyk, "PN-Net: conjoined triple deep network for learning local image descriptors," arXiv preprint, January, 2016.
7 D. G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, vol. 60, no. 2, pp. 91-110, November, 2004.   DOI
8 H. Bay, A. Ess, T. Tuytelaars and L. V. Gool, "Speeded-Up Robust Features (SURF)," Computer Vision & Image Understanding, vol. 110, no. 3, pp. 346-359, June, 2008.   DOI
9 E. Tola, V. Lepetit and P. Fua, "DAISY: an efficient dense descriptor applied to wide-baseline stereo," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 32, no. 5, pp. 815-830, May, 2010.   DOI
10 X. Han, T. Leung, Y. Jia, R. Sukthankar and A. C. Berg, "MatchNet: unifying feature and metric learning for patch-based matching," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3279-3286, June 7-12, 2015.
11 K. Simonyan, A. Vedaldi and A. Zisserman, "Learning local feature descriptors using convex optimisation," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 36, no. 8, pp. 1573-1585, August, 2014.   DOI
12 C. Strecha, A. Bronstein, M. Bronstein and P. Fua, "LDAHash: improved matching with smaller descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 34, no. 1, pp. 66-78, January 2012.   DOI
13 M. Calonder, V. Lepetit, C. Strecha and P. Fua, "BRIEF: Binary Robust Independent Elementary Features," in Proc. of European Conference on Computer Vision, pp. 778-792, September 5-11, 2010.
14 E. Rublee, V. Rabaud, K. Konolige and G. Bradski, "ORB: an efficient alternative to SIFT or SURF," in Proc. of IEEE International Conference on Computer Vision, pp. 2564-2571, November 6-13, 2011.
15 Y. Ke and R. Sukthankar, "PCA-SIFT: a more distinctive representation for local image descriptors," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 506-513, June 27-July 2, 2004.
16 H. Cai, K. Mikolajczyk and J. Matas, "Learning linear discriminant projections for dimensionality reduction of image descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 33, no. 2, pp. 338-352, February, 2011.   DOI
17 R. Ortiz, "FREAK: fast retina keypoint," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 510-517, June 16-21, 2012.
18 G. Hua, M. Brown and S. Winder, "Discriminant embedding for local image descriptors," in Proc. of IEEE International Conference on Computer Vision, pp. 1-8, October 14-20, 2007.
19 M. Brown, G. Hua and S. Winder, "Discriminative learning of local image descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 33, no. 1, pp. 43-57, January, 2011.   DOI
20 A. Richardson and E. Olson, "TailoredBRIEF: online per-feature descriptor customization," in Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems, September 28-October 2, 2015.
21 P. Viola and M. J. Jones, "Robust real-time face detection," International Journal of Computer Vision, vol. 57, no. 2, pp. 137-154, May, 2004.   DOI
22 B. Guo and J. Liu, "Real-time keypoint-based object tracking via online learning," in Proc. of IEEE International Conference on Information Science and Technology, pp. 907-911, March 23-25, 2013.
23 J. M. Morel and G. Yu, "ASIFT: a new framework for fully affine invariant image comparison," Siam Journal on Imaging Sciences, vol. 2, no. 2, pp. 438-469, April, 2009.   DOI
24 M. Ozuysal, M. Calonder, V. Lepetit and P. Fua, "Fast keypoint recognition using random ferns," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 32, no. 3, pp. 448-61, March, 2010.   DOI
25 D. Nister and H. Stewenius, "Scalable recognition with a vocabulary tree," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2161-2168, June 17-22, 2006.
26 S. A. J. Winder and M. Brown, "Learning local image descriptors," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1-8, June 17-22, 2007.
27 M. Brown, "Multi-view stereo correspondence dataset,"
28 K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 27, no. 10, pp. 1615-1630, October 2005.   DOI
29 Visual Geometry Group, Department of Engineering Science, University of Oxford, "Affine covariant regions datasets,"
30 H. Shao, T. Svoboda and L. V. Gool, "Zubud-zurich buildings database for image based recognition, "
31 D. Nister and H. Stewenius, "Recognition benchmark images,"
32 M. Oszust, "An optimisation approach to the design of a fast, compact and distinctive binary descriptor," Signal Image & Video Processing, vol. 10, no. 8, pp. 1-8, November 2016.   DOI
33 S. Liao, X. Zhu, Z. Lei, L. Zhang and S. Z. Li, "Learning multi-scale block local binary patterns for face recognition," in Proc. of International Conference on Biometrics, pp. 828-837, August 27-29, 2007.
34 H. Li, Y. Guan, L. Liu, F. Wang and L. Wang, "Re-ranking for microblog retrieval via multiple graph model," Multimedia Tools & Applications, vol. 75, no. 15, pp. 8939-8954, August, 2016.   DOI
35 M. U. Kim and K. Yoon, "Performance evaluation of large-scale object recognition system using bag-of-visual words model," Multimedia Tools & Applications, vol. 74, no. 7, pp. 2499-2517, April, 2015.   DOI
36 L. Mansourian, M. T. Abdullah, L. N. Abdullah, A. Azman and M. R. Mustaffa, "A Salient Based Bag of Visual Word model (SBBoVW): improvements toward difficult object recognition and object location in image retrieval," KSII Transactions on Internet and Information Systems, vol. 10, no. 2, pp. 769-786, February, 2016.   DOI
37 S. Leutenegger, M. Chli and R. Y. Siegwart, "BRISK: binary robust invariant scalable keypoints," in Proc. of IEEE International Conference on Computer Vision, pp. 2548-2555, November 6-13, 2011.
38 X. Yang and K. T. T. Cheng, "Local difference binary for ultrafast and distinctive feature description," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 36, no. 1, pp. 188-194, January, 2014.   DOI
39 B. Fan, Q. Kong, T. Trzcinski, Z. Wang and C. Pan, "Receptive fields selection for binary feature description," IEEE Transactions on Image Processing, vol. 23, no. 6, pp. 2583-2595, June, 2014.   DOI
40 Y. Furukawa and J. Ponce, "Accurate, dense, and robust multiview stereopsis," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 32, no. 8, pp. 1362-1376, August, 2010.   DOI
41 X. Zhang, B. Guo and Y. Yan, "Image retrieval method based on IPDSH and SRIP," KSII Transactions on Internet and Information Systems, vol. 8, no. 5, pp. 1676-1689, May, 2014.   DOI
42 Y. Guo, G. Zhao, Z. Zhou and M. Pietikainen, "Video texture synthesis with multi-frame LBP-TOP and diffeomorphic growth model," IEEE Transactions on Image Processing, vol. 22, no. 10, pp. 3879-3891, October, 2013.   DOI