Bandwidth Efficient Summed Area Table Generation for CUDA |
Ha, Sang-Won
(Dept. of Computer Science, Yonsei Univ.)
Choi, Moon-Hee (Samsung Electronics Corp.) Jun, Tae-Joon (Dept. of Computer Science, Yonsei Univ.) Kim, Jin-Woo (Dept. of Computer Science, Yonsei Univ.) Byun, Hye-Ran (Dept. of Computer Science, Yonsei Univ.) Han, Tack-Don (Dept. of Computer Science, Yonsei Univ.) |
1 | Hensley, J., Scheuermann, T., Coombe, G., Singh, M., and Lastra, A. "Fast summed-area table generation and its applications," Computer Graphics Forum, Vol. 24, No. 3, pp 547-555, Sept. 2005. DOI |
2 | Demers, J., "Depth of Field: A Survey of Techniques," GPU Gems, Addison Wesley, pp 375-390, 2004. |
3 | Grabner, M., Grabner, H., and Bischof, H., "Fast approximated SIFT," ACCV 2006, LNCS, Vol. 3851, pp 918-927, 2006. |
4 | Bay, H., Tuytelaars, T., and Gool, L. V., "SURF: Speeded Up Robust Features," ECCV 2006, LNCS, Vol. 3951, pp 404-417, 2006. |
5 | Harris, M., Sengupta, S., and Owens, J. D. "Parallel prefix sum (scan) with CUDA," In Nguyen, H., ed., GPU Gems 3. Addison Wesley, 2007. |
6 | NVIDIA CUDA C Programming Guide, Ver. 4.0, 2011. |
7 | Harris, M., Sengupta, S., and Owens, J.D., "Parallel Prefix Sum (Scan) with CUDA," GPU Gems 3, H. Nguyen, Addison-Wesley, Ch. 31, Aug. 2007. |
8 | Kogge, P. M. and Stone, S. S., "A Parallel Algorithm for the Efficient Solution of a General Class of Recurrence Equations," IEEE Trans. on Computers, Vol. C-22, No. 8, pp 786-793, 1973. DOI |
9 | CUDA Data Parallel Primitives Library, http://code.google.com/p/cudpp |
10 | Crow, F. C. "Summed-area tables for texture mapping," In SIGGRAPH '84: Proceedings of the 11th annual conference on Computer graphics and interactive techniques, NY, NY, USA, pp 207-212, 1984. |
11 | Heckbert, P. S., "Filtering by Repeated Integration," ACM SIGGRAPH Computer Graphics, Vol. 20, No. 4, pp 315-321, 1986. DOI |