DOI QR코드

DOI QR Code

Quantization Parameter Determination Method for Face Depth Image Encoding

깊이 얼굴 영상 부호화에서의 양자화 인자 결정 방법

  • 박동진 (동의대학교 컴퓨터소프트웨어공학과) ;
  • 권순각 (동의대학교 컴퓨터소프트웨어공학과)
  • Received : 2020.01.10
  • Accepted : 2020.02.02
  • Published : 2020.02.29

Abstract

In this paper, we propose a quantization parameter determination method for face depth image encoding in order to minimize an impact on a face recognition accuracy. When a face depth image is compressed through quantization in H.264/AVC, differential quantization parameters are assigned according to an accuracy of ellipsoid modeling prediction and an importance degree of a unit block in extracting facial features. The simulation results show that the face recognition success rates are improved by up to 6% at the same compression rates through the proposed compression rate determination method.

본 논문에서는 얼굴 인식 정확도에 미치는 영향을 최소화하면서 효율적으로 깊이 얼굴 영상을 압축하기 위한 양자화 변수 결정 방법을 제안한다. H.264/AVC의 양자화를 적용하여 깊이 얼굴 영상을 압축 할 때 얼굴 특징을 최대한 유지할 수 있도록 타원체 모델링의 예측 정확도와 각각의 양자화 단위 블록의 얼굴 인식에서의 중요도를 이용하여 양자화 인자를 차등적으로 부여한다. 모의실험 결과 제안된 방법을 통해 같은 압축율에서 얼굴 인식 성공률이 최대 6% 개선되었다.

Keywords

References

  1. Kwon, S. K. (2019). Face Recognition Using Depth and Infrared Pictures, Nonlinear Theory and Its Applications, IEICE, 10(1), 2-15. https://doi.org/10.1587/nolta.10.2.
  2. Wu, H., Sun, X., Yang, J., and Wu, F. (2019). 3D Mesh Based Inter-Image Prediction for Image Set Compression, Proceeding of the IEEE International Conference on Multimedia and Expo, July 8-12, Shanghai, China.
  3. Schnabel, R., and Klein, R. (2006). Octree-based Point-cloud Compression. Proceeding of the Eurographics Symposium on Point-Based Graphics, July 29-30, Boston, Massachusetts, USA.
  4. Gumhold, S., Karni, Z., Isenburg, M., and Seidel, H. (2005) Predictive Pointcloud Compression, Proceeding of ACM SIGGRAPH, Jul. 31-Aug. 4, Los Angeles, California, USA.
  5. Morvan, Y., Farin, D., and deWith, P.H.N. (2007). Depth-image Compression Based on An R-D Optimized Quadtree Decomposition for The Transmission of Multiview Images, Proceeding of the IEEE International Conference on Image Processing, Sep. 16-Oct. 19, San Antonio, Texas, USA.
  6. Milani, S., and Calvagno, G. (2010). A Depth Image Coder Based on Progressive Silhouettes, IEEE Signal Process. Letters, 17(8), 711-714. https://doi.org/10.1109/LSP.2010.2051619.
  7. Shen, G., Kim, W., Narang, S., Orterga, A., Lee, J., and Wey, H. (2010). Edge Adaptive Transform for Efficient Depth Map Coding, Proceeding of Picture Coding Symposium, Dec. 8-10, Nagoya, Japan.
  8. Maitre, M., and Do, M. (2010). Depth and Depth-Color Coding Using Shape Adaptive Wavelets, Journal of Visual Communication and Image Representation, 21(5-6), 513-522. https://doi.org/10.1016/j.jvcir.2010.03.005.
  9. Fu, J., Miao, D., Yu, W., Wang, S., Lu, Y., and Li, S. (2013). Kinect-Like Depth Data Compression, IEEE Transactions on Multimedia, 15(6), 1340-1352. https://doi.org/10.1109/TMM.2013.2247584
  10. Wang, X., Sekercioglu, Y. A., Drummond, T., Natalizio, E., Fantoni, I., and Fremont, V. (2016). Fast Depth Video Compression for Mobile RGB-D Sensors, IEEE Transactions on Circuits and Systems for Video Technology, 26(4), 673-686, https://doi.org/10.1109/TCSVT.2015.2416571
  11. Mamou, K., Zaharia, T., and Preteux, F. (2008). FAMC: The MPEG-4 Standard for Animated Mesh Compression, Proceeding of the IEEE International Conference on Image Processing, Oct. 12-15, San Diego, California, USA.
  12. Paysan, P., Knothe, R., Amberg, B., Romdhani, S., and Vetter, T. (2009). A 3D Face Model for Pose and Illumination Invariant Face Recognition, Proceeding of IEEE International Conference on Advanced Video and Signal Based Surveillance, Sep. 2-4, Genova, Italy.
  13. Lee, D. S. and Kwon, S. K. (2018). Intra Prediction of Depth Picture with Plane Modeling, Symmetry, 10, 1-16. https://doi.org/10.3390/sym10120715
  14. Takagi, K., Takishima, Y., and Nakajima, Y. (2003). A Study on Rate Distortion Optimization Scheme for JVT Coder, Proceeding of SPIE, Apr. 21, Orlando, Florida, USA.
  15. Ma, S., Wen, G., and Yan, L. (2005). Rate-Distortion Analysis for H.264/AVC Video Coding and its Application to Rate Control, IEEE Transactions on Circuits and Systems for Video Technology, 15(12), 1533-1544. https://doi.org/10.1109/TCSVT.2005.857300
  16. Hg, R. I., Jasek, P., Rofidal, C., Nasrollahi, K., and Moeslund, T. B. (2012). An RGB-D Database using Microsoft's Kinect for Windows for Face Detection, Proceeding of the IEEE 8th International Conference on Signal Image Technology & Internet Based Systems, Nov. 25-29, Sorrento, Naples, Italy.
  17. Lee, D. S., Han D. H., and Kwon, S. K. (2018). Face Recognition Method by Using Infrared and Depth Images, Journal of the Korea Industrial Information Systems Research, 23(2), 1-9. http://dx.doi.org/10.9723/jksiis.2018.23.2.001.