[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5909/JBE.2018.23.1.86

Coding History Detection of Speech Signal using Deep Neural Network

Cho, Hyo-Jin (Dept. of Electronics Engineering, Kwangwoon University)
Jang, Won (Dept. of Electronics Engineering, Kwangwoon University)
Shin, Seong-Hyeon (Dept. of Electronics Engineering, Kwangwoon University)
Park, Hochong (Dept. of Electronics Engineering, Kwangwoon University)

Publication Information

Journal of Broadcast Engineering / v.23, no.1, 2018 , pp. 86-92 More about this Journal

Abstract

In this paper, we propose a method for coding history detection of digital speech signal. In digital speech communication and storage, the signal is encoded to reduce the number of bits. Therefore, when a speech signal waveform is given, we need to detect its coding history so that we can determine whether the signal is an original or an coded one, and if coded, determine the number of times of coding. In this paper, we propose a coding history detection method for 12.2kbps AMR codec in terms of original, single coding, and double coding. The proposed method extracts a speech-specific feature vector from the given speech, and models the feature vector using a deep neural network. We confirm that the proposed feature vector provides better performance in coding history detection than the feature vector computed from the general spectrogram.

Keywords

coding history; feature vector; speech parameter; DNN;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	B. D'Alessandro and Y. Q. Shi, "MP3 bit rate quality detection through frequency spectrum analysis," Proc. 11th ACM Workshop on Multimedia and Security, pp. 57-61, 2009.
2	T. Bianchi, A. De Rosa, M. Fontani, G. Rocciolo and A. Piva, "Detection and classification of double compressed MP3 audio tracks," Proc. 1st ACM Workshop on Information Hiding and Multimedia Security, pp. 159-164, 2013.
3	D. Luo, W. Luo, R. Yang and J. Huang, "Identifying compression history of wave audio and its applications," ACM Trans. on Multimedia Computing, Communications, and Applications, vol. 10, no. 3, pp. 30:1-30:19, 2014.
4	D. Seichter, L. Cuccovillo and P. Aichroth, "AAC encoding detection and bitrate estimation using a convolutional neural network," Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp. 2069-2073, 2016.
5	D. Luo, R. Yang, B. Li and J. Huang, "Detection of Double Compressed AMR Audio Using Stacked Autoencoder," IEEE Trans. on Information Forensics and Security, vol. 12, no. 2, pp. 432-444, 2017. DOI
6	Y. LeCun, Y. Bengio and G. Hinton, "Deep learning," Nature, 521.7553: 436-444, 2015. DOI
7	K. L. Priddy and P. E. Keller, Artificial neural networks: an introduction, SPIE Press, 2005.
8	S. Ioffe and C. Szegedy, "Batch normalization: accelerating deep network training by reducing internal covariate shift," Int. Conf. on Machine Learning(ICML), pp. 448-456, 2015.
9	H.-W. Yun, S.-H. Shin, W.-J. Jang and H. Park, "On-line audio genre classification using spectrogram and deep neural network," J. of Broadcast Engineering, vol. 21, no. 6, pp. 977-985, Nov. 2016. DOI

KSCI

Coding History Detection of Speech Signal using Deep Neural Network 심층 신경망을 이용한 음성 신호의 부호화 이력 검출

Coding History Detection of Speech Signal using Deep Neural Network