Browse > Article
http://dx.doi.org/10.5392/JKCA.2020.20.05.001

Pornographic Content Detection Scheme Using Bi-directional Relationships in Audio Signals  

Song, KwangHo (인하대학교 정보통신공학과)
Kim, Yoo-Sung (인하대학교 정보통신공학과)
Publication Information
Abstract
In this paper, we propose a new pornographic content detection scheme using bi-directional relationships between neighboring auditory signals in order to accurately detect sound-centered obscene contents that are rapidly spreading via the Internet. To capture the bi-directional relationships between neighboring signals, we design a multilayered bi-directional dilated-causal convolution network by stacking several dilated-causal convolution blocks each of which performs bi-directional dilated-causal convolution operations. To verify the performance of the proposed scheme, we compare its accuracy to those of the previous two schemes each of which uses simple auditory feature vectors with a support vector machine and uses only the forward relationships in audio signals by a previous stack of dilated-causal convolution layers. As the results, the proposed scheme produces an accuracy of up to 84.38% that is superior performance up to 25.80% than other two comparison schemes.
Keywords
Pornographic Contents Detection; Audio Signal; Bi-directional Selationship; Dilated Convolution;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Zhiyi Qu, Jing Yu, and Qiang Niu, "Pornographic Audio Detection Using MFCC Feature and Vector Quantization," International Conference on Computational and Information Sciences, pp.924-927, 2010.
2 조동욱, 김지영, "음란 유해사이트 차단을 위한 음향신호 처리 및 분석," 한국콘텐츠학회논문지, Vol.4, No.2, pp.1-6, 2004.
3 김용운, 김봉완, 최대림, 김태권, 고락환, 이용주, "안드로이드 OS기반 음향정보를 이용한 음란동영상 검출 서비스 구현," 한국멀티미디어학회 학술발표논문집, pp.416-419, 2010.
4 Ziqiang Shi, Tieran Zheng, Jiqing Han, and Boyang Gao, "Erotic Audio Recognition Using Heterogeneouos Ensemble Classifiers," International Journal of Computer and Electrical Engineering, Vol.4, No.5, pp.666-669, 2012.
5 KwangHo Song and Yoo-Sung Kim, "Pornographic Video Detection Scheme Using Multimodal Feature," Journal of Engineering and Applied Sciences, Vol.13, No.5, pp.1174-1182, 2018.
6 M. Slaney, A critique of pure audition, Computational Auditory Scene Analysis, 1997.
7 Google Deepmind, "WAVENET: A GENERATIVE MODEL FOR RAW AUDIO," arXiv:1609.03499v2 [cs.SD], 2016.
8 B. Kim, D. Choi, and Y. Lee, "Speech/music discrimination using mel-cepstrum modulation energy," Lecture Notes in Artificial Intelligence, Vol.4629, pp.106-414, 2007.
9 J. Pearl, "Causal inference in statistics: An overview," Statistics surveys, Vol.3, pp.96-146, 2009.   DOI
10 X. Zhang, J. Yao, and Q. He, "Research of STRAIGHT Spectrogram and Difference Subspace Algorithm for Speech Recognition," 2nd International Congress on Image and Signal Processing, pp.1-4, 2009.
11 Daniel Moreira, Sandra Avila, Mauricio Perez, Daniel Moraes, Vanessa Testoni, Eduardo Valle, Siome Goldenstein, and Anderson Rocha, "Pornography Classification: The Hidden Clues in Video Space-Time," Forensic Science International, Vol.268, pp.46-61, 2016.   DOI
12 김현영, 김재웅, "'문화예술 콘텐츠 제작 및 유통에서의 빅데이터 활용 연구," 한국콘텐츠학회논문지, Vol.19, No.7, pp.384-392, 2019.   DOI
13 B. McFee, C. Raffel, D. Liang, D. P. W. Ellis, M. McVicar, E. Battenberg, and O. Nieto, "librosa: Audio and music signal analysis in python," in Proceedings of The 14th Python in Science Conference, pp.18-25, 2015.
14 I. Mierswa and K. Moric, "Automatic feature extraction for classifying audio data," Machine Learning Journal, Vol.58, pp.127-149, 2005.   DOI