References
- L. Gerosa, G. Valenzise, M. Tagliasacchi, F. Antonacci, and A. Sarti, "Scream and Gunshot Detection in Noisy Environments," In Proc. the IEEE Conf. on Signal Processing, Poznan, Poland, Sept. 2007.
- J. Park, J. Lim, J. Yang, J. Kyung, and M. Hahn, "False Positive Movie Clip Decision in Black-box Using Car Door-Closing Sound Classification," In Proc. the Institute of Electronics Engineers of Korea, vol. 2014, no. 6, 2014, pp. 761-763.
- W. Huang, T. Chiew, H. Li, T. Kok, and J. Biswas, "Scream detection for home applications," In Proc. the IEEE Conf. on Industrial Electronics and Applications, Taichung, Taiwan, June 2010.
- S. Oh, J. Uee, H. Lee, Y. Chung, and D. Park, "Abnormal Sound Detection and Identification in Surveillance System," J. of Korean Institute of Information Scientists and Engineers, vol. 39, no. 2, 2012, pp. 144-152.
- M. Lim, D. Kim, K. Kim, and J. Kim, "Audio Event Classification Using Deep Neural Networks," J. of the Korean Society of Speech Sciences, vol. 7, no. 4, 2015, pp. 27-33.
- D. Wei, J. Li, P. Pham, S. Das, and Shuhui Qu, Florian Metze, "Sound Event Detection for Real Life Audio DCASE Challenge," In Proc. European Signal Processing Conf. on Detection and Classification of Acoustic Scenes and Events, Budapest, Hungary, Sept. 2016.
- Q. Kong and I. Sobieraj, W. Wang and M. Plumbley, "Deep Neural Network Baseline for DCASE Challenge 2016," In Proc. European Signal Processing Conf. on Detection and Classification of Acoustic Scenes and Events, Budapest, Hungary, Sept. 2016.
- S. Bang, "Implementation of Image based Fire Detection System Using Convolution Neural Network," J. of the Korea Institute of Electronic Communication Sciences, vol. 12, no. 2, 2017, pp. 331-336. https://doi.org/10.13067/JKIECS.2017.12.2.331
- S. Lim and D. Kim, "Semantic Segmentation using Convolutional Neural Network with Conditional Random Field," J. of the Korea Institute of Electronic Communication Sciences, vol. 12, no. 3, 2017, pp. 451-456. https://doi.org/10.13067/JKIECS.2017.12.3.451
- E. Cakir, G. Parascandolo, T. Heittola, H. Huttunen, and T. Virtanen, "Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection," EEE/ACM Trans. Audio, Speech, and Language Processing, vol. 25, no. 6, 2017, pp. 1291-1303. https://doi.org/10.1109/TASLP.2017.2690575
- A. Mesaros, T. Heittola, A. Diment, B. Elizalde, A. Shah, E. Vincent, B. Raj, and T. Virtanen, "DCASE 2017 Challenge setup: Tasks, datasets and baseline system" In Proc. DCASE 2017 - Workshop on Detection and Classification of Acoustic Scenes and Events, Munich, Germany, Nov. 2017.
- Y. Lee and P. Moon, "A Comparison and Analysis of Deep Learning Framework," J. of the Korea Institute of Electronic Communication Sciences, vol. 12, no. 1, 2017, pp. 115-122. https://doi.org/10.13067/JKIECS.2017.12.1.115
- A. Mesaros, T. Heittola, and T. Virtanen, "Metrics for polyphonic sound event detection," Applied Sciences, vol. 6, no. 6, 2016, pp. 321-337 https://doi.org/10.3390/app6110321