Fig. 1. Overall Structure of the Proposed Method
Fig. 2. Sample Waveform and Spectrogram of Railway Point Machine Sound Data
Fig. 3. F1 Score of the Proposed Method on Railway Sound Data Under Various Noise Conditions
Fig. 4. Sample Waveform and Spectrogram of Porcine Sound Data
Fig. 5. F1 Score of the Proposed Method on Porcine Sound Data Under Various Noise Conditions
Table 1. Basic Statistics of Environmental Noise on Railway Point Machine Sound Data
Table 2. Results of Similarity Measurement Between Noisy Signal and Enhanced Signal on Railway Sound Data
Table 3. Basic Statistics of Environmental Noise on Porcine Sound Data
Table 4. Results of Similarity Measurement Between Noisy Signal and Enhanced Signal on Porcine Sound Data
Table 5. Quantitative and Qualitative Comparison Analysis Between the Proposed Method and Other Methods
References
- Y. Choi, J. Lee, D. Park, and Y. Chung, “Noise-Robust Porcine Respiratory Diseases Classification Using Texture Analysis and CNN,” KIPS Transactions on Software and Data Engineering, Vol. 7, No. 3, pp. 91-98, 2018. https://doi.org/10.3745/KTSDE.2018.7.3.91
- Y. Kim, J. Sa, Y. Chung, D. Park, and S. Lee, “Resource-Efficient Pet Dog Sound Events Classification Using LSTMFCN Based on Time-Series Data,” Sensors, Vol. 8, No. 18, pp. 4019, 2018.
- J. Sa, Y. Choi, Y. Chung, H. Kim, D. Park, and S. Yoon, "Replacement Condition Detection of Railway Point Machines Using an Electric Current Sensor," Sensors, Vol. 17, pp. 263, 2017. https://doi.org/10.3390/s17020263
- J. Salamon and J.P. Bello, “Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification,” IEEE Signal Processing Letters, Vol. 24, No. 3, pp. 279-283, 2017. https://doi.org/10.1109/LSP.2017.2657381
- J. Lee, H. Choi, D. Park, Y. Chung, H.Y. Kim, and S. Yoon, “Fault Detection and Diagnosis of Railway Point Machines by Sound Analysis,” Sensors, Vol. 16, No. 4, pp. 549, 2016. https://doi.org/10.3390/s16040549
- Y. Choi, J. Lee, D. Park, J. Lee, Y. Chung, H.Y. Kim, and S. Yoon, “Stress Detection of Railway Point Machine Using Sound Analysis,” KIPS Transactions on Software and Data Engineering, Vol. 5, No. 9, pp. 433-440, 2016. https://doi.org/10.3745/KTSDE.2016.5.9.433
- M. Guarino, P. Jans, A. Costa, J.M. Aerts, and D. Berckmans, “Field Test of Algorithm for Automatic Cough Detection in Pig Houses,” Computers and Electronics in Agriculture, Vol. 62, No. 1, pp. 22-28, 2008. https://doi.org/10.1016/j.compag.2007.08.016
- Y. Chung, S. Oh, J. Lee, D. Park, H. Chang, and S. Kim, “Automatic Detection and Recognition of Pig Wasting Diseases Using Sound Data in Audio Surveillance,” Sensors, Vol. 13, No. 10, pp. 12929-12942, 2013. https://doi.org/10.3390/s131012929
- J. Lee, L. Jin, D. Park, Y. Chung, and H. Chang, “Acoustic Features for Pig Wasting Disease Detection,” International Journal of Information Processing and Management, Vol. 6, No. 1, pp. 37-46, 2015.
- R. Zazo, T.N. Sainath, G. Simko, and C. Parada, "Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection," In Proceeding of Interspeech, pp. 3668-3672, 2016.
- H. Zhang, I. McLoughlin, Y. Song, "Robust Sound Event Recognition Using Convolutional Neural Networks," IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 559-563, 2015.
- Y. Choi, O. Atif, J. Lee, D. Park, and Y. Chung, “Noise-Robust Sound-Event Classification System with Texture Analysis,” Symmetry, Vol. 10, No. 9, pp. 402, 2018. https://doi.org/10.3390/sym10090402
- Z. Zhang, J. Geiger, J. Pohjalainen, A.E.D. Mousa, W. Jin, and B. Schuller, “Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments,” ACM Transactions on Intelligent Systems and Technology, Vol. 9, No. 5, pp. 49, 2018.
- Y. Choi, Y. Jung, Y. Kim, Y. Suh, and H. Kim, “An Endto-End Method for Korean Text-to-Speech Systems,” Phonetics and Speech Sciences, Vol. 10, No. 1, pp. 39-48, 2018.
- S. Pascual, A. Bonafonte, and J. Serra, "SEGAN: Speech Enhancement Generative Adversarial Network," In Proceedings of Interspeech, pp. 3642-3646, 2017.
- S. Dieleman and B. Schrauwen, "End-to-End Learning for Music Audio," IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6964-6968, 2014.
- R. Collobert, C. Puhrsch, and G. Synnaeve, "Wav2Letter: An End-to-End ConvNet-Based Speech Recognition System," arXiv preprint arXiv:1609.03193, 2016.
- Y. Zhang, W. Chan, and N. Jaitly, "Very Deep Convolutional Networks for End-to-End Speech Recognition," IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4845-4849, 2017.
- S. Kim, T. Hori, and S. Watanabe, "Joint CTC-Attention Based End-to-End Speech Recognition Using Multi-Task Learning," IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4835-4839, 2017.
- R.V. Sharan and T.J. Moir, "Noise Robust Audio Surveillance Using Reduced Spectrogram Image Feature and Oneagainst-all SVM," Neurocomputing, Vol. 158, pp. 90-99, 2015. https://doi.org/10.1016/j.neucom.2015.02.001
- J. Lee, Y. Choi, D. Park, and Y. Chung, “Sound Noise-Robust Porcine Wasting Diseases Detection and Classification System Using Convolutional Neural Network,” Journal of Korean Institute of Information Technology, Vol. 16, No. 5, pp. 1-13, 2018. https://doi.org/10.14801/jkiit.2018.16.5.1
- I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, and Y. Bengio, "Generative Adversarial Nets," In Advances in Neural Information Processing Systems, pp. 2672-2680, 2014.
- C. Zhang and Y. Peng, "Stacking VAE and GAN for Context-aware Text-to-Image Generation," IEEE Fourth International Conference on Multimedia Big Data, pp. 1-5, 2018.
- T. Asada, C. Roberts, and T. Koseki, "An Algorithm for Improved Performance of Railway Condition Monitoring Equipment: Alternating-Current Point Machine Case Study," Transportation Research Part C: Emerging Technologies, Vol. 30, pp. 81-92, 2013. https://doi.org/10.1016/j.trc.2013.01.008
- A.W. Rix, J.G. Beerends, M.P. Hollier, and A.P. Hekstra, "Perceptual Evaluation of Speech Quality (PESQ)-A New Method for Speech Quality Assessment of Telephone Networks and Codecs," IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 2, pp. 749-752, 2001.
- J. Hansen and B. Pellom, "An Effective Quality Evaluation Protocol for Speech Enhancement Algorithms," International Conference on Spoken Language Processing, Vol. 7, pp. 2819-2822, 1998.
- B. Shao, D. Wang, T. Li, and M. Ogihara, “Music Recommendation Based on Acoustic Features and User Access Patterns,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 8, pp. 1602-1611, 2009. https://doi.org/10.1109/TASL.2009.2020893
- J. Han, M. Kamber, and J. Pei, "Data Mining: Concepts and Techniques," 3rd ed., Morgan Kaufman, San Francisco, CA, USA, 2012.
- S. Theodoridis and K. Koutroumbas, "Pattern Recognition," 4th ed., Academic Press: Kidlington, Oxford, UK, 2009.
- D.M. Powers, “Evaluation: From Precision, Recall and FFactor to ROC, Informedness Markedness and Correlation,” Journal of Machine Learning Technologies, Vol. 2, No. 1, pp. 37-63, 2011.