DOI QR코드

DOI QR Code

Infant cry recognition using a deep transfer learning method

딥 트랜스퍼 러닝 기반의 아기 울음소리 식별

  • Bo, Zhao (Dept. of Computer Information Science, Korea University) ;
  • Lee, Jonguk (Dept. of Computer Convergence Software, Korea University) ;
  • Atif, Othmane (Dept. of Computer Information Science, Korea University) ;
  • Park, Daihee (Dept. of Computer Convergence Software, Korea University) ;
  • Chung, Yongwha (Dept. of Computer Convergence Software, Korea University)
  • 박철 (고려대학교 컴퓨터정보학과) ;
  • 이종욱 (고려대학교 컴퓨터융합소프트웨어학과) ;
  • 오스만 (고려대학교 컴퓨터정보학과) ;
  • 박대희 (고려대학교 컴퓨터융합소프트웨어학과) ;
  • 정용화 (고려대학교 컴퓨터융합소프트웨어학과)
  • Published : 2020.11.05

Abstract

Infants express their physical and emotional needs to the outside world mainly through crying. However, most of parents find it challenging to understand the reason behind their babies' cries. Failure to correctly understand the cause of a baby' cry and take appropriate actions can affect the cognitive and motor development of newborns undergoing rapid brain development. In this paper, we propose an infant cry recognition system based on deep transfer learning to help parents identify crying babies' needs the same way a specialist would. The proposed system works by transforming the waveform of the cry signal into log-mel spectrogram, then uses the VGGish model pre-trained on AudioSet to extract a 128-dimensional feature vector from the spectrogram. Finally, a softmax function is used to classify the extracted feature vector and recognize the corresponding type of cry. The experimental results show that our method achieves a good performance exceeding 0.96 in precision and recall, and f1-score.

Keywords

Acknowledgement

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2018R1D1A3B07044938 and NRF-2020R1I1A3070835).