과제정보
이 성과는 2020년도 정부(과학기술정보통신부)의 재원으로 한국연구재단의 지원을 받아 수행된 연구임(No. 2020R1C1C1A01013020).
참고문헌
- Alfassy A, Karlinsky L, Aides A, Shtok J, Harary S, Feris, R, Giryes R, and Bronstein AM (2019). Laso: label-set operations networks for multi-label few-shot learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6548-6557.
- Brown JC (1991). Calculation of a constant Q spectral transform, The Journal of the Acoustical Society of America, 89, 425-434. https://doi.org/10.1121/1.400476
- Chen Z, Fu Y, Wang YX, Ma L, Liu W, and Hebert M (2019). Image deformation meta-networks for one-shot learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8680-8689.
- Delgado H, Todisco M, Sahidulla Md, Evans N, Kinnunen T, Lee K, and Yamagishi J (2018). Asvspoof 2017 Version 2.0: Meta-Data Analysis and Baseline En-hancements.
- Goodfellow I, Warde-Farley D, Mirza M, Courville A, and Bengio Y (2013). Maxout networks. In International Conference on Machine Learning, PMLR, 1319-1327.
- Kinnunen T, Sahidulla Md, Delgado H, Todisco M, Evans N, Yamagishi J, and Lee K (2017). The asvspoof 2017 challenge: assessing the limits of replayspoofing attack detection, Interspeech 2017, 2-6.
- Ko T, Peddinti V, Povey D, and Khudanpur S (2015). Audio augmentation for speech recognition. In Sixteenth Annual Conference of the International Speech Communication Association.
- Korshunov P and Marcel S (2016). Cross-database evaluation of audio-based spoofing detection systems, In Interspeech.
- Lavrentyeva G, Novoselov S, Malykh E, Kozlov A, Kudashev O, and Shchemelinin V (2017). Audio replay attack detection with deep learning frameworks, In Interspeech, 82-86.
- McFee B, Humphrey EJ, and Bello JP (2015). A software framework for musical data augmentation, ISMIR, 248-254.
- Mikolajczyk A and Grochowski M (2018). Data augmentation for improving deep learning in image classification problem. In 2018 International Interdisciplinary PhD Workshop (IIPhDW), IEEE, 117-122.
- Perez L and Wang J (2017). The Effectiveness of Data Augmentation in Image Classification Using Deep Learning, arXiv preprint arXiv:1712.04621.
- Salamon J and Bello JP(2017). Deep convolutional neural networks and data augmentation for environmental sound classification, IEEE Signal Processing Letters, 24, 279-283. https://doi.org/10.1109/LSP.2017.2657381
- Srivastava N, Hinton G, Krizhevsky A, Sutskever I, and Salakhutdinov R (2014). Dropout: a simple way to prevent neural networks from overfitting, The Journal of Machine Learning Research, 15, 1929-1958.
- Todisco M, Delgado H, and Evans N (2017). Constant Q cepstral coefficients: a spoofing countermeasure for automatic speaker verification, Computer Speech & Language, 45, 516-535. https://doi.org/10.1016/j.csl.2017.01.001
- Wu X, He, R, Sun Z, and Tan T (2018). A light CNN for deep face representation with noisy labels, IEEE Transactions on Information Forensics and Security, 13, 2884-2896. https://doi.org/10.1109/TIFS.2018.2833032