Search | Korea Science

Optimization of Multi-time Scale Loss Function Suitable for DNN-based Audio Coder (심층신경망 기반 오디오 부호화기를 위한 Multi-time Scale 손실함수의 최적화)

Shin, Seung-Min;Byun, Joon;Park, Young-Cheol;Beack, Seung-kwon;Sung, Jong-mo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2022.06a
- /
- pp.1315-1317
- /
- 2022
최근, 심층신경망 기반 오디오 부호화기가 활발히 연구되고 있다. 심층신경망 기반 오디오 부호화기는 기존의 전통적인 오디오 부호화기보다 구조적으로 간단하지만, 네트워크의 복잡도를 증가시키지 않고 인지적 성능향상을 기대하는 것은 어렵다. 이 문제를 해결하기 위하여 인간의 청각적 특성을 활용한 심리음향모델 기반 손실함수를 사용한 기법들이 소개되었다. 심리음향 모델 기반 손실함수를 사용한 오디오 부호화기는 양자화 잡음을 잘 제어하였지만, 여전히 지각적인 향상이 필요하다. 본 논문에서는 심층신경망 기반 오디오 부호화기를 위한 Multi-time Scale 손실함수의 지역 손실함수 윈도우 크기의 최적화 제안한다. Multi-time Scale 손실함수의 지역 손실함수 계산을 위한 윈도우 크기를 조절하며, 이를 통하여 오디오 부호화에 적합한 윈도우 사이즈를 결정한다. 실험을 통해 얻은 최적의 Multi-time Scale 손실함수를 사용하여 네트워크를 훈련하였고, 주관적 평가를 통해 기존의 심리음향모델 기반 손실함수보다 좋은 음성 품질을 보여주는 것을 확인하였다.
PDF

Effects of the Loss Function for Korean Left-To-Right Dependency Parser (의존 구문 분석에 손실 함수가 미치는 영향: 한국어 Left-To-Right Parser를 중심으로)

Lee, Jinu;Choi, Maengsik;Lee, Chunghee;Lee, Yeonsoo
- Annual Conference on Human and Language Technology
- /
- 2020.10a
- /
- pp.93-97
- /
- 2020
본 연구는 딥 러닝 기반 의존 구문 분석에서, 학습에 적용하는 손실 함수에 따른 성능을 평가하였다. Pointer Network를 이용한 Left-To-Right 모델을 총 세 가지의 손실 함수(Maximize Golden Probability, Cross Entropy, Local Hinge)를 이용하여 학습시켰다. 그 결과 LH 손실 함수로 학습한 모델이 선행 연구와 같이 MGP 손실 함수로 학습한 것에 비해 UAS/LAS가 각각 0.86%p/0.87%p 상승하였으며, 특히 의존 거리가 먼 경우에 대하여 분석 성능이 크게 향상됨을 확인하였다. 딥러닝 의존 구문 분석기를 구현할 때 학습모델과 입력 표상뿐만 아니라 손실 함수 역시 중요하게 고려되어야 함을 보였다.
PDF

A study on loss combination in time and frequency for effective speech enhancement based on complex-valued spectrum (효과적인 복소 스펙트럼 기반 음성 향상을 위한 시간과 주파수 영역 손실함수 조합에 관한 연구)

Jung, Jaehee;Kim, Wooil
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.1
- /
- pp.38-44
- /
- 2022
Speech enhancement is performed to improve intelligibility and quality of the noise-corrupted speech. In this paper, speech enhancement performance was compared using different loss functions in time and frequency domains. This study proposes a combination of loss functions to utilize advantage of each domain by considering both the details of spectrum and the speech waveform. In our study, Scale Invariant-Source to Noise Ratio (SI-SNR) is used for the time domain loss function, and Mean Squared Error (MSE) is used for the frequency domain, which is calculated over the complex-valued spectrum and magnitude spectrum. The phase loss is obtained using the sin function. Speech enhancement result is evaluated using Source-to-Distortion Ratio (SDR), Perceptual Evaluation of Speech Quality (PESQ), and Short-Time Objective Intelligibility (STOI). In order to confirm the result of speech enhancement, resulting spectrograms are also compared. The experimental results over the TIMIT database show the highest performance when using combination of SI-SNR and magnitude loss functions.
https://doi.org/10.7776/ASK.2022.41.1.038 인용 PDF KSCI

PRESSURE DROP, RESPIRATION MODELS AND AUTOMATIC AERATION OF ROUGH RICE (벼의 압력손실 및 호흡 모델과 자동통풍에 관한 연구)

Chung, J.H.;Verma, Lalit R.
- Journal of Biosystems Engineering
- /
- v.16 no.3
- /
- pp.298-311
- /
- 1991
빈 시스템에서 적정 팬을 선정하기 위해서, 빈에서 벼를 건조 혹은 통풍시킬 때 벼의 저항에 의해 발생되는 공기의 압력손실을 예측하는 모델을 개발하였다. 또한 벼의 건물 손실을 예측하기 위하여 벼의 호흡 모델을 개발하였다. 그리고 온도 및 습도 센서들을 이용한 자동계측 시스템을 사용하여 저장된 벼의 상태를 연속적으로 측정, 분석함으로써 벼의 통풍기준을 결정하고 이를 근거로 빈의 자동통풍 시스템을 개발하여 평가하였다. 공기의 정압 손실은 공기의 속도 및 벼의 함수율의 함수로서 나타내어졌으며, 일정 곡물 깊이에서 벼의 함수율이 낮을수록 그 정압손실은 증가하였다. 벼의 호흡에 의해 발생되는 이산화탄소의 양은 저장온도, 벼의 함수율, 저장 기간의 함수로서 나타낼 수 있었다. 벼의 안전 저장을 위해 곡물의 온도 및 함수율, 평형상대습도, 벼의 품질저하지수(deterioration index)에 대한 자동통풍 기준을 결정하였으며 이들을 이용해서 퍼스널 컴퓨터로 팬, 제습기 등의 통풍 장치들을 자동제어하는 자동통풍 시스템을 개발하였다. 이 시스템은 곡물의 상태를 예측, 제어함으로써 14% 이하의 함수율과 4이하의 품질저하지수, 그리고 어떤 균류도 생성시키지 않음으로써 벼를 안전하게 저장할 수 있었다.
PDF

Performance comparison evaluation of speech enhancement using various loss functions (다양한 손실 함수를 이용한 음성 향상 성능 비교 평가)

Hwang, Seo-Rim;Byun, Joon;Park, Young-Cheol
- The Journal of the Acoustical Society of Korea
- /
- v.40 no.2
- /
- pp.176-182
- /
- 2021
This paper evaluates and compares the performance of the Deep Nerual Network (DNN)-based speech enhancement models according to various loss functions. We used a complex network that can consider the phase information of speech as a baseline model. As the loss function, we consider two types of basic loss functions; the Mean Squared Error (MSE) and the Scale-Invariant Source-to-Noise Ratio (SI-SNR), and two types of perceptual-based loss functions, including the Perceptual Metric for Speech Quality Evaluation (PMSQE) and the Log Mel Spectra (LMS). The performance comparison was performed through objective evaluation and listening tests with outputs obtained using various combinations of the loss functions. Test results show that when a perceptual-based loss function was combined with MSE or SI-SNR, the overall performance is improved, and the perceptual-based loss functions, even exhibiting lower objective scores showed better performance in the listening test.
https://doi.org/10.7776/ASK.2021.40.2.176 인용 PDF KSCI

Development of Loss Function for Estimation of Flood Damage Cost in Main Public Facilities - Road·Water and Sewerage Facilities - (주요 공공시설물의 홍수피해액 추정을 위한 손실함수 개발 - 도로 및 상·하수도시설물 -)

Hwang, Shin Bum;Kim, Sang Ho
- Proceedings of the Korea Water Resources Association Conference
- /
- 2021.06a
- /
- pp.49-49
- /
- 2021
홍수 발생 시 제내지에 존재하는 도로 및 상·하수도시설물은 저지대를 중심으로 생성되는 침수지역이 아닌 대부분 집중호우, 태풍으로 인해 발생한 유출량이 지표면 유출로 이어져 지면 경사를 따라 유하하면서 흐름을 방해하거나 노후된 시설물 등에서 피해가 발생한다. 이러한 피해발생 특성을 고려하여 홍수피해액을 추정하기에는 침수면적과 시설물 현황 등을 활용하는 기존의 손실 함수 개발 방법으로는 부족한 부분이 존재하며, 유수 흐름의 주요 인자인 침수심, 유속 등과 같은 수리특성을 고려하여 시설물에 대한 홍수피해액을 추정하는 방안이 필요하다. 본 연구에서는 수리특성을 고려한 시설물의 홍수피해액을 추정하기 위한 손실함수를 개발하고자 국가재난정보관리시스템(NDMS) DB에서 해당 시설물의 상세주소를 이용하여 피해 발생위치와 피해액을 파악하였으며, 2차원 수리해석 모형인 FLO-2D를 활용하여 시설물의 피해위치에서 발생된 수리특성 인자인 침수심과 유속을 분석하였다. 시설물의 단위면적 당 피해액을 종속변수로, 분석된 평균 침수심과 평균 유속을 독립변수로 선정한 후 변수 자료들의 신뢰성과 함수의 설명력을 향상시키기 위하여 이상자료들을 제거한 후 손실함수를 개발하였다. 본 연구에서 개발된 손실함수는 수리특성 인자인 침수심과 유속에 의하여 홍수피해액을 직접적으로 추정하는 방법으로 향후 홍수재해에 대한 사전 재산피해 추정을 통하여 합리적인 선제적 예방조치 등의 홍수재해 예방 활동 등에 활용될 수 있을 것으로 기대한다.
PDF

Comparison of Deep Learning Loss Function Performance for Medical Video Biomarker Extraction (의료 영상 바이오마커 추출을 위한 딥러닝 손실함수 성능 비교)

Seo, Jin-beom;Cho, Young-bok
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.05a
- /
- pp.72-74
- /
- 2021
The deep learning process currently utilized in various fields consists of data preparation, data preprocessing, model generation, model learning, and model evaluation. In the process of model learning, the loss function compares the value of the model with the actual value and outputs the difference. In this paper, we analyze various loss functions used in the deep learning model for biomarker extraction, which measure the degree of loss of neural network output values, and try to find the best loss function through experiments.
PDF

Audio Fingerprint Binarization by Minimizing Hinge-Loss Function (경첩 손실 함수 최소화를 통한 오디오 핑거프린트 이진화)

Seo, Jin Soo
- The Journal of the Acoustical Society of Korea
- /
- v.32 no.5
- /
- pp.415-422
- /
- 2013
This paper proposes a robust binary audio fingerprinting method by minimizing hinge-loss function. In the proposed method, the type of fingerprints is binary, which is conducive in reducing the size of fingerprint DB. In general, the binarization of features for fingerprinting deteriorates the performance of fingerprinting system, such as robustness and discriminability. Thus it is necessary to minimize such performance loss. Since the similarity between two audio clips is represented by a hinge-like function, we propose a method to derive a binary fingerprinting by minimizing a hinge-loss function. The derived hinge-loss function is minimized by using the minimal loss hashing. Experiments over thousands of songs demonstrate that the identification performance of binary fingerprinting can be improved by minimizing the proposed hinge loss function.
https://doi.org/10.7776/ASK.2013.32.5.415 인용 PDF KSCI

Development of Loss Functions for River Facilities (하천시설물에 대한 손실함수의 개발)

Kim, Sang Ho;Hwang, Shin Bum;Kim, Yeon Su;Hee, Chang
- Proceedings of the Korea Water Resources Association Conference
- /
- 2017.05a
- /
- pp.122-122
- /
- 2017
재해로부터 국민의 인명과 재산을 보호하기 위해서는 재해 발생의 빈도 증가와 대형화 추세에 따라 예상되어지는 피해규모의 분석과 예측을 통한 대책 마련이 필요하다. 한국의 경우 피해지역 조사를 통하여 획득한 피해일시, 피해시설물, 피해내역, 피해액과 같은 피해액 정보를 관리하는 국가재난관리시스템(National Disaster Management System, NDMS)이 운영되고 있다. 그러나 공공시설물 중 가장 많은 피해액을 나타내고 있는 하천시설물에 대한 피해규모의 예측에 대한 연구는 전무한 실정이다. 본 연구에서는 국가재난관리시스템의 과거 하천 피해정보를 이용하여 하천의 구간별 평균유속과 피해연장으로부터 하천의 피해액을 추정할 수 있는 하천 손실함수를 개발하였으며, 시범 대상지역에 적용하여 검증하였다. 하천 손실함수는 향후 피해액 추정에 따른 피해규모 분석을 통하여 재해저감대책을 마련하는데 필요한 유용한 정보로 활용될 수 있을 것이다.
PDF

A Multi-band Loss Function for Improving Time-Domain Autoencoder (시간 영역 오토인코더의 성능 개선을 위한 다중 대역 손실 함수)

Lim, Yujin;Yu, Jeongchan;Seo, Eunmi;Park, Hochong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- fall
- /
- pp.78-79
- /
- 2021
본 논문에서는 시간 영역 오토인코더의 성능 개선을 위한 다중 대역 손실 함수를 제안한다. 기존의 시간 영역 오토인코더를 사용하는 압축 및 복원 모델은 저 대역 손실에 치중되어 고 대역 신호를 생성하지 못하고 다운 샘플링된 신호를 결과로 출력하는 문제점을 가진다. 이를 해결하기 위해 대역별로 손실을 분리하여 가중치를 조절할 수 있는 다중 대역 손실 함수를 제안한다. 제안하는 손실 함수가 적용된 오토인코더에 음성 신호를 입력하여 학습을 진행한 결과, 다운 샘플링이 발생하지 않으며 고 대역 신호가 복원되는 것을 스펙트로그램을 통해 확인하였다.
PDF

Search Result 708, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)