• Title/Summary/Keyword: Audio quality evaluation

Search Result 43, Processing Time 0.021 seconds

Performance Analysis of Audio Data Hiding Method based on Phase Information with Various Window Length (주파수 변환의 길이에 따른 위상 기반 오디오 정보 은닉 기술의 음질 및 성능 분석)

  • Cho, Kiho;Kim, Nam Soo
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.12
    • /
    • pp.232-237
    • /
    • 2013
  • The role of the window length of time-frequency transformation is important for the audio data hiding methods utilizing phase information. In this paper, the experiments for our audio data hiding method were conducted in order to evaluate the audio quality and robustness against reverberant environment. The experimental results showed the tendency that the worse audio quality but better robustness were obtained when the lengthy window was applied. The important reason for quality degradation was pre-echo which flatters the percussive sound. The results also indicated that the wireless communication theory related to the length of time-frequency transform can be applied in the field of audio data hiding and acoustic data transmission.

Human Laughter Generation using Hybrid Generative Models

  • Mansouri, Nadia;Lachiri, Zied
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1590-1609
    • /
    • 2021
  • Laughter is one of the most important nonverbal sound that human generates. It is a means for expressing his emotions. The acoustic and contextual features of this specific sound are different from those of speech and many difficulties arise during their modeling process. During this work, we propose an audio laughter generation system based on unsupervised generative models: the autoencoder (AE) and its variants. This procedure is the association of three main sub-process, (1) the analysis which consist of extracting the log magnitude spectrogram from the laughter database, (2) the generative models training, (3) the synthesis stage which incorporate the involvement of an intermediate mechanism: the vocoder. To improve the synthesis quality, we suggest two hybrid models (LSTM-VAE, GRU-VAE and CNN-VAE) that combine the representation learning capacity of variational autoencoder (VAE) with the temporal modelling ability of a long short-term memory RNN (LSTM) and the CNN ability to learn invariant features. To figure out the performance of our proposed audio laughter generation process, objective evaluation (RMSE) and a perceptual audio quality test (listening test) were conducted. According to these evaluation metrics, we can show that the GRU-VAE outperforms the other VAE models.

Audio Transcoding for Audio Streams from a T-DTV Broadcasting Station to a T-DMB Receiver

  • Bang, Kyoung-Ho;Park, Young-Cheol;Seo, Jeong-Il
    • ETRI Journal
    • /
    • v.28 no.5
    • /
    • pp.664-667
    • /
    • 2006
  • We propose an efficient audio transcoding algorithm that can convert audio streams from terrestrial digital television broadcasting service stations to those for terrestrial digital multimedia broadcasting hand-held receivers. The proposed algorithm avoids the complicated psychoacoustic analysis by calculating the scalefactors of the bit-sliced arithmetic coding encoder directly from the signal-to-noise ratio parameters of the AC-3 decoder. The bit-allocation process is also simplified by cascading the nested distortion control loop. Through subjective evaluation, it is shown that the proposed algorithm provides comparable audio quality to tandem coding but it requires much smaller complexity.

  • PDF

A Quality Evaluation and Analysis of the On-Line Edutainment Software (온라인 에듀테인먼트 품질 평가 및 분석)

  • Lho, Young-Uhg;Park, Sang-Won;Jung, Deok-Gil
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.11
    • /
    • pp.2192-2198
    • /
    • 2007
  • An edutainment is a composite which are joining with the serveral matter and the multiplicity of techqniues An Eduainment software use serveral audio and video media. The human have different preference to audio and video media. It is difficult to evaluate quality of an edutainment software than a general software. So, we need an evaluation factors to compare quality of edutainment software. We developed the evaluation factors for edutainment software. And the middle school students used the evaluation factors to evaluate the favorite edutainment softwares among them. We evaluate and analyze the results.

Implementation of the High-Quality Audio System with the Separately Processed Musical Instrument Channels (악기별 분리처리를 통한 고음질 오디오 시스템 구현)

  • Kim, Tae-Hoon;Lee, Sang-Hak;Kim, Dae-Kyung;Lee, Sang-Chan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.4
    • /
    • pp.346-353
    • /
    • 2013
  • This paper deals with the implementation of a high-quality audio system for karaoke. For improving the key/tempo changes performance, we separated the audio into many musical instrument channels. By separating musical instrument channels, high-quality key/tempo changes can be achieved and we confirmed this using the cross-correlation distribution and the MOS evaluation. The improved audio system was implemented using the TMS320C6747 DSP with fixed/floating-point operations. The implemented audio system can perform the multi-channel WMA decoding, the MP3 encoding/decoding, the wav playing, the EQ, and the key/tempo changes in real time. The WMA channels used for processing the separated instrument channels. The audio system includs the MP3 encoding/decoding function for playing and recording and the wav channel for the effect sound.

Determination of the Speaker Position and Evaluation of the Audio System of the Passenger Car (자동차 스피커의 위치선정 및 오디오 성능평가 방법)

  • 이장명;권오상
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.4 no.4
    • /
    • pp.1-8
    • /
    • 1996
  • The sound quality of the car audio system is affected by the serveral factors such as the dimensions of the room, the boundary condition of the wall, the location of the speakers, etc. Among these factors, the location of the car speakers has been focused to find the best location of the car speakers assuming that the flat response is better. To verify the suggestion, the subjective test is adopted using 10 people. The developed method is utilizd to evaluate the function of the audio system with fixed speaker position.

  • PDF

LED Communication based Multi-hop Audio Data Transmission Network System (LED 통신 기반 멀티 홉 오디오 데이터 전송네트워크시스템)

  • Jo, Seung Wan;Le, The Dung;An, Beongku
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.6
    • /
    • pp.180-187
    • /
    • 2013
  • In this paper, we propose a LED communication based multi-hop audio data transmission network system. The main contribution and features of the proposed system are as follows. First, the contribution of this research is to develope the LED communication based multi-hop transmission network system which can transmit audio data signal with long distance via multi-hops. Second, the developed system has the following features: In transmitter, audio data is transmitted after encoding with S/PDIF format via a general LED. The relay receives digital audio signal by using photo diode and then transmits the signal to receiver after error checking and amplifying. The receiver receives the encoded audio data via photo diode and then converts to analog audio signal by using decoding and amplifying. The performance evaluation of the proposed system is conducted in the laboratory with fluorescent light source. The results of the performance evaluation confirm that the system can provide high quality audio transmission from transmiter to receiver via multi-hop relays in a long distance while we can see there are differences in the transmitted audio quality according to the used LED colors.

The Measurement Method and The Sound Quality Evaluation of Headphones and Earphones (헤드폰 및 이어폰의 데이터 측정 및 객관적 음질 평가 방법)

  • Sung Ho Young;Kim Jong-Bae;Lee Joon-Hyun;Jang Seong-Cheol
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.505-506
    • /
    • 2004
  • 이어폰과 헤드폰의 성능 향상을 위해서는 특성에 대한 정확한 측정과 평가가 요구된다. 이어폰과 헤드폰은 room 과 같은 acoustic channel 을 거치지 않고 청취자의 귀에 직접 소리가 전달되며 ear canal 특성이 포함되기 때문에 스피커와는 다른 기준이 필요하다. 그러나 사람 귀의 canal 특성은 개인에 따른 편차가 심하여 정확한 측정 및 성능 평가에 어려움이 따른다. 본 논문에서는 이어폰과 헤드폰의 특성을 측정하는 적절한 방법을 고찰하고 측정된 데이터를 이용하여 음질 성능을 평가할 수 있는 객관적인 방법을 제시하고자 한다.

  • PDF

A Study on Vocal Removal Scheme of SAOC Using Harmonic Information (하모닉 정보를 이용한 SAOC의 보컬 신호 제거 방법에 관한 연구)

  • Park, Ji-Hoon;Jang, Dae-Geun;Hahn, Min-Soo
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.10
    • /
    • pp.1171-1179
    • /
    • 2013
  • Interactive audio service provide with audio generating and editing functionality according to user's preference. A spatial audio object coding (SAOC) scheme is audio coding technology that can support the interactive audio service with relatively low bit-rate. However, when the SAOC scheme remove the specific one object such as vocal object signal for Karaoke mode, the scheme support poor quality because the removed vocal object remain in the SAOC-decoded background music. Thus, we propose a new SAOC vocal harmonic extranction and elimination technique to improve the background music quality in the Karaoke service. Namely, utilizing the harmonic information of the vocal object, we removed the harmonics of the vocal object remaining in the background music. As harmonic parameters, we utilize the pitch, MVF(maximum voiced frequency), and harmonic amplitude. To evaluate the performance of the proposed scheme, we perform the objective and subjective evaluation. As our experimental results, we can confirm that the background music quality is improved by the proposed scheme comparing with the SAOC scheme.

Visible Light Communication based Multi-hop Multimedia Data Transmission Networks System (VLC 기반 멀티 홉 멀티미디어 데이터 전송 네트워크 시스템)

  • Park, In-Chul;Shin, Jung-Jin;Park, Joo-Young;Dung, Le The;An, Beongku
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.14 no.3
    • /
    • pp.21-31
    • /
    • 2014
  • In this paper, we propose VLC(visible light communication) based multi-hop multimedia data transmission system. The main contributions and features of the proposed system are as follows. First, the contribution of this research is to develope the LED communication based multi-hop transmission network system which can transmit multimedia data(audio data, video data) with long distance. Second, the developed system has the following features: In transmitter, audio data and video data are transmitted via multi-hops using two channels. The relay in audio channel receives digital audio signal by using photo diode and then transmits the signal to receiver after error checking and amplifying. The receiver receives the encoded audio data via photo diode and then converts to analog audio signal by using decoding and amplifying. The relay in video channel receives video signal by using photo diode and then amplify the video signal using OP-AMP and then transmits the signal to receiver. The receiver amplifies the received signal from photo diode and then sends it to the monitor. The performance evaluation of the proposed system is conducted in the laboratory with fluorescent light source. The results of the performance evaluation confirm that the system can provide high quality multimedia data transmission from transmiter to receiver via multi-hop relays in a long distance while we can see there are differences in the transmitted multimedia(audio and video) quality according to the used LED colors.