• Title/Summary/Keyword: band-limited audio

Search Result 9, Processing Time 0.022 seconds

Deep Learning based Raw Audio Signal Bandwidth Extension System (딥러닝 기반 음향 신호 대역 확장 시스템)

  • Kim, Yun-Su;Seok, Jong-Won
    • Journal of IKEEE
    • /
    • v.24 no.4
    • /
    • pp.1122-1128
    • /
    • 2020
  • Bandwidth Extension refers to restoring and expanding a narrow band signal(NB) that is damaged or damaged in the encoding and decoding process due to the lack of channel capacity or the characteristics of the codec installed in the mobile communication device. It means converting to a wideband signal(WB). Bandwidth extension research mainly focuses on voice signals and converts high bands into frequency domains, such as SBR (Spectral Band Replication) and IGF (Intelligent Gap Filling), and restores disappeared or damaged high bands based on complex feature extraction processes. In this paper, we propose a model that outputs an bandwidth extended signal based on an autoencoder among deep learning models, using the residual connection of one-dimensional convolutional neural networks (CNN), the bandwidth is extended by inputting a time domain signal of a certain length without complicated pre-processing. In addition, it was confirmed that the damaged high band can be restored even by training on a dataset containing various types of sound sources including music that is not limited to the speech.

A 3D Audio Codec Employing a Revised Noise Filling Method (수정된 잡음 채움 기법을 적용한 3D 오디오 부호기)

  • Kim, Rin Chul
    • Journal of Broadcast Engineering
    • /
    • v.26 no.3
    • /
    • pp.327-330
    • /
    • 2021
  • In this paper, a new noise filling method is proposed for improving the performance of the 3D audio codec. In the new method, the core band is limited up to MAX_SFB, not up to the IGF start frequency. And the noise filling is applied to all frequency range of the IGF source patches. We conduct the MUSHRA test and find that the proposed noise filling method demonstrates better performance than the conventional method.

An Implementation of Sound Enhanced MPEG-1 Audio Decoder on Embedded OS Platform (음질향상 알고리즘을 내장한 MPEG-1 오디오 디코더의 Embedded OS 플랫폼에의 구현)

  • Hong, Sung-Min;Park, Kyu-Sik
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.8
    • /
    • pp.958-966
    • /
    • 2007
  • In this paper, we implement a sound-enhanced MPEG-1 audio decoder on embedded OS Platform. Low bit rate lossy audio codecs such as MP3, OGG, and AAC for mitigating the problems in storage space and network bandwidth suffer a major common problem such as a loss of high frequency fidelity of audio signal. This high frequency loss will reproduce only a band-limited low-frequency part of audio in the standard CD-quality audio. In order to overcome this problem, we embedded a sound enhancement algorithm into the MPEG-1 audio decoder and then the algorithms optimized according to the characteristic of the MPEG-1 audio layer I, II, III were implemented on an embedded OS platform. From the experimental results with spectrum analysis and listening test, we confirm the superiority of the proposed system compared to the standard MPEG-1 audio decoder.

  • PDF

An Implementation of an ARM Platform based MP3 Sound Enhancement System (ARM 플랫폼 기반의 MP3 오디오 음질 향상 시스템 구현)

  • Oh, Sang-Hun;Park, Kyu-Sik
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.1
    • /
    • pp.70-75
    • /
    • 2007
  • In order to mitigate the problems in storage space and network bandwidth for the full CD quality audio with 44.1 kHz sampling rate, current existing digital audio is always restricted by sampling rate and bandwidth. This kind of restriction normally can be resolved by using low bit rate audio codec such as MP3, OGG, and AAC. However it suffers a major problem such as a loss of high frequency fidelity. This high frequency loss will reproduce only the band-limited low-frequency part of audio in the standard CD-quality audio. In general, the high frequency contents of audio have lots of information such as localization and ambient information, and bright nature of audio. The purpose of this paper is to implement on ARM platform system that can effectively estimate and compensate the missing high frequency contents of MP3 audio. From the experimental results with spectrum analysis and listening test, we confirm the superiority of the proposed algorithms for MP3 audio quality enhancement.

Audio Quality Enhancement at a Low-bit Rate Perceptual Audio Coding (저비트율로 압축된 오디오의 음질 개선 방법)

  • 서정일;서진수;홍진우;강경옥
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.6
    • /
    • pp.566-575
    • /
    • 2002
  • Low-titrate audio coding enables a number of Internet and mobile multimedia streaming service more efficiently. For the help of next-generation mobile telephone technologies and digital audio/video compression algorithm, we can enjoy the real-time multimedia contents on our mobile devices (cellular phone, PDA notebook, etc). But the limited available bandwidth of mobile communication network prohibits transmitting high-qualify AV contents. In addition, most bandwidth is assigned to transmit video contents. In this paper, we design a novel and simple method for reproducing high frequency components. The spectrum of high frequency components, which are lost by down-sampling, are modeled by the energy rate with low frequency band in Bark scale, and these values are multiplexed with conventional coded bitstream. At the decoder side, the high frequency components are reconstructed by duplicating with low frequency band spectrum at a rate of decoded energy rates. As a result of segmental SNR and MOS test, we convinced that our proposed method enhances the subjective sound quality only 10%∼20% additional bits. In addition, this proposed method can apply all kinds of frequency domain audio compression algorithms, such as MPEG-1/2, AAC, AC-3, and etc.

Audio Fingerprint Extraction Method Using Multi-Level Quantization Scheme (다중 레벨 양자화 기법을 적용한 오디오 핑거프린트 추출 방법)

  • Song Won-Sik;Park Man-Soo;Kim Hoi-Rin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.4
    • /
    • pp.151-158
    • /
    • 2006
  • In this paper, we proposed a new audio fingerprint extraction method, based on Philips' music retrieval algorithm, which uses the energy difference of neighboring filter-bank and probabilistic characteristics of music. Since Philips method uses too many filter-banks in limited frequency band, it may cause audio fingerprints to be highly sensitive to additive noises and to have too high correlation between neighboring bands. The proposed method improves robustness to noises by reducing the number of filter-banks while it maintains the discriminative power by representing the energy difference of bands with 2 bits where the quantization levels are determined by probabilistic characteristics. The correlation which exists among 4 different levels in 2 bits is not only utilized in similarity measurement. but also in efficient reduction of searching area. Experiments show that the proposed method is not only more robust to various environmental noises (street, department, car, office, and restaurant), but also takes less time for database search than Philips in the case where music is highly degraded.

A Comparative Study of Vowels Produced by Normal Subjects and Patients with Malignant Vocal Folds by Correlation Coefficient and Difference Sum of Narrow-band Spectra (악성종양환자와 정상인이 발성한 모음의 좁은대역 스펙트럼값의 상관계수와 절대차이합 비교)

  • Yang, Byung-Gon;Wang, Soo-Geun;Jo, Cheol-Woo;Kim, Hyung-Soon;Kim, Eun-Ji;Kwon, Soon-Bok
    • Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.189-200
    • /
    • 2003
  • The objective of this study was to examine two new parameters by which we could screen people with malignant vocal folds. The new parameters were the difference sums and Pearson correlation coefficients between adjacent pairs of intensity level matrices of narrow-band spectra. Audio files from the Korean Disordered Speech Database were analyzed by Praat, a speech analysis software, to obtain matrices of 400 intensity levels at 16 time points of each sustained vowel spectra. We limited our study to 12 normal subjects and 20 patients with malignant vocal folds who recorded at least three Korean vowels at a sound-proofed booth in Busan National University Hospital. Results indicated that the average coefficients of the abnormal subjects were much lower than those of the normal subjects while the average difference sums of the patients were much higher than those of the normal ones. Also, we found that the degree of the malignancy of the vocal folds was related to the coefficients and sums. However, some subjects at the initial stages of cancerous vocal folds yielded almost comparable coefficients and difference sums to those of the normal speakers. Further studies on larger databases will be desirable to set certain criteria or threshold levels for screening people with vocal fold diseases.

  • PDF

A Quality Improvement of MP3-Coded Audios Using Bandwidth Extension (대역 확장을 통한 MP3 오디오의 음질 향상)

  • Heo, So-Young;Kim, Rin-Chul
    • Journal of Broadcast Engineering
    • /
    • v.13 no.5
    • /
    • pp.744-751
    • /
    • 2008
  • In this paper, we investigate methods to enhance the perceptual quality of MP3-coded audios. Based on the high frequency reconstruction method by Liu, in the proposed method, we determine adaptively the starting point of high frequency reconstruction. We also present an improved linear estimation method. For high frequency component generation, we compare two methods. One is a replication of low-frequency components and the other is an insertion of additive white Gaussian noise signals. Through subjective tests, we shall show that the proposed method can improve the perceptual quality of MP3-coded audio.

Text Mining-Based Analysis of Hyundai Automobile Consumer Satisfaction and Dissatisfaction Factors in the Chinese Market: A Comparison with Other Brands (텍스트 마이닝을 이용한 현대 자동차 중국시장 소비자의 만족 및 불만족 요인 분석 연구: 다른 브랜드와의 비교)

  • Cui Ran;Inyong Nam
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.539-549
    • /
    • 2024
  • This study employed text mining techniques like frequency analysis, word clouds, and LDA topic modeling to assess consumer satisfaction and dissatisfaction with Hyundai Motor Company in the Chinese market, compared to brands such as Toyota, Volkswagen, Buick, and Geely. Focusing on compact vehicles from these brands between 2021 and 2023, this study analyzed customer reviews. The results indicated Hyundai Avante's positive factors, including a long wheelbase. However, it also highlighted dissatisfaction aspects like Manipulate, engine performance, trunk space, chassis and suspension, safety features, quantity and brand of audio speakers, music membership service, separation band, screen reflection, CarLife, and map services. Addressing these issues could significantly enhance Hyundai's competitiveness in the Chinese market. Previous studies mainly focused on literature research and surveys, which only revealed consumer perceptions limited to the variables set by the researchers. This study, through text mining and comparing various car brands, aims to gain a deeper understanding of market trends and consumer preferences, providing useful information for marketing strategies of Hyundai and other brands in the Chinese market.