Search | Korea Science

Speech/Music Signal Classification Based on Spectrum Flux and MFCC For Audio Coder (오디오 부호화기를 위한 스펙트럼 변화 및 MFCC 기반 음성/음악 신호 분류)

Sangkil Lee;In-Sung Lee
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.16 no.5
- /
- pp.239-246
- /
- 2023
In this paper, we propose an open-loop algorithm to classify speech and music signals using the spectral flux parameters and Mel Frequency Cepstral Coefficients(MFCC) parameters for the audio coder. To increase responsiveness, the MFCC was used as a short-term feature parameter and spectral fluxes were used as a long-term feature parameters to improve accuracy. The overall voice/music signal classification decision is made by combining the short-term classification method and the long-term classification method. The Gaussian Mixed Model (GMM) was used for pattern recognition and the optimal GMM parameters were extracted using the Expectation Maximization (EM) algorithm. The proposed long-term and short-term combined speech/music signal classification method showed an average classification error rate of 1.5% on various audio sound sources, and improved the classification error rate by 0.9% compared to the short-term single classification method and 0.6% compared to the long-term single classification method. The proposed speech/music signal classification method was able to improve the classification error rate performance by 9.1% in percussion music signals with attacks and 5.8% in voice signals compared to the Unified Speech Audio Coding (USAC) audio classification method.
https://doi.org/10.17661/jkiiect.2023.16.5.239 인용 PDF HTML

Active Noise Control of Blower Fan Noise at the Small-medium Size Factories (중소규모 공장에 설치된 송풍기의 소음 감소를 위한 능동소음제어)

Oh, Wongeun
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.15 no.7
- /
- pp.4659-4664
- /
- 2014
The noise produced in a factory is a cause of the noise complaint of the surrounding residential areas. In addition, it affects the work efficiency and health of workers. This paper presents the results of a basic study to reduce the noise generated from the blower, which is often used in the factory of a small and medium scale, using an active noise controller (ANC) in three-dimensional space. For this purpose, the simulator program, which can compare various parameters of the original noise and controlled noise, such as sound pressure levels, power spectra, and equivalent noise levels, was developed. The noise data was recorded at 17 points around a turbo fan blower currently being operated in a small-medium size factory. The simulation results showed that the power spectrum was reduced by a maximum of 40dB in the low frequency band and the average equivalent noise level attenuation was 12.6dB.
https://doi.org/10.5762/KAIS.2014.15.7.4659 인용 PDF KSCI

Review of Researches on Clubroot Disease of Chinese Cabbage in Korea and Future Tasks for Its Management (우리나라 배추 뿌리혹병 연구 현홍과 향후과제)

Kim, Choong-Hoe;Cho, Won-Dae;Lee, Sang-Bum
- Research in Plant Disease
- /
- v.9 no.2
- /
- pp.57-63
- /
- 2003
Clubroot disease of curcifer crops caused by Plasmodiophora brassicae had been first reported in 1928 in Korea, and maintained mild occurrence until 1980s. Since 1990s the disease has become severe in alpine areas of Kyonggi and Kangwon, gradually spread to plain fields throughout the country, and remains as the great-est limiting factor for its production. Researches on the disease has begun in late 1990s after experiencing severe epidemics. Survey of occurrence and etiological studies have been carried out, particularly, on the pathogen physiology, race identification, quantification of soil pathogen population, and host spectrum of the pathogen. Ecology of gall formation and its decay, yield loss assessment associated with time of infection, and relationships between crop rotation and the disease incidence was also studied during late 1990s. In studies of its control, more than 200 crucifer cultivars were evaluated for their resistance to the disease. Lime applica-tion to field soil was also attempted to reduce the disease incidence. Resistant radish and welsh onion were recommended as rotation crops with crucifers after 3-year field experiments. However, so for, most studies on clubroot disease in Korea have been focused on chemical control. Two fungicides, fluazinam and flusulfamide, were selected and extensively studied on their application technologies and combination effects with lime application or other soil treatment. To develop environmentally-friendly control methods, solar-disinfection of soil, phosphoric acid as a nontoxic compound, and root-parasiting endophytes as biocontrol agents were examined for their effects on the disease in fields. In the future, more researches are needed to be done on development of resistant varieties effective to several races of the pathogen, establishment of economically-sound crop rotation system, and improvement of soil-disinfection technique applicable to Korean field condi-tion, and development of methodology of pretreatment of fungicides onto seeds and seedbeds.
https://doi.org/10.5423/RPD.2003.9.2.057 인용 PDF KSCI

Evaluation for Noise Reduction of the HVAC by Modification of CAM Curve (CAM 곡선 개선에 의한 차량용 공조기의 소음 저감 평가)

Jeong, J.E.;Jung, C.Y.;Seo, B.J.;Jeong, U.C.;Oh, J.E.
- Transactions of the Korean Society for Noise and Vibration Engineering
- /
- v.21 no.9
- /
- pp.787-797
- /
- 2011
The noise in a vehicle is an important factor for customers purchasing a car. Particularly, reduction of the noise that is generated from HVAC(heating, ventilation and air conditioning) is very important since it has considerable effects on interior noise. In general, identification of noise source is crucial to reduce noise level. The complex acoustic intensity method is widely used to obtain the accurate measurement and identification of noise source. Therefore, in the previous study, noise source of HVAC was identified through experimental approach using the complex acoustic intensity method. In this study, we are intended to confirm reduced level of noise by comparing the result between before and after modification of cam curve that is based on identified noise source of HVAC. It is found out that noise source of HVAC are motor and cam area using the complex acoustic intensity method in the previous study. We performed experiments to compare noise level between before and after modification of cam curve. Especially, it can be seen that complex acoustic intensity method using both active and reactive intensity is vital in devising a strategy for comparison to noise level. Also, the vector flow of acoustic intensity was investigated to identify sound intensity distributions and energy flow in the near field of HVAC.
https://doi.org/10.5050/KSNVE.2011.21.9.787 인용 PDF KSCI

Interior Noise Characteristics of the Electric Trains in Gyeongchun Line (경춘선 전동열차의 실내 소음 특성)

Ann, Yong Chan;Lee, Jung Hyeok;Kim, Seock Hyun
- Transactions of the Korean Society of Mechanical Engineers A
- /
- v.38 no.7
- /
- pp.817-822
- /
- 2014
Since the opening of the double-track railway for the Gyeongchun local electric train and the semi-high speed train ITX, floating population between Seoul and Chuncheon has rapidly increased. This is attributable to the competitiveness of the railway service in terms of punctuality and safety of operation, mass transportation and low fare. However, many passengers have expressed strong dissatisfaction and displeasure towards the interior noise and its high rate of increase, particularly in tunnel sections. In this study, the interior noise characteristics of Gyeongchun local electric train and ITX were analyzed and compared. Noise levels, frequency spectrum and sound quality indices were compared for the open land, tunnel and bridge. Finally, from the noise levels depending on the location in the vehicle compartment, the noise transmission path was determined and a basic strategy for reducing the interior noise was developed.
https://doi.org/10.3795/KSME-A.2014.38.7.817 인용 PDF KSCI

Design of a tracking and demodulation circuit for wideband DDMA in IMT-2000 (IMT-2000 광대역 CDMA의 동기추적 및 데이터 복조 회로구현)

권형철;오현서;이재호;조경록
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.24 no.6A
- /
- pp.871-880
- /
- 1999
In this paper, a pseudo-noise(PN) tracking and demodulation circuits are analyzed and designed for a direct-sequence/spread-spectrum multiple access system under a mobile fading channel. We consider noncoherent delay locked loop(DLL) as a PN code tracking loop which has 1/8 PN chip resolution. The tracking performance of DLL is evaluated in terms of locking time from a loose state and tracking jitter. The received signal is demodulated to original data by despreading with PN code locked by DLL. Also the designed circuit supports sound service of 32Kbps and in-band signal with 4.096MHz chip clock. The circuits are implemented and verified with FPGA, which is shown completely data recovery under AWGN 7dB and will be available for IMT-2000.
PDF

The First Formant Characteristics in Vocalize of One Soprano (소프라노 1인의 모음곡 발성 시 제 1 포먼트의 변화양상)

Song, Yun-Kyung;Jin, Sung-Min
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.16 no.1
- /
- pp.10-14
- /
- 2005
Background and Objectives : Vowels are characterized on the basis of formant patterns. The first formant(F1) is determined by high-low placement of the tongue, and the second formant (F2) by front-back placement of the tongue. The fundamental frequency(F0) of a soprano often exceed the normal frequency of the first formant. And the vocal intensity is boosted when F0 is high and a harmonic coincides with a formant. This is called a formant tuning. Experienced singers thus learned how to tune their formants over a resonable range by lowering the tongue to maximize their vocal intensity. So, the current study aimed to identify the formant tuning in one experienced soprano by comparing the first formants of vowel [i] in three different voice production : speech, ascending scale, and vocalize. Materials and Method : All voices recordings of vowel [i] in speech, ascending scale (from F4 note to A4 note), and vocalize(:Ridente la calam") were made with digital audio tape-corder in a sound treated room. And the captured data were analyzed by the long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elementrics, Model, 4300B). Results : Although the first formant of vowel [i] in speech was 238Hz, those of ascending scale [i] were 377Hz, 405Hz, 453Hz respectively in F4(349z), G4(392Hz), A4(440Hz) note, and 722Hz, 820Hz, 918Hz respectively in F5 (698Hz), G5(784Hz), A5(880Hz) note. In vocalize, first formants of [i] were 380Hz, 398Hz, 453Hz respectively in F4, G4, A4 note, and 720Hz, 821Hz, 890Hz respectively in F5, G5, A5 note. Conclusion : These results showed that the first formant of ascending scale and vocalize sustained higher frequency than fundamental frequency in high pitch. This finding implicates that the formant tuning of vowel [i] in ascending scale was also noted in vocalize.
PDF

Problems of Strobovideolarygoscopic Findings and Usual Voice Management of Vocal Major Students, and Acoustic Characteristics of Singing Voice (성악도들의 음성관리 및 성대화상술상의 문제점과 발성에 대한 음향분석학적 특징)

진성민;김대영;반재호;이상혁;송윤경;권기환;이경철;이용배
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.10 no.1
- /
- pp.43-49
- /
- 1999
Objectives : The purpose of this study was to systematically analyze and compare e acoustic sound structure of vocal major student's singing voice. Materials and Methods : The nineteen vocal major students were the subject group and healthy nineteen females were the control group for this study. The subject group was taken a strobovideolaryngoscopy by the use of flexible nasopharyngoscopy. And acoustic analysis was taken between two groups. Additionally the inquiry on usual voice problems and management was performed by thirty-six vocal major students. Results : The subject group presents many functional voice disorder findings such as AP contraction(44%), phase difference(36%) tremor(25%), posterior gap(17%), hyperadduction of vestibular fold(6%), and anterior gap(3%) on strobovideolaryngoscopy. And the vocal major students did reveal an enhanced number of high frequency harmonic partials when singing compared to the control group in the narrow band spectrum study. But there was no significant difference in jitter, shimmer and noise to harmonic ratio in both groups. Almost all vocal major students present a lot of voice problems in singing such as loss of high note(17%), loss of quiet voice(17%), effortful and tired voice(36%) etc on inquiry. And they always effort to prevent vocal dysfunction by the use of various type of method such as voice rest(28%), hydration(28%), gargling with salt(11%) etc. Conclusions : The vocal major students always take care of maintaining a good voice condition, but a lot of vocal major students revealed abnormal strobovideolaryngoscopic findings and they are absent in the conception of systemic and scientific voice management. Therefore, the young singers need a good voice training and voice therapy Program under the good ralationship of laryngologist and voice training teacher.
PDF

An Implementation of an ARM Platform based MP3 Sound Enhancement System (ARM 플랫폼 기반의 MP3 오디오 음질 향상 시스템 구현)

Oh, Sang-Hun;Park, Kyu-Sik
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.44 no.1
- /
- pp.70-75
- /
- 2007
In order to mitigate the problems in storage space and network bandwidth for the full CD quality audio with 44.1 kHz sampling rate, current existing digital audio is always restricted by sampling rate and bandwidth. This kind of restriction normally can be resolved by using low bit rate audio codec such as MP3, OGG, and AAC. However it suffers a major problem such as a loss of high frequency fidelity. This high frequency loss will reproduce only the band-limited low-frequency part of audio in the standard CD-quality audio. In general, the high frequency contents of audio have lots of information such as localization and ambient information, and bright nature of audio. The purpose of this paper is to implement on ARM platform system that can effectively estimate and compensate the missing high frequency contents of MP3 audio. From the experimental results with spectrum analysis and listening test, we confirm the superiority of the proposed algorithms for MP3 audio quality enhancement.
PDF KSCI

Dhar, Pranab Kumar;Kim, Jong-Myon
- Journal of the Korea Society of Computer and Information
- /
- v.15 no.2
- /
- pp.109-117
- /
- 2010
Digital watermarking has drawn extensive attention for protecting digital contents from unauthorized copying. This paper proposes a new watermarking scheme in frequency domain for copyright protection of digital audio. In our proposed watermarking system, the original audio is segmented into non-overlapping frames. Watermarks are then embedded into the selected prominent peaks in the magnitude spectrum of each frame. Watermarks are extracted by performing the inverse operation of watermark embedding process. Simulation results indicate that the proposed scheme is robust against various kinds of attacks such as noise addition, cropping, resampling, re-quantization, MP3 compression, and low pass filtering. Our proposed watermarking system outperforms Cox's method in terms of imperceptibility, while keeping comparable robustness with the Cox's method. Our proposed system achieves SNR (signal-to-noise ratio) values ranging from 20 dB to 28 dB. This is in contrast to Cox's method which achieves SNR values ranging from only 14 dB to 23 dB.
https://doi.org/10.9708/jksci.2010.15.2.109 인용 PDF KSCI

Search Result 305, Processing Time 0.025 seconds

Speech/Music Signal Classification Based on Spectrum Flux and MFCC For Audio Coder (오디오 부호화기를 위한 스펙트럼 변화 및 MFCC 기반 음성/음악 신호 분류)

Active Noise Control of Blower Fan Noise at the Small-medium Size Factories (중소규모 공장에 설치된 송풍기의 소음 감소를 위한 능동소음제어)

Review of Researches on Clubroot Disease of Chinese Cabbage in Korea and Future Tasks for Its Management (우리나라 배추 뿌리혹병 연구 현홍과 향후과제)

Evaluation for Noise Reduction of the HVAC by Modification of CAM Curve (CAM 곡선 개선에 의한 차량용 공조기의 소음 저감 평가)

Interior Noise Characteristics of the Electric Trains in Gyeongchun Line (경춘선 전동열차의 실내 소음 특성)

Design of a tracking and demodulation circuit for wideband DDMA in IMT-2000 (IMT-2000 광대역 CDMA의 동기추적 및 데이터 복조 회로구현)

The First Formant Characteristics in Vocalize of One Soprano (소프라노 1인의 모음곡 발성 시 제 1 포먼트의 변화양상)

Problems of Strobovideolarygoscopic Findings and Usual Voice Management of Vocal Major Students, and Acoustic Characteristics of Singing Voice (성악도들의 음성관리 및 성대화상술상의 문제점과 발성에 대한 음향분석학적 특징)

An Implementation of an ARM Platform based MP3 Sound Enhancement System (ARM 플랫폼 기반의 MP3 오디오 음질 향상 시스템 구현)

Robust Audio Watermarking in Frequency Domain for Copyright Protection (저작권 보호를 위한 주파수 영역에서의 강인한 오디오 워터마킹)

Search Result 305, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)