• Title/Summary/Keyword: Sound synthesis

Search Result 137, Processing Time 0.033 seconds

Efficient Foam Sound Generation with Screened Clustering Based Sound Synthesis (스크린드 군집화 기반의 사운드 합성을 이용한 효율적인 거품 사운드 생성)

  • Shin, YoungChan;Kim, Jong-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.553-556
    • /
    • 2022
  • 본 논문에서는 거품 입자를 활용하여 시뮬레이션 장면에 맞는 소리를 효율적으로 합성할 수 있는 기법을 제안한다. 물리 기반 시뮬레이션 환경에서 소리를 표현하는 대표적인 방법은 생성과 합성이다. 사운드 생성의 경우 시뮬레이션 장면마다 물리 기반 접근법을 사용하여 소리를 생성할 수 있는데 계산 시간과 재질 표현의 어려움으로 다양한 시뮬레이션 장면에 대한 소리를 만들어 내기에는 쉽지 않다. 사운드 합성의 경우 소리 데이터를 미리 구축해야 하는 사전 준비가 필요하지만, 한 번 구축하면 비슷한 장면에서는 같은 소리 데이터를 활용할 수 있는 점이 있다. 따라서 본 논문에서는 거품 시뮬레이션의 소리 합성을 위해 소리 데이터를 구축하고 거품 입자의 효율적인 군집화를 통해 계산 시간을 줄이면서 소리의 사실감은 개선할 수 있는 사운드 합성 기법을 제안한다.

  • PDF

Novel Sound Energy and Reversal Mapping for Procedural Sound Synthesis in Cloth Simulation (옷감 시뮬레이션의 절차적 사운드 합성을 위한 새로운 사운드의 에너지와 반전 매핑)

  • Kim, Dong-Hui;Moon, Seong-Hyeok;Shin, Young-Chan;Kim, Jong-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.587-590
    • /
    • 2022
  • 본 논문에서는 물리기반 옷감 시뮬레이션에 적합한 소리를 효율적으로 생성하기 위한 데이터 기반 합성 기법을 제안한다. 시뮬레이션에서 소리를 표현하는 방법은 크게 생성과 합성이 있지만, 합성은 실시간 애플리케이션에서 활용이 가능하기 때문에 인터랙티브한 환경에서 자주 활용되고 있다. 하지만, 데이터에 의존하기 때문에 원하는 장면에 부합하는 사운드를 합성하기는 어려우며, 기존 방법은 한 방향으로만 사운드 데이터를 검색하기 때문에 불연속으로 인한 끊김 현상이 발생한다. 본 논문에서는 양방향 사운드 합성 기법을 제시하며, 이를 통해 불연속적으로 합성되는 사운드 결과를 효율적으로 개선될 수 있음을 보여준다.

  • PDF

Design and Implementation of Korean Tet-to-Speech System (다이폰을 이용한 한국어 문자-음성 변환 시스템의 설계 및 구현)

  • 정준구
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.91-94
    • /
    • 1994
  • This paper is a study on the design and implementation of the Korean Tet-to-Speech system. In this paper, parameter symthesis method is chosen for speech symthesis method and PARCOR coeffient, one of the LPC analysis, is used as acoustic parameter, We use a diphone as synthesis unit, it include a basic naturalness of human speech. Diphone DB is consisted of 1228 PCM files. LPC synthesis method has defect that decline clearness of synthesis speech, during synthesizing unvoiced sound In this paper, we improve clearness of synthesized speech, using residual signal as ecitation signal of unvoiced sound. Besides, to improve a naturalness, we control the prosody of synthesized speech through controlling the energy and pitch pattern. Synthesis system is implemented at PC/486 and use a 70Hz-4.5KHz band pass filter for speech imput/output, amplifier and TMS320c30 DSP board.

  • PDF

Sound Engine for Korean Traditional Instruments Using General Purpose Digital Signal Processor (범용 디지털 신호처리기를 이용한 국악기 사운드 엔진 개발)

  • Kang, Myeong-Su;Cho, Sang-Jin;Kwon, Sun-Deok;Chong, Ui-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.3
    • /
    • pp.229-238
    • /
    • 2009
  • This paper describes a sound engine of Korean traditional instruments, which are the Gayageum and Taepyeongso, by using a TMS320F2812. The Gayageum and Taepyeongso models based on commuted waveguide synthesis (CWS) are required to synthesize each sound. There is an instrument selection button to choose one of instruments in the proposed sound engine, and thus a corresponding sound is produced by the relative model at every certain time. Every synthesized sound sample is transmitted to a DAC (TLV5638) using SPI communication, and it is played through a speaker via an audio interface. The length of the delay line determines a fundamental frequency of a desired sound. In order to determine the length of the delay line, it is needed that the time for synthesizing a sound sample should be checked by using a GPIO. It takes $28.6{\mu}s$ for the Gayageum and $21{\mu}s$ for the Taepyeongso, respectively. It happens that each sound sample is synthesized and transferred to the DAC in an interrupt service routine (ISR) of the proposed sound engine. A timer of the TMS320F2812 has four events for generating interrupts. In this paper, the interrupt is happened by using the period matching event of it, and the ISR is called whenever the interrupt happens, $60{\mu}s$. Compared to original sounds with their spectra, the results are good enough to represent timbres of instruments except 'Mu, Hwang, Tae, Joong' of the Taepyeongso. Moreover, only one sound is produced when playing the Taepyeongso and it takes $21{\mu}s$ for the real-time playing. In the case of the Gayageum, players usually use their two fingers (thumb and middle finger or thumb and index finger), so it takes $57.2{\mu}s$ for the real-time playing.

Study on the Vehicle Sound Based on the Formant Filter and Musical Harmonics (포먼트 필터와 음악 화성학에 기반한 차량 음질 연구)

  • Chang, Kyoung-Jin;Park, Dong Chul
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.25 no.8
    • /
    • pp.525-531
    • /
    • 2015
  • Driving sound is an effective element to promote the product identity of a vehicle by providing customers with attractive sound which reflects the concept of a vehicle. Recently, major automakers are focusing on the target sound setting so that the sound can represent the brand image as well as the unique concept of a vehicle. In this study, a new method of target setting for the driving sound will be introduced based on using formant filter and musical harmonics characteristics. In addition, a target sound suggested from this method will be realized and verified by using active noise control in vehicle.

Timbral Analysis of the Piri Sound and Designing an Audio Filter for Yoseong Expression (요성을 중심으로 한 피리의 음색 변화 분석 및 필터 디자인)

  • Nam, Sangbong;Lee, Sun-jin;Lee, Gangseong;Lee, Donoung
    • Journal of the HCI Society of Korea
    • /
    • v.10 no.2
    • /
    • pp.5-11
    • /
    • 2015
  • Yoseong sound is one of the Piri's representative techniques including unique timbre of Korean traditional musical instrument. This paper presents the acoustic characteristics of Yoseong sound by analyzing the sound of Piri and suggests audio filters that make Yoseong sound from ordinary sound of the Piri.

Multi-Pulse Amplitude and Location Estimation by Maximum-Likelihood Estimation in MPE-LPC Speech Synthesis (MPE-LPC음성합성에서 Maximum- Likelihood Estimation에 의한 Multi-Pulse의 크기와 위치 추정)

  • 이기용;최홍섭;안수길
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.9
    • /
    • pp.1436-1443
    • /
    • 1989
  • In this paper, we propose a maximum-likelihood estimation(MLE) method to obtain the location and the amplitude of the pulses in MPE( multi-pulse excitation)-LPC speech synthesis using multi-pulses as excitation source. This MLE method computes the value maximizing the likelihood function with respect to unknown parameters(amplitude and position of the pulses) for the observed data sequence. Thus in the case of overlapped pulses, the method is equivalent to Ozawa's crosscorrelation method, resulting in equal amount of computation and sound quality with the cross-correlation method. We show by computer simulation: the multi-pulses obtained by MLE method are(1) pseudo-periodic in pitch in the case of voicde sound, (2) the pulses are random for unvoiced sound, (3) the pulses change from random to periodic in the interval where the original speech signal changes from unvoiced to voiced. Short time power specta of original speech and syunthesized speech obtained by using multi-pulses as excitation source are quite similar to each other at the formants.

  • PDF

Lower-order ARMA Modeling of Head-Related Transfer Functions for Sound-Field Synthesis Systme

  • Yim, Jeong-Bin;Kim, Chun-Duck;Kang, Seong-Hoon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.3E
    • /
    • pp.37-44
    • /
    • 1996
  • A new method for efficient modeling of the Head-Related Transfer Functions(HRTF's) without loss of any directional information is proposed. In this paper, the HRTF's were empirically measured in a real room and modeled as the ARMA models with common AR coefficients and different MA coefficients. To assess the validity of the proposed ARMA model, psychophysical tests show that the proposed ARMA model, in comparison with the conventional MA model, requires a small number of parameters to represent empirical HRTF's and improves the back-to-front confusions in sound-field localization. Thus, significant simplifications in the implementations of sound-field synthesis systems could be obtained by using the proposed ARMA model.

  • PDF

Study Concerning Preference for Noise Quality of Automotive Horn for Improvement of Perceived Quality and Improvement of New Noise Metric (감성 품질 향상을 위한 자동차 Horn의 선호 음질에 관한 연구 및 음질 요소 개발)

  • Kang, Hee-Su;Lee, Sang-Kwon;Shin, Tae-Jin;Jung, Ki-Woong;Park, Dong-Chul
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.25 no.3
    • /
    • pp.141-149
    • /
    • 2015
  • In this study, there is an investigation about the sound quality of automotive horn that attached to luxury sedans. In order to define a questionnaire of horn sound quality the factor analysis is conducted. Ten automotive horns are selected for this research and ten passenger cars(nine is luxury sedan and one economy class car). Luxury is used for the questionnaire as an attribute for the sound quality of car horn. The interior noises for ten test cars are recorded and used for the subjective analysis of car horn sound. In the paper, new sound metric for the car horn sound is presented. The new sound metric is used for the objective sound index for the prediction of subjective sound quality of car horn.