• Title/Summary/Keyword: Synthesizing sound

Search Result 30, Processing Time 0.036 seconds

Interactive Synthesizing of Sound in Virtual Environments with considering Shape and Elevation of the Terrain (가상환경에서 지형의 형태와 고도를 고려한 인터렉티브한 사운드 합성)

  • Park, Soyeon;Park, Seong-A;Kim, Jong-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.593-596
    • /
    • 2021
  • 본 논문에서는 지형의 고도와 기울기를 고려하여 사운드의 확산과 회절을 인터렉티브하게 표현할 수 있는 사운드 합성 기법을 제안한다. 우리의 접근 방식은 광선 추적법(Raytracing)을 기반으로 소리의 크기를 보여줄 수 있는 사운드 강도 맵을 빠르게 계산한다. 지형의 고도와 기울기 값을 고려하여 소리의 강도 맵에 가중치를 적용하여, 결과적으로 지형의 지오메트리 분석인 큰 계산 과정을 피하고 지형을 고려한 소리의 크기를 인터렉티브하게 업데이트한다. 이 과정에서 소리의 근원지에 따른 고도 간의 격차를 계산하여 소리의 크기를 자동으로 감쇠시키며, 주변 지형의 공간 기울기를 기반으로 기울기의 차에 따라 소리의 크기를 자동으로 감쇠시킨다. 본 논문에서 제안하는 방법을 이용한 소리는 실제 높낮이가 있는 현실의 지형에서 듣는 소리의 전파 및 감쇠와 유사한 형태를 보여주며, 소리의 위치에 따라서 소리 감소 패턴이 변경되고, 또한, 지형에 따라서 소리의 크기가 제어되는 결과를 보여준다.

  • PDF

Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech (한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리)

  • Hong, Mun-Ki;Shin, Ji-Young;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.7-25
    • /
    • 2003
  • Predicting the right prosodic elements is a key factor in improving the quality of synthesized speech. Prosodic elements include break, pitch, duration and loudness. Pitch, which is realized by Fundamental Frequency (F0), is the most important element relating to the quality of the synthesized speech. However, the previous method for predicting the F0 appears to reveal some problems. If voiced and unvoiced sounds are not correctly classified, it results in wrong prediction of pitch, wrong unit of triphone in synthesizing the voiced and unvoiced sounds, and the sound of click or vibration. This kind of feature is usual in the case of the transformation from the voiced sound to the unvoiced sound or from the unvoiced sound to the voiced sound. Such problem is not resolved by the method of grammar, and it much influences the synthesized sound. Therefore, to steadily acquire the correct value of pitch, in this paper we propose a new model for predicting and classifying the voiced and unvoiced sounds using the CART tool.

  • PDF

Design of FM sound synthesizer IC for multimedia with phase bit optimized (위상 데이터 비트수를 최적화한 멀티미디어용 FM 음원합성 IC의 설계)

  • 홍현석;김이섭
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.11
    • /
    • pp.2978-2990
    • /
    • 1996
  • With the advent of multimedia era, there are ever increasing interest in computer music and sound syntheis. An FM type sound synthesizing method makes possible the syntheis ofvarious sounds ofmusical instruments with a relatively simple hardware architecture. Therefore, in this paper, we designed a hardware architecture for real-time sound synthesizer and its logic gates. In this paper, we designed a basic sound generator for implementation of real-time logic gates, analzed characteristics of sounds synthesized in this architecture and extracted parameters of FM sounds of musical instruments by using the Csound software. The major bolkcs to build the hardware are a phase-generator, a singe-function-generator, an envelope-generator and a multiplier-part. Finally, logic circuits are designed and verified in VHDL and logic gates by 1.0um standard cell library, which will be easily implementable by the form of ASIC.

  • PDF

Physical modeling synthesizing of 25 strings Gayageum using white noise as exciter (화이트 노이즈를 익사이터로 이용하는 25현 가야금의 피지컬 모델링 신디사이징)

  • Bae, June;Kim, Jangyoung;Yang, Yoongi
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.5
    • /
    • pp.740-746
    • /
    • 2018
  • Up until now, attempts to produce computer instruments have been largely focused on two types of sampling methods and physical modeling. Since the sampling method removes much of the harmonics in the sampling process, the effect of exciter that emphasizing the harmonic of the strings are mini. However, the physical modeling method can produce a lot of harmonics, and by emphasizing the harmonics of a particular frequency band among these harmonics, it is possible to produce a sound more like a lively sound. In this paper, we propose a method of using white noise in realizing exciter emphasizing harmonics of pre - specified frequency band in prefectural physical modeling. And comparing the envelope and spectrum of the Gayageum sound, we confirmed that the physical modeling method with the exciter is more suitable for the actual Gayageum sound than the sampling method and the conventional physical modeling method.

Synthesis of 3D Sound Movement by Embedded DSP

  • Komata, Shinya;Sakamoto, Noriaki;Kobayashi, Wataru;Onoye, Takao;Shirakawa, Isao
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.117-120
    • /
    • 2002
  • A single DSP implementation of 3D sound movement is described. With the use of a realtime 3D acoustic image localization algorithm, an efficient approach is devised for synthesizing the 3D sound movement by interpolating only two parameters of "delay" and "gain". Based on this algorithm, the realtime 3D sound synthesis is performed by a commercially available 16-bit fixed-point DSP with computational labor of 65 MIPS and memory space of 9.6k words, which demonstrates that the algorithm call be used even for the mobile applications.

  • PDF

Sound Quality Evaluation for Laundry Noise by a Virtual Laundry Noise Considering the Effect of Various Noise Sources in a Drum Washing Machine (소음원의 영향이 고려된 가상 세탁음 제작을 통한 드럼 세탁기의 음질 인덱스 구축)

  • Jeong, Jae-Eun;Yang, In-Hyung;Fawazi, Noor;Jeong, Un-Chang;Lee, Jung-Youn;Oh, Jae-Eung
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.22 no.6
    • /
    • pp.564-573
    • /
    • 2012
  • The objective of this study is to determine the effect for the sound quality according to the noise source and to build the sound quality index of the laundry noise. In order to compare laundry noise among the influence of noise sources, we made virtual laundry noises by synthesizing an actual laundry noise and each noise source such as a dropping noise, water noise, motor noise and circulation pump noise. We conducted a listening test by customers using virtual laundry noises. As a result of listening test, we found that the dropping noise has a decisive effect on the sound quality of the laundry noise. We conducted the multi regression analysis of sound quality for the laundry noise using the statistical data processing. It is verified to the reliability of the multi regression index by comparison with listening results and index results of other actual laundry noises. This study is expected to provide a guide line for improvement of the laundry noise.

A Particle-Grid Method for Efficient Sound Synthesis of Ocean Waves

  • Jong-Hyun Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.10
    • /
    • pp.157-164
    • /
    • 2024
  • In this paper, we propose a technique that utilizes the physical properties of foam particles to synthesize foam sounds and efficiently control their size. A typical way to represent sound in physics-based simulation environments is to generate and synthesize virtual sounds. In particular, foam particles have a large number of particles, so synthesizing sounds using only particles is computationally expensive, and a way to reduce the amount of computation is to use spatial information, lattices. In this paper, we present a method for reliably mapping and clustering foam particles into a lattice space. Furthermore, we utilize this structure to control the loudness of the sound according to the location of the sound source and the audience. As a result, the method proposed in this paper proposes an efficient way to synthesize the sound of bubble particles, which utilizes the velocity and position of the bubble particles projected in the lattice space, and synthesizes the sound of bubble particles based on the position relationship of the audience and the directionality of the sound.

Synthesizing the ′Dan-So′ sound using the Physical Model (Physical 모델을 이용한 단소 음 합성)

  • 성덕현;김규년
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.04b
    • /
    • pp.658-660
    • /
    • 2001
  • 본 논문에서는 Physical Model을 이용해 단소의 음을 합성해 내는 방법에 대해서 연구하였다. 이러한 모델을 이용한다면 별도의 음원 데이터를 사용하지 않고도 일반적으로 사용되고 있는 Table-lookup방식보다 더 효율적으로 악기 음을 합성 할 수가 있다. 먼저 파동방정식을 이용하여 단소내부 공기의 움직임을 나타내고 Physical Model을 제시하였다. Physical Model을 구현하기 위해서 두 개의 delay line이 사용되었으며, 그 양쪽 끝에는 각각 파동의 reflectance를 모델링 하기 위해 Reflectance filter가, 입력 신호를 모델링 하기 위한 input function이 사용되었다. 합성된 음을 평가 하기 위하여 실제 녹음된 음과 주파수 및 시간 도메인 상에서의 비교가 이루어 졌고 실제 악기와 유사한 음을 합성해 내었음을 확인 할 수 있었다.

  • PDF

Desktop program production

  • Enami, Kazumasa;Fukui, Kazuo;Yagi, Nobuyuki
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1996.06b
    • /
    • pp.77-81
    • /
    • 1996
  • In order to conform to the needs of effective program production in multimedia era, we are studying Desk Top Program Production system. With the DTPP, users can easily produce multimedia program including video, sound, and ancillary data, and freely handle video images synthesizing video components retrieved from video database. This paper describes the new program production system, DTPP and its key technologies such as cooperative program production via multimedia network, indexing and utilization of attribute information of images, and image segmentation and spatio-temporal editing.

  • PDF

Design and Implementation of Korean Tet-to-Speech System (다이폰을 이용한 한국어 문자-음성 변환 시스템의 설계 및 구현)

  • 정준구
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.91-94
    • /
    • 1994
  • This paper is a study on the design and implementation of the Korean Tet-to-Speech system. In this paper, parameter symthesis method is chosen for speech symthesis method and PARCOR coeffient, one of the LPC analysis, is used as acoustic parameter, We use a diphone as synthesis unit, it include a basic naturalness of human speech. Diphone DB is consisted of 1228 PCM files. LPC synthesis method has defect that decline clearness of synthesis speech, during synthesizing unvoiced sound In this paper, we improve clearness of synthesized speech, using residual signal as ecitation signal of unvoiced sound. Besides, to improve a naturalness, we control the prosody of synthesized speech through controlling the energy and pitch pattern. Synthesis system is implemented at PC/486 and use a 70Hz-4.5KHz band pass filter for speech imput/output, amplifier and TMS320c30 DSP board.

  • PDF