• Title/Summary/Keyword: Voice, Sound

Search Result 331, Processing Time 0.024 seconds

Characteristics of the auditory evaluation of good impression using speech manipulation scripts (말소리 변조 스크립트를 이용한 호감도 청취평가 특징)

  • Kwon, Soonbok
    • Phonetics and Speech Sciences
    • /
    • v.8 no.4
    • /
    • pp.131-138
    • /
    • 2016
  • This study analyzes the characteristics of good impression using speech manipulation scripts and investigates the characteristics of preferred speech voice. Fourty male and female college students participated in this study. They have been exposed to the Gyeongsang dialect spoken by their friends and family for more than 15 years. Two sample voices(1 male and 1 female), considered as giving good impression, were subject to voice analysis. Two students were asked to read the sample paragraph of 'Walking' and their voice samples were analyzed through Praat. The collected speech data were manipulated into 4 different sets by changing pitch level, degree of loudness and speech rate. First, both men and women received good impression more from pitch-lowered sound than from the original one. Second, men tended to receive good impression more from slightly louder voice than from the natural-pitched one. Third, it was shown that men often felt more drowned to a voice at slightly faster speech rate than at the original speech rate. Overall, both male and female listeners favored lower pitch over the original pitch. Men tended to prefer louder voice sound while women preferred less loud one. Men received better impression at a lower speech rate but women at a faster speech rate.

A Study on the Sound Effect for Improving Customer's Speech Recognition in the TTS-based Shop Music Broadcasting Service (TTS를 이용한 매장음원방송에서 고객의 인지도 향상을 위한 음향효과 연구)

  • Kang, Sun-Mee;Kim, Hyun-Deuc;Chang, Moon-Soo
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.105-109
    • /
    • 2009
  • This thesis describes the method for well voice announcement using the TTS(Text-To-Speech) technology in the shop music broadcasting service. Offering a high quality TTS sound service for each shop requires a great expense. According to a report on the architectural acoustics the room acoustic indexes such as reverberation time and early decay time are closely connected with a subjective awareness about acoustics. By using the result the customers will be able to recognize better the voice announcement by applying sound effect to speech files made by TTS. The result of an aural comprehension examination has shown better about almost all of the parameters by applying reverb effect to TTS sound.

  • PDF

The effect of oral sound Daseureum of Jindo Ssitgimgut on anxiety disorder: Soul therapist Byung-cheon Park oral sound, Daseureum is revived on YouTube (https://youtu.be/k98ENbsIp7o?list=RDk98ENbsIp7o)

  • Ko, Kyung-Ja
    • CELLMED
    • /
    • v.6 no.3
    • /
    • pp.19.1-19.3
    • /
    • 2016
  • Jindo Ssitgimgut has been known as a funeral ritual for a long time in Korea. However, there is no study for music therapy on anxiety disorder. The aims of this study were to argue that Oral sound Daseureum of Jindo Ssitgimgut may have meaningful effect on anxiety disorder. Jindo Ssitgimgut is literally a cleansing soul. Jindo Ssitgimgut is designated as the Intangible Cultural Property No. 2 by the Korean government. Jindo Ssitgimgut is transmitted from generation to generation, not the descent of God. So, the accent is on art and one's sincere sympathy. So, with careful listening Youtube, this music Daseureum exhibits an exquisite balance between the human voice and the sounds do the instruments. The author think a good combination of his voice, Jing (Korean gong), and Ajaeng (Korean cello) can help with anxiety disorder.

Experimental study of the sound quality performance and improvement of magnetic fluid speaker (자성유체 스피커의 음질 성능 및 향상에 관한 실험적 연구)

  • Lee, Moo-Yeon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.12
    • /
    • pp.6993-6997
    • /
    • 2014
  • The aim of this study was to experimentally investigate the sound quality characteristics, such as sound deflection, sound pressure level and frequency characteristics of a magnetic type speaker in an anechoic chamber to overcome the sound quality and voice-coil temperature problems. To accomplish this, the sound quality performance of the magnetic type speaker was tested according to the magnetic fluid amount and magnetic field intensity. The sound deflection, sound pressure level, and frequency characteristics were measured using the Smarrt program. As a result, at a magnetic fluid amount of 2.4 ml, the sound deflection and the sound pressure level of the magnetic type speaker were enhanced by comparing with those of the general type speaker. The frequency characteristics and the sound pressure level of the magnetic type speaker were enhanced greatly with increasing magnetic field intensity from 8.06 mT to 9.10 mT. In addition, the sound deflection of the magnetic type speaker was 0.01% lower than that of the general type speaker.

Understanding of the Western Classical Singing in Medical Point of View (서양식 성악발성법의 의학적 이해)

  • Choi, Hong-Shik;Hong, Hyun-Jun;Yum, Yong-Hyuk;Nam, Do-Hyun
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.22 no.2
    • /
    • pp.106-110
    • /
    • 2011
  • Western classical singing voices are different from those of pop song singer's singing voices as well as traditional Korean singing such as Pansori. We anlalysed the singing voices from three different categories with using free application programs available at the usual smart phones : sound level meter and Spectral View Analyzer and fiberoptic rhinolaryngoscopic evaluation. The intensity of voice produced by a classical western singer was 11 dB louder than that produced by a pop song singer. Source sound, glottic sound, as well as harmonic sound and singing resonant sound (Singer's formant) are much more prominent. When evaluated under video-rhinolaryngoscopy during singing, the resonance cavity especially oropharyngeal cavity and hypopharyngeal cavity are widely opened during singing of the western classical singer than those of the traditional Korean singer's singing. Difference of singing methods including producing the glottal sound, respiration and resonance are discussed. Possible explanation of development of 'Singer's Formant' is discussed.

  • PDF

Communication Support System for ALS Patient Based on Text Input Interface Using Eye Tracking and Deep Learning Based Sound Synthesi (눈동자 추적 기반 입력 및 딥러닝 기반 음성 합성을 적용한 루게릭 환자 의사소통 지원 시스템)

  • Park Hyunjoo;Jeong Seungdo
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.20 no.2
    • /
    • pp.27-36
    • /
    • 2024
  • Accidents or disease can lead to acquired voice dysphonia. In this case, we propose a new input interface based on eye movements to facilitate communication for patients. Unlike the existing method that presents the English alphabet as it is, we reorganized the layout of the alphabet to support the Korean alphabet and designed it so that patients can enter words by themselves using only eye movements, gaze, and blinking. The proposed interface not only reduces fatigue by minimizing eye movements, but also allows for easy and quick input through an intuitive arrangement. For natural communication, we also implemented a system that allows patients who are unable to speak to communicate with their own voice. The system works by tracking eye movements to record what the patient is trying to say, then using Glow-TTS and Multi-band MelGAN to reconstruct their own voice using the learned voice to output sound.

Tonal development and voice quality in the stops of Seoul Korean

  • Yu, Hye Jeong
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.91-99
    • /
    • 2018
  • Korean stops are currently undergoing a tonogenetic sound change, as found in the Seoul dialect in which a merged VOT of aspirated and lax stops induces F0 to be the primary cue for distinguishing the two stops and the lax stops have lower F0 than the aspirated stops. In tonal languages, low tone is produced with a breathy voice. This study investigated whether there are changes in voice quality with respect to the tonogenetic sound change of Korean stops. Two age groups speaking the Seoul dialect participated in this study: five females and six males born in the 1940s and 1950s and nine females and eight males born in the 1980s and 1990s. This study replicated previous findings of VOT and F0 and further examined H1-H2, H1-A1, and H1-A2 to see how they correlate with the sound change. In the older and younger generations, H1-H2, H1-A1, and H1-A2 were significantly lower after the tense stops than after the aspirated and lax stops, but they were not significantly different after the aspirated and lax stops. However, the younger females exhibited some different results for H1-H2 and H1-A2 than the older generation. In the younger females, the H1-H2 mean was higher after the aspirated stops than it was after the lax stops at the vowel onset, and the H1-H2 difference increased at the vowel midpoint. Although there was an inter-speaker variation in the results of H1-H2 and H1-A1, analyses of individual speakers showed that the H1-H2 and H1-A1 were higher after the lax stops than after the aspirated stops in the younger female speakers. These results indicate that lax stops tend to be breathier than aspirated stops in the younger female speakers. They also indicate that changes in voice quality are on Korean stops with tonal sound change, but are still developing.

A Study on the Acoustic Characteristics of Sexy Voice (섹시한 음성의 음향학적 특징 연구)

  • Jeong Ok-Ran;Jo Sung-Mi
    • MALSORI
    • /
    • no.57
    • /
    • pp.73-84
    • /
    • 2006
  • The purpose of this study was to explore the acoustic characteristics of sexy voice. In this study, we measured acoustic parameters (fundamental frequency, jitter, shimmer, and nasalance) of a sustained vowel sound produced by 40 actors (20 males and 20 females) and 40 non-actors (20 males and 20 females). Digital audio recordings were made in the sustained vowel |a| for acoustic analyses using Praat (version 4.1.9) and Nasal View (version 4.5). Twenty voice pathologists participated in the listening experiment and judged the degree of sexiness on a 7-point scale. The results showed that fundamental frequency, shimmer and nasalance had significant differences between actors and non-actors. The acoustic parameters of sexy voice matched perceptual aspects of a previous study: Low fundamental frequency-low pitch and high shimmer-husky voice. On the other hand, the nasalance score did not match that of the previous study: Decreased nasalance had a higher score on sexiness scale judged by the listeners. It would be desirable to study the voice quality by analyzing and controlling more acoustic and auditory parameters for practical applications in the future.

  • PDF

Speaker Separation Based on Directional Filter and Harmonic Filter (Directional Filter와 Harmonic Filter 기반 화자 분리)

  • Baek, Seung-Eun;Kim, Jin-Young;Na, Seung-You;Choi, Seung-Ho
    • Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.125-136
    • /
    • 2005
  • Automatic speech recognition is much more difficult in real world. Speech recognition according to SIR (Signal to Interface Ratio) is difficult in situations in which noise of surrounding environment and multi-speaker exists. Therefore, study on main speaker's voice extractions a very important field in speech signal processing in binaural sound. In this paper, we used directional filter and harmonic filter among other existing methods to extract the main speaker's information in binaural sound. The main speaker's voice was extracted using directional filter, and other remaining speaker's information was removed using harmonic filter through main speaker's pitch detection. As a result, voice of the main speaker was enhanced.

  • PDF

Recognition of Individual Cattle by His and /or Her Voice

  • Yoshio, Ikeda;Yohei, Ishii
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 1998.06b
    • /
    • pp.270-275
    • /
    • 1998
  • It was assumed that the voice of cattle is generated with the virtual white noise through the digital filter called the linear prediction filter, and filter parameters (prediction coefficients) were estimated by the maximum entropy method (MEM) , using the sound signal of the animal . The feature planes were defined by the pairs of two parameters selected appropriately from these parameters. The cattle voices were divided into three levels, that is the high, medium and low levels according to their total power equivalent to the variances of the sound signal . It was found that the straight lines could be used for recognizing tow cow and one calf for high level voices. For high and medium level voices, however, it was difficult or impossible to recognize individual cattle on the parameters planes.

  • PDF