• 제목/요약/키워드: Voice Production

검색결과 139건 처리시간 0.033초

음성인식을 이용한 개인환경의 스마트 미러 (Smart Mirror of Personal Environment using Voice Recognition)

  • 여운찬;박신후;문진완;안성원;한영오
    • 한국전자통신학회논문지
    • /
    • 제14권1호
    • /
    • pp.199-204
    • /
    • 2019
  • 본 논문에서는 개인의 일상생활에 필요한 컨텐츠를 제공하는 스마트 미러를 소개한다. 음성인식으로 지정해놓은 명령어를 입력하면 디스플레이에서 원하는 컨텐츠를 출력하는 스마트 미러를 제작하였다. 현재 제작한 스마트 미러의 컨텐츠는 시간과, 날씨, 지하철정보, 일정, 사진이 있다. 시중의 개인 가정용으로 판매하고 있는 스마트 미러는 비싼 가격으로 인해 보급이 어려운 상태이지만 본 논문에서 제시하는 스마트 미러 제작을 통해 제조 단가를 낮출 수 있으며, 음성인식으로 더 편리하게 이용할 수 있다.

병적인 소리 떨림증과 소리꾼 떨림증의 음향학적인 비교연구 (The comparative Study of the Acoustic Representation between Pansori singer's and Spasmodic dysphonia patient's Voice)

  • 홍기환;김현기;이진국;조재식
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.143-145
    • /
    • 2007
  • Muscle groups that are located in and around the vocal tract can produce audible changes in frequency and/or intensity of the voice. Vocal vibrato is a characteristic feature in the singing of performers trained in the western classical tradition and vibrato is generally considered to result from modulation in frequency amplitude and timbre. Vocal tremor is also characterized by periodic fluctuations in the voice frequency or intensity and vocal tremor is symptom of a neurological disease as Spasmodic dysphonia , Parkinson's disease. Vocal vibrato and Vocal tremor may have many of the same origins and mechanisms in the voice production systems. The purpose of this study is to find acostic character of Korean traditional song Pansori singer's vibrato and Spasmodic dysphonia patient's vocal tremor. twelve Pansori singers and seven Spasmodic dysponia patients participated to this study. Power spectrum and Real time Spectrogram are used to analyze the acoustic characteristics of Pansori singing and Spasmodic dysphonia patient's voice The results are as follows; First, vowel formant differences between Pansori singing and Spasmodic dysphonia patient's voice are higher F1, F3. Second, The vibrato rate show differences between Pansori singing and Spasmodic dysphonia patients;$4^{\sim}6/sec$ and $5{\sim}6/sec$ Vibrato rate of pitch is 5.7 Hz ${\sim}$ 42.4 Hz for Pansori singing , 3.8 Hz ${\sim}$ 27.9 Hz for Spasmodic dysphonia patients ;Vibrato rate of intensity range is 0.07 dB ${\sim}$ 8.26 dB for Pansori singing and 0.07 dB ${\sim}$ 4.81 dB for Spasmodic dysphonia patients

  • PDF

수직후두부분절제술 및 점막 피판과 지방 이식을 통한 성대 재건술 후의 음성분석 (The Analysis of Voice after Vertical Partial Laryngectomy with Mucosal Flap and Fat Graft Reconstruction)

  • 주형로;최인자;김진환;안회영;노영수
    • 대한후두음성언어의학회지
    • /
    • 제18권2호
    • /
    • pp.134-137
    • /
    • 2007
  • Background and Objectives: The goals of laryngeal reconstruction have been prevention of aspiration, production of a functional voice, and maintenance of an adequate airway for decannulation. It is generally believed that the reconstruction of the glottic region after vertical partial laryngectomy (VPL) can improve laryngeal function. The objective of this study is to evaluate of voice function after VPL with mucosal flap and fat graft reconstruction. Materials and Methods: From 1994 to 2006, 13 patients, who had been treated with VPL with mucosal flap and fat graft reconstruction. The voice characteristics, acoustic, aerodynamic parameter were measured in 13 patients after vertical partial laryngectomy with mucosal flap and fat graft reconstruction. Acoustic analysis was carried out using Computerized Speech Lab (CSL) and aerodynamic analysis were carried out using Aerophon II,3 months and 12 months after surgery. Results: The GRBAS scale, jitter, shimmer, NHR were improved as time goes on after surgery. But, maximum phonation time was shortened after surgery and there is no significant differences between before and after surgery in mean flow rate. Conclusion: The voice function of the mucosal flap and fat graft reconstruction after VPL were satisfactory. This can be an excellent reconstruction method after vertical partial laryngectomy.

  • PDF

고속 생산형 필름 진동판 성형기 및 금형 국산화 개발(I) - 단수 생산 진동판 성형기 - (Domestic Development of Vibrational Film Forming Machine and Die and Mold in the High Speed Production(I) - Single production forming machine -)

  • 김정현
    • 한국기계가공학회지
    • /
    • 제11권6호
    • /
    • pp.9-15
    • /
    • 2012
  • Vibrational film has been more employed in ear-phones or small type of speakers along with a wide use of portable multi-media equipments such as MP3 and MP4. However, the current hand work production process of diaphragms is inefficient. In this study, a die-and-mold and a single production forming machine are developed, and they result in a multi-production forming machine. The multi-production forming machine consists primarily of a film feeding unit and an unwinding unit. A vacuum suction device provides the film feeding unit, while the unwinding unit is obtained using an appropriate damper. The advantage of the developed single production forming machine is shown according to a proper voice test.

몽골 전통 발성 흐미의 발성 방법 분석에 대한 사례연구 (Analysis of Singing Technique of Mongolian Traditional Singing Called Khoomei)

  • 남도현;백재연;황연신;최홍식
    • 음성과학
    • /
    • 제15권3호
    • /
    • pp.145-156
    • /
    • 2008
  • The goal of this study was to investigate acoustic and physiologic characteristics of two phonation types of 'Khoomei' which is a traditional singing style of people who live around the Altai mountains or Mongolia region. It can be produced two pitches simultaneously - high melody pitch can be perceived along with a low drone pitch. Sygyt and kargyraa styles are the most popular and identifiable styles and they can be recognized as the different sounds depending on the method of voice production. Two trained Mongolians participated and have used at least 5 - 6 years. The characteristics of this voice production were measured by using flexible fiberscope, Stroboscopy, Lx Speech studio, Spead, and Doctor Speech. In Sygyt style, very high vocal fold closure (71.50%) with both true and false vocal folds contact and strong breathing support was observed. They also showed that tongue height and harmonics were increased (around 10dB) with resonance cavity movement. In contrast, it was found that Kargyraa sound had very low pitch with relaxed stomach, less laryngeal tension and lower vocal fold contact (69.50%) than hard Sygyt style sound without raising the tongue during phonation. 'Khoomei' phonation can be made by strong contact of both true and false vocal folds and by increasing the harmonics as well.

  • PDF

갑상연골 골절로 인한 성대마비의 치험례 (A Case of Thyroid Cartilage Fracture with Vocal Cord Paralysis)

  • 조진규;차창일;안회영;조중생;홍남표
    • 대한기관식도과학회:학술대회논문집
    • /
    • 대한기관식도과학회 1983년도 제17차 학술대회연제순서 및 초록
    • /
    • pp.14.2-14
    • /
    • 1983
  • 후두외상의 손상은 그 정도나 범위에 따라 차이는 인지만 주요 후유증으로는 기도폐쇄, 부종, 주위조직의 봉와직염 및 농양, 누공, 후두연골 및 연골지막염, 만성 후두협착, 성대마비, 기관발거곤란증, 성음장애 등을 들 수 있고, 일반적인 후두외상의 치료방법은 일차적으로 신속한 기도유지를 위한 처치를 한 다음 상기각 후유증에 따르는 이차 시술을 시행하는 것이 보통이다. 최근 저자들은 교통사고로 인한 후두부 및 경부의 폐쇄적 외상으로 갑상연골 골절과 좌측 성대마비, 연하장애 및 우측 쇄골 골절을 보인 환자에게서 갑상연골 정복술을 시행 후 술후 2개월에 상기 증세의 호전을 보인 예를 경험하였기에 문헌고찰과 함께 보고하는 바이다.

  • PDF

The effects of length of residence (LOR) on voice onset time (VOT)

  • Kim, Mi-Ryoung
    • 말소리와 음성과학
    • /
    • 제12권4호
    • /
    • pp.9-17
    • /
    • 2020
  • Changes in the first language (L1) sound system as a result of acquiring a second language (L2) (i.e., phonetic drift) have received considerable attention from a variety of speakers, settings, and environments. Less attention has been given to phonetic drift in adult speakers' L2 learning as their length of residence in America (LOR) increases. This study examines the effects of LOR on voice onset time (VOT) in L1 Korean stops. Three different groups of Korean adult learners of L2 English were compared to assess how malleable their L1 representations are in terms of LOR and whether there is any relationship between L1 change and L2 acquisition. The results showed that the effect of LOR was linguistically unimportant in the production of Korean stops. However, VOT merger as evidence of sound change in Korean stops were robust in the speech production of most of the female speakers across the groups. The results suggest that L2 English may not be the primary cause of L1 sound change. For generalizability, further study is necessary to see whether other acoustic cues show a similar pattern.

L1-L2 Transfer in VOT and f0 Production by Korean English Learners: L1 Sound Change and L2 Stop Production

  • Kim, Mi-Ryoung
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.31-41
    • /
    • 2012
  • Recent studies have shown that the stop system of Korean is undergoing a sound change in terms of the two acoustic parameters, voice onset time (VOT) and fundamental frequency (f0). Because of a VOT merger of a consonantal opposition and onset-f0 interaction, the relative importance of the two parameters has been changing in Korean where f0 is a primary cue and VOT is a secondary cue in distinguishing lax from aspirated stops in speech production as well as perception. In English, however, VOT is a primary cue and f0 is a secondary cue in contrasting voiced and voiceless stops. This study examines how Korean English learners use the two acoustic parameters of L1 in producing L2 English stops and whether the sound change of acoustic parameters in L1 affects L2 speech production. The data were collected from six adult Korean English learners. Results show that Korean English learners use not only VOT but also f0 to contrast L2 voiced and voiceless stops. However, unlike VOT variations among speakers, the magnitude effect of onset consonants on f0 in L2 English was steady and robust, indicating that f0 also plays an important role in contrasting the [voice] contrast in L2 English. The results suggest that the important role of f0 in contrasting lax and aspirated stops in L1 Korean is transferred to the contrast of voiced and voiceless stops in L2 English. The results imply that, for Korean English learners, f0 rather than VOT will play an important perceptual cue in contrasting voiced and voiceless stops in L2 English.

밀폐공간 구조 요구자를 위한 더미 표준화 개발 방안 (Plan for the Development of a Standardized Dummy for Persons in Need of Rescue in a Confined Space)

  • 최서연;이동호;김형준
    • 대한안전경영과학회지
    • /
    • 제18권4호
    • /
    • pp.99-105
    • /
    • 2016
  • This study was conducted to develop a dummy in an environment similar to the human body, to prepare a standard for evaluation and to present the process of the production in order to evaluate the performance of the robot that can detect the persons needing rescue in a confined space, who are difficult for fire-fighting officials to rescue in case of fire and disaster. As a result, a standard for evaluation was developed and standardized into four parts 'Normal,' 'Risk Stage 1,' 'Risk Stage 2' and 'Risk Stage 3'based on the number of breath cycles, carbon dioxide concentration, core temperature and criteria for hearing to recognize the voice. In addition, in order to produce a dummy, fever, breathing capacity and voice output function were compared and analyzed. This study has significance that it built up basic data of the method of producing the actual dummy, by presenting characteristics and controlling methods using the waterproof insulation heating coil for the function, solenoid valve for the consecutive output of breathing capacity and USB program sound board for voice output.

A perception-based analysis of voice onset time (VOT) dissimilation in Korean

  • Hijo Kang;Mira Oh
    • 말소리와 음성과학
    • /
    • 제16권1호
    • /
    • pp.25-31
    • /
    • 2024
  • This study examines the perceptual motivation behind dissimilation. Consistent with previous arguments suggesting that dissimilation originates from perception rather than production (Coetzee, 2005; Kiparsky, 2003; Scheer, 2013), we hypothesized that an oral stop with short of voice onset time (VOT) would be recognized as non-aspirated more often when it is followed by an aspirated stop with a long VOT. This hypothesis was tested through a perception experiment in which 32 Korean listeners made judgments on the first consonant of C1VC2V words manipulated with C1 VOT and C2 types. The results revealed that aspirated-based C1 was recognized as aspirated or tense depending on the duration of VOT, while lenis-based C1 was consistently recognized as lenis. The dissimilatory effect of aspirated C2 was confirmed as anticipated, and furthermore, tense C2 increased the ratio of tense responses more than aspirated C2. These results provide evidence of a perceptual bias against recurrent aspirated stops, which may play a role in activating a dissimilatory rule or constraint in a language. The assimilatory effect of tense C2 is in consistent with findings indicating that word-initial tensification is facilitated by the following tense stop in Korean (Kang & Oh, 2016; H. Kim, 2016).