• Title/Summary/Keyword: normal voice

Search Result 302, Processing Time 0.028 seconds

A comparison of acoustic measures among the microphone types for smartphone recordings in normal adults (정상 성인에서 스마트폰 녹음을 위한 마이크 유형 간 음향학적 측정치 비교)

  • Jeong In Park;Seung Jin Lee
    • Phonetics and Speech Sciences
    • /
    • v.16 no.2
    • /
    • pp.49-58
    • /
    • 2024
  • This study aimed to compare the acoustic measurements of speech samples recorded from individuals with normal voices using various devices: the Computerized Speech Lab (CSL), a unidirectional wired pin-microphone (WIRED) suitable for smartphones, the built-in omnidirectional microphone (SMART) of smartphones, and Bluetooth-connected wireless earphones, specifically the Galaxy Buds2 Pro (WIRELESS). This study included 40 normal adults (12 males and 28 females) who had not visited an otolaryngologist for respiratory diseases within the past three months. Participants performed sustained vowel /a/ phonation for four seconds and reading tasks with sentences ("Walk") and paragraphs ("Autumn") in a sound-treated booth. Recordings were simultaneously conducted using the four different devices and synchronized based on the CSL-recorded samples for analysis using the MDVP, ADSV, and VOXplot programs. Compared with CSL, the Cepstral Spectral Index of Dysphonia (CSIDV, CSIDS) and Acoustic Voice Quality Index (AVQI) values were lower in the WIRED and higher in the SMART. The opposite trend was observed for the L/H spectral ratios (SRV and SRS), and the WIRELESS demonstrated task-specific discrepancies. Furthermore, both the fundamental frequency (F0) and the cepstral peak prominence of the vowel samples (CPPV) had intraclass correlation coefficient (ICC) values above 0.9, indicating high reliability. These variables, F0 and CPPV were considered highly reliable for voice recordings across different microphone types. However, caution should be exercised when analyzing and interpreting variables such as the SR, CSID, and AVQI, which may be influenced by the type of microphone used.

A Basic Study on the Differential Diagnostic System of Laryngeal Diseases using Hierarchical Neural Networks (다단계 신경회로망을 이용한 후두질환 감별진단 시스템의 개발)

  • 전계록;김기련;권순복;예수영;이승진;왕수건
    • Journal of Biomedical Engineering Research
    • /
    • v.23 no.3
    • /
    • pp.197-205
    • /
    • 2002
  • The objectives of this Paper is to implement a diagnostic classifier of differential laryngeal diseases from acoustic signals acquired in a noisy room. For this Purpose, the voice signals of the vowel /a/ were collected from Patients in a soundproof chamber and got mixed with noise. Then, the acoustic Parameters were analyzed, and hierarchical neural networks were applied to the data classification. The classifier had a structure of five-step hierarchical neural networks. The first neural network classified the group into normal and benign or malign laryngeal disease cases. The second network classified the group into normal or benign laryngeal disease cases The following network distinguished polyp. nodule. Palsy from the benign laryngeal cases. Glottic cancer cases were discriminated into T1, T2. T3, T4 by the fourth and fifth networks All the neural networks were based on multilayer perceptron model which classified non-linear Patterns effectively and learned by an error back-propagation algorithm. We chose some acoustic Parameters for classification by investigating the distribution of laryngeal diseases and Pilot classification results of those Parameters derived from MDVP. The classifier was tested by using the chosen parameters to find the optimum ones. Then the networks were improved by including such Pre-Processing steps as linear and z-score transformation. Results showed that 90% of T1, 100% of T2-4 were correctly distinguished. On the other hand. 88.23% of vocal Polyps, 100% of normal cases. vocal nodules. and vocal cord Paralysis were classified from the data collected in a noisy room.

Validity and reliability of Korean version of quality of life questionnaire related with music perception and engagement of the elderly (난청노인의 한국어판 음악지각과 참여와 관련된 삶의 질 설문지의 타당도와 신뢰도)

  • Lee, Do-Hye;Choi, Chul-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.1
    • /
    • pp.87-98
    • /
    • 2022
  • The purpose of this study is to develop the Korean version of Music-Related Quality of Life (K-MRQoL) for the elderly. The K-MRQoL consisted of Musical Ability, Attitude, Activity Frequency (PART 1) and Musical Ability, Attitude, Activity Important (PART 2). Each subcategory consists of Music Perception with 11 items and Music Engagement with 7 items. The validity and reliability of K-MRQoL were measured with Pearson's and Cronbach's alpha correlation coefficients and Independent t-test in total 30 elderly with normal hearing and 30 elderly with hearing loss from local welfare Centers and nursing homes. The correlation coefficients between total scores and PART 1 and PART 2 ranged from .701 to .948 and from .598 to .926, respectively. The internal consistency between total and Part 1 and Part 2 ranged from .846 to .931 and from .838 to .918, respectively. The test-retest correlations were .979, .970, and 979 for total, PART 1, and PART 2, respectively. The correlation between K-MRQoL and Quality of Communication Life Scale was .449. There were significant differences in total, PART 1, and PART 2 between the elderly with normal hearing and hearing loss. This indicates that the K-MRQoL can be used as a useful clinical tool to evaluate Music-related Quality of Life in the elderly with normal hearing or hearing loss.

Realization of an IEEE 802.11g VoWLAN Terminal with Support of Adaptable Power Save and QoS During a Call (통화 중 적응적 Power Save와 QoS 지원이 가능한 IEEE B02.11g VoWLAN 단말기 구현)

  • Kwon, Sung-Su;Lee, Jong-Chul
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.10A
    • /
    • pp.1003-1013
    • /
    • 2006
  • There is a serious problem in an 802.11g VoWLAN (Voice over Wireless LAN) terminal that talk time is less than 30% compared with an 802.11b terminal. It is almost impossible to achieve talk time level of the 802.11b MAC transmission method because IEEE 802.11g uses OFDM modulation, which is a kind of multi-carrier method and OFDM transmission speed is 54 Mbps faster than normal modulation. In this paper, a new concept of a Holdover time as a power saving method during a call with 802.11g terminal is suggested for the first time. Increase in the number of engaged terminals as a result of holdover time causes to QoS problem because of the increase in the number of back-off and then contention window. In this paper, to solve the QoS problem, a new approach is suggested such that when in down lint the sequence number of 802.11 G.711 is analyzed in the MAC of the terminal and then the Hold over time depending on loss rate is changed. Also, consumption of an electric current of 802.11b/g and MAC parameter's performance due to busy traffic caused by increase in the number of terminal are analyzed and then real data using VQT and Airopeek are analyzed.

A Study of VR Interaction for Non-contact Hair Styling (비대면 헤어 스타일링 재현을 위한 VR 인터렉션 연구)

  • Park, Sungjun;Yoo, Sangwook;Chin, Seongah
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.2
    • /
    • pp.367-372
    • /
    • 2022
  • With the recent advent of the New Normal era, realistic technologies and non-contact technologies are receiving social attention. However, the hair styling field focuses on the direction of the hair itself, individual movements, and modeling, focusing on hair simulation. In order to create an improved practice environment and demand of the times, this study proposed a non-contact hair styling VR system. In the theoretical review, we studied the existing cases of hair cut research. Existing haircut-related research tend to be mainly focused on force-based feedback. Research on the interactive haircut work in the virtual environment as addressed in this paper has not been done yet. VR controllers capable of finger tracking the movements necessary for beauty enable selection, cutting, and rotation of beauty tools, and built a non-contact collaboration environment. As a result, we conducted two experiments for interactive hair cutting in VR. First, it is a haircut operation for synchronization using finger tracking and holding hook animation. We made position correction for accurate motion. Second, it is a real-time interactive cutting operation in a multi-user virtual collaboration environment. This made it possible for instructors and learners to communicate with each other through VR HMD built-in microphones and Photon Voice in non-contact situations.

Convergence Development of Video and E-learning System for Education Disabled Students (장애학생의 학습을 위한 화상과 이러닝 시스템의 융합 개발)

  • Son, Yeob-Myeong;Jung, Byeong-Soo
    • Journal of the Korea Convergence Society
    • /
    • v.6 no.4
    • /
    • pp.113-119
    • /
    • 2015
  • Currently, we are presenting an alternative educational environment for the normal student of education rules failure of the only that has been the school system student. The study for students with disabilities, it is designed especially to be able to use difficult disabilities the use of hand. Development objectives of the learning video e-learning system of persons with disabilities, is that to be able to capable of self-directed learning of disabled students. Configuration of e-running system, Web-based multimedia system, utilizing the system that will change the video conferencing system and voice to a character hearing impaired students through the chat system is 1:1 by communication, and teachers it is possible to perform two-way communication. A learning disability e-learning system developed in this paper between teachers and students with disabilities 1:1 training is conducted using a two-way communication algorithms.

Efficient Braille Keyboard of Smart Phone for the Blind (전맹인을 위한 효율적인 스마트폰 점자키보드 시스템 기술)

  • Koo, Min-Su;Kim, Byung-Gyu;Shin, Hyun-Cheul
    • Convergence Security Journal
    • /
    • v.15 no.2
    • /
    • pp.11-17
    • /
    • 2015
  • According to NIA (National Information Society Agency) report, digital divide is a vicious circle which makes quality interval of human's life between people who have information and relatively not. Especially in a smart phone society, there is a big problem between disabled people and normal people. This paper suggests a braille keyboard system for the blind to solve these problems. It is designed for portable use, convenience and quick text typing based on bluetooth communication technique. In text input test, all texts (Korean, English, number) were accurately expressed and typing speed was about 22 [c/m](characters per a minute). Especially, we design text2voice function to provide more accurate key input system. Popular applications such as call, alarm, message, kakao talk, internet, music, were performed without carring problems using the proposed braille keyboard system. The proposed technique is thought to contribute to eliminating the digital divide through the expansion of smartphone users.

The First Formant Characteristics in Vocalize of One Soprano (소프라노 1인의 모음곡 발성 시 제 1 포먼트의 변화양상)

  • Song, Yun-Kyung;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.10-14
    • /
    • 2005
  • Background and Objectives : Vowels are characterized on the basis of formant patterns. The first formant(F1) is determined by high-low placement of the tongue, and the second formant (F2) by front-back placement of the tongue. The fundamental frequency(F0) of a soprano often exceed the normal frequency of the first formant. And the vocal intensity is boosted when F0 is high and a harmonic coincides with a formant. This is called a formant tuning. Experienced singers thus learned how to tune their formants over a resonable range by lowering the tongue to maximize their vocal intensity. So, the current study aimed to identify the formant tuning in one experienced soprano by comparing the first formants of vowel [i] in three different voice production : speech, ascending scale, and vocalize. Materials and Method : All voices recordings of vowel [i] in speech, ascending scale (from F4 note to A4 note), and vocalize(:Ridente la calam") were made with digital audio tape-corder in a sound treated room. And the captured data were analyzed by the long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elementrics, Model, 4300B). Results : Although the first formant of vowel [i] in speech was 238Hz, those of ascending scale [i] were 377Hz, 405Hz, 453Hz respectively in F4(349z), G4(392Hz), A4(440Hz) note, and 722Hz, 820Hz, 918Hz respectively in F5 (698Hz), G5(784Hz), A5(880Hz) note. In vocalize, first formants of [i] were 380Hz, 398Hz, 453Hz respectively in F4, G4, A4 note, and 720Hz, 821Hz, 890Hz respectively in F5, G5, A5 note. Conclusion : These results showed that the first formant of ascending scale and vocalize sustained higher frequency than fundamental frequency in high pitch. This finding implicates that the formant tuning of vowel [i] in ascending scale was also noted in vocalize.

  • PDF

The Development and Application of Web-Based Learning System for Correct Use of Internet Communication Words in Elementary Schools ("바른말 고운말" 교실 웹기반 학습시스템 개발 및 적용)

  • Yoon, Hee-Soo;Kim, Dong-Ho
    • Journal of The Korean Association of Information Education
    • /
    • v.8 no.2
    • /
    • pp.191-201
    • /
    • 2004
  • In accordance with wide spread of personal computer and the expansion of network access, the use of internet has been popular and communication by text message is much more normal than that of voice and image. Accordingly, the side effect of communication language brings about gap between diverse social class, the isolation of communication between generations, abusive expressions, obstacles of juvenile mental development and so on. It appears by the form of slang and vulgar word and has a negative effect on education of mother tongue and usage of children's real language. To deal with these problems, we developed new web-based education system through the analysis of learners' requirement; "Barun Mal, Goeun Mal class". So we verified its efficiency to apply to real class. We also found that this system increased the learners' interest and educational effectiveness. Also, this system contributed to the proper use of language.

  • PDF

Vein Wrapping Technique for Nerve Reconstruction in Patients with Thyroid Cancer Invading the Recurrent Laryngeal Nerve

  • Yoo, Young-Moon;Lee, Il-Jae;Lim, Hyo-Seob;Kim, Joo-Hyoung;Park, Myong-Chul
    • Archives of Plastic Surgery
    • /
    • v.39 no.1
    • /
    • pp.71-75
    • /
    • 2012
  • Recurrent laryngeal nerve paralysis is the most common and serious complication after thyroid cancer surgery. The objective of this study was to report the advantages of the vein wrapping technique for nerve reconstruction in patients with thyroid cancer invading the recurrent laryngeal nerve and its effects on postoperative phonatory function. The subjects were three patients who underwent resection of the recurrent laryngeal nerve during surgical extirpation of papillary thyroid cancer. Free ansa cervicalis nerve graft or direct neurorrhaphy with a vein wrapping technique was used to facilitate nerve regeneration, protect the anastomosed nerve site mechanically, and prevent neuroma formation. One-year postoperative laryngoscopic examination revealed good vocal cord mobility. Maximum phonation time ($19.5{\pm}0.3$ sec) was longer than a previously-reported value in conventional reconstruction patients ($18.8{\pm}6.6$ sec). The present phonation efficiency index ($7.88{\pm}0.78$) was higher than that previously calculated in conventional reconstruction ($7.59{\pm}2.82$). The mean value of the Voice Handicap Index-10 was 6, which was within the normal range. This study demonstrates improvement in phonation indices measured 1 year after recurrent laryngeal nerve reconstruction. Our results confirm that the vein wrapping technique has theoretical advantages and could be favored over conventional reconstruction techniques for invenerate nerve injuries.