• Title/Summary/Keyword: Speech pattern

Search Result 412, Processing Time 0.027 seconds

Voice range profile in premutation, mutation, and postmutation of men (변성이전, 변성 및 변성이후 남성의 발성범위 프로파일)

  • Kim, Jaeock;Lee, Seung Jin
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.89-100
    • /
    • 2021
  • This study compared the voice range profiles (VRPs) with glissando and simplified VRP methods with 57 men who were in premutation (8-13 years), mutation (11-16 years), and postmutation (10-24 years) stages. The difference between modal and falsetto areas measured in two VRP methods was also compared. As the results, the average fundamental frequency (F0) was in the order of premuaton>mutation>postmutation. The maximum F0 (F0max), the range of F0 (F0range), the maximum intensity (Imax), and the range of intensity (Irange) were the lowest in the mutation stage, and these variables were higher in falsetto area than in modal area in both methods. In addition, most variables of VRP in glissando were higher than in simplified VRP, but the differences were not significant. This study showed that, in men in mutation stage, due to the temporary anatomical and physiological changes of the larynx, the mechanism of the vocal folds vibration changes and VRP shows a different pattern from that of other age groups. Both the VRPs of glissando and simplifed VRP are suitable for clinical practice by experienced examiners. And it is necessary to measure not only the falsetto area but also the modal area when measuring VRP.

The Proposal of the Fuzzed Lyapunov Dimension at Speech Signal (음성에 대한 퍼지-리아프노프 차원의 제안)

  • In, Joon-Hawn;Yoo, Byong-Wook;Ryu, Seok-Han;Jung, Myong-Jin;Kim, Chang-Seok
    • Journal of the Korean Institute of Telematics and Electronics T
    • /
    • v.36T no.4
    • /
    • pp.30-37
    • /
    • 1999
  • This study suggested the Fuzzy Lyapunov dimension. The Fuzzy Lyapunov dimension is to evaluate the quantitative variation of the attractor. In this paper the speaker recognition is evaluated by the Fuzzy Lyapunov dimension. It has been proved that the suggested Fuzzy Lyapunov dimension is superior in the discrimination characteristics between standard reference pattern attractors, and in reference to the test pattern attractor, it has been verified that it is the speaker recognition parameter which absorbs the pattern variation. In order to evaluate the Fuzzy Lyapunov dimension as speaker recognition parameter, the mistaken recognition according to discrimination error in each of speaker and standard reference pattern was estimated, and the validity of the speaker recognition parameter was experimental. As the result of the speaker recognition experiment, 97.0[%] of recognition ratio was obtained, and it was confirmed that the Fuzzy Lyapunov dimension was fit for the speaker recognition parameter.

  • PDF

Acoustic characteristics of speech-language pathologists related to their subjective vocal fatigue (언어재활사의 주관적 음성피로도와 관련된 음향적 특성)

  • Jeon, Hyewon;Kim, Jiyoun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.87-101
    • /
    • 2022
  • In addition to administering a questionnaire (J-survey), which questions individuals on subjective vocal fatigue, voice samples were collected before and after speech-language pathology sessions from 50 female speech-language pathologists in their 20s and 30s in the Daejeon and Chungnam areas. We identified significant differences in Korean Vocal Fatigue Index scores between the fatigue and non-fatigue groups, with the most prominent differences in sections one and two. Regarding acoustic phonetic characteristics, both groups showed a pattern in which low-frequency band energy was relatively low, and high-frequency band energy was increased after the treatment sessions. This trend was well reflected in the low-to-high ratio of vowels, slope LTAS, energy in the third formant, and energy in the 4,000-8,000 Hz range. A difference between the groups was observed only in the vowel energy of the low-frequency band (0-4,000 Hz) before treatment, with the non-fatigue group having a higher value than the fatigue group. This characteristic could be interpreted as a result of voice abuse and higher muscle tonus caused by long-term voice work. The perturbation parameter and shimmer local was lowered in the non-fatigue group after treatment, and the noise-to-harmonics ratio (NHR) was lowered in both groups following treatment. The decrease in NHR and the fall of shimmer local could be attributed to vocal cord hypertension, but it could be concluded that the effective voice use of speech-language pathologists also contributed to this effect, especially in the non-fatigue group. In the case of the non-fatigue group, the rhamonics-to-noise ratio increased significantly after treatment, indicating that the harmonic structure was more stable after treatment.

Design of Parallel Input Pattern and Synchronization Method for Multimodal Interaction (멀티모달 인터랙션을 위한 사용자 병렬 모달리티 입력방식 및 입력 동기화 방법 설계)

  • Im, Mi-Jeong;Park, Beom
    • Journal of the Ergonomics Society of Korea
    • /
    • v.25 no.2
    • /
    • pp.135-146
    • /
    • 2006
  • Multimodal interfaces are recognition-based technologies that interpret and encode hand gestures, eye-gaze, movement pattern, speech, physical location and other natural human behaviors. Modality is the type of communication channel used for interaction. It also covers the way an idea is expressed or perceived, or the manner in which an action is performed. Multimodal Interfaces are the technologies that constitute multimodal interaction processes which occur consciously or unconsciously while communicating between human and computer. So input/output forms of multimodal interfaces assume different aspects from existing ones. Moreover, different people show different cognitive styles and individual preferences play a role in the selection of one input mode over another. Therefore to develop an effective design of multimodal user interfaces, input/output structure need to be formulated through the research of human cognition. This paper analyzes the characteristics of each human modality and suggests combination types of modalities, dual-coding for formulating multimodal interaction. Then it designs multimodal language and input synchronization method according to the granularity of input synchronization. To effectively guide the development of next-generation multimodal interfaces, substantially cognitive modeling will be needed to understand the temporal and semantic relations between different modalities, their joint functionality, and their overall potential for supporting computation in different forms. This paper is expected that it can show multimodal interface designers how to organize and integrate human input modalities while interacting with multimodal interfaces.

Isolated Word Recognition using Modified Dynamic Averaging Method (변형된 Dynamic Averaging 방법을 이용한 단독어인식)

  • Jeoung, Eui-Bung;Ko, Young-Hyuk;Lee, Jong-Arc
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.2
    • /
    • pp.23-28
    • /
    • 1991
  • This paper is a study on isolated word recognition by independent speaker, we propose DTW speech recognition system by modified dynamic averaging method as reference pattern. 57 city names are selected as recognition vocabulary and 2th LPC cepstrum coefficients are used as the feature parameter. In this paper, besides recognition experiment using modified dynamic averaging method as reference pattern, we perform recognition experiments using causal method, dynamic averaging method, linear averaging method and clustering method with the same data in the same conditions for comparison with it. Through the experiment result, it is proved that recogntion rate by DTW using modified dynamic averaging method is the best as 97.6 percent.

  • PDF

A Method on the Improvement of Speaker Enrolling Speed for a Multilayer Perceptron Based Speaker Verification System through Reducing Learning Data (다층신경망 기반 화자증명 시스템에서 학습 데이터 감축을 통한 화자등록속도 향상방법)

  • 이백영;황병원;이태승
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.6
    • /
    • pp.585-591
    • /
    • 2002
  • While the multilayer perceptron(MLP) provides several advantages against the existing pattern recognition methods, it requires relatively long time in learning. This results in prolonging speaker enrollment time with a speaker verification system that uses the MLP as a classifier. This paper proposes a method that shortens the enrollment time through adopting the cohort speakers method used in the existing parametric systems and reducing the number of background speakers required to learn the MLP, and confirms the effect of the method by showing the result of an experiment that applies the method to a continuant and MLP-based speaker verification system.

A Characteristic EEG Pattern of Angelman Syndrome

  • Yoon, Joong-Soo;Song, Woon-Heung;Choi, Hwa-Sik
    • Korean Journal of Clinical Laboratory Science
    • /
    • v.42 no.2
    • /
    • pp.97-102
    • /
    • 2010
  • The two new female cases of Angelman syndrome (AS) were described, which diagnosed on the basis of clinical features (dysmorphic facial features, severe mental retardation with absent speech, peculiar jerky movements, ataxic gait and paroxysms of inappropriate laughter) and neurophysiological findings. Failure to detect the deletion of the long arm of chromosome 15 or the absence of epileptic seizure were not considered sufficient to exclude a diagnosis of AS. Feeding problems, developmental delay and early signs of ataxia, especially tremor on handling objects and unstable posture when seated, proved effective as the clinical markers for early diagnosis of AS. Most of the authors agreed about the existence of three main EEG patterns in AS which may appear in isolation or in various combinations in the same patient. The most frequently observed pattern in children has prolonged runs of high amplitude rhythmic 2-3 Hz activity predominantly over the frontal region with superimposed interictal epileptiform discharges. High amplitude rhythmic 4-6 Hz activity, prominent in the occipital regions, with spikes, which can be facilitated by eye closure, is often seen in children under the age of 12 years. The EEG findings are characteristic of AS when seen in the appropriate clinical context and can be helpful to identify AS patients at an early age when genetic counselling may be particularly important.

  • PDF

Analyzing the Acoustic Elements and Emotion Recognition from Speech Signal Based on DRNN (음향적 요소분석과 DRNN을 이용한 음성신호의 감성 인식)

  • Sim, Kwee-Bo;Park, Chang-Hyun;Joo, Young-Hoon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.1
    • /
    • pp.45-50
    • /
    • 2003
  • Recently, robots technique has been developed remarkably. Emotion recognition is necessary to make an intimate robot. This paper shows the simulator and simulation result which recognize or classify emotions by learning pitch pattern. Also, because the pitch is not sufficient for recognizing emotion, we added acoustic elements. For that reason, we analyze the relation between emotion and acoustic elements. The simulator is composed of the DRNN(Dynamic Recurrent Neural Network), Feature extraction. DRNN is a learning algorithm for pitch pattern.

The Perception-Based study of a weak syllable in English Words with Weak-Strong pattern by Korean Learners(I) (약강구조 영어 단어에 대한 초급 및 고급 영어학습자의 약음절 지각과 반응시간(I))

  • Kim, Hee-Sung;Shin, Ji-Young;Kim, Kee-Ho
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.73-77
    • /
    • 2005
  • The purpose of this study is to observe how Korean learners of English perceive a weak syllable in words with WS syllable pattern. According to the automated discrimination task using E-Prime, the proportion of right answer and reaction time of the stimuli with same word pairs (a-a, b-b) was more and faster respectively than that with different word pairs (a-b, b-a). Specifically, in a-b or b-a stimuli structure, familiarity(word frequency) of stressed word succeeding weak syllable and whether the weak syllable had coda in it was two important factors in distinguishing between a word with and without weak syllable. Even though the high English proficiency Koreans had faster reaction time than the low English proficiency Koreans, all Korean learners somewhat had difficulty perceiving the weak syllable at the beginning of the word.

  • PDF

A Study of the Chewing Patterns in Patients with Temporomandibular Disorders by Electrognathography (Electrognathography를 이용한 측두하악장애환자의 저작양태에 관한 연구)

  • Moon-Gyu Kim;Kyung-Soo Han
    • Journal of Oral Medicine and Pain
    • /
    • v.20 no.2
    • /
    • pp.291-306
    • /
    • 1995
  • Mandibular movement is composed of border movement and functional movement. Border movement such as maximal mouth opening, hinge opening ad lateral eccentric movement has good reproducibility, but functional movement such as chewing, swallowing and speech has also reproducibility. Especially for chewing movement, individual reproducibility has been confirmed by many studies. Study of chewing pattern is still in controversy. In new approach for raising the diagnostic value, numeric parameters and morphologic characteristics could be used for evaluation of chewing pattern. This study was performed to investigate the differences between chewing pattern in controls and in patients with temporomandibular disorders. Sixty-three patients with temporomandibular disorders participated in this study, and they were divided into unilaterally affected subjects or bilaterally affected subjects. Then unilaterally affected subjects were classified into closed lock group, disk displacement with reduction group, and degenerative joint disease group. For recording of chewing pattern, subjects were asked to chew one piece of presoftened chewing gum on both sides, and the chewing movement was recorded with the Electrognatho- Graphy(Bio-Research Associates Inc., U.S.A.). Tooth contact pattern for occlusal stability (Total left-right statistics )was also recorded with T-Scan(Tekscan Co., U.S.A.). The dta related to chewing pattern and total left-right statistics were statistically analyzed by SAS/stat program. The obtained results were as follows : 1. In patient group, mean value of A-P distance and the ratio of A-P distance to vertical distance were larger than control group, but the value of lateral distance in affected side and the closing velocity in unaffected side were smaller than that of control group, respectively. 2. In case of unilateral affected patients, chewing pattern of other side had tendency to restricted movement and slow velocity in closed lock group or degenerative joint disease group than control group or disk displacement with reduction group. 3. In bilateral degenerative joint disease patients, contralateral side had tendency to large range of motion and slow chewing velocity than preferred chewing side. 4. The patients with restricted mouth opening below than 35mm had higher value of total left-right statistics than patient group mouth opening above 35mm. Also closed lock group had higher total left-right statistics than disk displacement with reduction group, degenerative joint disease group and control group. 5. There was some difference in morphologic characteristics of chewing pattern between in control group and in affected side of unilateral patient group, but no difference between control group and unaffected side of unilateral patient group. 6. There were positive correlations between vertical distance and A-P distance, between vertical distance and chewing velocity, between A-P distance and chewing velocity, and between opening velocity and closing velocity in unilateral affected patients.

  • PDF