• Title/Summary/Keyword: speech rates

Search Result 271, Processing Time 0.023 seconds

A Study on Adaptive Model Updating and a Priori Threshold Decision for Speaker Verification System (화자 확인 시스템을 위한 적응적 모델 갱신과 사전 문턱치 결정에 관한 연구)

  • 진세훈;이재희;강철호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.5
    • /
    • pp.20-26
    • /
    • 2000
  • In speaker verification system the HMM(hidden Markov model) parameter updating using small amount of data and the priori threshold decision are crucial factor for dealing with long-term variability in people voices. In the paper we present the speaker model updating technique which can be adaptable to the session-to-intra speaker variability and the priori threshold determining technique. The proposed technique decreases verification error rates which the session-to-session intra-speaker variability can bring by adapting new speech data to speaker model parameter through Baum Welch re-estimation. And in this study the proposed priori threshold determining technique is decided by a hybrid score measurement which combines the world model based technique and the cohen model based technique together. The results show that the proposed technique can lead a better performance and the difference of performance is small between the posteriori threshold decision based approach and the proposed priori threshold decision based approach.

  • PDF

Implementation of Adaptive Multi Rate (AMR) Vocoder for the Asynchronous IMT-2000 Mobile ASIC (IMT-2000 비동기식 단말기용 ASIC을 위한 적응형 다중 비트율 (AMR) 보코더의 구현)

  • 변경진;최민석;한민수;김경수
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.56-61
    • /
    • 2001
  • This paper presents the real-time implementation of an AMR (Adaptive Multi Rate) vocoder which is included in the asynchronous International Mobile Telecommunication (IMT)-2000 mobile ASIC. The implemented AMR vocoder is a multi-rate coder with 8 modes operating at bit rates from 12.2kbps down to 4.75kbps. Not only the encoder and the decoder as basic functions of the vocoder are implemented, but VAD (Voice Activity Detection), SCR (Source Controlled Rate) operation and frame structuring blocks for the system interface are also implemented in this vocoder. The DSP for AMR vocoder implementation is a 16bit fixed-point DSP which is based on the TeakLite core and consists of memory block, serial interface block, register files for the parallel interface with CPU, and interrupt control logic. Through the implementation, we reduce the maximum operating complexity to 24MIPS by efficiently managing the memory structure. The AMR vocoder is verified throughout all the test vectors provided by 3GPP, and stable operation in the real-time testing board is also proved.

  • PDF

Reasons influencing the preferences of prospective patients and orthodontists for different orthodontic appliances

  • Maranon-Vasquez, Guido Artemio;Barreto, Luisa Schubach da Costa;Pithon, Matheus Melo;Nojima, Lincoln Issamu;Nojima, Matilde da Cunha Goncalves;Araujo, Monica Tirre de Souza;de Souza, Margareth Maria Gomes
    • The korean journal of orthodontics
    • /
    • v.51 no.2
    • /
    • pp.115-125
    • /
    • 2021
  • Objective: To evaluate the reasons influencing the preferences for a certain type of orthodontic appliance over another among prospective patients (PP) and orthodontists. Methods: A total of 49 PP and 51 orthodontists were asked about their preferences for the following appliances: clear aligners (CA), lingual metallic brackets (LMB), polycrystalline and monocrystalline ceramic brackets, and buccal metallic brackets (BMB). The participants rated the importance of 17 potential reasons that would explain their choices. The reasons that contributed most to these preferences were identified. Non-parametric tests (Fisher's exact, χ2 and Mann-Whitney tests) and multivariate analyses (regression and discriminant analysis) were used to assess the data (α = 0.05). Results: CA and BMB were the most chosen appliances by PP and orthodontists, respectively. LMB was the most rejected option among both groups of participants (p < 0.001). Rates of the importance of pain/discomfort, smile esthetics, finishing details, and feeding/speech impairment showed the highest differences between PP and orthodontists (p < 0.0005). Discriminant analyses showed that individuals who considered treatment time and smile esthetics as more important were more likely to prefer CA, while those who prioritized finishing details and cost were more likely to choose BMB (p < 0.05). Conclusions: Reasons related to comfort and quality of life during use were considered as more important by PP, while those related to the results and clinical performance of the appliances were considered as more relevant by orthodontists.

Reconstruction of Pharyngolaryngeal Defects with the Ileocolon Free Flap: A Comprehensive Review and How to Optimize Outcomes

  • Escandon, Joseph M.;Santamaria, Eric;Prieto, Peter A.;Duarte-Bateman, Daniela;Ciudad, Pedro;Pencek, Megan;Langstein, Howard N.;Chen, Hung-Chi;Manrique, Oscar J.
    • Archives of Plastic Surgery
    • /
    • v.49 no.3
    • /
    • pp.378-396
    • /
    • 2022
  • Several reconstructive methods have been reported to restore the continuity of the aerodigestive tract following resection of pharyngeal and hypopharyngeal cancers. However, high complication rates have been reported after voice prosthesis insertion. In this setting, the ileocolon free flap (ICFF) offers a tubularized flap for reconstruction of the hypopharynx while providing a natural phonation tube. Herein, we systematically reviewed the current evidence on the use of the ICFF for reconstruction of the aerodigestive tract. A systematic literature search was conducted across PubMed MEDLINE, Web of Science, ScienceDirect, Scopus, and Ovid MEDLINE(R). Data on the technical considerations and surgical and functional outcomes were extracted. Twenty-one studies were included. The mean age and follow-up were 54.65 years and 24.72 months, respectively. An isoperistaltic or antiperistaltic standard ICFF, patch flap, or chimeric seromuscular-ICFF can be used depending on the patients' needs. The seromuscular chimeric flap is useful to augment the closure of the distal anastomotic site. The maximum phonation time, frequency, and sound pressure level (dB) were higher with ileal segments of 7 to 15 cm. The incidence of postoperative leakage ranged from 0 to 13.3%, and the majority was occurring at the coloesophageal junction. The revision rate of the microanastomosis ranged from 0 to 16.6%. The ICFF provides a reliable and versatile alternative for reconstruction of middle-size defects of the aerodigestive tract. Its three-dimensional configuration and functional anatomy encourage early speech and deglutition without a prosthetic valve and minimal donor-site morbidity.

Detects depression-related emotions in user input sentences (사용자 입력 문장에서 우울 관련 감정 탐지)

  • Oh, Jaedong;Oh, Hayoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.12
    • /
    • pp.1759-1768
    • /
    • 2022
  • This paper proposes a model to detect depression-related emotions in a user's speech using wellness dialogue scripts provided by AI Hub, topic-specific daily conversation datasets, and chatbot datasets published on Github. There are 18 emotions, including depression and lethargy, in depression-related emotions, and emotion classification tasks are performed using KoBERT and KOELECTRA models that show high performance in language models. For model-specific performance comparisons, we build diverse datasets and compare classification results while adjusting batch sizes and learning rates for models that perform well. Furthermore, a person performs a multi-classification task by selecting all labels whose output values are higher than a specific threshold as the correct answer, in order to reflect feeling multiple emotions at the same time. The model with the best performance derived through this process is called the Depression model, and the model is then used to classify depression-related emotions for user utterances.

Effects of Nutrient Intake on Oral Health and Chewing Difficulty by Age Group (연령층별 구강건강과 저작불편이 영양소 섭취에 미치는 영향)

  • Kim, Seol-Hee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.2
    • /
    • pp.202-209
    • /
    • 2018
  • This study analyzed the effects of the nutrient intake on oral health and chewing difficulty according to the age group. The subjects were 5,855 participants of the third Korea National Health and Nutrition Examination Survey(KNHANES VI), 2015, Korea Centers for Disease Control and prevention and aged 20 years and over. The data were analyzed using SPSS Ver 21.0, classified as the difficulty in chewing group (DC) and no difficulty in chewing group (NDC). As a result, the DC rates were 5 times higher in the 60+ year age group (39.5%) than in the 20-39 year age group (8.1%). The DC group were experience periodontal disease (33.4%), dental caries (30.1%), diabetes (41.8%), myocardial infarction (57.3%), arthritis (44.0%), asthma (48.0%), and depression (41.9%). In addition, 86% of the DC group were experiencing speech problems. The DC group had significantly lower intakes (1446.59g), than the NDC group (1666.62g), and the protein, carbohydrate, dietary fiber and other dietary intake were significantly lower. These findings suggest that the chewing difficulty is related to the nutrient intake, and psychological status in the elderly DC group. Therefore, the care of chewing difficulties is essential for the elderly to maintain a healthy lifestyle. Accordingly, oral care and myofunctional therapy are needed to maintain oral health.

Audio Stream Delivery Using AMR(Adaptive Multi-Rate) Coder with Forward Error Correction in the Internet (인터넷 환경에서 FEC 기능이 추가된 AMR음성 부호화기를 이용한 오디오 스트림 전송)

  • 김은중;이인성
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.12A
    • /
    • pp.2027-2035
    • /
    • 2001
  • In this paper, we present an audio stream delivery using the AMR (Adaptive Multi-Rate) coder that was adopted by ETSI and 3GPP as a standard vocoder for next generation IMT-2000 service in which includes combined sender (FEC) and receiver reconstruction technique in the Internet. By use of the media-specific FEC scheme, the possibility to recover lost packets can be much increased due to the addition of repair data to a main data stream, by which the contents of lost packets can be recovered. The AMR codec is based on the code-excited linear predictive (CELP) coding model. So we use a frame erasure concealment for CELP-based coders. The proposed scheme is evaluated with ITU-T G.729 (CS-ACELP) coder and AMR - 12.2 kbit/s through the SNR (Signal to Noise Ratio) and the MOS (Mean Opinion Score) test. The proposed scheme provides 1.1 higher in Mean Opinion Score value and 5.61 dB higher than AMR - 12.2 kbit/s in terms of SNR in 10% packet loss, and maintains the communicab1e quality speech at frame erasure rates lop to 20%.

  • PDF

An Analysis on the Suicide Concept, its Religious Circuit and Construction Way: Focused on the cases of the Korean Catholic and Protestant Churches (자살 관념의 종교적 회로와 구성 방식에 관한 분석: 한국 가톨릭교회와 개신교를 중심으로)

  • Park, Sang Un
    • The Critical Review of Religion and Culture
    • /
    • no.31
    • /
    • pp.255-287
    • /
    • 2017
  • This paper analyzes the religious circuit of suicidal concept based on verbal expression and ritual acts, which are found in the suicide discourse of Korean Catholic Church and Protestant Church. In the relationship of suicide and religion, it is easily overlooked the religious circuit and its construction that forms the concept of suicide among the religious laymen. It is assumed that the belief system of traditional religions prohibits suicide and the laymen accordingly construct a perception or concept of suicide along with this belief system. Various studies on this subject have proved it. However, in order to understand the religious way of constructing the concept of suicide on a personal level, it is necessary to pay attention to the religious environment in which the concepts and emotions of suicide circulate. The laymen do not passively and perfectly accept the finely established suicide concept provided by the doctrine or the theology. Rather, the laymen tend to collect the pieces of concept over the suicide that are drifting in the religious environment of his/her daily routine life and to make an concept of suicide in an incomplete form. We can find the unstable and imperfect traits of such a suicide concept through the experience of suicide survivors who have a religious background. For the suicide survivors with religious beliefs, they resist the formal doctrinal and theological provisions to suicide, or try to understand the notion of suicide in their own contexts. In terms of linguistic expressions and ritual acts relating to suicide, the attentions are differently directed in the public and the private domain among the religious groups. Considering on the high rates of suicide in Korean society, the Korean Catholic Churches are increasingly tolerant over the suicide and accept it in the public sphere. It is unlikely when comparing to the negative attitudes of the suicide in the past. However, such tolerance does not go beyond the doctrinal and ethical judgment that defines suicide as a serious sin. The once-committed lay believer's speech and gestures usually contain the various emotions, such as sadness, grief, anxiety, regretfulness, eagerness, and pain in the private spheres. The language and gestures with these emotions have been activated in the religious circuits of suicide, being extended to the religious apparatus for the person who died of suicide. In case of Protestantism, the institutional organizations, such as the particular denominations and the individual-churchism of the Korean Protestant Churches, and their own interpretations of the Bible have in the private sphere strongly effected on the linguistic expressions and the rituals related to the suicide. The religious-ethical judgment of the suicide is varied how the suicide is interpreted by the theologians and the pastors. And the ritual acts for healing the complex feelings and the psychological wounds of the suicide survivors are not actively explored and adopted yet. It makes harder to approach and heal the protestant followers since they emphasize the innermost belief and the salvation assurance faith.

A Study on the vocabulary and Problem-Solving Ability of Adolescents with Developmental Disabilities on Leisure and Recreation (발달장애 청소년의 여가 및 레크레이션에 관한 어휘 및 문제해결 능력 연구)

  • Wha-Soo Kim;Eun-Hong Kim;Ji-Won Yang;Ji-Woo Lee;Ju-Hyeon Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.107-119
    • /
    • 2024
  • The purpose of this study is to examine and analyze the vocabulary and problem-solving ability characteristics of adolescents with developmental disabilities related to leisure and recreation and use them as basic data in education and support of recreation activities for adolescents with developmental disabilities. The study participants were comprised of adolescents with developmental disabilities, divided into two groups based on their receptive language age: those under 10 years old and those 10 years and older. The results obtained through this study are as follows. First, there was a significant difference in leisure and recreation vocabulary between the two groups according to receptive language age. Second, there was a significant difference in problem-solving ability between the two groups based on their receptive language age. Third, the analysis of the correlation between leisure and recreation vocabulary and problem-solving abilities within each group revealed that the under 10 years old group showed the highest correlation in basic vocabulary and basic problem-solving abilities, while the 10 years and older group exhibited the highest correlation in intermediate and advanced levels of problem-solving abilities. Fourth, the analysis of incorrect responses to leisure and recreation vocabulary showed a high rate of selecting vocabulary related to similar topics as incorrect answers. Additionally, the analysis of overreactions to problem-solving abilities indicated an increasing tendency of incorrect responses in items requiring context comprehension. Additionally, the analysis of incorrect responses to problem-solving abilities indicated a tendency of higher error rates in items requiring context comprehension. The results of this study provide insights for discussing directions in communication-related skills education for the smooth recreation life of adolescents with developmental disabilities. Accordingly, it is expected to be utilized as foundational information for educational and support programs aimed at the successful recreation activities of adolescents with developmental disabilities.

A Study of Psychometric Function Curve for Korean Standard Monosyllabic Word Lists for Preschoolers (KS-MWL-P) (한국표준 학령전기용 단음절어표 (Korean Standard Monosyllabic Word Lists for Preschoolers, KS-MWL-P)의 심리음향기능곡선 연구)

  • Shin, Hyun-Wook;Kim, Jin-Sook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.6
    • /
    • pp.534-541
    • /
    • 2009
  • Word recognition test (WRT) for the children can be useful for diagnosing the degree of communication disability, prescribing hearing instruments, planning aural rehabilitation and speech therapy, and determination of site of lesions. The Korean standard monosyllabic word lists for preschoolers (KS-MWL-P) were developed considering the criteria given by the literatures. However, the authors of KS-MWL-P suggested more children should be included to verify homogeneity of the lists using psychometric function curve since only 8 children participated in the developing process. The purpose of this study was to explore the homogeneity of KS-MWL-P for supplementing the limitations of the lists employing psychometric analysis. To 23 preschoolers who have normal-hearing, 100 monosyllabic KS-MWL-P words were examined with the pictures. Psychometric function curve with linear slopes of 20% and 80%'s correct rates through accounting recognition scores of each monosyllabic word at variable intensities from -10 to 40 dBHL was obtained and analyzed. As a result, s-shaped psychometric function curve was presented with increasing correct rate depending on intensity and showed no statistical significant differences among each word and list. The congruous graph shapes among lists also indicated good homogeneity and the list 1,2,3,4's average slopes were 4.48, 3.86, 4.65, 4.50. It was verified that the homogeneity was suitable because the analysis of variance showed no statistical significance among lists (p>0.05). However, KS-MWL-P's order of slope according to the order of the number of items, $1{\sim}10$, $1{\sim}20$, $1{\sim}25$ showed no difference with the p-value of 0.93, 0.59, 0.91, 0.70 for the lists 1,2,3, and 4, respectively. Although KS-MWL-P was assumed that the lower-numbered items were easy for testing younger ages, this study's results could not agree with the author's conclusion. Considering this matter, rearranging of the number of items should be performed according to the analysis of slope suggested by this study for testing younger children with easier items. Other than this, in conclusion, KS-MWL-P was proved to be useful for clinical and rehabilitative evaluating and training tools for preschoolers.