• Title/Summary/Keyword: Speech Quality

Search Result 803, Processing Time 0.031 seconds

Audio Stream Delivery Using AMR(Adaptive Multi-Rate) Coder with Forward Error Correction in the Internet (인터넷 환경에서 FEC 기능이 추가된 AMR음성 부호화기를 이용한 오디오 스트림 전송)

  • 김은중;이인성
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.12A
    • /
    • pp.2027-2035
    • /
    • 2001
  • In this paper, we present an audio stream delivery using the AMR (Adaptive Multi-Rate) coder that was adopted by ETSI and 3GPP as a standard vocoder for next generation IMT-2000 service in which includes combined sender (FEC) and receiver reconstruction technique in the Internet. By use of the media-specific FEC scheme, the possibility to recover lost packets can be much increased due to the addition of repair data to a main data stream, by which the contents of lost packets can be recovered. The AMR codec is based on the code-excited linear predictive (CELP) coding model. So we use a frame erasure concealment for CELP-based coders. The proposed scheme is evaluated with ITU-T G.729 (CS-ACELP) coder and AMR - 12.2 kbit/s through the SNR (Signal to Noise Ratio) and the MOS (Mean Opinion Score) test. The proposed scheme provides 1.1 higher in Mean Opinion Score value and 5.61 dB higher than AMR - 12.2 kbit/s in terms of SNR in 10% packet loss, and maintains the communicab1e quality speech at frame erasure rates lop to 20%.

  • PDF

Improving QoS using Cellular-IP/PRC in Wireless Internet Environment (Cellular-IP/PRC에서 핸드오프 상태 머신에 의한 QoS 개선)

  • Kim Dong-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.6
    • /
    • pp.1302-1308
    • /
    • 2005
  • Propose Cellular-IP/PRC network with united paging and Cellular IP special duality that use roof information administration cache to secure lake acceptance method in wireless Internet environment and QoS in lesser extent cell environment. When speech quality is secured considering increment of interference to receive in case of suppose that proposed acceptance method grooves base radio station capacity of transfer node is plenty, and moat of contiguity cell transfer node was accepted at groove base radio station with a blow, groove base radio station new trench lake acceptance method based on transmission of a message electric power estimate of transfer node be. Do it so that may apply composing PC(Paging Cache) and RC(Routing Cache) that was used to manage paging and router in radio Internet network in integral management and all nodes as one PRC(Paging Router Cache), and add hand off state machine in transfer node so that can manage hand off of transfer node and Roaming state efficiently, and studies so that achieve connection function at node. Analyze benevolent person who influence on telephone traffic in system environment and forecasts each link currency rank and imbalance degree, forecast most close and important lake interception probability and lake falling off probability, GoS(Grade of Service), efficiency of cell capacity in QoS because applies algorithm proposing based on algorithm use gun send-receive electric power that judge by looking downward link whether currency book was limited and accepts or intercept lake and handles and displays QoS performance improvement.

Differentiation of Adductor-Type Spasmodic Dysphonia from Muscle Tension Dysphonia Using Spectrogram (스펙트로그램을 이용한 내전형 연축성 발성 장애와 근긴장성 발성 장애의 감별)

  • Noh, Seung Ho;Kim, So Yean;Cho, Jae Kyung;Lee, Sang Hyuk;Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.100-105
    • /
    • 2017
  • Background and Objectives : Adductor type spasmodic dysphonia (ADSD) is neurogenic disorder and focal laryngeal dystonia, while muscle tension dysphonia (MTD) is caused by functional voice disorder. Both ADSD and MTD may be associated with excessive supraglottic contraction and compensation, resulting in a strained voice quality with spastic voice breaks. The aim of this study was to determine the utility of spectrogram analysis in the differentiation of ADSD from MTD. Materials and Methods : From 2015 through 2017, 17 patients of ADSD and 20 of MTD, underwent acoustic recording and phonatory function studies, were enrolled. Jitter (frequency perturbation), Shimmer (amplitude perturbation) were obtained using MDVP (Multi-dimensional Voice Program) and GRBAS scale was used for perceptual evaluation. The two speech therapist evaluated a wide band (11,250 Hz) spectrogram by blind test using 4 scales (0-3 point) for four spectral findings, abrupt voice breaks, irregular wide spaced vertical striations, well defined formants and high frequency spectral noise. Results : Jitter, Shimmer and GRBAS were not found different between two groups with no significant correlation (p>0.05). Abrupt voice breaks and irregular wide spaced vertical striations of ADSD were significantly higher than those of MTD with strong correlation (p<0.01). High frequency spectral noise of MTD were higher than those of ADSD with strong correlation (p<0.01). Well defined formants were not found different between two groups. Conclusion : The wide band spectrograms provided visual perceptual information can differentiate ADSD from MTD. Spectrogram analysis is a useful diagnostic tool for differentiating ADSD from MTD where perceptual analysis and clinical evaluation alone are insufficient.

  • PDF

Effects of Maternal Role Education Program on the Mother-Infant Interaction and Infant Development (영아기 어머니역할 교육 프로그램이 모아상호작용과 영아발달에 미치는 효과)

  • Bang Kyung Sook
    • Child Health Nursing Research
    • /
    • v.7 no.1
    • /
    • pp.21-34
    • /
    • 2001
  • The impact of childhood experience has lifelong significance on subsequent health and development. Especially, the experience of infant is mostly affected by the quality of parental care and rearing environment. But the new mothers usually do not know what to do because of the lack of experience in these days. Therefore, an educational program regarding maternal role would be necessary. This study was conducted to evaluate the effectiveness of the maternal role education program for mother-infant interaction, child-rearing environment, and infant development. Non-equivalent control group time-series design was used, and Barnard's mother-infant interaction model was used as a conceptual framework of this study. The subjects were the healthy infants weighing over 2,500gm at birth, whose gestational age was more than 37 weeks, and their mothers. The final sample consisted of 19 mother-infant dyads for intervention group and 18 dyads for control group. Data were collected from March 15th to September 3rd in 1999. For the intervention group, programmed education which focused on mother-infant interaction, breast feeding, and infant care was provided before discharge. Telephone counselling was provided within one week after discharge. Home visiting for maternal role education was provided twice, one month and three months postpartum. For the control group, home visiting was also conducted but only for data collection. The data were analyzed using chi-square test and t-test to test the equivalence of two groups, and the effectiveness of intervention program was determined with repeated measure ANCOVA and t-test. The results were as follows: 1. Significant differences were found in mother-infant interaction between two groups(p=.000). It indicates that intervention program was effective in improving mother- infant interaction. In subscale analysis, four out of six subscale showed significant differences between the groups: sensitivity to cues (p=.000), social-emotional growth fostering (p=.000), cognitive growth fostering(p=.000) in mothers, and responsiveness to caregiver (p=.019) in infants. 2. The difference in the mean score of childrearing environment (HOME) between the intervention group and control group was significant(p=.003). When each subscale of HOME was examined individually, intervention group showed significantly higher scores in the diversity of stimulation(p=.000), and mother's involvement(p=.001). 3. Three-month-Infants of the intervention group showed higher GQ in the Griffiths mental development scale(p=.026). In subscale analysis, significant differences were found in the personal-social(p=.005), and the hearing and speech(p=.003). In conclusion, the maternal role education program proved to be effective in promoting the mother-infant interaction, organizing the childrearing environment, and fostering the infant development. These results are very meaningful that we found maternal role education necessary for normal infants' mothers, and that nurses can make a great contribution in promoting health of infants and mothers.

  • PDF

A study on the lip shape recognition algorithm using 3-D Model (3차원 모델을 이용한 입모양 인식 알고리즘에 관한 연구)

  • 배철수
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.3 no.1
    • /
    • pp.59-68
    • /
    • 1999
  • Recently, research and developmental direction of communication system is concurrent adopting voice data and face image in speaking to provide more higher recognition rate then in the case of only voice data. Therefore, we present a method of lipreading in speech image sequence by using the 3-D facial shape model. The method use a feature information of the face image such as the opening-level of lip, the movement of jaw, and the projection height of lip. At first, we adjust the 3-D face model to speeching face image sequence. Then, to get a feature information we compute variance quantity from adjusted 3-D shape model of image sequence and use the variance quality of the adjusted 3-D model as recognition parameters. We use the intensity inclination values which obtaining from the variance in 3-D feature points as the separation of recognition units from the sequential image. After then, we use discrete HMM algorithm at recognition process, depending on multiple observation sequence which considers the variance of 3-D feature point fully. As a result of recognition experiment with the 8 Korean vowels and 2 Korean consonants, we have about 80% of recognition rate for the plosives and vowels. We propose that usability with visual distinguishing factor that using feature vector because as a result of recognition experiment for recognition parameter with the 10 korean vowels, obtaining high recognition rate.

  • PDF

The Verification of Korean Version Swallowing Disturbance Questionnaire (K-SDQ) (한국판 삼킴 곤란 척도(K-SDQ)의 번안본 검증)

  • Jung, SoWoon;Kim, JungWan
    • 재활복지
    • /
    • v.22 no.4
    • /
    • pp.43-58
    • /
    • 2018
  • Swallowing disorders that can affect nutrient intakes and quality of life are commonly shown among the elderly as well as patients with neurogenic disorder. This study verifies the reliability and validity of the Swallowing Disturbance Questionnaire (SDQ), a subjective swallowing disability assessment tool, modified for Koreans' eating habit and cultural sentiment, against 105 stroke patients, in order to help identify early swallowing problems of the elderly. Reliability of internal consistency in the Korean version of SDQ is .601, test-retest reliability is .97, and concurrent validity is .956. Based on 8 points of cut-off score, 46.8% of sensitivity and 81.6% of specificity. Comparing the results of video fluoroscopic study (VFSS), an objective swallowing disorder test with those of Korean version of SDQ, negative predictive value (NPV) and positive predictive value (PPV) was shown as 81% and 53%. The Korean version of SDQ is expected to be a useful testing tool to discriminate swallowing disorders in stroke patients. It has great clinical significance in that swallowing difficulties shown by subjects can be sorted out to request a diagnostic assessment before clinical evaluation by a rehabilitation therapist or ruling out unnecessary exposure to additional tests by accurately identifying stroke patients without swallowing problems.

Acoustic Analysis and Auditory-Perceptual Assessment for Diagnosis of Functional Dysphonia (기능성 음성장애의 진단을 위한 음향학적, 청지각적 평가)

  • Kim, Geun-Hyo;Lee, Yeon-Yoo;Bae, In-Ho;Lee, Jae-Seok;Lee, Chang-Yoon;Park, Hee-June;Lee, Byung-Joo;Kwon, Soon-Bok
    • Journal of Clinical Otolaryngology Head and Neck Surgery
    • /
    • v.29 no.2
    • /
    • pp.212-222
    • /
    • 2018
  • Background and Objectives : The purpose of this study was to compare the measured values of acoustic and auditory perceptual assessments between normal and functional dysphonia (FD) groups. Materials and Methods : 102 subjects with FD and 59 normal voice groups were participated in this study. Mid-vowel portion of the sustained vowel /a/ and two sentences of 'Sanchaek' were edited, concatenated, and analyzed by Praat script. And then auditory-perceptual (AP) rating was completed by three listeners. Results : The FD group showed higher acoustic voice quality index version 2.02 and version 3.01 (AVQIv2 and AVQIv3), slope, Hammarberg index (HAM), grade (G) and overall severity (OS), values than normal group. Additionally, smoothed cepstral peak prominence in Praat (PraatCPPS), tilt, low-to high spectral band energies (L/H ratio), long-term average spectrum (LTAS) in FD group were lower than normal voice group. And the correlation among measured values ranged from -0.250 to 0.960. In ROC curve analysis, cutoff values of AVQIv2, AVQIv3, PraatCPPS, slope, tilt, L/H ratio, HAM, and LTAS were 3.270, 2.013, 13.838, -22.286, -9.754, 369.043, 27.912, and 34.523, respectively, and the AUC of each analysis was over .890 in AVQIv2, AVQIv3, and PraatCPPS, over 0.731 in HAM, tilt, and slope, over 0.605 in LTAS and L/H ratio. Conclusions : In conclusion, AVQI and CPPS showed the highest predictive power for distinguishing between normal and FD groups. Acoustic analyses and AP rating as noninvasive examination can reinforce the screening capability of FD and help to establish efficient diagnosis and treatment process plan for FD.

Effects of Cognitive Impairment on Self-reported Hearing Handicap in Older Adults with Early-stage Presbycusis (초기 노인성 난청자에서 인지장애가 일상생활 듣기 어려움에 미치는 영향)

  • Lee, Soo Jung
    • 한국노년학
    • /
    • v.38 no.1
    • /
    • pp.1-14
    • /
    • 2018
  • Everyday hearing handicap caused by presbycusis ultimately reduces quality of life in older adults. The aim of this study was to explore effects of cognitive impairment on self-reported hearing handicap in older adults with early-stage presbycusis. We compared K-HHIE scores between 40 elderly subjects with mild cognitive impairment (MCI) and age- and hearing-threshold matched 40 cognitively normal elderly (CNE) subjects. The results are as follows: 1) The MCI group scored significantly higher than the CNE group on the social/situational and emotional sections, and in total. 2) The MCI group scored significantly higher than the CNE group on all four subscales, and the most significant group difference was on the first subscale relating to interpersonal relationships and social handicaps. 3) Both groups scored highest on the item 8 (problems hearing whispering sounds) and item 15 (problems hearing TV or radio sounds). Besides those two items, the MCI group also scored high on the item 21 (problems hearing in a restaurant), item 6 (problems hearing when attending a party), item 3 (avoiding groups of people), and item 20 (personal or social restrictions). Our findings suggest that, among older adults with early-stage presbycusis, older adults with cognitive impairment tend to report greater everyday hearing handicap than their peers with normal cognitive function. Especially, they show significant problems hearing in background noise or multi-talker situations, which cause social restrictions and social/emotional loneliness.

Use of Digital Educational Resources in the Training of Future Specialists in the EU Countries

  • Plakhotnik, Olga;Zlatnikov, Valentyn;Matviienko, Olena;Bezliudnyi, Oleksandr;Havrylenko, Anna;Yashchuk, Olena;Andrusyk, Pavlo
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.10
    • /
    • pp.17-24
    • /
    • 2022
  • The article proves that the main goal of informatization of higher education institutions in the EU countries is to improve the quality of education of future specialists by introducing digital educational resources into the education process. The main tasks of informatization of education are defined. Digital educational resources are interpreted as a set of data in digital form that is applicable for use in the learning process; it is an information source containing graphic, text, digital, speech, music, video, photo and other information aimed at implementing the goals and objectives of modern education; educational resources on the Internet, electronic textbooks, educational programs, electronic libraries, etc. The creation of digital educational resources is defined as one of the main directions of informatization of all forms and levels of Education. Types of digital educational resources by educational functions are considered. The factors that determine the effectiveness of using digital educational resources in the educational process are identified. The use of digital educational resources in the training of future specialists in the EU countries is considered in detail. European countries note that digital educational resources in professional use allow you to implement a fundamentally new approach to teaching and education, which is based on broad communication, free exchange of opinions, ideas, information of participants in a joint project, on a completely natural desire to learn new things, expand their horizons; is based on real research methods (scientific or creative laboratories), allowing you to learn the laws of nature, the basics of techniques, technology, social phenomena in their dynamics, in the process of solving vital problems, features of various types of creativity in the process of joint activities of a group of participants; promotes the acquisition by teachers of various related skills that can be very useful in their professional activities, including the skills of using computer equipment and various digital technologies.

Design of CNN-based Braille Conversion and Voice Output Device for the Blind (시각장애인을 위한 CNN 기반의 점자 변환 및 음성 출력 장치 설계)

  • Seung-Bin Park;Bong-Hyun Kim
    • Journal of Internet of Things and Convergence
    • /
    • v.9 no.3
    • /
    • pp.87-92
    • /
    • 2023
  • As times develop, information becomes more diverse and methods of obtaining it become more diverse. About 80% of the amount of information gained in life is acquired through the visual sense. However, visually impaired people have limited ability to interpret visual materials. That's why Braille, a text for the blind, appeared. However, the Braille decoding rate of the blind is only 5%, and as the demand of the blind who want various forms of platforms or materials increases over time, development and product production for the blind are taking place. An example of product production is braille books, which seem to have more disadvantages than advantages, and unlike non-disabled people, it is true that access to information is still very difficult. In this paper, we designed a CNN-based Braille conversion and voice output device to make it easier for visually impaired people to obtain information than conventional methods. The device aims to improve the quality of life by allowing books, text images, or handwritten images that are not made in Braille to be converted into Braille through camera recognition, and designing a function that can be converted into voice according to the needs of the blind.