• Title/Summary/Keyword: Speech quality

Search Result 805, Processing Time 0.03 seconds

A Call Processi n g Method for the VoIP Wideband High Quality Speech Codec (VoIP 계층형 광대역 고품질 음성 코덱 협상 처리 기술 분석)

  • Kang, T.G.;Kim, D.Y.;Kim, Y.S.
    • Electronics and Telecommunications Trends
    • /
    • v.19 no.5 s.89
    • /
    • pp.114-124
    • /
    • 2004
  • 유선 네트워크, 무선 이동통신 네트워크, 인터넷 등을 통합하는 유무선 통합 네트워크(BcN)에서는 VoIP기술을 사용하게 될 것이다. TTA 표준으로 2004년 7월에 제정된 VoIP 계층형 광대역 고품질 음성 코덱은 핵심계층에 G.711, G.723.1, G.729를 사용하므로 10종의 PT 를 설정하여 코덱을 협상한다. 이로 인하여 자기자신의 코덱 이외에도 G.711, G.723.1, G.729 등과 상호 호환이 되는 장점을 갖는다. 본 고는신규로 제정된 VoIP 계층형 광대역 고품질 음성 코덱을 네트워크에서 사용할 수 있도록 호 처리에 대한표준화를 추진하여야 하는데 이를 위한 표준 기술을 설명하고, 코덱과 호처리 관계 및 표준화 기술을 근거로 한 코덱 협상 처리 기술을 설명한다. 코덱 협상 처리 기술로서 PSTN/MSC 연동 코덱 협상 방안과All IP 코덱 협상 방안으로 구분하였다. All IP 코덱 협상 방안에서는 발신, 착신, MGC, 착신서버에서 호환성을 위한 호 처리 기능을 제공한다. 본 고의 호 처리 기술을 적용하면, VoIP 계층형 광대역 고품질 음성코덱은 기존 네트워크 장치 기능을 수정하지 않고 사용할 수 있다.

Detecting and correcting errors in Korean POS-tagged corpora (한국어 품사 부착 말뭉치의 오류 검출 및 수정)

  • Choi, Myung-Gil;Seo, Hyung-Won;Kwon, Hong-Seok;Kim, Jae-Hoon
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.37 no.2
    • /
    • pp.227-235
    • /
    • 2013
  • The quality of the part-of-speech (POS) annotation in a corpus plays an important role in developing POS taggers. There, however, are several kinds of errors in Korean POS-tagged corpora like Sejong Corpus. Such errors are likely to be various like annotation errors, spelling errors, insertion and/or deletion of unexpected characters. In this paper, we propose a method for detecting annotation errors using error patterns, and also develop a tool for effectively correcting them. Overall, based on the proposed method, we have hand-corrected annotation errors in Sejong POS Tagged Corpus using the developed tool. As the result, it is faster at least 9 times when compared without using any tools. Therefore we have observed that the proposed method is effective for correcting annotation errors in POS-tagged corpus.

Velopharyngeal Insufficiency Accompanied with Hypertrophic Tonsils: A Case Report (편도비대를 동반한 구개인두부전 환자의 치험례)

  • Kim, Eun Key;Koh, Kyung Suck;Park, Mi Kyong
    • Archives of Plastic Surgery
    • /
    • v.32 no.5
    • /
    • pp.660-662
    • /
    • 2005
  • It is well documented that adenoidectomy is attributed to hypernasality in certain cases, but not clear that the enlarged tonsils affect the quality of speech. Hypertrophied tonsils may cause and complicate the problem of velopharyngeal incompetency. The huge tonsils prevent lateral pharyngeal walls from a medial movement and interfere velar elevation, being hypernasality. Hyponasality developes as the tonsils encroach in nasopharyngeal space. Voluminous tonsils also interfere airflow in the oropharyneal passage and produce the phenomenon of cul-de-sac resonance or muffled sound. The authors and et al. present a case of velopharyngeal insufficiency accompanied with hypertrophic tonsils. Improving the lateral constricting pharyngeal wall and velar elevation after tonsillectomy minimized the velopharyngeal gap. Accordingly, the procedures of sphincter pharyngoplasty and palatal lengthening resolved the problem of hypernasality instead of pharyngeal flap. Tonsillectomy prior to pharyngeal flap surgery tends to reduce the postoperative airway problems. Sometimes, however, only tonsillectomy does without pharyngeal flap. Surgical approach by stages and intermittent evaluation are recommended at intervals of at least six weeks.

Medialization Thyroplasty with Silastic- Decision Making & Practical Points (Silastic을 이용한 내전 갑상성형술-적용 및 술기)

  • Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.18 no.1
    • /
    • pp.7-10
    • /
    • 2007
  • Unilateral vocal fold paralysis resulting in glottal incompetence can cause significant morbidity attributable to impaired speech, swallowing, and ability to protect the airway. The treatment of unilateral vocal cord paralysis has a long history, marked by technical innovations and improvements. These methods typically use endoscopic injection or implants to augment the volume of the affected vocal fold. The first known treatment, reported by Brunnings in 1911, was paraffin injection. The first thyroplasty medializing the paralysed vocal cord was performed by Payr in 1915 ; here, a cartilage door-flap was created from the thyroid ala to obtain better voice quality. In the 1970s, Isshiki systematized and developed the use of the external medialization by Payr. Later he modified his original technique, and achieved safer and better results. Many other methods were introduced for external medialization during the 1980s and 1990s. There has been couple of materials using for medialization laryngoplasty: silicone bloc, cartilage, goretex (polytetrafluoroethylene), titanium, etc. Among them, silicone bloc is the most popularly used material. Type I thyroplasty in combination with arytenoid adduction is a proven technique for medialization of the paralysed vocal fold. In this paper, personal experience for using silicone bloc type I thyroplasty : decision making and practical points, long-term results and complication of the procedure will be discussed.

  • PDF

A Study on subtitle synchronization calibration to enhance hearing-impaired persons' viewing convenience of e-sports contents or game streamer contents (청각장애인의 이스포츠 중계방송 및 게임 스트리머 콘텐츠 시청 편의성 증대를 위한 자막 동기화 보정 연구)

  • Shin, Dong-Hwan;Kim, Jeong-Soo;Kim, Chang-Won
    • Journal of Korea Game Society
    • /
    • v.19 no.1
    • /
    • pp.73-84
    • /
    • 2019
  • This study is intended to suggest ways to improve the quality of the service of subtitles provided for the convenience of viewing for deaf people on e-sports broadcast content and game streamer content. Generally, subtitling files of broadcast content are manually written on air by stenographers, so a delay of 3 to 5 seconds is inevitable compared to the original content. Therefore, the present study proposed the formation of an automatic synchronization calibration system using speech recognition technology. In addition, a content application experiment using this system was conducted, and the final result confirmed that the time of synchronization error of subtitling data could be reduced to less than 1 second.

Behavioral Problems in Patients with Prader-Willi Syndrome

  • Park, Sung Won
    • Journal of mucopolysaccharidosis and rare diseases
    • /
    • v.5 no.1
    • /
    • pp.29-33
    • /
    • 2021
  • Prader-Willi Syndrome (PWS) is a neurodevelopmental genomic imprinting disorder involving a lack of gene expression from the paternal chromosome 15q11-q13 region. This is typically due to paternal 15q11-q13 deletions (in approximately 60% of cases), maternal uniparental disomy 15, or when both 15s are from the mother (about 35% of cases). An imprinting center controls the expression of imprinted genes in the chromosome 15q11-q13 region. PWS is a neurodevelopmental disorder characterized by mental retardation and distinct physical, behavioral, and psychiatric features. Characteristic behavioral disturbances in PWS include excessive interest in food, skin picking, difficulty with a change in routine, temper tantrums, obsessive and compulsive behaviors, and mood fluctuations. Individuals with PWS typically have intellectual disabilities (borderline to mild/moderate mental retardation) and exhibit a higher overall level of behavior disturbances compared to individuals with similar intellectual disabilities. This condition severely limits social adaptations and quality of life. Different factors have been linked to the intensity and form of these behavioral disturbances, but there is no consensus regarding the cause. Consequently, there is still controversy surrounding management strategies and there is a need for new data. PWS is a multisystem disorder. Family members, caregivers, physicians, dieticians, and speech-language pathologists all play an important role in the management and treatment of symptoms in an individual with PWS. Here we analyze behavioral problems in children and adults with PWS by age and review appropriate management and treatment strategies for these symptoms.

Relationship between depressive experience and unmet dental needs in the elderly (노인의 우울 경험과 미충족 치과의료 경험의 관계)

  • Kim, Sun-Mi;Jung, Mi-Hee;Ahn, Eunsuk
    • Journal of Korean Academy of Dental Administration
    • /
    • v.8 no.1
    • /
    • pp.30-36
    • /
    • 2020
  • This study is conducted on 1,725 elderly people over 65 years of age using 2018 data obtained from the 7th National Health and Nutrition Survey (KNHANES) data. In this study, an analysis is performed considering the general characteristics of the elderly and their oral health status (authoring discomfort, speech problems, etc.) to confirm the relationship between the elderly's unmet dental experience and depressive experience. The results of this study showed that depressive experiences by the elderly resulted in unmet dental medical experiences, and it was also found that the income level and the complaint of chewing discomfort had an effect. Based on these results, it is believed that oral health policies should be developed to improve the unmet dental medical experience by considering the socio-economic level of the elderly and depressive experiences. This policy development is expected to lead not only to the improvement of oral health for the elderly, but also to improve the quality of life for the elderly through health promotion.

A Study on the Syntagma & Paradigm by Repetition, Variation and Contrast in Ads

  • Choi, Seong-hoon
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.9
    • /
    • pp.1-12
    • /
    • 2017
  • This study is the academic work to explore the potential meanings of print advertisements. Linguistic features such as repetition, variation, contrast and phonological structure in the verbal texts of ads can give rise to shades-of-meaning or slight variations in advertising. The language of advertising is not only language in words. It is also a language in images, colors, and pictures. Pictures and words combine to form the advertisement's visual text.. While the words are very important in delivering the sales message, the visual text cannot be ignored in advertisements. Forming part of the visual text is the paralanguage of the ad. Paralanguage is the meaningful behaviour accompanying language, such as voice quality, gestures, facial expressions and touch in speech, and choice of typeface and letter sizes in writing. Foregrounding is the throwing into relief of the linguistic sign against the background of the norms of ordinary language. This paper focuses its discussion on the advertisements within the framework of the paradigmatic and the syntagmatic relationship. The sources of ads have been confined to Malboro. The ads were reselected based on purposive sampling methods.

Implementation of Enhanced Vision for an Autonomous Map-based Robot Navigation

  • Roland, Cubahiro;Choi, Donggyu;Kim, Minyoung;Jang, Jongwook
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.41-43
    • /
    • 2021
  • Robot Operating System (ROS) has been a prominent and successful framework used in robotics business and academia.. However, the framework has long been focused and limited to navigation of robots and manipulation of objects in the environment. This focus leaves out other important field such as speech recognition, vision abilities, etc. Our goal is to take advantage of ROS capacity to integrate additional libraries of programming functions aimed at real-time computer vision with a depth-image camera. In this paper we will focus on the implementation of an upgraded vision with the help of a depth camera which provides a high quality data for a much enhanced and accurate understanding of the environment. The varied data from the cameras are then incorporated in ROS communication structure for any potential use. For this particular case, the system will use OpenCV libraries to manipulate the data from the camera and provide a face-detection capabilities to the robot, while navigating an indoor environment. The whole system has been implemented and tested on the latest technologies of Turtlebot3 and Raspberry Pi4.

  • PDF

On a Processing Time Reduction of Cepstrum-Based Pitch Alteration in Time-Frequency Hybrid Domain (켑스트럼 기반 혼성영역 피치변경법의 처리시간 단축에 관한 연구)

  • Jo, Wang-Rae;Kim, Jong-Kuk;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.41-47
    • /
    • 2010
  • The pitch alteration technique for voice conversion is classified in time domain, frequency domain and hybrid domain. The Hybrid domain method has a merit of clearness and natural-ness of pitch altered speech but has the major drawback of long processing time. In this paper, we proposed a new method that can reduce the processing time of pitch alteration in time-frequency hybrid domain. We omitted the bit-reversing process of FFT and IFFT in changing the processing domain. Therefore we can reduce the processing time by 86.26% to the conventional method with same quality.