• Title/Summary/Keyword: Speech quality

검색결과 805건 처리시간 0.026초

AN ALGORITHM FOR CLASSIFYING EMOTION OF SENTENCES AND A METHOD TO DIVIDE A TEXT INTO SOME SCENES BASED ON THE EMOTION OF SENTENCES

  • Fukoshi, Hirotaka;Sugimoto, Futoshi;Yoneyama, Masahide
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2009년도 IWAIT
    • /
    • pp.773-777
    • /
    • 2009
  • In recent years, the field of synthesizing voice has been developed rapidly, and the technologies such as reading aloud an email or sound guidance of a car navigation system are used in various scenes of our life. The sound quality is monotonous like reading news. It is preferable for a text such as a novel to be read by the voice that expresses emotions wealthily. Therefore, we have been trying to develop a system reading aloud novels automatically that are expressed clear emotions comparatively such as juvenile literature. At first it is necessary to identify emotions expressed in a sentence in texts in order to make a computer read texts with an emotionally expressive voice. A method on the basis of the meaning interpretation that utilized artificial intelligence technology for a method to specify emotions of texts is thought, but it is very difficult with the current technology. Therefore, we propose a method to determine only emotion every sentence in a novel by a simpler way. This method determines the emotion of a sentence according to an emotion that words such as a verb in a Japanese verb sentence, and an adjective and an adverb in a adjective sentence, have. The emotional characteristics that these words have are prepared beforehand as a emotional words dictionary by us. The emotions used here are seven types: "joy," "sorrow," "anger," "surprise," "terror," "aversion" or "neutral."

  • PDF

성대마비의 음성장애 측정을 위한 청지각적 및 음향학적 평가 (Auditory-Perceptual and Acoustic Evaluation in Measuring Dysphonia Severity of Vocal Cord Paralysis)

  • 김근효;이연우;박희준;배인호;이병주;권순복
    • 대한후두음성언어의학회지
    • /
    • 제28권2호
    • /
    • pp.106-111
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to investigate the criterion-related concurrent validity of two standardized auditory-perceptual assessments and the Acoustic Voice Quality Index (AVQI) for measuring dysphonia severity in patients with vocal cord paralysis (VCP). Materials and Methods : Total 210 patients with VCP and 236 normal voice subjects were asked to sustain the vowel [a:] and to read aloud the Korean text "Walk". A 2 second mid-vowel portion of the sustained vowel and two sentences (with 26 syllables) were recorded. And then voice samples were edited, concatenated, and analyzed according to Praat script. Two standardized auditory-perceptual assessment (GRBAS and CAPE-V) were performed by three raters. Results : The VCP group showed higher AVQI, Grade (G) and Overall Severity (OS) values than normal voice group. And the correlation among AVQI, G, and OS ranged from 0.904 to 0.926. In ROC curve analysis, cutoff values of AVQI, G, and OS were <3.79, <0.00, and <30.00, respectively, and the AUC of each analysis was over .89. Conclusion : AVQI and auditory evaluation can improve the early screening ability of VCP voice and help to establish effective diagnosis and treatment plan for VCP-related dysphonia.

  • PDF

일측 성대마비 환자에 대해 음성치료와 성대주입술의 초기 치료 효과 비교 연구 (Comparison of Initial Therapeutic Effects of Voice Therapy and Injection Laryngoplasty for Unilateral Vocal Cord Paralysis Patients)

  • 이창윤;안수연;장현;손희영
    • 대한후두음성언어의학회지
    • /
    • 제28권2호
    • /
    • pp.112-117
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to classify patients with unilateral vocal fold paralysis according to their fixed location and to analysis the effects of two treatment methods by early voice therapy and injection laryngoplasty. Materials and Methods : Twenty patients who were classified as full abduction and slight abduction according to the position of paralysis were treated injection laryngoplasy, and 23 patients were treated by voice therapy. Twenty patients were treated injection laryngoplasy and 23 patients were treated voice therapy. Results were evaluated by acoustic analysis, electroglottography, cepstrum analysis before and after therapy. The voice therapy was conducted by improving the larynx movement and glottal contact, whilst removing hypertension of the supraglottic and use the breathing. Results : Significant improvement was found in the acoustic parameter, cepstrum parameter, and EGG before and after treatment in both groups. There was no significant difference between the two groups when compared before and after treatment to compare the effects of injection laryngoplasty and voice therapy. Conclusion : The initial treatments for unilateral vocal cord paralysis are injection laryngoplasty and voice therapy. however, there is no precise standard about which method should be applied first. Therefore, in this study, we tried to classify patients according to their paralysis position and then apply two methods. The results of this study suggest that voice therapy and Injection laryngoplasty at the initial stage is a very useful method to improve voice quality of vocal fold paralysis and improve laryngeal function.

  • PDF

조선 궁궐 건축물의 음향성능 측정 및 평가 - 편전 및 침전을 중심으로 - (Measurement and Evaluation of the Acoustic Performance in the Royal Palace Buildings of Joseon Dynasty - Focused on Pyeonjeon and Chimjeon -)

  • 김남욱;김명준;한욱
    • 한국소음진동공학회논문집
    • /
    • 제19권12호
    • /
    • pp.1269-1280
    • /
    • 2009
  • This study was performed to construct sound performance DB of royal palace buildings and to examine the special quality more scientifically. Research target of royal palace were Changdeokgung and Gyeongbokgung. Sound insulation performance between the adjacent room and facade, room acoustics of Pyeonjeon and Chimjeon which is representative building in royal palace were examined through field measurement. Measured values of RT($T_{mf}$) at Pyeonjeon were 0.78 sec. and 1.03 sec. in Seonjeongjoen and Sajeongjoen, respectively. The RTs of both Pyeonjeon buildings were estimated suitable for speech and lecture considering their volume. The RT($T_{mf}$)s at Chimjeon were measured in range of 0.29~0.55 sec. This meant that the acoustic energy in rooms was decreased by sound transmission through mulberry paper(Hanji) of traditional windows and doors. As a sound insulation performance, the single-number quantities($D_{ls,2m,nT,w}$) of the building facades in Pyeonjeon and Chimjeon were measured 4~20 dB. Also the single-number quantities($D_{p,w}$) between the adjacent rooms in Chimjeon were measured 3~18 dB. Sound insulation performance of traditional building elements such as window and door depended strongly on their layers and area.

2.5세에 진단된 헌터증후군 1례 (A Case of Hunter Syndrome Diagnosed at Age of 2.5 Year)

  • 최미란;권영희;진동규;이지은
    • 대한유전성대사질환학회지
    • /
    • 제14권2호
    • /
    • pp.178-181
    • /
    • 2014
  • 헌터증후군(뮤코다당증 II형)은 글리코사미노글리칸의 분해를 촉매하는 효소인 iduronate-2-sulfatase 결핍에 의해 조직이나 기관의 세포 내 리소좀에 heparin sulfate와 dermatan sulfate 등의 전구물질이 축적되어 퇴행성 병변을 일으키는 유전 질환이다. 현재 효소보충요법을 통해 증상의 호전 및 질병의 진행을 지연시키는 치료가 가능하나, 중추신경계 증상이 발현된 경우 치료가 어려운 한계가 있어, 무엇보다 조기에 의심하고 진단하여 치료를 시작하는 것이 중요하다. 따라서 어린연령에 진단된 환아들의 임상적 특징에 대해 이해하는 것이 필요하며, 이에 저자들은 2.5세의 어린 연령에 진단된 환아를 경험하여 이를 보고하는 바이다.

편도외 농양 환자의 발화시 조음 및 음성의 변화 (The Acoustic Characteristics of Articulation and Phonation in Peritonsillar Abscess)

  • 최현진;송윤경;여장옥;허세형;진성민
    • 대한후두음성언어의학회지
    • /
    • 제19권2호
    • /
    • pp.133-135
    • /
    • 2008
  • Background and Objectives: The voice changes can occur in peritonsillar abscess and the labeling of this changes as a "muffled voice". The aim of this study was to investigate the changes in acoustic feature of voice before and after treatment in patients with peritonsillar abscess. Materials and Method: 12 patients with peritonsillar abscess were enrolled in the study. Acoustic analysis on sustained Korean vowels /a/, /i/ and /u/ were performed before and after treatment. Results: In patients with peritonsillar abscess, the first formant frequency (F1) and second formant frequency (F2) of /a/ were decreased. There was tendency of articulation of back-low vowel /a/ as back-high vowel /u/. F1 of /i/ and /u/ were increased, while F2 were decreased. There was tendency of articulation of front-high vowel /i/ as back-low vowel /a/. The third, forth, fifth formant frequency (F3, F4, F5) of /a/, /i/ and /u/ were decreased although statistically not significant. Conclusion: The anatomical and functional changes of oropharynx by peritonsillar abscess can cause changes in resonance and speech quality. We suggest that these changes could be the cause of 'muffled voice' in patients of peritonsillar abscess.

  • PDF

후두미세수술 후 음향지표의 변화와 환자의 만족도 비교 (Change of Acoustic Parameter and Voice Handicap Index after Laryngeal Microsurgery)

  • 김범석;신지훈;김기용;이용섭;김경래;태경
    • 대한후두음성언어의학회지
    • /
    • 제19권2호
    • /
    • pp.142-145
    • /
    • 2008
  • Background and Object: The aim of this study is to evaluate the change of patient's subjective voice handicap index (VHI) and acoustic parameters before and after laryngeal microsurgery for benign vocal cord disease. Materials and Method: We analyzed 78 patients who received laryngeal microsurgery for benign vocal cord disease from January 2004 to February 2007 retrospectively. There were 28 vocal polyp, 40 vocal nodule, 5 intracordal cyst and 5 Reinke's edema. Jitter, shimmer, harmony to noise ratio (HNR) were analyzed before surgery and 2-3months after surgery using the Doctor's speech science program. The voice handicap index introduced by the Pittsburgh Voice Center was used to examine patient's subjective change of voice quality. Results: Acoustic parameters of jitter, shimmer and HNR were improved in patients with vocal polyp and vocal nodule after surgery. The acoustic parameters were not improved in patients with Reinke's edema, statistically. Only jitter was improved significantly in patients with intracordal cyst (p<0.05). The VHI was significantly improved after surgery. The change of jitter and shimmer was significantly correlated with the change of VHI after surgery. Conclusion: The acoustic parameters and VHI were significantly improved in patients with benign vocal disease after laryngeal microsurgery.

  • PDF

후두기관 분리술로 치료한 만성 흡인 15례 (Laryngotracheal Separation in Patient with Chronic Intractable Aspiration)

  • 공일규;안수연;김봉직;정은정;이명철;;성명훈;김광현
    • 대한기관식도과학회지
    • /
    • 제13권1호
    • /
    • pp.23-28
    • /
    • 2007
  • Background and Objectives: Since intractable aspiration in patients with impaired protective function of the larynx often results in multiple episode of aspiration pneumonia, repeated hospitalizations and expensive nursing care. The authors reported the preliminary results of laryngotracheal separation(LTS) in patient with chronic intractable aspiration. The purpose of this study was to report the follow up results of patient outcome with the LTS. Materials and Methods: A retrospective review of 15 patients who underwent LTS between 1996 and 2006 was conducted. Ages ranged from 3 to 72 years. Results: Eight patients had morbid aspiration as a consequence of acquired neurologic injuries and seven patients with congenital neurologic injuries. Two patients had a postoperative fistula, which was well controlled with local wound care. Following LTS, aspiration was effectively controlled in all patients and eight were able to tolerate a regular diet. Conclusion: LTS is a low-risk, successful, definitive procedure which decreases the potential for aspiration, pulmonary complications, duration of hospitalizations and increases quality of life, especially in patent with irreversible upper airway dysfunction and poor speech potential.

  • PDF

인터넷상에서 지적재산권 분쟁에 따른 준거법 적용에 관한 논점 (A study on the Governing Law to Application under the Intellectual Property Right Disputes in Internet)

  • 박종삼
    • 한국중재학회지:중재연구
    • /
    • 제14권1호
    • /
    • pp.133-156
    • /
    • 2004
  • The rapid development of the internet may not have occurred without techniques of linking and framing, which provide users flexible and easy access to other website. These techniques have enabled internet users to navigate the internet efficiently and sort through the products, services and information available on the internet. The Advent of the global information structure and the do-called EC revolution raise countless new issues and questions. There are no limitations regulating the expressions on the cyberspace due to internet's of quality anonymity? diversity? spontaneity. Therefore, the freedom of speech is expanded in both areas of time and space, which was impossible with the old communicating system. Although online technology raises many new legal issues, the law available to help us resolve them, at least today, is largely based on the world as it existed before online commerce became a reality. Thus the challenge is to predict how these new legal issues may be resolved using the current law. As a result of the drastic change of the environment for international trade of which that has taken took place in parallel with the global information technology revolution on a global basis, the scope of issues to be addressed which should be resolved by the conflict of laws principles has been remarkably expanded, and various new issues of an entirely which are quite new in its type and nature have arisen been raised. Further more in addition, the old act prior act was regarded as insufficient in that it lacked rules on international governing law to adjudicate, or international adjudicatory governing law, where as the expectation of the public was that the private international law should function as the basic law of the legal relational encompassing rules on governing law given the increase of It international disputes. for the move the private international law has also attracted more attention from the korean.

  • PDF

3차원 모델을 이용한 입모양 인식 알고리즘에 관한 연구 (A study on the lip shape recognition algorithm using 3-D Model)

  • 남기환;배철수
    • 한국정보통신학회논문지
    • /
    • 제6권5호
    • /
    • pp.783-788
    • /
    • 2002
  • 최근 통신 시스템의 연구와 발전 방향은 목소리의 음성 정보와 말하는 얼굴 영상의 화상 정보를 함께 적용하므로서 음성 정보만을 제공하는 경우보다 높은 인식율을 제공한다. 따라서 본 연구는 청각장애자들의 언어 대체수단 중 하나인 구화(speechreading)에서 가장 시각적 변별력이 논은 입모양 인식을 일반 퍼스널 컴퓨터상에서 구현하고자 한다. 본 논문은 기존의 방법과 달리 말하는 영상 시퀀스에서 입모양 인식을 행하기 위해 3차원 모델을 사용하여 입의 벌어진 정도, 턱의 움직임, 입술의 돌출과 같은 3차원 특징 정보를 제공하였다. 이와 같은 특징 정보를 얻기 위해 3차원 형살 모델을 입력 동영상에 정합시키고 정합된 3차원 형상모델에서 각 특징점의 변화량을 인식파라미터로 사용하였다. 그리고, 인식단위로 동영상을 분리하는 방법은 3차원 특징점 변화량에서 얻어지는 강도의 기울기에 의하여 이루어지고, 인식은 각각의 3차인 특징벡터를 이산 HMM 인식기의 인식 파라메타로 사용하였다.