• Title/Summary/Keyword: 발성특성

검색결과 217건 처리시간 0.022초

최적경로와 가중직교인자를 이용한 화자인식 (Speaker Recognition Using Optimal Path and Weighted Orthogonal Parameters)

  • 박승규;배철수
    • 한국음향학회지
    • /
    • 제11권2호
    • /
    • pp.68-72
    • /
    • 1992
  • 최근, 많은 연구자들이 KLT를 이용한 통계적 처리방법으로 화자인식을 수행하고 있으나, 통계적 처리방법의 개인성 포함정도와 음성의 동적인 발성속도는 화자인식율의 저하요인이 되고 있다. 본연구에서는 각 화자의 직교인자에 개인성을 강조하기 위하여 화자의 고유치를 가중치로 한 가중직교인자와 음성의 동적인 시간특성을 정규화하는 DTW의 최적경로를 이용한 화자인식방법을 연구하였다. 이방법을 확인하기 위하여 종래의 통계적 처리에 의한 화자인식, 최적경로와 최적경로와 가중직교인자를 이용한 화자인식의 결과를 비교한 결과, 종래의 방법보다 우수한 화자인식율을 얻어 그 유효성을 확인하였다.

  • PDF

후두적출술 환자의 발성법에 따른 음향학적 특성 (Acoustic Characteristics of Patients with Total Laryngectomees via Voice Rehabilitation Techniques)

  • 장효령;심희정;고도흥
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.25-32
    • /
    • 2013
  • This research is aimed at finding the acoustic characteristics of different voice rehabilitation techniques, the electrolaryx (EL), standard esophageal (SE), and tracheoesophageal (TE), used on 17 patients with laryngectomees. The analysis of the voice qualities was achieved using MDVP. In order to compare the acoustic characteristics, patients were asked to produce the vowel /a/ sound. The acoustic analysis included fundamental frequency (f0), jitter, shimmer, and noise-to-harmonic ratio (NHR). The main acoustic results showed no significant statistical differences between the average measurements of SE and TE speakers. It was found that the current study showed the same tendency found in previous studies. There was also a significant difference between SE and EL speakers. On the other hand, there were no significant statistical differences between the average measurements of TE and EL speakers on all acoustic measurements. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation for patients with laryngectomees. In future, the present findings and issues should be considered in the context of gender. Specifically, the number of women who are diagnosed with laryngeal cancer continues to rise and their acoustic characteristics may indeed differ from those of men.

KAERI 소각시설의 실용화를 위한 방사학적 안전성 분석

  • 양희철;김정국;김창회;박원만;김봉환;김준형;오원진;박현수
    • 한국원자력학회:학술대회논문집
    • /
    • 한국원자력학회 1998년도 춘계학술발표회논문집(2)
    • /
    • pp.409-414
    • /
    • 1998
  • 5 년간의 기술실증 및 안전성 검토를 거쳐 한국원자력연구소내 실증소각시설을 자체발생 가연성 $\beta$/${\gamma}$폐기물을 소각하는 시설로 인허가를 얻었다. 동위원소포함 모의폐기물 및 원전발생 가연성폐기물 실증소각 결과에 기준을 두고 연간 배출오염원 및 가상 사고시의 방사학적 위해성을 평가하여 저준위 폐기물을 부지내에서 소각처리할 때 그 위해성은 무시할 수 있을 것으로 미미함 을 확인하였다. 실증시험으로 주된 배출 방사선원은 고온의 소각로에서 휘발성이 크고 저준위 폐기물내 농도가 큰 반휘발성 Cs-137 및 Cs-134로, 발전소 가연성폐기물과 같은 핵종조성을 가진 0.109 mCi/kg 의 소각시 Cs-137 및 Cs-134의 배출농도가 공기중허용농도의 10%를 약간 상회하는 것으로 평가되었다. 비방사성 CsCI을 이용한 시험소각을 통하여 사용되는 저온배기체처리계통 에서의 휘발된 Cs의 배기체 냉각시 입자화 및 제거특성을 고찰한 결과 휘발된 기체상 Cs성분은 건식배기체 냉각공정을 거치면서 대부분 마이크론 크기이하의 입자로 생성되지만 5% 미만이 전이영역 크기에 분포하여 주여과장치인 여과포집진기에서 제거효율이 99.9% 이상이었다.

  • PDF

이중음성을 보인 변성발성장애 환자 음성의 음향학적 특성 및 치험례 -증 례 보 고- (Diplophonia in Mutational Falsetto : Acoustic Characteristics and Treatment -A Case Report-)

  • 임재열;임성은;이성은;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제15권1호
    • /
    • pp.47-51
    • /
    • 2004
  • Normally, as a result of increased laryngeal growth, the male voice drops about one octave in pitch level during adolescence. Failure of the voice to drop in pitch is consider to be a clinically significant voice disorder - 'mutational dysphonia'. The aim of this article is to evaluate the changes brought about by voice therapy, using the analysis of the EGG measure from Lx Speech Studio program(Laryngograph Ltd, UK) as well as acoustic, and aerodynamic studies in 18-year-old mutational dysphonia patient. The results from the Lx Speech Studio program demonstrated bimodal distribution of DFx(Hz), DQx(%), QxFx and diplophonic characteristic. After voice therapy combined with manual compression method, the distribution of DFx, DQx, QxFx was changed uniform with a dramatic reduction of higher pitch level. In addition, this finding suggests the EGG measure helps to choice treatment options, monitor the efficacy of therapy, and estimate the prognosis of diseases.

  • PDF

복잡계로서의 건축개념과 조형적 특성에 관한 연구(I) (The Architectural Concepts and Design Properties as a Complex System)

  • 김주미
    • 한국실내디자인학회논문집
    • /
    • 제22호
    • /
    • pp.123-131
    • /
    • 2000
  • The purpose of this study is to propose a new design concept and properties within new paradigm. Contemporary students of architectural design seem to redefine the mechanic and reductive approach to design method based upon Euclidean geometry. In this study, the organic space-time and holistic view-point that constitutes the background for all this is radically different from the modern design. It consists of three sections as follow: First, it presents a concept of complex system and properties of complexity that we find in new natural science and tries to combine that news geometry with architectural design to provide a methodological basis for morphogenesis and transformation. Second, the complexity in architecture is defined as a fractal shape, folded space, and irreducible organic system that cannot be fully understood by modernist idea of architecture. Third, the complexity in architecture is strategy based on the electronic paradigm that would enable the emergence of creative possibility. The complexity theory offer new insights to explain not only natural laws but also define dynamic architecture. In fine, this study places a great emphasis on the organic world-view to the spatial organization, which I hope will contribute to generating a greater number of creative possibilities for design.

  • PDF

광류와 조음 발성 특성을 이용한 립리딩 알고리즘 (A Lip-reading Algorithm Using Optical Flow and Properties of Articulatory Phonation)

  • 이미애
    • 한국멀티미디어학회논문지
    • /
    • 제21권7호
    • /
    • pp.745-754
    • /
    • 2018
  • Language is an essential tool for verbal and emotional communication among human beings, enabling them to engage in social interactions. Although a majority of hearing-impaired people can speak; however, they are unable to receive feedback on their pronunciation most of them can speak. However, they do not receive feedback on their pronunciation. This results in impaired communication owing to incorrect pronunciation, which causes difficulties in their social interactions. If hearing-impaired people could receive continuous feedback on their pronunciation and phonation through lip-reading training, they could communicate more effectively with people without hearing disabilities, anytime and anywhere, without the use of sign language. In this study, the mouth area is detected from videos of learners speaking monosyllabic words. The grayscale information of the detected mouth area is used to estimate a velocity vector using Optical Flow. This information is then quantified as feature values to classify vowels. Subsequently, a system is proposed that classifies monosyllables by algebraic computation of geometric feature values of lips using the characteristics of articulatory phonation. Additionally, the system provides feedback by evaluating the comparison between the information which is obtained from the sample categories and experimental results.

녹음 환경의 차이에 따른 화자의 음원 특성 비교: 발성유형지수 k를 중심으로 (Comparison of Speaker's Source Characteristics in Different Recording Environments by Using Phonation Type Index k)

  • 이후동;강선미;박한상;장문수
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.213-224
    • /
    • 2003
  • Spoken sound includes not only speaker's source but the characteristics of vocal tract and speech radiation. This paper is based on the theory of Park[1], who proposes the Phonation Type Index k; a variable that shows the characteristic of speaker's source excluding those of speaker's vocal tract and speech radiation. With Park's theory, we collect data by changing recording environments and expanding experimental data, and analyze the data collected to see whether or not the PTI k shows good discriminating power as a variable for speaker recognition. In the experiment, we repeatedly record 8 sentences ten times for each of 5 males in the environment of a recording room and an office, extract PTI k for each speaker, and measure the discriminating power for each speaker by using the value of PTI k. The result shows that PTI k has the excellent discriminating power of speakers. We also confirm that, even if the recording environment is changed, PTI k shows similar results.

  • PDF

돼지의 수.포유 행동 I. 수유 행동에서 모돈(랜드레이스$\times$요크셔) 발성음의 특성 (Nursing and Suckling Behaviour in Domestic Pigs 1. Characteristics of the Grunting Sound of the Sow(Landrace $\times$ Yorkshire) during Nursing Behaviour)

  • 장홍희;연성찬
    • 한국임상수의학회지
    • /
    • 제19권2호
    • /
    • pp.191-194
    • /
    • 2002
  • The nursing vocalization of domestic pigs(Landrace$\times$Yorkshire) was investigated with respect to common features. All vocalizations uttered during nursings in 5 sows at 5 days after farrowing were recorded and 305 grunts were processed in a spectrograph. The sow's repeated grunting during nursing can be regarded as a contact call and a signal of the mother to start and synchronize the suckling behavior of the piglets. Analysis in the time domain revealed the gross structure of the call, whereas in the frequency domain the fine structure of single grunts was investigated. Nursing interval, duration of nursing behavior, duration of grunt, grunt rate per 10 seconds, fundamental frequency, 1 formant, 2 formant, 3 formant, 4 formant and spectrum were investigated. The results showed that mean interval between the nursing following one another was 25, 4.6 min and duration of nursing behavior was 3.2 $\pm$ 0.7 min. Average duration of grunt was 203.9 $\pm$ 63.6 ms. The formant contours could be identified. The nursing behavior might be disturbed by the grunts of alien sow.

대학박물관과 정보화 (University museum and informatization)

  • 이정호
    • 고문화
    • /
    • 57호
    • /
    • pp.301-314
    • /
    • 2001
  • 1)전통적 대학박물관은 교육적인 기능으로서 전시와 도록 등의 매체를 이용하는 것이다. 이는 관람자로 하여금 박물관이라는 공간으로 유도해야만 소기의 목적을 발성할 수 있다는 부담을 지니고 있음을 말한다. 그러나 정보화 사회로 접어들면서 박물관은 이용자들을 물리적인 공간으로의 유도하기 위한 부담 중 상당부분을 경감할 수 있다. 이는 사이버박물관이라는 커뮤니케이션을 통해 인터넷공간상에 자기의 존재와 특성, 전시정보 등을 배포함으로서 가능해 진다. 대학박물관이 가졌던 전통적인 패러다임의 변화라 할 수 있다. 2) 박물관 정보화는 개인용 컴퓨터와 함께 고도로 발달된 정보통신기술들의 접목, 데이터베이스, 멀티미디어의 발전, 그리고 하이퍼텍스트를 통한 지식정보의 전달과 정보관리 체제의 변화를 기술적인 배경으로 한다. 3) 대학박물관의 정보화를 위해서는 보유하고 있는 정보화의 자원을 모색해야 하며 축적된 자원을 데이터베이스화하고 이를 바탕으로 사용자에 대한 검색 서비스 제공하는 공정이 필요하다. 또한 구축된 서비스의 지속적인 관리를 위해 기술적인 사항에 대한 약간의 숙련도 역시 필요하다 마지막으로 구축된 자원을 이미지화 등으로 재 포장 후 사용자에게 제공함으로써 접근성을 높이고 이렇게 얻은 정보를 지식으로 발전적인 재생산하도록 배려한다. 4) 박물관의 정보화과정을 행하면서 정보화사회의 단점으로 지적되어 온 탈인간화에 주의하며, 다양성을 추구함과 동시에 실시간적 정보제공방식도 운영해 볼 만 하다.

  • PDF

내전형 연축성 발성장애의 연속 발화 특성 (Characteristics of Connected Speech in ADSD)

  • 황연신;김재옥;최홍식
    • 말소리와 음성과학
    • /
    • 제1권1호
    • /
    • pp.93-98
    • /
    • 2009
  • The aim of this study was to investigate voice characteristics of adductive spasmodic dysphonia(ADSD) by measuring electroglottal and acoustic examination at the sentence level. The clinical records of 86 ADSD female patients (age group of $20{\sim}50$ years) and the control records of 86 normal females (age group of $20{\sim}40$ years) were recorded by speech studio(Laryngograph Ltd., UK). An independent t-test was used to compare ADSD and normal group. Results were as follows. (1) Fundamental frequency($F_0$) was significantly decreased in ADSD compared with normal group. (2) Irregularity of frequency and closed quotient(CQ) was significantly increased in ADSD compared with normal group. (3) Voiceless duration increased and voiced duration was significantly decreased in ADSD compared with normal group. (4) Fricative duration was increased in ADSD compared with normal group but it wasn't significant. In conclusion, strained, tight and choked voice shows an increase of CQ, tremor voice shows an increase of irregularity of frequency and less feminine voice shows decrease of $F_0$. Increase of voiceless duration and fricative duration and decrease of voiced duration related with diminution speech intelligibility.

  • PDF