통합 검색 | Korea Science

웨이브렛 변환을 이용한 음성신호의 성문폐쇄시점 검출 (Detection of Glottal Closure Instant for Voiced Speech Using Wavelet Transform)

배건성
- 음성과학
- /
- 제7권3호
- /
- pp.153-165
- /
- 2000
During the phonation of voiced sounds, instants exist where the glottis is opened or closed, due to the periodic vibration of the vocal cord. When closed, this is called the glottal closure instant(GCI) or epoch.. The correct detection of the GCI is one of the important problems in speech processing for pitch detection, pitch synchronous analysis, and so on. Recently, it has been shown that the local maxima points of the wavelet transformed speech signal correspond to the GCIs of speech signal. In this paper, we investigate the accuracy of Gels estimated from this wavelet transformed speech signal. For this purpose we compare them with the negative peak points of the differentiated EGG signal that represents the actual GCIs of speech signal.
PDF

웨이브렛 변환을 이용한 음성신호의 유성음/무성음/묵음 분류 (Voiced/Unvoiced/Silence Classification웨 of Speech Signal Using Wavelet Transform)

손영호;배건성
- 음성과학
- /
- 제4권2호
- /
- pp.41-54
- /
- 1998
Speech signals are, depending on the characteristics of waveform, classified as voiced sound, unvoiced sound, and silence. Voiced sound, produced by an air flow generated by the vibration of the vocal cords, is quasi-periodic, while unvoiced sound, produced by a turbulent air flow passed through some constriction in the vocal tract, is noise-like. Silence represents the ambient noise signal during the absence of speech. The need for deciding whether a given segment of a speech waveform should be classified as voiced, unvoiced, or silence has arisen in many speech analysis systems. In this paper, a voiced/unvoiced/silence classification algorithm using spectral change in the wavelet transformed signal is proposed and then, experimental results are demonstrated with our discussions.
PDF

성문특성이 제거된 성도특성 추출에 관한 연구 (A Study on Extract of Vocal Tract Characteristic after Concealing the Vocal Cord Property)

임지선
- 한국산학기술학회:학술대회논문집
- /
- 한국산학기술학회 2010년도 춘계학술발표논문집 1부
- /
- pp.253-256
- /
- 2010
Since the amplitude of voiced fall off at about -20dB/decade, dynamic range is often compressed prior to spectral analysis so that details at weak, high frequencies may be visible. Preemphasizing the speech, either by differentiating the analog speech $s_a$(t) prior to A/D conversion or by differencing the discrete-time s(n)=$s_a$(nT), compensating for falloff at high frequencies. The most common form of preemphasis is y(n)=s(n)-As(n-1), where A typically lies between 0.9 and 1.0 and reflects the degree of pre-emphasis. In this paper, we proposed that A is adjusted at each time by measuring the slope of envelope in frequency domain.
PDF

후두질환에 따른 자음의 음성발현시간의 특성 (The Characteristics of Voice Onset Time of the Korean Stops in the Benign Laryngeal Disorders)

홍기환;이화욱;김진성;이은정;소상수;최동일;양윤수
- 대한후두음성언어의학회지
- /
- 제17권2호
- /
- pp.98-102
- /
- 2006
Background and Objectives : Voice onset time(VOT) is defined as the time interval from oral release of a stop consonant to the onset of glottal pulsing in the following vowel. VOT is a temporal characteristics of stop consonants that reflects the complex timing of glottal articulation relative to supraglottal articulation. Stop consonants are characterized by creation of a pressure difference across a complete occlusion in the vocal tract, followed by a sudden release 'burst' due to opening that occlusion. The objects of this study is to evaluate a usefulness of voice onset time in the assessment of voice disorderd patients. Subjects : Subjects were 20 adults with normal voice and with benign laryngeal disorders. Subjects with voice disorders represented the following vocal pathologies : vocal polyp, vocal nodule, Reinke's edema and unilateral vocal fold paralysis(UVFP). Control subjects were matched for age (21-40 yews old) and sex(male) with the voice disorders subjects and had normal vocal qualities with no history of voice disorders. Methods : Each voice-disordered and matched control subject read the test passages containing three types of Korean bilabial consonants. VOT measures were made for the initial $/p/p^h/\;and\;/p'/$. VOT was measured using acoustic waveform or wide band spectrogram. Results : For each voiceless stop consonants, there was a significant difference in VOT between the voice disordered and normal subjects. The mean VOTs of the lax stops in UVFP was significantly shorter than those of control subjects in the UVFP. The mean VOTs of the aspirated stops in the vocal polyp and nodule were longer than those of control subjects, but not significant. The mean VOTs of the glottalized in voice disordered groups were longer than those of control subjects, and significant statistically in the UVFP. Conclusions : VOT may be a clinically useful acoustic parameter in the assessment of voice disordered patients, especially in the unilateral vocal fold paralysis.
PDF

선형 예측법에 의한 음성신호의 분석과 그 응용 방안 (Analysis of Speech Signals by linear prediction and It's Application)

김명규
- 대한전자공학회논문지
- /
- 제18권4호
- /
- pp.27-33
- /
- 1981
한국어의 주요 단모음에 대하여 음정을 변화시켜 가면서 전형 예측법에 따른 모형 스펙트럼과 성도의 형태를 추정함으로써 음의 고저에 따른 특성 변화를 분석하였다. 또 분석한 결과를 이용하여 음소의 조합에 의한 음성합성방안을 제시하고 실험에 의해 그 타당성을 입증하였다.
PDF

VTN을 이용한 화자 정규화에 관한 연구 (A Study on Speaker Normalization using VTN)

손창희;손종목;배건성
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2001년도 제14회 신호처리 합동 학술대회 논문집
- /
- pp.499-502
- /
- 2001
본 연구에서는 화자에 따라 서로 다른 성도의 길이에 의해 발생하는 음성인식 시스템의 성능 저하를 줄이기 위하여, VTN(Vocal Tract Normalization)을 음성인식 시스템에 적용하고, 주소 인식 실험을 통하여 인식 성능을 평가하였다. 또, VTN을 CMN과 동시에 적용하여 인식 실험을 하였다. 실험에서는 화자간 성도길이의 차이를 반영하기 위하여 13개의 Warping 계수에 대해 필터 뱅크를 이용한 선형 Warping 방법을 적용하였다. 실험결과, Baseline 인식 시스템에 비하여 VTN을 적용하면, WER(Word Error Rate)이 1.24% 감소하였고, CMN과 VTN을 동시에 적용한 실험에서는 Baseline 인식 시스템과 비교하여 WER이 0.33% 감소 하였지만 VTN을 적용한 실험결과와 비교하면 오히려 0.91% 증가하였다.
PDF

목소리 특성의 주관적 평가와 음성 특징과의 상관관계 기초연구 (A Preliminary Study on Correlation between Voice Characteristics and Speech Features)

한성만;김상범;김종열;권철홍
- 말소리와 음성과학
- /
- 제3권4호
- /
- pp.85-91
- /
- 2011
Sasang constitution medicine utilizes voice characteristics to diagnose a person's constitution. To classify Sasang constitutional groups using speech information technology, this study aims at establishing the relationship between Sasang constitutional groups and their corresponding voice characteristics by investigating various speech feature variables. The speech variables include features related to speech source and vocal tract filter. Experimental results show that statistically significant correlation between voice characteristics and some speech feature variables is observed.
PDF

진동센서를 이용한 객관적 비강공명 측정장치의 개발에 대한 연구 (Development of an Objective Measuring Device for the Nasal Resonance using the Vibratory Sensor)

박용재;최홍식;김광문;홍원표
- 대한음성언어의학회:학술대회논문집
- /
- 대한음성언어의학회 1994년도 제2회 학술대회 연제순서 및 초록집
- /
- pp.84-84
- /
- 1994
사람의 음성은 성대에서 성대음이 발성되어 성도(vocal tract)에서 공명되고 여과(filter)되어 생성된다. 성도로는 후두로부터 하인두강, 중인두강, 구강으로 이어지는 주된 통로와 하인두강, 중인두강, 상인두강, 비강으로 이어지는 보조적인 통로가 있다. 보통의 모음 발성 시에는 구강으로 통하는 통로가 주로 공명강으로 작용되며 비강 통로는 별 작용을 하지 않지만, 'ㄴ, ㅁ, o, ' 등의 비 자음을 발성할 때에는 비강통로가 주 공명강으로 작용된다. (중략)
PDF

우리말 9개 모음에서 음 대와 성도내 좁힘의 관계에 관한 연구 (Relationship between formants and constriction area of vocal tract in 9 Korean standard vowels)

서경식;김광문;최홍식;정태섭;곽도식;이현복
- 대한음성언어의학회:학술대회논문집
- /
- 대한음성언어의학회 1993년도 제1회 학술대회 연제순서 및 초록집
- /
- pp.17-17
- /
- 1993
한국어 모음 발성시 책은 Videovelopharyngogram과 동시에 녹음된 음성을 분석하여, 각 모음별로 성도내 좁힘에 대한 성문으로부터의 거리를 측정하고, 음 대를 구하여 그 상관관계를 알아보았다. 측정 인원은 표준말을 사용하는 것으로 판정된 성인 남녀 각 5명으로 하고, 측정방법으로 Simens Pantoscop 를 이용하여 Videovelopharyngogram을 얻고, DT282-F-6 SE board로 digitized된 음성을 CSpeech version 3. 의 software로 분석하였다.(중략)
PDF

Chasing ideas in phonetics

Ladefoged, Peter
- 음성과학
- /
- 제5권2호
- /
- pp.7-16
- /
- 1999
Starting as a poet, I learned about the sounds of words with David Abercrombie. Then, remembering my background in physics, I moved to studying acoustic phonetics and speech synthesis. From there I learned about psychology and how. to test perceptual theories. A meeting with a physiologist led to work on the use of the respiratory muscles in speech. Later I landed in Africa teaching English phonetics and learning about African languages. When I went to UCLA to set up a lab I was able to find bright students who helped make computer models of the vocal tract and taught me linguistic theory. And I was able to continue wandering around the world, describing the sounds of a wide range of languages.
PDF

검색결과 172건 처리시간 0.023초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)