Search | Korea Science

Automatic Vowel Onset Point Detection Based on Auditory Frequency Response (청각 주파수 응답에 기반한 자동 모음 개시 지점 탐지)

Zang, Xian;Kim, Hag-Tae;Chong, Kil-To
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.13 no.1
- /
- pp.333-342
- /
- 2012
This paper presents a vowel onset point (VOP) detection method based on the human auditory system. This method maps the "perceptual" frequency scale, i.e. Mel scale onto a linear acoustic frequency, and then establishes a series of Triangular Mel-weighted Filter Bank simulate the function of band pass filtering in human ear. This nonlinear critical-band filter bank helps greatly reduce the data dimensionality, and eliminate the effect of harmonic waves to make the formants more prominent in the nonlinear spaced Mel spectrum. The sum of mel spectrum peaks energy is extracted as feature for each frame, and the instinct at which the energy amplitude starts rising sharply is detected as VOP, by convolving with Gabor window. For the single-word database which contains 12 vowels articulated with different kinds of consonants, the experimental results showed a good average detection rate of 72.73%, higher than other vowel detection methods based on short-time energy and zero-crossing rate.
https://doi.org/10.5762/KAIS.2012.13.1.333 인용 PDF KSCI

An Analysis of the Vowel Formants of the Young Males in the Buckeye Corpus (벅아이 코퍼스에서의 젊은 성인 남성의 모음 포먼트 분석)

Yoon, Kyu-Chul;Noh, Hye-Uk
- Phonetics and Speech Sciences
- /
- v.4 no.2
- /
- pp.41-49
- /
- 2012
The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, syllabic stress information, the location in a word, location in utterance, speech rate of three consecutive words, and the word frequency in the corpus. The results indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants. The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, the syllabic stress information, the location in a word, the location in an utterance, the speech rate of the three consecutive words, and the word frequency in the corpus. The result indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants.
https://doi.org/10.13064/KSSS.2012.4.2.041 인용 PDF

The Articulation Characteristics of the Profound Hearing-Impaired Children with Reference to Formant Bandwidth (심도 청각장애 아동의 조음 특성: 포먼트 대역폭을 중심으로)

Choi, Eunah
- Phonetics and Speech Sciences
- /
- v.6 no.2
- /
- pp.55-64
- /
- 2014
This study measured formant bandwidths of profound hearing impaired children and examined the characteristics of their articulation. For this study, 10 cochlear implanted children(CI), 10 hearing aid children(HA) and 10 normal hearing children(NH) were asked to read 7 Korean vowels(/ɑ, ʌ, o, u, ɯ, i, ɛ/). The subjects' readings were recorded by NasalView and analyzed by Praat. The analysis of the formant bandwidths explains the degree of vocal fold opening and the characteristics of radiation. Through the analysis of formant bandwidth, we can see that the hearing-impaired maintain vocal fold tension when they speak high vowels and characteristics of radiation. Narrower B1 means better maintain vocal fold tension, wider B2 means more front and wider B3 means the rounder lips. CI's B1 was widest and NH's was narrowest. And females' B1 was wider than males'. Among vowels, B1 of /a/ was widest, and B1 of /i/ was narrowest. In the case of B2, HA and NH's B2 was wider than CI's. Females' B2 was wider than males'. And B2 of /i/ was widest, and B2 of /ʌ/ was narrowest. In the case of B3, NH's was widest, and CI's was narrowest. Males' was wider than females'. Among vowels, B3 of /o/ was widest, and B3 of /ɛ/ was narrowest. As a result, first, through the analysis of B1, we can find that NH and males could better maintain vocal fold tension than the hearing-impaired or females, and all children articulate /i/ with vocal fold tension than other vowels. Second, through the analysis of B2, NH and HA articulate vowels with the weaker rounded than CI does. And females articulate vowels with the weaker rounded than males do. Third, through the analysis of B3, NH articulate vowels with the rounder than HA or CI do, and males articulate vowels with the rounder than females do. Through the results, we can expect that the analysis of formant bandwidth will be applied to the therapy of articulation for the hearing-impaired with hearing aids or cochlear implant.
https://doi.org/10.13064/KSSS.2014.6.2.055 인용 PDF KSCI

A Lingual Sound Analysis based on Oriental Medicine Auscultation for Heart Diseases Diagnosis (심장(心臟) 질환(疾患) 진단(診斷)을 위한 한의학적 청진(聽診) 기반의 설음(舌音) 분석)

Kim, Bong-Hyun;Cho, Dong-Uk;Her, Sung-Ho
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.34 no.8B
- /
- pp.830-838
- /
- 2009
Oriental medicine lacks diagnosis data in fixed quantity possible to express visually to patients by depending on clinician's intuition than Western medicine that continues to development by various diagnosis devices. For that, this paper intends to examine relation between heart and voice signal regarded as center organ and source of life and mind in order to implement objectification through the visualization of oriental diagnosis method above all. According to because the heart is related to the tongue among five organs, by thinking with sounds, we would design the way of identifying existence of heart diseases focused on the fact that lingual sound pronunciation of heart patient is inexact. For this, we achieved a comparison, analysis of statistical bandwidth and morphological modeling of the second formants frequency about a lingual sound for their voice constituted subject group of heart diseases and normal people. Finally, we analyzed interrelationship to the result of experiment by designed method.
PDF KSCI

Face Feature Extraction for Child Ocular Inspection and Diagnosis of Colics by Crying Analysis (소아 망진을 위한 얼굴 특징 추출 및 영아 산통 진단을 위한 울음소리 분석)

Cho Dong-Uk;Kim Bong-Hyun
- The KIPS Transactions:PartB
- /
- v.13B no.2 s.105
- /
- pp.97-104
- /
- 2006
There is no method to control for the child efficiently when disease happens who cannot be able to express his symptoms. Therefore, doctor's diagnosis depends on inquiring from child's patients, that leads to wrong diagnosis result. For this, in this paper, we would like to develop child ocular inspection, auscultation diagnosis instruments, using Oriental medicine principle that living body signal of five organs and six hallow organs which reflects patients face and voice We would like to get more accurate diagnosis result for child's symptoms from doctor's intuition on the basis of diagnostic sight visualization, objectification, quantization itself. This paper develops color revision, YCbCr application, and face color selection and five sensory organs and nose or apex extraction method etc, in child ocular inspection by first work achievement sequence among the whole development systems. Also, in occasion of child auscultation, crying characteristics of colics through pitch, intensity and formant analysis is numerized and objectifies doctor's intuition through this. Finally, experiments are performed to verify the effectiveness of the proposed methods.
https://doi.org/10.3745/KIPSTB.2006.13B.2.097 인용 PDF KSCI

A SPECTROGRAPHICAL STUDY OF KOREAN VOWELS

LEE H.B.;Zhi M.J.
- MALSORI
- /
- no.6
- /
- pp.4-12
- /
- 1983
이 논문은 음향 분석기를 이용하여 한국어의 단순모음 8개를 음향 음성학적으로 분석하고 그 결과를, 이 현복의 1971년 논문 "현대 서울말의 모음 음가"에서 기분 모음을 기준으로 하여 기술한 단순 모음의 소리값과 비교하는 데어 목적이 있다. 특히, 한국어의 모음 1)길고 세게 날 때, 2)짧고 세게 날 때, 그리고 3) 여리게 날 때의 세가지 환경에 따라 변이음의 음가가 달리 나타난다는 이 현복의 이론을 음향 음성학적으로 확인해 보는 것이 연구를 하는 주요 관심사이다. 이 실험에 사용된 자료는 위에 말한 이 현복의 논문과 "한국어 음성학"(김선기, 1937, 1971; 영문)에 제시된 낱말로 이루어져 있으며, 이를 스웨덴에 유학중인 지 민제가 자신의 목소리로 직접 녹음하여 위메오 대한 음성학과의 음향 음성학 실험실에서 음향분석기로 분석한 다음, 각 모음의 제1 및 제2포인트를 측정하여 리를 토대로 음향도를 만들었다. 이 실험 결과는 다음과 같이 요약할 수 있다. : 1)그림 2,3과 포먼트 표에서 보인 바와 같이, 모음 /이, 에, 오, 으/는 각각 이 현복의 주장대로 환경에 따라 세 개의 분명히 다른 음가를 나타내고 있다. 2) 한편 모음 /애, 아, 우, 어/는 모음의 길이에 따라 다만 두 종류의 음가 변동이 나타날 뿐이며 강세의 유무에 따른 음가 차이는 드러나지 않았다. 3) 이 현복의 주장대로 모음 /에/와 /애/mss 음가의 차이가 크지 않으므로 음운 대립이 무디어질 수 있음을 이번 실험 결과로 확인 하였다. 특히 강세가 없는 /에/는 강세가 있는 /애/와 소리값이 거의 같았다. 4) 이 현복은 표준말에서 /어/의 음가가 세대에 따라 다르며, 안정된 세대의 말씨에서는 /어:/가 /어/에 비해 높고 중앙화한 소리값을 지닌다는 주장을 하였다. 그러나 이 실험 연구에서는 녹음한 이가 젊은 세대이어서 인지 그러한 현상이 나타나지 않았고, 다만 /어:/는 /어/보다 높이만이 높은 것으로 나타났다. 5) 이번 실험 연구에서 모음의 소리값이 장단과 강세에 따라 달라진다는 이 현복의 주장이 대체로 증명된 셈이나, 종합적이고 확고한 결론을 내리려면 좀 더 광범한 실험 연구가 필요하다고 본다. 특히 안정된 세대의 말씨를 직접 녹음하여 음향 음성학적으로 분석함이 필요하다.
PDF

Analysis of the First Formant Bandwidth and Vocal Vibration bu Kidney Ear Point Irritation (신장 이혈(耳穴) 자극에 따른 제 1 포먼트 대역폭 및 성대 진동 분석)

Seo, Kyoung-Won;Kang, Deok-Hyun;Bae, Jung-Su;Jang, Yong-Jo;Yean, Yong-Hem;Lim, Soon-Yong;Min, Ji-Seon;Kim, Bong-Hyun;Ka, Min-Kyoung;Cho, Dong-Uk
- Proceedings of the Korea Information Processing Society Conference
- /
- 2010.11a
- /
- pp.1430-1433
- /
- 2010
삶의 지표가 상승함에 따라 선진국에서는 건강에 대한 관심도가 높아져 질병이 발생되기 전에 조기에 진단하여 예방하는 건강 패턴이 행해지고 있는 실정이다. 이와 같은 예방 분야를 반영한 것이 대체의학이며 이침 요법은 부작용이 적은 방법으로 널리 사용되고 있다. 이침 요법은 교육과정을 거친 후 자가 진단을 통해 응급처지가 가능한 것으로 실생활에서 손쉽게 이용되고 있다. 따라서 본 논문에서는 신장에 해당하는 이(耳)혈 상응점을 자극하여 신장과 관련된 음성 요소의 변화를 측정하였다. 이를 위해 신장 이혈 상응점을 자극하기 전과 후의 음성을 수집하여 음성 분석 요소 중 제 1 Formant Bandwidth와 Jitter, Shimmer값을 적용하여 비교, 분석하였다. 결과적으로 신장 이혈 상응점 자극에 의해 성대 진동의 변화율이 낮아져 발음의 정확성을 나타내는 결론을 도출하였다.
https://doi.org/10.3745/PKIPS.y2010m11a.1430 인용 PDF

A Study on the Correlation Between Sasang Constitution and Sound Characteristics Used Harmonics and Formant Bandwidth (Harmonics(배음)와 Formant Bandwidth(포먼트 폭)를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究))

Park, Sung-Jin;Kim, Dal-Rae
- Journal of Sasang Constitutional Medicine
- /
- v.16 no.1
- /
- pp.61-73
- /
- 2004
This study was prepared to investigate the correlation between Sasang constitutional groups and voice characteristics using voice analysis system(in this study, CSL). I focused on the voice characteristics in terms of harmonics, Formant frequency and Formant Bandwidth. The subjects were 71 males. I classified them into three groups, that is Soeumin group, Soyangin group and Taeumin group. The classification method of Constitution used two ways, QSCCII(Questionnarie for the Sasang Constitution Classification II) and Interview with a specialist in Sasang Constitution. So 71 people were categorized into 31 Soeumin(people), 18 Soyangin(people) and 22 Taeumin(people). Pitch is approximately similar to the fundamental frequency(F0) in voices. Shimmer in dB gives an evaluation of the period-to-period variability of the peak-to-peak amplitude within the analyzed voice sample. FFT(Fast Fourier Transform) method in CSL can display sampled voices into harmonics. H1 is the first peak and h2 is the second peak in the harmonics. The amplitude difference of h1 and h2(h1-h2) can be explained as the speaker's phonation type, And Formant frequency and bandwidth can be explained as the speaker's vocal tract. So I checked the harmonics and Formant frequency and Bandwidth as the voice parameters. First I have captured /e/ voices from all subjects using microphone. And then I analyzed /e/ voices with CSL. Power Spectrum and Formant History is the menu in the CSL which can display harmonics and Formant frequency and bandwidth. The results about the correlation between Sasang Constitutional Groups and voice parameters are as follows; 1. There is no significant amplitude difference of harmonics(h1-h2) among three groups. 2. There is the significant difference between Soeumin Group and Soyangin Group in Formant Frequency 1 and Formant Bandwidth 1(p<0.05). Any other parameters have no significance. I assume that Soyangin Group has clearer and brighter voice than Soeumin Group according to the Formant Bandwidth difference. And I think its result has coincidence with the context of "Dongyi Suse Bowon" and "Sasangimhejinam".
PDF

A Signal Processing Technique for Predictive Fault Detection based on Vibration Data (진동 데이터 기반 설비고장예지를 위한 신호처리기법)

Song, Ye Won;Lee, Hong Seong;Park, Hoonseok;Kim, Young Jin;Jung, Jae-Yoon
- The Journal of Society for e-Business Studies
- /
- v.23 no.2
- /
- pp.111-121
- /
- 2018
Many problems in rotating machinery such as aircraft engines, wind turbines and motors are caused by bearing defects. The abnormalities of the bearing can be detected by analyzing signal data such as vibration or noise, proper pre-processing through a few signal processing techniques is required to analyze their frequencies. In this paper, we introduce the condition monitoring method for diagnosing the failure of the rotating machines by analyzing the vibration signal of the bearing. From the collected signal data, the normal states are trained, and then normal or abnormal state data are classified based on the trained normal state. For preprocessing, a Hamming window is applied to eliminate leakage generated in this process, and the cepstrum analysis is performed to obtain the original signal of the signal data, called the formant. From the vibration data of the IMS bearing dataset, we have extracted 6 statistic indicators using the cepstral coefficients and showed that the application of the Mahalanobis distance classifier can monitor the bearing status and detect the failure in advance.
https://doi.org/10.7838/jsebs.2018.23.2.111 인용 PDF KSCI

Effects of Butorphanol on Behavior after Intestinal Anastomosis in Dogs (Butorphanol의 투여가 장문합술 후 개의 행동에 미치는 영향)

Koo Ja-min;Lee Hee-chun;Chang Hong-hee;Seong Yong-jeung;Lee Hyo-jong;Yeon Seong-chan
- Journal of Veterinary Clinics
- /
- v.22 no.1
- /
- pp.6-15
- /
- 2005
This study was performed to investigate non-invasive behavioral pain assessment of dogs after surgery, and the analgesic effects of butorphanol after intestinal anastomosis in dogs. In this study, five dogs in the Control Group were anesthetized, but did not undergo surgery. Five dogs in the Analgesic Group were undergone intestinal anastomosis and treated with butorphanol. Five dogs in the Non-analgesic Group were also undergone intestinal anastomosis without analgesic treatment. The dogs in the Analgesic Group received butorphanol (0.4 mg/kg, IM) before and immediately after operation, while dogs in Control and Non-analgesic Groups received isovolumetric doses of sterile saline. The behavior of dogs were videotaped for 400 mins after anesthesia, during which time a researcher interacted with the dog once per each 80 mins. At each interaction, the researcher recorded behavioral pain score, using University of Melbourne Pain Scale. Interactive and non-interactive behaviors were observed and quantitated by a single observer using focal continuous sampling method. Vocalizations were obtained during 400 mins after anesthesia, and duration of call, intensity, pitch, 1-4 Formant were analyzed. Surgery affected an increasing of pain score. During interactions with researcher, greeting behaviors were decreased after surgery. Differences between Analgesic group given analgesic or that given a placebo drug were readily understood using quantitative behavioral measurements and vocalization. Significant difference between Analgesic group given butorphanol or that the given placebo drug was apparent(p< 0.05).
PDF KSCI

Search Result 96, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)