Search | Korea Science

A Comparative Study of the Speech Signal Parameters for the Consonants of Pyongyang and Seoul Dialects - Focused on "ㅅ/ㅆ" (평양 지역어와 서울 지역어의 자음에 대한 음성신호 파라미터들의 비교 연구 - "ㅅ/ ㅆ"을 중심으로)

So, Shin-Ae;Lee, Kang-Hee;You, Kwang-Bock;Lim, Ha-Young
- Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
- /
- v.8 no.6
- /
- pp.927-937
- /
- 2018
In this paper the comparative study of the consonants of Pyongyang and Seoul dialects of Korean is performed from the perspective of the signal processing which can be regarded as the basis of engineering applications. Until today, the most of speech signal studies were primarily focused on the vowels which are playing important role in the language evolution. In any language, however, the number of consonants is greater than the number of vowels. Therefore, the research of consonants is also important. In this paper, with the vowel study of the Pyongyang dialect, which was conducted by phonological research and experimental phonetic methods, the consonant studies are processed based on an engineering operation. The alveolar consonant, which has demonstrated many differences in the phonetic value between Pyongyang and Seoul dialects, was used as the experimental data. The major parameters of the speech signal analysis - formant frequency, pitch, spectrogram - are measured. The phonetic values between the two dialects were compared with respect to /시/ and /씨/ of Korean language. This study can be used as the basis for the voice recognition and the voice synthesis in the future.
https://doi.org/10.21742/AJMAHS.2018.06.59 인용

Automatic Vowel Onset Point Detection Based on Auditory Frequency Response (청각 주파수 응답에 기반한 자동 모음 개시 지점 탐지)

Zang, Xian;Kim, Hag-Tae;Chong, Kil-To
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.13 no.1
- /
- pp.333-342
- /
- 2012
This paper presents a vowel onset point (VOP) detection method based on the human auditory system. This method maps the "perceptual" frequency scale, i.e. Mel scale onto a linear acoustic frequency, and then establishes a series of Triangular Mel-weighted Filter Bank simulate the function of band pass filtering in human ear. This nonlinear critical-band filter bank helps greatly reduce the data dimensionality, and eliminate the effect of harmonic waves to make the formants more prominent in the nonlinear spaced Mel spectrum. The sum of mel spectrum peaks energy is extracted as feature for each frame, and the instinct at which the energy amplitude starts rising sharply is detected as VOP, by convolving with Gabor window. For the single-word database which contains 12 vowels articulated with different kinds of consonants, the experimental results showed a good average detection rate of 72.73%, higher than other vowel detection methods based on short-time energy and zero-crossing rate.
https://doi.org/10.5762/KAIS.2012.13.1.333 인용 PDF KSCI

Formant frequency changes of female voice /a/, /i/, /u/ in real ear (실이에서 여자 음성 /ㅏ/, /ㅣ/, /ㅜ/의 포먼트 주파수 변화)

Heo, Seungdeok;Kang, Huira
- Phonetics and Speech Sciences
- /
- v.9 no.1
- /
- pp.49-53
- /
- 2017
Formant frequencies depend on the position of tongue, the shape of lips, and larynx. In the auditory system, the external ear canal is an open-end resonator, which can modify the voice characteristics. This study investigates the effect of the real ear on formant frequencies. Fifteen subjects ranging from 22 to 30 years of age participated in the study. This study employed three corner vowels: the low central vowel /a/, the high front vowel /i/, and the high back vowel /u/. For this study, the voice of a well-educated undergraduate who majored in speech-language pathology, was recorded with a high performance condenser microphone placed in the upper pinna and in the ear canal. Paired t-test showed that there were significant difference in the formant frequencies of F1, F2, F3, and F4 between the free field and the real ear. For /a/, all formant frequencies decreased significantly in the real ear. For /i/, F2 increased and F3 and F4 decreased. For /u/, F1 and F2 increased, but F3 and F4 decreased. It seems that these voice modifications in the real ear contribute to interpreting voice quality and understanding speech, timbre, and individual characteristics, which are influenced by the shape of the outer ear and external ear canal in such a way that formant frequencies become centralized in the vowel space.
https://doi.org/10.13064/KSSS.2017.9.1.049 인용 PDF KSCI

A Study on the Acoustic Characteristics Parameter of Resonance Cavity and Phonation in Liver Diseases (간 질환이 공명강과 발성에 미치는 음성분석학적 특징 요소 연구)

Lim, Soon-Yong;Lim, Sung-Su;Youn, Yong-Heum;Min, Ji-Sun;Song, Han-Sol;Kim, Bong-Hyun;Ka, Min-Kyoung;Cho, Dong-Uk
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.04a
- /
- pp.1093-1096
- /
- 2011
현대 의료 분야는 질병의 진단과 치료뿐만 아니라 질병의 예방 및 건강증진을 위한 관리, 유지의 역할도 중요하게 대두되고 있다. 즉, 질병의 조기 발견과 진단으로 예방 및 관리를 생활화하고 건강수준을 높이는 방향을 제시하는 등 건강증진을 유도하는 계기를 증대시키고 있다. 따라서 본 논문에서는 간질환이 음성에 미치는 영향을 연구하기 위해 간 질환자를 대상으로 공명강과 발성의 변화를 측정하는 실험을 수행하였다. 이를 위해 간 질환자를 피실험자 집단으로 구성하여 간질환으로 인해 입원했을 때와 치료 후에 퇴원했을 때의 음성을 각각 수집하여 음성 분석 요소 중 제3포먼트 주파수 대역폭과 무성음 추출 패턴수를 측정하여 간 질환으로 인해 공명강과 발성에 미치는 영향을 분석하는 연구를 수행하였다.
https://doi.org/10.3745/PKIPS.y2011m04a.1093 인용 PDF

A Lingual Sound Analysis based on Oriental Medicine Auscultation for Heart Diseases Diagnosis (심장(心臟) 질환(疾患) 진단(診斷)을 위한 한의학적 청진(聽診) 기반의 설음(舌音) 분석)

Kim, Bong-Hyun;Cho, Dong-Uk;Her, Sung-Ho
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.34 no.8B
- /
- pp.830-838
- /
- 2009
Oriental medicine lacks diagnosis data in fixed quantity possible to express visually to patients by depending on clinician's intuition than Western medicine that continues to development by various diagnosis devices. For that, this paper intends to examine relation between heart and voice signal regarded as center organ and source of life and mind in order to implement objectification through the visualization of oriental diagnosis method above all. According to because the heart is related to the tongue among five organs, by thinking with sounds, we would design the way of identifying existence of heart diseases focused on the fact that lingual sound pronunciation of heart patient is inexact. For this, we achieved a comparison, analysis of statistical bandwidth and morphological modeling of the second formants frequency about a lingual sound for their voice constituted subject group of heart diseases and normal people. Finally, we analyzed interrelationship to the result of experiment by designed method.
PDF KSCI

A Signal Processing Technique for Predictive Fault Detection based on Vibration Data (진동 데이터 기반 설비고장예지를 위한 신호처리기법)

Song, Ye Won;Lee, Hong Seong;Park, Hoonseok;Kim, Young Jin;Jung, Jae-Yoon
- The Journal of Society for e-Business Studies
- /
- v.23 no.2
- /
- pp.111-121
- /
- 2018
Many problems in rotating machinery such as aircraft engines, wind turbines and motors are caused by bearing defects. The abnormalities of the bearing can be detected by analyzing signal data such as vibration or noise, proper pre-processing through a few signal processing techniques is required to analyze their frequencies. In this paper, we introduce the condition monitoring method for diagnosing the failure of the rotating machines by analyzing the vibration signal of the bearing. From the collected signal data, the normal states are trained, and then normal or abnormal state data are classified based on the trained normal state. For preprocessing, a Hamming window is applied to eliminate leakage generated in this process, and the cepstrum analysis is performed to obtain the original signal of the signal data, called the formant. From the vibration data of the IMS bearing dataset, we have extracted 6 statistic indicators using the cepstral coefficients and showed that the application of the Mahalanobis distance classifier can monitor the bearing status and detect the failure in advance.
https://doi.org/10.7838/jsebs.2018.23.2.111 인용 PDF KSCI

Effective Feature Vector for Isolated-Word Recognizer using Vocal Cord Signal (성대신호 기반의 명령어인식기를 위한 특징벡터 연구)

Jung, Young-Giu;Han, Mun-Sung;Lee, Sang-Jo
- Journal of KIISE:Software and Applications
- /
- v.34 no.3
- /
- pp.226-234
- /
- 2007
In this paper, we develop a speech recognition system using a throat microphone. The use of this kind of microphone minimizes the impact of environmental noise. However, because of the absence of high frequencies and the partially loss of formant frequencies, previous systems developed with those devices have shown a lower recognition rate than systems which use standard microphone signals. This problem has led to researchers using throat microphone signals as supplementary data sources supporting standard microphone signals. In this paper, we present a high performance ASR system which we developed using only a throat microphone by taking advantage of Korean Phonological Feature Theory and a detailed throat signal analysis. Analyzing the spectrum and the result of FFT of the throat microphone signal, we find that the conventional MFCC feature vector that uses a critical pass filter does not characterize the throat microphone signals well. We also describe the conditions of the feature extraction algorithm which make it best suited for throat microphone signal analysis. The conditions involve (1) a sensitive band-pass filter and (2) use of feature vector which is suitable for voice/non-voice classification. We experimentally show that the ZCPA algorithm designed to meet these conditions improves the recognizer's performance by approximately 16%. And we find that an additional noise-canceling algorithm such as RAST A results in 2% more performance improvement.
PDF KSCI

A comparative study of the acoustic characteristics of the vowel /a/ between children with spastic and dyskinetic cerebral palsy (경직형과 불수의운동형 뇌성마비아동의 /아/ 모음 음향학적 비교)

Jeong, Pil Yeon;Sim, Hyun Sub
- Phonetics and Speech Sciences
- /
- v.12 no.1
- /
- pp.65-74
- /
- 2020
This study aims to compare the acoustic characteristics of vowel phonation in children with spastic and dyskinetic cerebral palsy (CP). Thirty-four children aged 4-12 years with CP participated in the study (spastic 26, dyskinetic 8). Voice samples for the acoustic analysis were extracted from a sustained vowel /a/. All acoustic measures were made using Praat. Group differences were compared by an independent t-test or Welch-Aspin test, if the equivalence assumption was not met. The results of this study are as follow. First, maximum phonation time(MPT) was significantly shorter for the dyskinetic CP than for the spastic CP. Second, shimmer percent was significantly increased in the dyskinetic CP than in the spastic CP. Lastly, there were no significant group differences in both the first formant and the second formant. These findings indicate that the dyskinetic CP has a poorer respiratory capacity and poorer laryngeal function than the spastic CP. On the other hand, both groups have a comparable ability to articulate the vowel /a/. The results of the present study help speech language pathologists identify the speech motor control ability of children with two types of CP (spastic and dyskinetic) and help to make an intervention plan associated with a specific type of CP.
https://doi.org/10.13064/KSSS.2020.12.1.065 인용 PDF KSCI

Influence of Sexual Desire Caused by Watching Phonography on Human Body (음란물 시청으로 야기된 성욕이 인체에 미치는 영향)

Kim, Bong Hyun;Cho, Dong Uk;Kim, Hee Dae;Lee, Bum Joo;Park, Young;Jeong, Yeon Man
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.42 no.4
- /
- pp.831-837
- /
- 2017
The development of various electronic media such as the Internet and smart phones, each kinds of media informations has been accompanied by the fact that various types of media information are provided from one media, and on the other hand, various dysfunctions including smart phone addiction are also caused by a very large social problem. Especially, one of the biggest dysfunctions is the social crime problem such as sex crime caused by increased sexual desire according to watch the phonography, and even if it is not a social crime, watching the phonography has influenced bad mental and physical on human body. In this paper, we try to analyze what kind of change occurs in the voice in order to investigate what kind of bad influence it has on the human body after watching the phonography. In other words, the voice in the human body is the place where the human body signal is most expressed with the face. Therefore, the purpose of this study is to investigate the effects on the organs of the human body by comparing the change of voice before and after watching phonography. Experimental results showed that the stress hormone was increased by the inability to resolve sexual desire after watching the phonography, which resulted in an increase in the bandwidth of the 3rd formant frequency.
https://doi.org/10.7840/kics.2017.42.4.831 인용 PDF KSCI

An Identification of the Healing Effect of Rain Sound According to the Gender and Personal - Adjusted Rain Sound Making (성별에 따른 빗소리의 힐링 효과 규명 및 개인 맞춤형 빗소리 제작)

Lee, Bum Joo;Cho, Dong Uk;Cho, Sang Hyun;Song, Young Bin;Jeong, Yeon Man
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.41 no.10
- /
- pp.1263-1269
- /
- 2016
Stress has become one of the largest health risk to shorten the life time of health. Accordingly, in order to increase the life time of health, stress relief can be very important. Many social expenses and economic commitment have been inputted for this purpose, but their effectiveness compared to the current situation is not very high. In this paper, we carried out an identification work of rain sound which is similar to the white noise that can stabilize the body and mind of the person by analyzing the variations of 3rd formant frequency bandwidth. Also, for relieving stress, the sounds of rain that is easily accessible at a relatively among the sounds of nature instead of consuming a lot of money and time were selected for solving these problems. In addition, we identified the effectiveness of the stress relief about the sound of rain and research on whether there is a difference between men and women in their 20's or not was performed. Finally, we discussed the personal - adjusted rain making to maximize the effectiveness of stress relief.
https://doi.org/10.7840/kics.2016.41.10.1263 인용 PDF KSCI

Search Result 30, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)