Search | Korea Science

English Phoneme Recognition using Segmental-Feature HMM (분절 특징 HMM을 이용한 영어 음소 인식)

Yun, Young-Sun
- Journal of KIISE:Software and Applications
- /
- v.29 no.3
- /
- pp.167-179
- /
- 2002
In this paper, we propose a new acoustic model for characterizing segmental features and an algorithm based upon a general framework of hidden Markov models (HMMs) in order to compensate the weakness of HMM assumptions. The segmental features are represented as a trajectory of observed vector sequences by a polynomial regression function because the single frame feature cannot represent the temporal dynamics of speech signals effectively. To apply the segmental features to pattern classification, we adopted segmental HMM(SHMM) which is known as the effective method to represent the trend of speech signals. SHMM separates observation probability of the given state into extra- and intra-segmental variations that show the long-term and short-term variabilities, respectively. To consider the segmental characteristics in acoustic model, we present segmental-feature HMM(SFHMM) by modifying the SHMM. The SFHMM therefore represents the external- and internal-variation as the observation probability of the trajectory in a given state and trajectory estimation error for the given segment, respectively. We conducted several experiments on the TIMIT database to establish the effectiveness of the proposed method and the characteristics of the segmental features. From the experimental results, we conclude that the proposed method is valuable, if its number of parameters is greater than that of conventional HMM, in the flexible and informative feature representation and the performance improvement.
PDF KSCI

Remote Seabed Classification Based on the Characteristics of the Acoustic Response of Echo Sounder: Preliminary Result of the Suyoung Bay, Busan (측심기의 음향반사 특성을 이용한 해저퇴적물의 원격분류: 부산 수영만의 예비결과)

Kim Gil Young;Kim Dae Choul;Kim Yang Eun;Lee Kwang Hoon;Park Soo Chul;Park Jong Won;Seo Young Kyo
- Korean Journal of Fisheries and Aquatic Sciences
- /
- v.35 no.3
- /
- pp.273-281
- /
- 2002
Determination of sediment type is generally based on ground truthing. This method, however, provides information only for the limited sites. Recent developments of remote classification of seafloor sediments made it possible to obtain continuous profiles of sediment types. QTC View system, which is an acoustic instrument providing digital real-time seabed classification, was used to classify seafloor sediment types in the Suyoung Bay, Pusan. QTC View was connected to 50 kHz echo sounder, All parameters of QTC View and echo sounder are uniformly kept during survey. By ground truthing, the sediments are classified into seven types, such as slightly gravelly sand, slightly gravelly sandy mud, gravelly muddy sand, clayey sand, sandy mud, slightly gravelly muddy sand, and rocky bottom. By the first remote classification using QTC View, four sediment types are clearly identified, such as slightly gravelly sand, gravelly mud, slightly gravelly muddy sand, and rocky bottom. These are similar to the result of the second survey. Also the result of remote classification matches well with that of ground truthing, but for sediment type determined by minor component. Therefore, QTC View can effectively be used for remote classification of seafloor sediments.
https://doi.org/10.5657/kfas.2002.35.3.273 인용 PDF KSCI

A study on A-pillar & wiper wind noise estimation using response surface methodology at design stage (반응면 기법을 이용한 A필라/와이퍼 풍절음 예측 연구)

Rim, Sungnam;Shin, Seongryong;Shin, Hyunsu
- The Journal of the Acoustical Society of Korea
- /
- v.37 no.5
- /
- pp.292-299
- /
- 2018
The vehicle exterior design is the main parameter of aerodynamic wind noise, but the modification of it is nearly impossible at a proto-type stage. Therefore, it is very important to verify exterior design and estimate the correct wind noise level at the early vehicle design stages. The numerical simulations of aerodynamic wind noises around A-pillar and wiper were developed for specific vehicle exterior designs, but could not be directly used for the discussions with designers because these need complex modeling and simulation process. This study proposes new approach to A-pillar and wiper wind noise estimation at design stage using response surface methodology of modeFRONTIER, of which database is composed of PowerFLOW simulation, PowerCLAY modeling, SEA-Baced (Statistical Energy Analysis-Based) interior noise simulation, and turbulent acoustic power simulation. New design parameters are defined and their contributions are analyzed. A state-of-the-art, easy and reliable CAT (Computer Aided Test) tool for A-pillar and wiper wind noise are acquired from this study, which shows high usefulness in car development.
https://doi.org/10.7776/ASK.2018.37.5.292 인용 PDF KSCI

Assessment of the Damage in High Performance Fiber-Reinforced Cement Composite under Compressive Loading Using Acoustic Emission (AE기법에 의한 압축력을 받는 고인성 섬유보강 시멘트 복합체의 손상 평가)

Kim, Sun-Woo;Yun, Hyun-Do
- Journal of the Korea Concrete Institute
- /
- v.21 no.5
- /
- pp.589-597
- /
- 2009
High Performance Fiber-reinforced Cement Composite (HPFRCC) shows the multiple crack and damage tolerance capacity due to the interfacial bonding of the fibers to the cement matrix. For practical application, it is needed to investigate the fractural behavior of HPFRCC and understand the micro-mechanism of cement matrix with reinforcing fiber. This study is devoted to the investigation of the AE signals in HPFRCC under monotonic and cyclic uniaxial compressive loading, and total four series were tested. The major experimental parameters include the type and volume fraction of fiber (PE, PVA, SC), the hybrid type and loading pattern. The test results showed that the damage progress by compressive behavior of the HPFRCC is a characteristic for the hybrid fiber type and volume fraction. It is found from acoustic emission (AE) parameter value, that the second and third compressive load cycles resulted in successive decrease of the amplitude as compared with the first compressive load cycle. Also, the AE Kaiser effect existed in HPFRCC specimens up to 80% of its ultimate strength. These observations suggested that the AE Kaiser effect has good potential to be used as a new tool to monitor the loading history of HPFRCC.
https://doi.org/10.4334/JKCI.2009.21.5.589 인용 PDF KSCI

Evaluation of the Stress Corrosion Cracking Behavior of Inconel G00 Alloy by Acoustic Emission (음향 방출에 의한 인코넬 600 합금의 응력 부식 균열 거동 평가)

Sung, Key-Yong;Kim, In-Sup;Yoon, Young-Ku
- Journal of the Korean Society for Nondestructive Testing
- /
- v.16 no.3
- /
- pp.174-183
- /
- 1996
Acoustic emission(AE) response during stress corrosion cracking(SCC) of Inconel 600 alloy has been monitored to study the AE detectability of crack generation and growth by comparing the crack behavior with AE parameters processed, and to evaluate the applicability as a nondestructive evaluation(AE) by measuring the minimum crack size detectable with AE. Variously heat-treated specimens were tensioned by constant extension rate test(CERT) in various extension rate to give rise to the different SCC behavior of specimens. The AE amplitude level generated from intergranular stress-corrosion cracking(IGSCC) is higher than those from ductile fracture and mechanical deformation, which means the AE amplitude can be a significant parameter for distinguishing the An source. AE can also provide the effective means to identify the transition from the small crack initiation and formation of dominant cracks to the dominant crack growth. Minimum crack size detectable with AE is supposed to be approximately 200 to $400{\mu}m$ in length and below $100{\mu}m$ in depth. The test results show that AE technique has a capability for detecting the early stage of IGSCC growth and the potential for practical application as a NDE.
PDF

Differentiation of Vocal Cyst and Polyp by High-Piched Phonation Characteristics (성대낭종과 성대폴립 간의 고음발성 양상의 차이)

Lee, Jong-Ik;Jeong, Go-Eun;Kim, Seong-Tae;Kim, Sang-Yeon;Nam, Soon-Yuhl;Kim, Sang-Yoon;Roh, Jong-Lyel;Choi, Seung-Ho
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.23 no.1
- /
- pp.48-51
- /
- 2012
Background and Objectives : Vocal fold cyst is generally treated by surgical resection, it has a difference with vocal fold polyp, treated by conservative management first. Decrease in mucosal waves is known as main diagnostic criteria of vocal fold cyst. Sometimes there is a difficulty for diffrential diagnosis between cyst and polyp only by endoscopic examination. The purpose of the study is to identify the objective features of vocal cyst and polyp on the basis of voice analysis for the proper differential diagnosis, especially at high pitched phonation. Materials and Method : The voice analysis was done in 15 focal fold cyst patients and 42 vocal fold polyp. Parameters of perceptual assessment, acoustic and aerodynamic measure, and voice range profile were compared between two groups. Results : Vocal fold cyst patients showed significantly reduced MPT by acoustic and aerodynamic analysis, narrowed frequency-range and low maximun frequency by voice range profile analysis compared with vocal fold polyp patient. Maximun frequency 381 Hz is established for cut off value, differential diagnosis between cyst and polyp (ROC analysis, sensitivity 60%, specificity 68%). Conclusion : Voice analysis is helpful for differential diagnosis between vocal fold cyst and polyp, especially there is a difficulty for distinguish cyst from polyp at clinical situation by endoscopic examination. The result of decreased maximum frequncy at vocal fold cyst supports incomplete high-pitched phonation and falsetto regester at vocal fold cyst patients due to decreased mucosal wave, compared with vocal fold polyp patients.
PDF

Compromised feature normalization method for deep neural network based speech recognition (심층신경망 기반의 음성인식을 위한 절충된 특징 정규화 방식)

Kim, Min Sik;Kim, Hyung Soon
- Phonetics and Speech Sciences
- /
- v.12 no.3
- /
- pp.65-71
- /
- 2020
Feature normalization is a method to reduce the effect of environmental mismatch between the training and test conditions through the normalization of statistical characteristics of acoustic feature parameters. It demonstrates excellent performance improvement in the traditional Gaussian mixture model-hidden Markov model (GMM-HMM)-based speech recognition system. However, in a deep neural network (DNN)-based speech recognition system, minimizing the effects of environmental mismatch does not necessarily lead to the best performance improvement. In this paper, we attribute the cause of this phenomenon to information loss due to excessive feature normalization. We investigate whether there is a feature normalization method that maximizes the speech recognition performance by properly reducing the impact of environmental mismatch, while preserving useful information for training acoustic models. To this end, we introduce the mean and exponentiated variance normalization (MEVN), which is a compromise between the mean normalization (MN) and the mean and variance normalization (MVN), and compare the performance of DNN-based speech recognition system in noisy and reverberant environments according to the degree of variance normalization. Experimental results reveal that a slight performance improvement is obtained with the MEVN over the MN and the MVN, depending on the degree of variance normalization.
https://doi.org/10.13064/KSSS.2020.12.3.065 인용 PDF KSCI

Development of Battery-free SAW Integrated Microsensor for Real Time Simultaneous Measurement of Humidity and $CO_2$ component (습도와 $CO_2$ 농도의 실시간 동시감지를 위한 무전원 SAW 기반 집적 센서 개발)

Lim, Chun-Bae;Lee, Kee-Keun;Wang, Wen;Yang, Sang-Sik
- Journal of the Microelectronics and Packaging Society
- /
- v.16 no.1
- /
- pp.13-19
- /
- 2009
A 440MHz wireless and passive surface acoustic wave (SAW) based chemical sensor was developed on a $41^{\circ}YX\;LiNbO_3$ piezoelectric substrate for simultaneous measurement of $CO_2$ gas and relative humidity (RH) using a reflective delay line pattern as the sensor element. The reflective delay line is composed of an interdigital transducer (IDT) and several shorted grating reflectors. A Teflon AF 2400 and a hydrophilic $SiO_2$ layer were used as $CO_2$ and water vapor sensitive films. The coupling of mode (COM) modeling was conducted to determine optimal device parameters prior to fabrication. According to simulation results, the device was fabricated and then wirelessly measured using the network analyzer. The measured reflective coefficient $S_{11}$ in the time domain showed high signal/noise (S/N) ratio, small signal attenuation, and few spurious peaks. In the $CO_2$ and humidity testing, high sensitivity ($2^{\circ}/ppm$ for $CO_2$ detection and $7.45^{\circ}/%$RH for humidity sensing), good linearity and repeatability were observed in the $CO_2$ concentration ranges of $75{\sim}375ppm$ and humidity levels of $20{\sim}80%$RH. Temperature and humidity compensations were also investigated during the sensitivity evaluation process.
PDF

Perception of Japanese word-initial stops by native listeners (모어청자에 의한 일본어 어두 폐쇄음의 지각)

Byun, Hi-Gyung
- Phonetics and Speech Sciences
- /
- v.13 no.3
- /
- pp.53-64
- /
- 2021
It is known that the voicing contrast for Japanese word-initial stops is primarily realized as differences in the voice onset time (VOT). However, recent studies have reported that voiced stops are more often produced with a positive VOT than with a negative VOT among the younger generation nationwide. It is also known that post-stop F0 is associated with the stop contrast, but the degree of F0 use differs from region to region. This study explores whether the difference in post-stop F0 functions as a perceptual cue to the stop contrast along with VOT. Fifty-five college students who are native listeners from four different regions participated in two or three perception tests. The results show that VOT is a primary cue to the voiced-voiceless distinction of word-initial stops, but that the effect of post-stop F0 on the stop contrast is marginal. The post-stop F0 is involved in perception only when VOT is ambiguous, such that a sound with high F0 is more often perceived as a voiceless stop, but not vice versa. The results of this study indicate that the acoustic parameters associated with the stop contrast are not the same in production and perception, and suggest that other factors such as context, which is not an acoustic characteristic, may also be involved in the stop contrast.
https://doi.org/10.13064/KSSS.2021.13.3.053 인용 PDF KSCI

Analysis of the Effect of Intralesional Steroid Injection on the Voice During Laryngeal Microsurgery (후두 미세수술 중 병변 내 스테로이드 주입이 음성에 미치는 효과 분석)

Jae Seon, Park;Hyun Seok, Kang;In Buhm, Lee;Sung Min, Jin;Sang Hyuk, Lee
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.33 no.3
- /
- pp.166-171
- /
- 2022
Background and Objectives Vocal fold (VF) scar is known to be the most common cause of dysphonia after laryngeal microsurgery (LMS). Steroids reduce postoperative scar formation by inhibiting inflammation and collagen deposition. However, the clinical evidence of whether steroids are helpful in reducing VF scar formation after LMS is still lacking. The purpose of this study is to determine whether intralesional VF steroid injection after LMS helps to reduce postoperative scar formation and voice quality. Materials and Method This study was conducted on 80 patients who underwent LMS for VF polyp, Reinke's edema, and leukoplakia. Among them, 40 patients who underwent VF steroid injection after LMS were set as the injection group, and patients who had similar sex, age, and lesion size and who underwent LMS alone were set as the control group. In each group, stroboscopy, multi-dimensional voice program, Aerophone II, and voice handicap index (VHI) were performed before and 1 month after surgery, and the results were statistically analyzed. Results There were no statistically significant differences in the distribution of sex, age, symptom duration, occupation and smoking status between each group. Both groups consisted of VF polyp (n=21), Reinke's edema (n=11), and leukoplakia (n=9). On stroboscopy, the lesion disappeared after surgery, and the amplitude and mucosal wave were symmetrical on both sides of the VFs in all patients. Acoustic parameters and VHI significantly improved after surgery in all patients. However, there was no significant difference between the injection and control group in most of the results. Conclusion There was no significant difference in the results of stroboscopy, acoustic, aerodynamic, and subjective evaluation before and after surgery in the injection group and the control group.
https://doi.org/10.22469/jkslp.2022.33.3.166 인용 PDF KSCI

Search Result 846, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)