Search | Korea Science

CASA-based Front-end Using Two-channel Speech for the Performance Improvement of Speech Recognition in Noisy Environments (잡음환경에서의 음성인식 성능 향상을 위한 이중채널 음성의 CASA 기반 전처리 방법)

Park, Ji-Hun;Yoon, Jae-Sam;Kim, Hong-Kook
- Proceedings of the IEEK Conference
- /
- 2007.07a
- /
- pp.289-290
- /
- 2007
In order to improve the performance of a speech recognition system in the presence of noise, we propose a noise robust front-end using two-channel speech signals by separating speech from noise based on the computational auditory scene analysis (CASA). The main cues for the separation are interaural time difference (ITD) and interaural level difference (ILD) between two-channel signal. As a result, we can extract 39 cepstral coefficients are extracted from separated speech components. It is shown from speech recognition experiments that proposed front-end has outperforms the ETSI front-end with single-channel speech.
PDF

User Needs for Haptic Communication of VR Fashion Product Shopping

Kim, Jongsun;Ha, Jisoo
- Fashion & Textile Research Journal
- /
- v.21 no.4
- /
- pp.401-411
- /
- 2019
Non-contact judgment and evaluation for products are increasingly needed along with a rapid environmental change in fashion that sows urgency in the need to implement services that allows users to judge and experience a tactile sense in a fashion product without actual contact. Technological development is required to provide users with syn-aesthetic experiences that integrate the visual, tactile and the auditory. There is also a need to conduct research to increase immersion that provides users with ICT-related experiences communicated through fashion images. The study analyzed demands for haptic communication technology by Korean users in immersive VR fashion product shopping. Accordingly, it defined haptic communication through literature research, investigated immersion in the VR environment and conducted in-depth interviews for haptic communication applicable to VR shopping. Findings show that hedonic reactions by fantasy, emotion and fun function are an important motive in selecting VR shopping. VR fashion product shopping steps were divided into 4: move to store, search in store, search of product and purchase based on offline store shopping experience. It defined the haptic communication by steps and analyzed the types of the haptic feedback to be implemented. The study results provide basic data for developing haptic communication technology that can enhance e a sense of the presence and immersion experiences that can help lay a groundwork for pilot studies on the convergence of the virtual and the real.
https://doi.org/10.5805/SFTI.2019.21.4.401 인용 PDF KSCI

Communication of Young Black-Tailed Gulls, Larus crassirostris, in response to Parents Behavior

Chung, Hoon;Cheong, Seok-Wan;Park, Shi-Ryong
- Animal cells and systems
- /
- v.8 no.4
- /
- pp.295-300
- /
- 2004
In the breeding colony of black-tailed gull, as nests of conspecific neighbors are very closely located, chicks are permanently exposed by sound and visual stimuli produced by adult conspecifics approaching their nests. The chicks, therefore, may need to learn ways to appropriately respond to their parents approach. In this study we experimentally manipulated sensory stimulation that is potentially provided by the parents to the offspring. Chicks incubated in the laboratory were exposed to a mew call of the conspecific adult. Then they were tested in three situations differing in sensory stimulation: 1) visual stimulation only, 2) auditory stimulation only, and 3) Simultaneous visual and auditory stimulations. We observed occurrence of different response of the chicks, which were categorized into three behaviors (begging call response, chirirah call and pecking behavior). We also investigated intensity of the chicks call in response to the different stimulations and the degree of response with age. The chicks exposed to only auditory stimulation made significantly more chirirah calls. The intensities (dB) of the mew call and chicks chirirah call were directly correlated. On the other hand, when chicks just saw the stuffed adult gull, they responded significantly more with a begging call and pecking behavior. In the situation of costimulation, the chicks responded with a begging call and pecking, but less frequently than visual stimulation only. The results suggest that young black-tailed gulls use call repertories to properly respond to parents behavior. Such results suggest an evolutionary process for uncreasing their survival rate in a group breeding site.
PDF KSCI

The Effect of Parent Involvement Auditory Training Program on Communication Ability of Children with Hearing Impairments (부모 듣기 지도 프로그램이 청각장애아동의 언어 능력과 의사소통 행동에 미치는 영향)

CHAE, Jung-Hee;HUH, Myung-Jin;PARK, Chan-Hee
- Journal of Fisheries and Marine Sciences Education
- /
- v.28 no.3
- /
- pp.818-830
- /
- 2016
The purpose of this study is to examine the effects of the parents listening guidance program, which allows the parents to understand their hearing impaired children and how to listen at home, on the communication skills of the hearing impaired children. The research subjects were 3 hearing impaired children who did not accompany with the intellectual, emotional and behabioral problems, and the listening guidance has been performed for their parents for 3 months through the listening guidance program. The changes in the communication skills in the hearing impaired children were observed comparing before and after the education. In the results, first, the receptive language skill of the hearing impaired children was improved after than before the parents listening guidance. Second, the expressive language skill of the hearing impaired children was improved after than before the parents listening guidance, too. Third, in the communication behavior of the hearing impaired children, the phonation and the speech production were increased together with the gesture after the parents listening guidance. In conclusion, it is deemed that the parents listening guidance program would have positive influence on the communication behavior of the hearing impaired children.
https://doi.org/10.13000/JFMSE.2016.28.3.818 인용 PDF KSCI

Speech Enhancement in Noisy Speech Using Neural Network (신경회로망을 사용한 잡음이 중첩된 음성 강조)

Choi, Jae-Seung
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.42 no.5 s.305
- /
- pp.165-172
- /
- 2005
In speech recognition under a noisy environment, it is necessary to construct a system which reduces the noise and enhances the speech. Then it is effective to imitate the human auditory system which has an excellent analytical spectrum mechanism for speech enhancement. Accordingly, this paper proposes an adaptive method using the auditory mechanism which is called lateral inhibition. This method first estimates the noise intensity by neural network, then adaptively adjusts both the coefficients of the lateral inhibition and the adjusting coefficient of amplitude component according to the noise intensity for each input frame. It is confirmed that the proposed method is effective for speech degraded by white noise, colored noise, and road noise based on the spectral distortion measurement.
PDF KSCI

Isolated-Word Speech Recognition in Telephone Environment Using Perceptual Auditory Characteristic (인지적 청각 특성을 이용한 고립 단어 전화 음성 인식)

Choi, Hyung-Ki;Park, Ki-Young;Kim, Chong-Kyo
- Journal of the Institute of Electronics Engineers of Korea TE
- /
- v.39 no.2
- /
- pp.60-65
- /
- 2002
In this paper, we propose GFCC(gammatone filter frequency cepstrum coefficient) parameter which was based on the auditory characteristic for accomplishing better speech recognition rate. And it is performed the experiment of speech recognition for isolated word acquired from telephone network. For the purpose of comparing GFCC parameter with other parameter, the experiment of speech recognition are carried out using MFCC and LPCC parameter. Also, for each parameter, we are implemented CMS(cepstral mean subtraction)which was applied or not in order to compensate channel distortion in telephone network. Accordingly, we found that the recognition rate using GFCC parameter is better than other parameter in the experimental result.
PDF KSCI

Analysis of Nonlinear Time Series by Bispectrum Methods and its Applications (바이스펙트럼에 의한 비선형 시계열 신호 해석과 그 응용)

Kim, Eung-Su;Lee, Yu-Jeong
- The Transactions of the Korea Information Processing Society
- /
- v.6 no.5
- /
- pp.1312-1322
- /
- 1999
The world of linearity, which is regular, predictable and irrelevant to time sequence in most natural phenomenon, is a very small part. In fact, signals generated from natural phenomenon with which we're in contact are showed only slight linearity. Therefore it is very difficult to understand and analyze natural phenomenon with only predictable and regular linear systems. Due to these reasons researches concerning non-linear signals that of analysis were excluded being regarded as noise are being actively carried out. Countless signals generated from nonlinear system have the information about itself, and analyzing those signals and get information from it, that will be able to be used effectively in so may fields. Hence, in this paper we used a higher order spectrum, especially the bispectrum. After we prove the validity applying bispectrum to logistic map, which is typical chaotic signal. Subsequently by showing the result applying for actual signal analysis of EEG according to auditory stimuli, we show that higher order spectra is a very useful parameter in analysis of non-linear signals and the result of EEG analysis according to auditory stimuli.
PDF

The Influence of YouTube "Mukbang" Content Characteristics on Viewers' Satisfaction and Word-of-Mouth Intentions

Jeong Sun LEE;Seunghyeon LEE;Seong Soo CHA
- The Journal of Industrial Distribution & Business
- /
- v.15 no.9
- /
- pp.1-9
- /
- 2024
Purpose: This study examines the impact of YouTube mukbang content characteristics on viewer satisfaction and word-of-mouth behavior. Drawing from theories in media psychology, consumer behavior, and communication studies, we investigate five key content characteristics: credibility, entertainment value, informativeness, visual appeal, and auditory quality. Research design, data and methodology: Using structural equation modeling with data from 206 mukbang viewers, we test hypothesized relationships between these characteristics, viewer satisfaction, and word-of-mouth behavior. Results: Research reveal that credibility and informativeness significantly and positively influence viewer satisfaction, while entertainment value, visual appeal, and auditory quality show no significant effect. Viewer satisfaction positively impacts word-of-mouth behavior. These findings challenge conventional assumptions about video content consumption and highlight the unique nature of mukbang viewing. Conclusions: The study contributes to digital content consumption literature by providing empirical evidence of factors influencing viewer engagement in the mukbang context. It offers practical insights for content creators, marketers, and platform developers, emphasizing the importance of informative and credible content in driving viewer satisfaction and promoting positive word-of-mouth. By extending established media theories to this emerging form of digital entertainment, our research paves the way for future studies. The study's limitations, including its cross-sectional nature and specific cultural context, suggest directions for future research.
https://doi.org/10.13106/jidb.2024.vol15.no9.1 인용 PDF

A Research on the Characteristics of Virtual Reality Stores -Focused on Hyundai VR Store and eBay VR Department Store- (가상현실 점포의 특성에 관한 연구 -현대백화점 VR 스토어와 eBay VR 백화점 사례를 중심으로-)

Jang, Ju Yeun;Chun, Jaehoon
- Journal of the Korean Society of Clothing and Textiles
- /
- v.42 no.4
- /
- pp.671-688
- /
- 2018
This study investigates the characteristics of VR stores that emerged as new fashion communication media. Two case studies on Hyundai and eBay VR Department stores were conducted along with a discussion of the function and meaning of the fashion VR store. The results showed that both stores provide novel shopping experiences; however, the two were differentiated in terms of production method and technology implementation level. Functional aspects such as providing shopping efficiency and purchasing service was insufficient in both stores. Instead, they were complementing by means of product rotation, recommendation system, voice guidance, or linkage with an online shopping mall. In experiential aspects, both stores provided a strong sense of immersion. Hyundai VR store enhanced immersion with a high resolution image of a real offline store; however, it lacked in the ability to provide multisensory stimulation such as kinetic sense or auditory stimulation. The eBay VR Department store intensified the immersion experience by providing auditory stimulation as well as visual stimulation that enhanced the speed and distance sense through the utilization of animation. However, the extent of experience was limited in terms of agency and transformation because of the low interactivity found in both store systems.
https://doi.org/10.5850/JKSCT.2018.42.4.671 인용 PDF KSCI

Speech Segmentation using Weighted Cross-correlation in CASA System (계산적 청각 장면 분석 시스템에서 가중치 상호상관계수를 이용한 음성 분리)

Kim, JungHo;Kang, ChulHo
- Journal of the Institute of Electronics and Information Engineers
- /
- v.51 no.5
- /
- pp.188-194
- /
- 2014
The feature extraction mechanism of the CASA(Computational Auditory Scene Analysis) system uses time continuity and frequency channel similarity to compose a correlogram of auditory elements. In segmentation, we compose a binary mask by using cross-correlation function, mask 1(speech) has the same periodicity and synchronization. However, when there is delay between autocorrelation signals with the same periodicity, it is determined as a speech, which is considered to be a drawback. In this paper, we proposed an algorithm to improve discrimination of channel similarity using Weighted Cross-correlation in segmentation. We conducted experiments to evaluate the speech segregation performance of the CASA system in background noise(siren, machine, white, car, crowd) environments by changing SNR 5dB and 0dB. In this paper, we compared the proposed algorithm to the conventional algorithm. The performance of the proposed algorithm has been improved as following: improvement of 2.75dB at SNR 5dB and 4.84dB at SNR 0dB for background noise environment.
https://doi.org/10.5573/ieie.2014.51.5.188 인용 PDF KSCI

Search Result 106, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)