• 제목/요약/키워드: Speech Interference

검색결과 67건 처리시간 0.023초

위상 모델 기반의 소프트 마스크를 이용한 단일 채널 음성분리 (Single-Channel Speech Separation Using Phase Model-Based Soft Mask)

  • 이윤경;권오욱
    • 한국음향학회지
    • /
    • 제29권2호
    • /
    • pp.141-147
    • /
    • 2010
  • 본 논문은 혼합 음성 신호로부터 크기와 위상 정보를 모두 고려하여 목표 음성 신호를 추출하고 향상하는 음성 분리 알고리듬을 제안한다.기존 연구에서는 혼합된 음성 신호의 로그 전력 스펙트럼 값이 시간-주파수 영역에서 서로 독립이라고 가정한 통계적 모델을 적용하기 때문에 음성 분리 결과 파형에 불연속을 야기한다. 본 논문에서는 이러한 불연속을 감소시키기 위하여 시간-주파수 영역에서의 스무딩 필터를 적용한다. 음성 분리 성능을 더욱 향상시키기 위하여 음성 신호의 크기와 함께 위상 정보를 고려하는 통계적 모델을 제안한다. 실혐 결과, 제안된 알고리즘이 기존의 크기 정보만을 사용한 알고리즘에 비하여 1.5 dB의 화자대간섭비 (SIR)를 개선하는 것으로 나타난다.

Vowel Fundamental Frequency in Manner Differentiation of Korean Stops and Affricates

  • Jang, Tae-Yeoub
    • 음성과학
    • /
    • 제7권1호
    • /
    • pp.217-232
    • /
    • 2000
  • In this study, I investigate the role of post-consonantal fundamental frequency (F0) as a cue for automatic distinction of types of Korean stops and affricates. Rather than examining data obtained by restricting contexts to a minimum to prevent the interference of irrelevant factors, a relatively natural speaker independent speech corpus is analysed. Automatic and statistical approaches are adopted to annotate data, to minimise speaker variability, and to evaluate the results. In spite of possible loss of information during those automatic analyses, statistics obtained suggest that vowel F0 is a useful cue for distinguishing manners of articulation of Korean non-continuant obstruents having the same place of articulation, especially of lax and aspirated stops and affricates. On the basis of the statistics, automatic classification is attempted over the relevant consonants in a specific context where the micro-prosodic effects appear to be maximised. The results confirm the usefulness of this effect in application for Korean phone recognition.

  • PDF

A Study on the Production of a Stop Plus Nasal Sequence in English Words by Korean Learners

  • Seo, Mi-Sun;Kim, Hee-Sung;Shin, Ji-Young;Kim, Kee-Ho
    • 음성과학
    • /
    • 제12권3호
    • /
    • pp.165-173
    • /
    • 2005
  • This paper investigates the influence of the Korean phonology on the production of English words including a stop plus nasal sequence through production experiments with a beginner and an advanced group of Korean English learners. The results of the production experiments show that both the beginner and the advanced group of Korean English learners were under the influence of the Korean phonological rule realizing a stop as a nasal before a nasal when they pronounced a stop plus nasal sequences in English words. The extent of L1 interference was greater in the beginner group than in the advanced group.

  • PDF

Production and Perception of English /r/ and /l/ by Korean Learners of English: An Experimental Study

  • Kang, Hyeon-Seok
    • 음성과학
    • /
    • 제6권
    • /
    • pp.7-24
    • /
    • 1999
  • Eleven Korean learners of English took part in an experiment where the production and perception of English /r/ and /l/ in four different word positions was investigated. Overall the subjects made more errors on /l/ in both production and identification tests. The frequency of the subjects' errors was also sensitive to word positions in which the two English liquids occur. Especially the subjects made noticeably fewer errors in intervocalic medial position. It is suggested that the Korean subjects' acquisitional pattern in this particular case of foreign phone learning can be explained more by language particular 'interference' effects rather than 'universal' acoustic arguments such as those given in Dissosway et a1. (1982) and Sheldon and Strange (1982). The results of the experiment also support the minority position among second language educators that in some cases of non-native phone acquisition, learners' production abilities can be developed earlier than their perceptual abilities.

  • PDF

건설공사장 소음의 위해성 평가 (Assessment of Annoyance Cause by Different Types of Construction Noises)

  • 이성찬;전진용
    • 한국소음진동공학회논문집
    • /
    • 제23권12호
    • /
    • pp.1111-1116
    • /
    • 2013
  • In the present study, annoyance caused by diverse construction noises was evaluated through surveys. A survey with a total of 110 construction workers was carried out to investigate annoyance from construction noises at different construction phases. The most effective item of adverse by construction noise was annoyance and the most annoyed construction stage was destruction construction. There were high correlation between work performance and work safety as well as work safety and speech interference. Participant's noise sensitivity were significant between low and high group at work performance, work safety, speech interference.

건설공사장 소음의 위해성 평가 (Assessment of Annoyance cause by Different Types of Construction Noises)

  • 이성찬;이평직;전진용;정진연;임정빈
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2011년도 추계학술대회 논문집
    • /
    • pp.408-413
    • /
    • 2011
  • In the present study, annoyance caused by diverse construction noises was evaluated through surveys and laboratory experiments. A survey with a total of 110 construction workers was carried out to investigate annoyance from construction noises at different construction phases. The most effective item of adverse by construction noise was annoyance and the most annoyed construction stage was destruction construction. Between work performance and work safety had correlation as well as between work safety and speech interference. Participant's noise sensitivity were significant between low and high group at work performance, work safety, speech interference.

  • PDF

주기적 Sample Skipping과 표준화주파수 축소에 의한 TDM 회선증가방식에서의 불특정 해석 (Distortion Analysis for two TDM Channel Expansion Methodsperiodic Sample Skipping and Sampling Frequency Reduction)

  • 안병성;이재균
    • 대한전자공학회논문지
    • /
    • 제12권3호
    • /
    • pp.30-36
    • /
    • 1975
  • TDM 회선을증가하기 위한 두가지 방식-주기적 sample skipping방법과 표본화주파수 축소 방법-에 대한 불특성을 해석비교하였다. 신호는 통계적으로 stationary인 random신호로 가정했으며, 선로의 잡음과 각랑 상호 간의 간섭효과는 고려하지 않았다. 음성신호에 대한 구체적 비교 결과, 주기회 sample skipping방법이 실제적 설계조건에서 훌륭한 선택이 될 수 있음을 보였다. Distortions are analyzed and compared for two TDM channel expansion methods- periodic sample skipping and sampling frequency reduction. Signal is assumed to be stationary random signal with zero·mean. Channel noise and interference are not considered in the analysis. For speech signal, it is shown that the periodic sample skipping method could be a better choice under practical design constraints.

  • PDF

Multi-channel Speech Enhancement Using Blind Source Separation and Cross-channel Wiener Filtering

  • Jang, Gil-Jin;Choi, Chang-Kyu;Lee, Yong-Beom;Kim, Jeong-Su;Kim, Sang-Ryong
    • The Journal of the Acoustical Society of Korea
    • /
    • 제23권2E호
    • /
    • pp.56-67
    • /
    • 2004
  • Despite abundant research outcomes of blind source separation (BSS) in many types of simulated environments, their performances are still not satisfactory to be applied to the real environments. The major obstacle may seem the finite filter length of the assumed mixing model and the nonlinear sensor noises. This paper presents a two-step speech enhancement method with multiple microphone inputs. The first step performs a frequency-domain BSS algorithm to produce multiple outputs without any prior knowledge of the mixed source signals. The second step further removes the remaining cross-channel interference by a spectral cancellation approach using a probabilistic source absence/presence detection technique. The desired primary source is detected every frame of the signal, and the secondary source is estimated in the power spectral domain using the other BSS output as a reference interfering source. Then the estimated secondary source is subtracted to reduce the cross-channel interference. Our experimental results show good separation enhancement performances on the real recordings of speech and music signals compared to the conventional BSS methods.

L2 억양에 나타나는 L1억양의 긍정적 전이와 부정적 전이 양상 - 일본인 한국어 학습자들을 중심으로 - (Positive and negative transfer of first language in producing second language - Focusing on Japanese learners of Korean -)

  • 윤영숙
    • 말소리와 음성과학
    • /
    • 제8권4호
    • /
    • pp.71-78
    • /
    • 2016
  • The purpose of this study is to investigate the effect of Japanese(L1) on the production of Korean accentual phrases(L2). Korean and Japanese have a similar prosodic structure. But different from Korean, Japanese is a pitch accent language. So each word has its own pitch accent. And pitch accents are maintained in the sentence intonation. This difference will have a negative influence on the production of Korean sentence intonation. For this study 4 Korean natives speakers and 10 advanced Japanese learners of Korean participated in the production test. The material analysed constituted 11 Korean sentences, six of which contain formally identical Sino-Korean and Sino-Japanese words. The results show that the initial pitch pattern of Korean accentual phrases was affected by Japanese pitch accent types and this interference was greater for formally identical Sino-Korean and Sino-Japanese words. But besides initial tones of accentual phrase, some positive interference was observed in the internal tonal pattern of accentual phrase. In the phonetic realization, the internal pitch range and initial pitch rising of accentual phrases was greater for Japanese learners of Korean than native speakers of Korean.

운전 상황에서 휴대폰 음성인터페이스의 사용성 향상에 관한 연구 (Usability Improvement for the Speech Interface of Mobile Phones While Driving)

  • 강윤환;정성욱;정가훈;최재호;정의승
    • 대한산업공학회지
    • /
    • 제35권1호
    • /
    • pp.109-118
    • /
    • 2009
  • While driving, the manual use of a mobile phone is heavily restricted due to the interference with the primary driving task. An alternative would be the use of speech interface. The current study aims to provide a guideline to implementation of a speech interface to the mobile phone. To do so, an expert evaluation was made and it revealed that a speech interface requires less workload, less performance degradation of the driving task than that of the keypad interface. To make speech interfaces more usable, new improvements are suggested. Subjective workload can be reduced and user satisfaction can be improved without degrading the primary task performance, for instance, by letting the user interrupt the speech of the phone, eliminating the repetitive words, letting the user know clearly what makes an error, providing a way to go back to the previous state, reducing the usage of keypad buttons and reducing the amount of the information on the screen.