• 제목/요약/키워드: Auditory condition

검색결과 105건 처리시간 0.034초

참여형 멀티미디어 시스템 사용자 감성평가를 위한 다차원 심물리학적 척도 체계 (Development of Multiple-modality Psychophysical Scaling System for Evaluating Subjective User Perception of the Participatory Multimedia System)

  • 나종관;박민용
    • 대한인간공학회지
    • /
    • 제23권3호
    • /
    • pp.89-99
    • /
    • 2004
  • A comprehensive psychophysical scaling system, multiple-modality magnitude estimation system (MMES) has been designed to measure subjective multidimensional human perception. Unlike paper-based magnitude estimation systems, the MMES has an additional auditory peripheral cue that varies with corresponding visual magnitude. As the simplest, purely psychological case, bimodal divided-attention conditions were simulated to establish the superiority of the MMES. Subjects were given brief presentations of pairs of simultaneous stimuli consisting of visual line-lengths and auditory white-noise levels. In the visual or auditory focused-attention conditions, only the line-lengths or the noise levels perceived should be reported respectively. On the other hand, in the divided-attention conditions, both the line-lengths and the noise levels should be reported. There were no significant differences among the different attention conditions. Human performance was better when the proportion of magnitude in stimulus pairs were identically presented. The additional auditory cues in the MMES improved the correlations between the magnitude of stimuli and MMES values in the divided-attention conditions.

CASA 기반 음성분리 성능 향상을 위한 형태 분석 기술의 응용 (Application of Shape Analysis Techniques for Improved CASA-Based Speech Separation)

  • 이윤경;권오욱
    • 대한음성학회지:말소리
    • /
    • 제65호
    • /
    • pp.153-168
    • /
    • 2008
  • We propose a new method to apply shape analysis techniques to a computational auditory scene analysis (CASA)-based speech separation system. The conventional CASA-based speech separation system extracts speech signals from a mixture of speech and noise signals. In the proposed method, we complement the missing speech signals by applying the shape analysis techniques such as labelling and distance function. In the speech separation experiment, the proposed method improves signal-to-noise ratio by 6.6 dB. When the proposed method is used as a front-end of speech recognizers, it improves recognition accuracy by 22% for the speech-shaped stationary noise condition and 7.2% for the two-talker noise condition at the target-to-masker ratio than or equal to -3 dB.

  • PDF

자율주행 차량의 제어권 인수요구 정보양상과 도로 형태에 따른 운전자의 제어권 인수시간과 정신적 작업부하 차이 (The Effects of Control Takeover Request Modality of Automated Vehicle and Road Type on Driver's Takeover Time and Mental Workload)

  • 윤남경;이재식
    • 감성과학
    • /
    • 제26권4호
    • /
    • pp.51-70
    • /
    • 2023
  • 본 연구에서는 운전 시뮬레이션을 사용하여 자율주행 환경을 구현한 후 3-수준 자율주행 조건에서 자율주행 차량 (automated vehicle: AV)으로부터 운전자에게 전달되는 제어권 인수 요구(takeover request: TOR) 정보의 양상(시각, 청각 및 시각+청각) 및 도로 형태(직선도로와 곡선도로)에 따라 운전자의 제어권 인수 시간(takeover time: TOT) 및 정신적 작업부하(제어권 인수 이후에 운전자들이 경험한 주관적 작업부하와 심장박동수에서의 변화)가 어떻게 차별화되는지 분석하였다. 본 연구의 결과를 요약하면 다음과 같다. 먼저, AV로부터 TOR이 제시된 이후 실험참가자들이 보인 TOT에 대한 분석 결과, TOR 정보양상의 측면에서는 시각 정보가 가장 빠른 TOT를 이끌어 낸 반면 청각정보 조건에서 가장 느렸고, 도로 형태 측면에서는 직선도로 조건에 비해 곡선도로 조건에서의 TOT가 유의하게 더 느렸으며, 특히 청각 정보 조건에서 도로 형태에 따른 TOT에서의 차이가 가장 컸다. 둘째, 정신적 작업부하에 대한 분석 결과, TOR 정보가 시각 혹은 시각+청각적으로 제시된 조건에 비해 청각적으로 제시된 조건에서 주관적 작업부하 측정치와 심장박동수 변화 크기 모두 전반적으로 더 낮았고 특히, 심장박동수 변화의 경우 이러한 경향은 곡선도로 조건에서만 관찰되었다. 이러한 결과는 TOR 정보의 양상과 도로 형태에 따라 운전자의 TOT와 정신적 작업부하 수준이 달라질 수 있고, 특히 TOT가 빠를수록 정신적 작업부하 수준은 상대적으로 더 높아질 수 있음을 시사한다.

행동 및 생리측정기반 개인 간 다중 감각정서 반응일치성 (Consistency between Individuals of Affective Responses for Multiple Modalities based on Behavioral and Physiological Data)

  • 장준혁;김종완
    • 감성과학
    • /
    • 제26권1호
    • /
    • pp.43-54
    • /
    • 2023
  • 본 연구는 참가자 간 상관(Intersubject correlation: ISC)기법을 통해 정서 유발 자극에 대한 한 참가자의 반응과 그 참가자를 제외한 나머지 참가자들의 반응 간 일치성이 각 정서표상 범주(지배가, 각성가, 정서가)와 다양한 감각양상(청각, 시각, 촉각)에서 어떠한 차이가 있는지 밝히고자 하였다. 참가자 간 상관을 계산하기 위해 사용된 데이터는 참가자들의 청각, 시각, 촉각 자극에 대한 생리 측정치와 정서 평정치로 구성되었으며, 한 참가자의 데이터 세트와 나머지 참가자들의 데이터 세트의 평균으로 구분한 뒤 가능한 모든 자극 쌍에 대해 상관을 구하는 방식으로 참가자 간 상관을 계산하였다. 연구 결과, 지배가를 기준으로 재정렬한 데이터 세트에 대한 참가자들의 반응 일치성은 청각 감각양상 조건보다 시각 감각양상 조건에서 높은 ISC 값을 얻었다. 다음으로 각성가로 재정렬한 데이터 세트의 경우 시각 감각양상과 청각 감각양상에서 차이가 있음은 같았지만, 지배가 기준으로 재정렬한 데이터 세트와 결과가 상반되었다. 마지막으로, 정서가를 기준으로 재정렬된 데이터 세트는 모든 감각양상에서 부정적인 데이터 세트들이 긍정적인 데이터 세트보다 참가자들의 반응 일치성이 더 높았다. 모든 데이터 세트에서 정서표상 범주의 높고 낮음과 상관없이 촉각 감각양상에서 높은 ISC 값을 얻었다. 본 연구의 결과는 참가자 간 상관의 다양한 감각양상과 정서표상에 대한 반응의 일치성이 의미하는 바에 대한 해석을 제시하며, ISC 분석 방법이 참가자 반응의 차이에 대한 패턴을 측정하는 유용한 도구가 될 가능성을 제시하였다.

난청인의 주파수 선택도 둔화현상이 음질에 미치는 영향 평가 (The Assessment on the Sound Quality of Reduced Frequency Selectivity of Hearing Impaired People)

  • 안홍섭;박규석;전유용;송영록;이상민
    • 전기학회논문지
    • /
    • 제60권6호
    • /
    • pp.1196-1203
    • /
    • 2011
  • The reduced frequency selectivity is a typical phenomenon of sensorineural hearing loss. In this paper, we compared two modeling methods for reduced frequency selectivity of hearing impaired people. The two models of reduced frequency selectivity were made using LPC(linear prediction coding) algorithm and bandwidth control algorithm based on ERB(equivalent rectangular bandwidth) of auditory filter, respectively. To compare the effectiveness of two models, we compared the result of PESQ (perceptual evaluation of speech quality) and LLR(log likelihood ratio) using 36 Korean words of two syllables. To verify the effect on noise condition, we mixed white and babble noise with 0dB and -3dB SNR to speech words. As the result, it is confirmed that the PESQ score of bandwidth control algorithm is higher than the score of LPC algorithm, on the other hands, and the LLR score of LPC algorithm is lower than the score of bandwidth control algorithm. It means that both non-linearity and widen auditory filter characteristics caused by reduced frequency selectivity could be more reflected in bandwidth control algorithm than in LPC algorithm.

유아의 다중과제 수행과 심리적 불응기: PRP 패러다임 과제를 중심으로 (Multiple Task Performance and Psychological Refractory Period in Children: Focusing on PRP Paradigm Tasks)

  • 김보경;이순형
    • 아동학회지
    • /
    • 제38권3호
    • /
    • pp.75-90
    • /
    • 2017
  • Objective: This study aimed to identify children's cognitive processing and performance characteristics while multiple task performance. It confirmed whether their multiple task performance and psychological refractory period (PRP) varied by task condition (stimulus onset asynchrony [SOA] and task difficulty) and stimulus modality. Methods: Seventy 5-year-olds were recruited. Multi-task tools were developed using the E-prime software. The children were required to respond to two stimuli (visual or auditory) presented with microscopic time difference and their response times (RTs) were recorded. Results: As the SOA increased, the RTs in the first task increased, while the RTs in the second task and PRP decreased. The RTs of the first and second tasks, and the PRP for difficult tasks, were significantly longer than those for easy tasks were. Additionally, there was an interaction effect between the SOA and task difficulty. Although there was no main effect of stimulus modality, task difficulty moderated the modality effect. In the high difficulty condition, the RTs of the first and second tasks and PRP for the visual-visual task were significantly longer than those for auditory-auditory task were. Conclusion: These results inform theoretical discussions on children's multi-task mechanism, and the loss of multiple task performance. Additionally, they provide practical implications and information on the composition of multi-tasks suitable for children in educational environments.

Comparison of McGurk Effect across Three Consonant-Vowel Combinations in Kannada

  • Devaraju, Dhatri S;U, Ajith Kumar;Maruthy, Santosh
    • Journal of Audiology & Otology
    • /
    • 제23권1호
    • /
    • pp.39-48
    • /
    • 2019
  • Background and Objectives: The influence of visual stimulus on the auditory component in the perception of auditory-visual (AV) consonant-vowel syllables has been demonstrated in different languages. Inherent properties of unimodal stimuli are known to modulate AV integration. The present study investigated how the amount of McGurk effect (an outcome of AV integration) varies across three different consonant combinations in Kannada language. The importance of unimodal syllable identification on the amount of McGurk effect was also seen. Subjects and Methods: Twenty-eight individuals performed an AV identification task with ba/ga, pa/ka and ma/ṇa consonant combinations in AV congruent, AV incongruent (McGurk combination), audio alone and visual alone condition. Cluster analysis was performed using the identification scores for the incongruent stimuli, to classify the individuals into two groups; one with high and the other with low McGurk scores. The differences in the audio alone and visual alone scores between these groups were compared. Results: The results showed significantly higher McGurk scores for ma/ṇa compared to ba/ga and pa/ka combinations in both high and low McGurk score groups. No significant difference was noted between ba/ga and pa/ka combinations in either group. Identification of /ṇa/ presented in the visual alone condition correlated negatively with the higher McGurk scores. Conclusions: The results suggest that the final percept following the AV integration is not exclusively explained by the unimodal identification of the syllables. But there are other factors which may also contribute to making inferences about the final percept.

Comparison of McGurk Effect across Three Consonant-Vowel Combinations in Kannada

  • Devaraju, Dhatri S;U, Ajith Kumar;Maruthy, Santosh
    • 대한청각학회지
    • /
    • 제23권1호
    • /
    • pp.39-48
    • /
    • 2019
  • Background and Objectives: The influence of visual stimulus on the auditory component in the perception of auditory-visual (AV) consonant-vowel syllables has been demonstrated in different languages. Inherent properties of unimodal stimuli are known to modulate AV integration. The present study investigated how the amount of McGurk effect (an outcome of AV integration) varies across three different consonant combinations in Kannada language. The importance of unimodal syllable identification on the amount of McGurk effect was also seen. Subjects and Methods: Twenty-eight individuals performed an AV identification task with ba/ga, pa/ka and ma/ṇa consonant combinations in AV congruent, AV incongruent (McGurk combination), audio alone and visual alone condition. Cluster analysis was performed using the identification scores for the incongruent stimuli, to classify the individuals into two groups; one with high and the other with low McGurk scores. The differences in the audio alone and visual alone scores between these groups were compared. Results: The results showed significantly higher McGurk scores for ma/ṇa compared to ba/ga and pa/ka combinations in both high and low McGurk score groups. No significant difference was noted between ba/ga and pa/ka combinations in either group. Identification of /ṇa/ presented in the visual alone condition correlated negatively with the higher McGurk scores. Conclusions: The results suggest that the final percept following the AV integration is not exclusively explained by the unimodal identification of the syllables. But there are other factors which may also contribute to making inferences about the final percept.

해양목장 대상 어류의 음향순치에 관한 기초적 연구 -1. 돌돔의 청각 특성- (A Basic Study on Acoustic Conditioning of Fish Suitable for a Marine Ranch -1. The Sound Sensitivity of Japanese Parrot Fish Oplegnathus fasciatus-)

  • 김성호;이창헌;서두옥;김용주
    • 한국수산과학회지
    • /
    • 제35권6호
    • /
    • pp.563-567
    • /
    • 2002
  • Developing base data on luring fish schools into netting position by the use of underwater audible sound on japanese parrot fish Oplegnathus fasciatus found in the coastal waters of Jeju Island, S. Korea. Auditory threshold was determined by the heartbeat condition technique using pure tones coupled with a delayed electric shock. The audible range of japanese parrot fish extended from 80 Hz to 500 Hz with a peak sensitivity at 200 Hz. The mean auditory thresholds at the frequencies of 80 Hz, 100 Hz, 200 Hz,300 Hz and 500 Hz were 104 dB, 95 dB, 91 dB, 99 dB and 113 dB, respectively. As the frequency became higher than 200 Hz, the auditory threshold increased almost linearly with increasing frequency. Critical ratios of fishes measured in the presence of masking noise in the spectrum level range of 69$\~$78 dB (0 dB re 1$\mu$Pa/$\sqrt{Hz}$) ranged from 21 dB to 40 dB at test frequencies. The noise spectrum level at the start of masking was about 70 dB within the test frequency range. The sound pressure level of 100$\~$200 Hz recognized by japanese parrot fish under the ambient noise is above 91 dB and the critical ratio for them is above 21 dB.

청취자가 이동이 가능한 청각 디스플레이 시스템 개발 (A development of the virtual auditory display system that allows listeners to move in a 3D space)

  • 강대기;이채봉
    • 융합신호처리학회논문지
    • /
    • 제13권1호
    • /
    • pp.1-5
    • /
    • 2012
  • 본 연구에서는 청취자가 3차원 공간에서 자유로이 움직일 수 있는 음장 재생 시스템을 개발하였다. 시스템은 방음실($4.7m(W){\times}2.8m(D){\times}3.0m(H)$)에 설치하였으며 개인용 컴퓨터, 음의 제시장치 그리고 3차원 초음파 센서로 구성하였다. 본 시스템에서는 실내 전체를 감지하는 3차원 초음파 센서로부터 청취자의 위치 정보 및 자세각 정보를 얻어, 컴퓨터에서 나오는 음신호에 머리전달함수(HRTFs)를 삽입하여 음의 제시 장치로부터 출력하도록 설정하였다. 실험에 사용한 머리전달함수는 청취자의 머리 정변에서 1.5m 떨어진 지점에서 각각 측정하였다. 그리고 시스템의 성능을 평가하기 위해 가상 음원의 위치를 랜덤으로 변화시키고, 청취자에게는 자유로운 이동을 허락한 상태에서 음원의 위치를 찾는 실험을 수행하였다. 그 결과, 3차원 공간에 있어서 제시된 음원의 위치에서 청취자가 추정한 위치까지의 거리 오차는 약 30cm 이내로 나타났다.