• Title/Summary/Keyword: Listener

Search Result 194, Processing Time 0.029 seconds

Effect of Digital Noise Reduction of Hearing Aids on Music and Speech Perception

  • Kim, Hyo Jeong;Lee, Jae Hee;Shim, Hyun Joon
    • Korean Journal of Audiology
    • /
    • v.24 no.4
    • /
    • pp.180-190
    • /
    • 2020
  • Background and Objectives: Although many studies have evaluated the effect of the digital noise reduction (DNR) algorithm of hearing aids (HAs) on speech recognition, there are few studies on the effect of DNR on music perception. Therefore, we aimed to evaluate the effect of DNR on music, in addition to speech perception, using objective and subjective measurements. Subjects and Methods: Sixteen HA users participated in this study (58.00±10.44 years; 3 males and 13 females). The objective assessment of speech and music perception was based on the Korean version of the Clinical Assessment of Music Perception test and word and sentence recognition scores. Meanwhile, for the subjective assessment, the quality rating of speech and music as well as self-reported HA benefits were evaluated. Results: There was no improvement conferred with DNR of HAs on the objective assessment tests of speech and music perception. The pitch discrimination at 262 Hz in the DNR-off condition was better than that in the unaided condition (p=0.024); however, the unaided condition and the DNR-on conditions did not differ. In the Korean music background questionnaire, responses regarding ease of communication were better in the DNR-on condition than in the DNR-off condition (p=0.029). Conclusions: Speech and music perception or sound quality did not improve with the activation of DNR. However, DNR positively influenced the listener's subjective listening comfort. The DNR-off condition in HAs may be beneficial for pitch discrimination at some frequencies.

Towards Low Complexity Model for Audio Event Detection

  • Saleem, Muhammad;Shah, Syed Muhammad Shehram;Saba, Erum;Pirzada, Nasrullah;Ahmed, Masood
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.9
    • /
    • pp.175-182
    • /
    • 2022
  • In our daily life, we come across different types of information, for example in the format of multimedia and text. We all need different types of information for our common routines as watching/reading the news, listening to the radio, and watching different types of videos. However, sometimes we could run into problems when a certain type of information is required. For example, someone is listening to the radio and wants to listen to jazz, and unfortunately, all the radio channels play pop music mixed with advertisements. The listener gets stuck with pop music and gives up searching for jazz. So, the above example can be solved with an automatic audio classification system. Deep Learning (DL) models could make human life easy by using audio classifications, but it is expensive and difficult to deploy such models at edge devices like nano BLE sense raspberry pi, because these models require huge computational power like graphics processing unit (G.P.U), to solve the problem, we proposed DL model. In our proposed work, we had gone for a low complexity model for Audio Event Detection (AED), we extracted Mel-spectrograms of dimension 128×431×1 from audio signals and applied normalization. A total of 3 data augmentation methods were applied as follows: frequency masking, time masking, and mixup. In addition, we designed Convolutional Neural Network (CNN) with spatial dropout, batch normalization, and separable 2D inspired by VGGnet [1]. In addition, we reduced the model size by using model quantization of float16 to the trained model. Experiments were conducted on the updated dataset provided by the Detection and Classification of Acoustic Events and Scenes (DCASE) 2020 challenge. We confirm that our model achieved a val_loss of 0.33 and an accuracy of 90.34% within the 132.50KB model size.

Using Interaction for an Experiential Story 'The Three Little Pigs and Wolf' - for ipad - (인터랙션을 활용한 체험형 동화 '아기 돼지 삼형제와 늑대' - ipad를 중심으로 -)

  • Kim, Hyunhee
    • Design Convergence Study
    • /
    • v.14 no.3
    • /
    • pp.1-15
    • /
    • 2015
  • Storytelling which is part of human nature, has changed over millions of years. The development of technology and media has shaped Storytelling into various forms and shapes, and due to the recent spread of smart devices, the influence of interactive storytelling has grown significantly. The technology which allows diverse and natural input of users, have transformed the listener to user and has allowed the user to 'experience' the story rather than 'hear' it. In line with the trend in the development of these technologies, this study seeks to design and implement an interactive tale for children on an ipad platform. Focusing on the interaction aspect, this story is designed mainly for 3-7 years olds, which contains various multimedia elements and interaction elements that use built in technology such as multi-touch technology and microphone technology to allow user input that aline with the context of story. Focusing on children's experience and empathy with the characters of the story, 'Three Little Pigs and the Wolf' contains 22 steps and was published in the itunes Store.

A Study on Enhancement of 3D Sound Using Improved HRTFS (개선된 머리전달함수를 이용한 3차원 입체음향 성능 개선 연구)

  • Koo, Kyo-Sik;Cha, Hyung-Tai
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.6
    • /
    • pp.557-565
    • /
    • 2009
  • To perceive the direction and the distance of a sound, we always use a couple of information. Head Related Transfer Function (HRTF) contains the information that sound arrives from a sound source to the ears of the listener, like differences of level, phase and frequency spectrum. For a reproduction system using 2 channels, we apply HRTF to many algorithms which make 3d sound. But it causes a problem to localize a sound source around a certain places which is called the cone-of-confusion. In this paper, we proposed the new algorithm to reduce the confusion of sound image localization. The difference of frequency spectrum and psychoacoustics theory are used to boost the spectral cue among each directions. To confirm the performance of the algorithm, informal listening tests are carried out. As a result, we can make the improved 3d sound in 2 channel system based on a headphone. Also sound quality of improved 3d sound is much better than conventional methods.

Speech Reinforcement Based on Soft Decision Under Far-End Noise Environments (원단 잡음 환경에서 Soft Decision에 기반한 새로운 음성 강화 기법)

  • Choi, Jae-Hun;Chang, Joon-Hyuk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.7
    • /
    • pp.379-385
    • /
    • 2008
  • In this paper, we propose an effective speech reinforcement technique under the near-end and the far-end noise environments. In general, since the intelligibility of the far-end speech for the near-end listener is significantly reduced under near-end noise environments, we require a far-end speech reinforcement approach to avoid this phenomena. Specifically, based on the estimated background noise spectrum of the near-end, we reinforce the far-end speech spectrum by incorporating the more general cases under the near-end with background noise. Also, we propose the novel approach to reinforce the actual speech signal except for the noise signal in the far-end noisy speech signal. The performance of the proposed algorithm is evaluated by the CCR (Comparison Category Rating) test of the method for subjective determination of transmission quality in ITU-T P.800 under various noise environments and shows better performances compared with the conventional method.

HRTF Enhancement Algorithm for Stereo ground Systems (스테레오 시스템을 위한 머리전달함수의 개선)

  • Koo, Kyo-Sik;Cha, Hyung-Tai
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.4
    • /
    • pp.207-214
    • /
    • 2008
  • To create 3D sound, we usually use two methods which are two channels or multichannel sound systems. Because of cost and space problems, we prefer two channel sound system to multi-channel. Using a headphone or two speakers, the most typical method to create 3D sound effects is a technology of head related transfer function (HRTF) which contains the information that sound arrives from a sound source to the ears of the listener. But it causes a problem to localize a sound source around a certain places which is called cone-of-confusion. In this paper, we proposed the new algorithm to reduce the confusion of sound image localization. HRTF grouping and psychoacoustics theory are used to boost the spectral cue with spectrum difference among each directions. Informal listening tests show that the proposed method improves the front-back sound localization characteristics much better than conventional methods.

Development of Micro Thermal Image Acquisition System (마이크로 열화상 계측 시스템의 IOT 모듈화 개발)

  • Lee, Jun-Yeob;Oh, Jong-woo;Lee, DongHoon
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2017.04a
    • /
    • pp.169-169
    • /
    • 2017
  • 스마트 돈사 내의 열환경 분석에 필수적으로 고려되어야 인자는 가축의 복사 에너지 변화로 볼 수 있다. 열환경 제어의 대상이기도 하지만 회귀적으로 열환경 변화의 인자이기도 하다. 이러한 가축의 복사 에너지 분석을 위하여 시설 내에 용이하게 배포가 가능한 열화상 계측 시스템을 개발하였다. 초소형 마이크로 열화상 계측 시스템에 부가적으로 IOT(Internet of Thing) 기반 기술을 이용한 모듈화 개발을 병행하였다. 열화상 계측 센서로 LWIR(Longwave infrared)영역에 해당하는 $8{\mu}m{\sim}4{\mu}m$의 영역에서 $0.05^{\circ}C$의 분해능을 보이는 $Lepton^{TM}$ (500-0690-00, FLIR, Goleta, CA)모델을 사용하였다. SPI(Serial Peripheral Interface) 속도 2 Mhz로 마이크로프로세서(NanoPi NEO Air, FrendlyArm, CA, USA)와 고속 통신을 수행하여 9 Hz의 계측이 가능하다. 열화상 센서와 마이컴으로 구성되는 단위 계측 시스템의 통신 기능 확장을 위하여 다음과 같이 세 단계의 정보 전달 시나리오를 설계하였다. 1) 단독적으로 열화상을 계측 하고 내장된 메모리에 저장하는 형식 2) 인접한 사용자 인터페이스에서 1번 단독 모듈에 접속하여 열화상을 실시간으로 전송하여 화면에 도시하는 형식 3) 2번 사용자 도시모듈과 병행적으로 Local WI-FI 통신을 이용한 모바일 기기에 화면을 도시하는 형식. 이와 같은 계층적이며 모듈화된 계측 시스템을 구성하기 위해서 1번 모듈에 공개 소프트웨어인 Hostapd 2.5(http://w1.fi/hostapd)버전을 설치하였다. 외부 인터넷 환경이 없는 상황에 1번 모듈 단독으로 AP(Access Point) 기능을 제공하여 지근 거리에 있는 2번 모듈과 3번 모바일 기기의 접속을 관리할 수 있다. 2번 모듈의 경우 화면 다수의 1번 모듈에 접속을 교차적으로 수행하는 방식과 2번 모듈 자체가 AP가 되어 1번 모듈의 접속을 허용하는 형태로 구성되어 있다. 계측 시스템의 계측 매트릭스 구성에 따라 선택적으로 결정할 수 있다. 1번 2번 모듈 공통적으로 TCP/IP Listener와 Client 서비스를 병렬적으로 수행할 수 있도록 개발을 하였다. 3번 모바일 기기에서 사용자 인터페이스 구현을 위하여 범용 Android 기반 GUI 프로그램과 Socket 통신을 연동시켰다. 1개의 열화상 Frame의 전송량은 9,600 Byte ($=80{\times}60{\times}2Byte$) 로 WI-FI 통신 전송 시 2회 ~ 6회 정도 내외로 가변적인 통신 수행 횟수를 나타내었다. 센서 계측 시스템과 정보 전송 시스템을 병렬적으로 구성한 모듈화 된 계측시스템의 전 요소에서 센서에서 제공하는 최대 계측 주기인 9 Hz 구현이 일반적으로 가능하였다. 이를 이용한 추후 연구를 통해 가축 객체의 열복사 정보와 돈사 내 열환경 간의 역학성을 연구할 것이다.

  • PDF

The Value of Film as Material for Learning a Foreign Language: Using Posh Discourse (영상자료가 지니는 외국어 학습 자료로서의 가치 : 공손한 언어를 중심으로)

  • Kim, Hye-Jeong
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.2
    • /
    • pp.643-651
    • /
    • 2016
  • This study considers the value of English-language films as material for learning a foreign tongue using posh discourse. In daily life, when we decline an invitation or convey unpleasant information to a listener, we use polite expressions; we are careful with our words. English language learners need to learn polite expressions in order to interact peacefully with others; doing so can minimize conflict, which is inherent in social relationships. This study uses the British drama Downton Abbey, which is about aristocracy. This study analyzes the posh discourse used in Downton Abbey and insists that students need to learn it explicitly. It is important to learn the polite expressions of this authentic drama in a real classroom. This study suggests that students work in groups to create a short video, and to try to understand the characters' personalities. Movies, TV dramas, and sitcoms provide great content that shows the various functions of the language that students want to learn. As a source of learning material, film can help improve students' motivation and interest in learning a foreign language.

Spatial Audio Signal Processing Technology Using Multi-Channel 3D Microphone (멀티채널 3차원 마이크를 이용한 입체음향 처리 기술)

  • Kang Kyeongok;Lee Taejin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.2
    • /
    • pp.68-77
    • /
    • 2005
  • The purpose of a spatial audio system is to give a listener an impression as if he were present in a recorded environment when its sound is reproduced. For this purpose a dummy head microphone is generally used. Because of its human-like shape, dummy head microphone can reproduce spatial images through headphone reproduction. However, its shape and size are restriction to public use and it is difficult to convert the output signal of dummy head microphone into a multi-channel signal for multi-channel environment. So, in this paper, we propose a multi-channel 3D microphone technology. The multi-channel 3D microphone acquire a spatial audio using five microphones around a horizontal plane of a rigid sphere and through post processing, it can reproduce various reproduction signals for headphone, stereo, stereo dipole, 4ch and 5ch reproduction environments. Because of complex computation, we implemented H/W based post processing system. To verily the Performance of the multi-channel 3D microphone, localization experiments were Performed. The result shows that a front/back confusion, which is the one of common limitations of conventional dummy head technology, can be reduced dramatically.

The Influence of Paired Think-Aloud Problem Solving on Interactions among PCK Components Considered in the Processes of Making Written Test Items by Pre-Service Chemistry Teachers (해결자·청취자 활동이 예비 화학교사의 지필평가 문항 제작 과정에서 고려된 교과교육학 지식(PCK) 구성 요소 사이의 상호작용에 미치는 영향)

  • Park, Jaesung;Kang, Hunsik;Han, JaeYoung
    • Journal of The Korean Association For Science Education
    • /
    • v.37 no.3
    • /
    • pp.429-440
    • /
    • 2017
  • This study investigated the influence of paired think-aloud problem solving on interactions among the pedagogical content knowledge (PCK) components considered in the processes of making written test items by pre-service chemistry teachers. The processes of making written test items using paired think-aloud problem solving in four small groups consisting of two pre-service chemistry teachers were recorded and transcribed. The analysis of the results revealed that the 'assessment in science education' of the five PCK components, regardless of the roles (solver or listener), was most frequently used in making written test items. 'Subject matter knowledge' and 'students' were also frequently used although less than the previous component. However, 'curriculum for science education' and 'instructional strategies and instruction for science education' was a little used. In the aspects of integration, the integrations between two or three components of various types were frequently found. The integrations among four or five components were also slightly found. However, the integrations of 'curriculum for science education' with the other components were less frequently found. The integrations of 'instructional strategies and instruction for science education' with other components were hardly found. The usefulness, limitations, and effective use of paired think-aloud problem solving as a strategy improving competency to make written test items and the PCK of pre-service teachers were discussed on the basis of the results.