• Title/Summary/Keyword: sound information

Search Result 1,716, Processing Time 0.028 seconds

Reference Channel Input-Based Speech Enhancement for Noise-Robust Recognition in Intelligent TV Applications (지능형 TV의 음성인식을 위한 참조 잡음 기반 음성개선)

  • Jeong, Sangbae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.2
    • /
    • pp.280-286
    • /
    • 2013
  • In this paper, a noise reduction system is proposed for the speech interface in intelligent TV applications. To reduce TV speaker sound which are very serious noises degrading recognition performance, a noise reduction algorithm utilizing the direct TV sound as the reference noise input is implemented. In the proposed algorithm, transfer functions are estimated to compensate for the difference between the direct TV sound and that recorded with the microphone installed on the TV frame. Then, the noise power spectrum in the received signal is calculated to perform Wiener filter-based noise cancellation. Additionally, a postprocessing step is applied to reduce remaining noises. Experimental results show that the proposed algorithm shows 88% recognition rate for isolated Korean words at 5 dB input SNR.

Implementation of Sound Source Location Detector (음원 위치 검출기의 구현)

  • 이종혁;김진천
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.4 no.5
    • /
    • pp.1017-1025
    • /
    • 2000
  • The human auditory system has been shown to posses remarkable abilities in the localization and tracking of sound sources. The localization is the result of processing two primary acoustics cues. These are the interaural time difference(ITD) cues and interaural intensity difference(IID) cues at the two ears. In this paper, we propose TEPILD(Time Energy Previous Integration Location Detector) model. TEPILD model is constructed with time function generator, energy function generator, previous location generator and azimuth detector. Time function generator is to process ITD and energy function generator is to process IID. Total average accuracy rate is 99.2%. These result are encouraging and show that proposed model can be applied to the sound source location detector.

  • PDF

The Analysis of Equaizer for Improving Sound Quality of Samrtphones (스마트폰의 음질 향상을 위한 Equalizer 분석)

  • Lee, Myung-hwan;Ryu, Chang-su
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.05a
    • /
    • pp.190-193
    • /
    • 2013
  • Before Smartphone releases, playing music form previous cellphone is very limited, so we need mp3 player instead. However, when Smartphone releases, a function of playing multimedia became one of the important things in Samrtphone including all functions of mp3 player. Even though it is easier to use since you can play music and multimedia in one device, it still has problem issues on the sound quality. This thesis will discuss about functions of EQ from Smartphone music players. After that, we will to achieve high fidelity sound through balance adjustment of Right Mark Audio Analyzer program.

  • PDF

Sensibility Evaluation of Internet Shoppers with the Sportswear Rustling Sounds (스포츠의류 마찰음 정보 제공에 따른 인터넷 구매자의 감성평가)

  • Baek, Gyeong-Rang;Jo, Gil-Su
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 2009.05a
    • /
    • pp.177-180
    • /
    • 2009
  • This study investigates the perception of different fabrics by consumers when provided with a video clip with rustling sounds of the fabric. We utilized sportswear products that are currently on the market and evaluated the emotional response of internet shoppers by measuring the physiological and psychological responses. Three kinds of vapor-permeable water-repellent fabric were selected to generate video clips each containing the fabric rustling sound and images of exercise activities wearing the sportswear made of the respective fabric. The new experimental website contained the video clips and was compared with the original website which served as a control. 30 subjects, who had experience to buy clothing online, took part in the physiological and psychological response to the video clip. Electroen-cephalography (EEG) was used to measure the physiological response while the psychological response consisted of evaluating accurate perception of the fabric, satisfaction, and consumer interest. When we offered video clips with fabric's rustling sound on the website, subjects answered they could get more accurate and rapid information to decide to purchase the products than otherwise they do the shopping without such information. However, such rustling sounds somewhat annoy customers, as proved psychological and physiological response. Our study is a critical step in evaluating the consumer's emotional response to sportswear fabric which will promote selling frequency, reduce the return rate and aid development of new sportswear fabric further evolution of the industry.

  • PDF

SVM-based Drone Sound Recognition using the Combination of HLA and WPT Techniques in Practical Noisy Environment

  • He, Yujing;Ahmad, Ishtiaq;Shi, Lin;Chang, KyungHi
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.10
    • /
    • pp.5078-5094
    • /
    • 2019
  • In recent years, the development of drone technologies has promoted the widespread commercial application of drones. However, the ability of drone to carry explosives and other destructive materials may bring serious threats to public safety. In order to reduce these threats from illegal drones, acoustic feature extraction and classification technologies are introduced for drone sound identification. In this paper, we introduce the acoustic feature vector extraction method of harmonic line association (HLA), and subband power feature extraction based on wavelet packet transform (WPT). We propose a feature vector extraction method based on combined HLA and WPT to extract more sophisticated characteristics of sound. Moreover, to identify drone sounds, support vector machine (SVM) classification with the optimized parameter by genetic algorithm (GA) is employed based on the extracted feature vector. Four drones' sounds and other kinds of sounds existing in outdoor environment are used to evaluate the performance of the proposed method. The experimental results show that with the proposed method, identification probability can achieve up to 100 % in trials, and robustness against noise is also significantly improved.

Sound Watermarking Technique based on Blackfin Processor (블랙핀 프로세서 기반의 사운드 워터마킹 기법)

  • Kim, Ye-il;Seo, Jung-hee;Park, Hung-bog
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.10a
    • /
    • pp.757-758
    • /
    • 2015
  • The digital watermark is one of important techniques to solve copyright authentication problems of digital media. Researches on the digital watermark are rapidly increasing in various media. The watermark system can resist some attacks such as signal attack, geometric attack and protocol attach. However, so far the robustness of the watermark needs to be improved. This paper suggests a watermarking technique with which a watermark is embedded on a coefficient of wavelet-based frequency band and extracted from it for protection of property rights and authentication of Blackfin processor-based digital sound. By carrying out hardware implementation of the suggested sound watermarking, the commercialization of protection of property rights and robustness of watermarks resulted from development of high-level programs can be confirmed.

  • PDF

Enhancement of Sound Image Localization on Vertical Plane for Three-Dimensional Acoustic Synthesis (3차원 음향 합성을 위한 수직면에서의 음상 정위 향상)

  • 김동현;정하영;김기만
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.3 no.3
    • /
    • pp.541-546
    • /
    • 1999
  • The head-related transfer function (HRTF), which expresses the acoustic process from the sound source to the human ears in the free field, contains critical informations which the location of the source can be traced. It also makes it possible to realize multi-dimensional acoustic system that can approximately generate non-existing sound source. The use of non-individual, common HRTF brings performance degradation in localization ability such as front-back judgment error, elevation judgment error. In this paper, we have reduced the error on vertical plane by increasing the spectral notch level. The performance of the proposed method was Proved through subjective test that it is Possible to improve the ability to locate stationary/moving source.

  • PDF

Cardiac Valve Disease Suspicion Monitoring System Using Heart Sound (심음을 이용한 심장 판막 질환 의심 모니터링 시스템)

  • Kim, Jin-Hwan;Noh, Yun-Hong;Jeong, Do-Un
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.05a
    • /
    • pp.627-628
    • /
    • 2017
  • Recently, the smart health care industry is showing great interest in health due to the rapid rise of the industry, and various devices capable of monitoring health status in daily life are being developed. In this study, we implemented a system to monitor cardiac activity and anomalous signals from cardiac activity measurement in a manner different from conventional electrocardiogram or pulse wave measurement. And, we performed the measurement evaluation on the subject using the implemented heart sound system. As a result, we confirmed the possibility of monitoring cardiac activity during daily life through heart sound measurement.

  • PDF

Underwater Acoustic Research Trends with Machine Learning: Ocean Parameter Inversion Applications

  • Yang, Haesang;Lee, Keunhwa;Choo, Youngmin;Kim, Kookhyun
    • Journal of Ocean Engineering and Technology
    • /
    • v.34 no.5
    • /
    • pp.371-376
    • /
    • 2020
  • Underwater acoustics, which is the study of the phenomena related to sound waves in water, has been applied mainly in research on the use of sound navigation and range (SONAR) systems for communication, target detection, investigation of marine resources and environments, and noise measurement and analysis. Underwater acoustics is mainly applied in the field of remote sensing, wherein information on a target object is acquired indirectly from acoustic data. Presently, machine learning, which has recently been applied successfully in a variety of research fields, is being utilized extensively in remote sensing to obtain and extract information. In the earlier parts of this work, we examined the research trends involving the machine learning techniques and theories that are mainly used in underwater acoustics, as well as their applications in active/passive SONAR systems (Yang et al., 2020a; Yang et al., 2020b; Yang et al., 2020c). As a follow-up, this paper reviews machine learning applications for the inversion of ocean parameters such as sound speed profiles and sediment geoacoustic parameters.

Ranging Algorithm of Underwater Acoustic Wave with Look-up Table (Look-up table을 이용한 수중 음향파 거리 추정 알고리즘)

  • Cheon, Ju-Hyun;Moon, Seung-Hyun;Lee, Ho-Kyoung
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.4
    • /
    • pp.23-29
    • /
    • 2015
  • In this paper, we introduce a underwater ranging algorithm with Look-up Table (LUT) by modifying the existing method which is using the changes of angles of accoustic rays with SSP (Sound Speed Profile). We compare the horizontal distance errors and the calculation times. Our new algorithm exploits Time of Arriva l(ToA) - horizontal distance table based on SSP. This algorithm offers faster calculation speed than the previous one with the slight increase of the distance estimation error.