• Title/Summary/Keyword: 오디오신호

Search Result 435, Processing Time 0.025 seconds

Modeling of Acoustic Echo Canceller Using Subband Adaptive Signal Processing (서브밴드 적응신호처리를 이용한 음향 에코제거기의 모델링)

  • Kim, Chun-Duck;Sim, Dong-Youn;Chung, Ho-Moon;Lee, Jun-Ku;Cha, Kyung-Hwan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.5
    • /
    • pp.43-49
    • /
    • 1997
  • Generally, echo cancelers of a TV conference system or a audio conference system are to carry out a real time processing in the case of the closed room having long reverberation time because the system requires much time to modify filter coefficients to environmental changes. Therefore this paper proposes a new subband adaptive filtering method using polyphase filter banks of MPEG(Moving Picture Experts Group) audio system to solve the problems. This method divides signal spectra of input and output into several frequency bands, and each band is adaptively filtered by using ES-NLMS (Exponential Step-Normalized Least Mean Square) algorithm. The optimal number of subband is determined by computational simulations. According to the results of simulation, ERLE of the subband model is 2dB smaller than general full band, calculation rate's of the subband model is decreased about 88%.

  • PDF

Output Improvement of Two-dimensional Audio Actuators by Corona Surface Treatments to Increase Adhesive Properties of Piezoelectric Materials (코로나 표면 처리의 접착력 향상에 의한 이차원 오디오 시스템의 출력 개선)

  • Um, Kee-Hong
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.5
    • /
    • pp.91-97
    • /
    • 2012
  • Recently, the performances of electrical and electronic devices are improving while the sizes are becoming smaller. As sound-generating systems, the two-dimensional speakers have been developed in place of conventional three-dimensional ones. Piezoelectric materials show the mechanical vibrations due to the voltage applied from outside the materials. The early film speakers had a limitations of output power in that it was not easy to make the conducting macromolecular films on the surfaces of the materials due to the internal chemical properties of materials. We have adopted the corona surface treatment in order to improve the output characteristics by increasing the adhesion of the coating material on to the surface of the center material of piezo film. The results showed the improvement of output power in the wider range of operating frequencies.

A Viewer Preference Model Based on Physiological Feedback (CogTV를 위한 생체신호기반 시청자 선호도 모델)

  • Park, Tae-Suh;Kim, Byoung-Hee;Zhang, Byoung-Tak
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.3
    • /
    • pp.316-322
    • /
    • 2014
  • A movie recommendation system is proposed to learn a preference model of a viewer by using multimodal features of a video content and their evoked implicit responses of the viewer in synchronized manner. In this system, facial expression, body posture, and physiological signals are measured to estimate the affective states of the viewer, in accordance with the stimuli consisting of low-level and affective features from video, audio, and text streams. Experimental results show that it is possible to predict arousal response, which is measured by electrodermal activity, of a viewer from auditory and text features in a video stimuli, for estimating interestingness on the video.

MB-OFDM UWB Technology for Increasing Transmission Reach of Wireless Speaker Systems (차세대 무선 스피커 시스템의 전송거리 증대를 위한 MB-OFDM UWB 기술)

  • Kim, Do-Hoon;Wee, Jung-Wook;Lee, Hyeon-Seok;Lee, Chung-Yong
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.48 no.6
    • /
    • pp.1-5
    • /
    • 2011
  • We present the Multi-band orthogonal frequency division multiplexing ultra-wideband (MB-OFDM UWB) technology for increasing the transmission reach of wireless speaker systems. The proposed scheme adopts the Reed-Solomon coding for preventing the random error perfectly and shows the SNR gain in low bit error rate (BER) especially. So, we can increase the maximum reach of MB-OFDM UWB technology since the receiver sensitivity is improved. The simulation environment includes most effects of realistic channel environments such as Additive White Gaussian Noise (AWGN), CM1 channel model, Sampling frequency offset (SFO), Carrier frequency offset (CFO) to improve the simulation accuracy. The simulation results show that the proposed scheme can give a maximum 2 dB SNR gain and increase the transmission reach up to 12.6m.

Combining deep learning-based online beamforming with spectral subtraction for speech recognition in noisy environments (잡음 환경에서의 음성인식을 위한 온라인 빔포밍과 스펙트럼 감산의 결합)

  • Yoon, Sung-Wook;Kwon, Oh-Wook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.5
    • /
    • pp.439-451
    • /
    • 2021
  • We propose a deep learning-based beamformer combined with spectral subtraction for continuous speech recognition operating in noisy environments. Conventional beamforming systems were mostly evaluated by using pre-segmented audio signals which were typically generated by mixing speech and noise continuously on a computer. However, since speech utterances are sparsely uttered along the time axis in real environments, conventional beamforming systems degrade in case when noise-only signals without speech are input. To alleviate this drawback, we combine online beamforming algorithm and spectral subtraction. We construct a Continuous Speech Enhancement (CSE) evaluation set to evaluate the online beamforming algorithm in noisy environments. The evaluation set is built by mixing sparsely-occurring speech utterances of the CHiME3 evaluation set and continuously-played CHiME3 background noise and background music of MUSDB. Using a Kaldi-based toolkit and Google web speech recognizer as a speech recognition back-end, we confirm that the proposed online beamforming algorithm with spectral subtraction shows better performance than the baseline online algorithm.

Commercial 4K UHD Streaming Device over 5G Mobile Communication Network (5G 이동통신망을 통한 상용 4K UHD 스트리밍 장치)

  • Junghoon, Paik;Yongsuk, Kim
    • Journal of Broadcast Engineering
    • /
    • v.27 no.6
    • /
    • pp.914-922
    • /
    • 2022
  • In this paper, we construct a commercial 4K UHD(Ultra High Definition) streaming device that utilizes a 5G mobile communication network as a transport channel and conduct a streaming performance test. It uses RTP(Realtime Transport Protocol) which has transmission quality monitoring capability as a transmission protocol to apply adaptive streaming. In addition, it provides the function to adjust the encoding rate of the video signal so that encoding can be optimized for the change in the bandwidth of the transmission channel. Through the performance test, it is confirmed that the H.265 encoding rate for 4K UHD signal is 48.69Mbps, the average glass-to-glass delay time is 293.60ms, and the average time difference between video and audio for lip sync is 120ms. With the result of performance test, it is shown that the streaming device is applied to 4K UHD Streaming device over 5G mobile communication network.

Development of FPGA-based failure detection equipment for SMART TV embedded camera (FPGA를 이용한 SMART TV용 내장형 카메라 불량 검출 장비 개발)

  • Lee, Jun Seo;Kim, Whan Woo;Kim, Ji-Hoon
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.18 no.5
    • /
    • pp.45-50
    • /
    • 2013
  • Recently, as the market for SMART TV expands, the camera is embedded for providing various user experience. However, this leads to occurrence of camera failure due to TV power up sequence problem, which are usually not detectable in conventional test equipments. Although the failure-detection can be possible by re-generating control signals for audio interface with new equipment, it is expensive and also requires much time to test. In this paper, for SMART TV, FPGA(Field Programmable Gate Array)-based failure-detection system is proposed which can lead to reduction of both cost and time for test.

A Study On Audial and Visual Information Transfer System for the disable and general Audience (시청각 장애관객 및 일반 관객을 위한 오디얼, 비주얼 정보 전달 시스템 연구)

  • Lee, Dong-Hun;Jang, Tae-Soo;Shin, Ho
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.756-762
    • /
    • 2009
  • The interest of show industry as leisure and culture bring about quantitative increase of public facility. It indicate the effort of bring culture benefits to the disable and the general who want detail information. Especially, there is less technology and service to help them. So it is very necessary method which is using the audial and visual transfer technology to solve the limitation. This study aimed at indicating the method that applied the audial and visual transfer technology which are useful for both The disable and General Audience.

  • PDF

An Efficient Content-based Retrieval System using High-Dimensional Index Structure Image Database (대규모 이미지 데이터베이스에서 고차원 색인 구조를 이용한 효율적인 내용 기반 검색 시스템)

  • Lee, Dong-Ho;Park, Ju-Hong;Jeong, Jin-Wan;Kim, Hyeong
    • Journal of KIISE:Software and Applications
    • /
    • v.26 no.1
    • /
    • pp.52-65
    • /
    • 1999
  • 이미지나 비디오, 오디오와 같이 멀티미디어 데이터들은 기존의 단순한 텍스트 기반의 데이터에 비하여 대용량적인 특성과 비정형적인 특성을 가지고 있어서 검색시 많은 어려움이 따른다. 본 논문에서는 대규모의 이미지 데이터베이스에서 효율적이고 신속하게 사용자가 원하는 이미지를 검색할수 있는 내용 기반 검색 시스템을 제시한다. 이를 위해서 본 논문에서는 최근 여러 장점으로 인하여 신호 분석이나 이미지 압축 분야에 많이 사용되는 웨이브릿 변환을 이용하여 이미지 데이터로부터 내용 기반 검색에 사용되는 특징 벡터를 효율적으로 추출하는 기법과 유사성 측정 방법을 제안한다. 그리고, 이러한 특징 추출방법과 유사성 측정 방법을 이용하여 내용 기반 질의 및 검색을 수행할 경우, 검색 조건을 만족하는 객체인데 실수로 검색해내지 못하는 경우인 false dismissals 이 발생하지 않음을 보인다. 또한 대규모 이미지 데이터베이스에서 신속한 내용 기반 검색을 지원하기 위하여 고차원 데이터에 대한 효율적인 색인을 제공하는 X-tree를 이용한 이미지 색인 방법을 보이며 이것이 기존의 순차 검색이나 R*-tree를 이용한 색인 방법보다 신속하게 이미지 데이터들을 검색할 수 있다는 것을 다양한 실험을 통해 보인다. 마지막으로 QBIC에서 제안한 검색 적합성 측정 방법을 이용하여 본 논문에서 제안하는 내용 기반 이미지 검색시스템의 검색 적합성을 보인다.

A Study on the Error Correction Algorithm for Digital Audio Systems (디지탈 오디오 시스템에서의 오류정정 알고리듬에 관한 연구)

  • Jun, Kyong-Il;Kim, Nam-Wook;Kim, Yong-Deak
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.7
    • /
    • pp.90-97
    • /
    • 1989
  • In this paper, we have taken the formation of two-dimension codeword named doubly-encoded code using the Reed-Solomon code, C1(32, 28) with minimum distance 5 and C2(32, 26) with minimum distance 7 and we have had computer simulation of these error correcting processes using modeled R-DAT (Rotationary Digital Audio Tape). As the result, the error rate per symbol has been decreased about 0.05 and on these processes, the newly developed digital signal processing technology such as erro correction using Berlekamp-Massey algorithm in frequency domain have been proven.

  • PDF