• 제목/요약/키워드: Audio Comparison

검색결과 89건 처리시간 0.019초

Angle-Based Virtual Source Location Representation for Spatial Audio Coding

  • Beack, Seung-Kwon;Seo, Jeong-Il;Moon, Han-Gil;Kang, Kyeong-Ok;Hahn, Min-Soo
    • ETRI Journal
    • /
    • 제28권2호
    • /
    • pp.219-222
    • /
    • 2006
  • Virtual source location information (VSLI) has been newly utilized as a spatial cue for compact representation of multichannel audio. This information is represented as the azimuth of the virtual source vector. The superiority of VSLI is confirmed by comparison of the spectral distances, average bit rates, and subjective assessment with a conventional cue.

  • PDF

동영상 정보제공이 위내시경 대상자의 신체적 불편감, 불안 및 간호 만족도에 미치는 효과 (The Effects of Video-audio Information Provision on Physical Discomfort, Anxiety, and Nursing Satisfaction of the Clients for Gastroscopy)

  • 권영은;김분한
    • 성인간호학회지
    • /
    • 제25권2호
    • /
    • pp.231-239
    • /
    • 2013
  • Purpose: This study was conducted to identify the effects of video-audio information provision on physical discomfort, anxiety and nursing satisfaction of the clients for gastroscopy. Methods: The study design was nonequivalent control group pre-post test design. The subjects were 50 patients who visited H hospital health examination center for gastroscopy. Video-audio information developed by the authors was used as educational material for the treatment group. The data were collected between September 15 and November 15, 2010. The study instruments were the State-Trait Anxiety Inventory, the Physical Discomfort Scale, and the Nursing Satisfaction Scale. Results: The level of anxiety and physical discomfort in the treatment group were not significantly different from that in the comparison group (t=-0.28, p=.781; t=-0.34, p=.741). The level of clients' satisfaction with nursing care in the treatment group was significantly higher than in the comparison group (t=-4.12, p<.001). Conclusion: Use of video-audio information was effective in the increase in satisfaction with care. Therefore, it could be useful in the nursing practice, and be utilized as a way of nursing intervention to improve nursing satisfaction.

다중 오디오 특징을 이용한 유해 동영상의 판별 (Classification of Phornographic Video with using the Features of Multiple Audio)

  • 김정수;정명범;성보경;권진만;구광효;고일주
    • 한국HCI학회:학술대회논문집
    • /
    • 한국HCI학회 2009년도 학술대회
    • /
    • pp.522-525
    • /
    • 2009
  • 본 논문에서는 인터넷의 역기능으로 현대 사회에 큰 문제를 야기 시키는 음란성 유해 동영상을 내용기반으로 판별하기 위한 방법을 제안하였다. 유해 동영상에서 오디오 데이터를 이용하여 특징을 추출하였다. 사용된 오디오 특징은 주파수 스펙트럼, 자기상관, MFCC이다. 음란성의 내용이 될 수 있는 소리의 특징을 추출하였고 동영상 전체 오디오에서 해당 소리의 특징과 일치하는지를 측정하여 유해성을 판별하였다. 제안한 방법의 실험은 각 특징마다 유해 판별 측정 결과와 다중 특징을 이용한 측정 결과를 비교 수행하였다. 하나의 오디오 특징만을 추출하여 사용하였을 때 보다 다중 특징의 사용이 좋은 결과를 얻을 수 있었다.

  • PDF

Arithmetic unit를 사용한 저전력 MPEG audio필터 구현 (Low-power MPEG audio filter implementation using Arithmetic Unit)

  • 장영범;이원상
    • 대한전자공학회논문지SP
    • /
    • 제41권5호
    • /
    • pp.283-290
    • /
    • 2004
  • 이 논문에서는 MPEG audio 알고리즘의 필터뱅크를 덧셈을 사용하여 저전력으로 구현할 수 있는 구조를 제안하였다. 제안된 구조는 CSD(Canonic Signed Digit) 형의 계수를 사용하며, 입력신호 샘플을 최대로 공유함으로서 사용되는 덧셈기의 수를 최소화하였다. 제안된 구조는 알고리즘에서 사용된 공통입력 공유, 선형위상 대칭 필터계수를 이용한 공유, 공통입력을 이용한 블록 공유, CSD 형의 계수와 공통패턴 공유를 통하여 사용되는 덧셈의 수를 최소화할 수 있음을 보였다. Verilog-HDL 코딩을 통하여 시뮬레이션을 수행한 결과, 제안된 구조는 기존의 곱셈기 구조의 구현면적과 비교하여 60.3%를 감소시킬 수 있음을 보였다. 또한 제안된 구조의 전력소모는 곱셈기 구조와 비교하여 93.9%를 감소시킬 수 있음을 보였다. 따라서 고속의 곱셈기가 내장된 DSP 프로세서를 사용하지 않고도, Arithmetic Unit나 마이크로 프로세서를 사용하여 효과적으로 MPEG audio 필터뱅크를 구현할 수 있음을 보였다.

스피커의 특성을 고려한 음향 전력 증폭기 구동 방식의 비교: 전압 구동 방식과 전류 구동 방식 (Comparison of the Driving Modes of an Audio Power Amplifier Considering the Characteristics of the Loudspeaker: Voltage Drive vs. Current Drive)

  • 은창수;이유칠
    • 한국멀티미디어학회논문지
    • /
    • 제20권9호
    • /
    • pp.1551-1558
    • /
    • 2017
  • Audio power amplifiers have been designed based on the premise that the impedance of loudspeakers is fixed at nominal 4 ohms or 8 ohms. However, it is known that the impedance varies with frequency and takes on the nominal value at some limited frequencies. The principle of the loudspeaker operation reveals that the sound pressure produced by the loudspeaker is proportional to the current flowing in the voice coil, not the voltage between the two terminals. We take the characteristics of the loudspeaker into account and compare the frequency responses of the loudspeaker in voltage-drive mode and current-drive mode via computer simulations, to conclude that the audio amplifier drive mode should be re-considered in an effort to improve the sound quality.

Comparison between audio-only and audiovisual biofeedback for regulating patients' respiration during four-dimensional radiotherapy

  • Yu, Jesang;Choi, Ji Hoon;Ma, Sun Young;Jeung, Tae Sig;Lim, Sangwook
    • Radiation Oncology Journal
    • /
    • 제33권3호
    • /
    • pp.250-255
    • /
    • 2015
  • Purpose: To compare audio-only biofeedback to conventional audiovisual biofeedback for regulating patients' respiration during four-dimensional radiotherapy, limiting damage to healthy surrounding tissues caused by organ movement. Materials and Methods: Six healthy volunteers were assisted by audiovisual or audio-only biofeedback systems to regulate their respirations. Volunteers breathed through a mask developed for this study by following computer-generated guiding curves displayed on a screen, combined with instructional sounds. They then performed breathing following instructional sounds only. The guiding signals and the volunteers' respiratory signals were logged at 20 samples per second. Results: The standard deviations between the guiding and respiratory curves for the audiovisual and audio-only biofeedback systems were 21.55% and 23.19%, respectively; the average correlation coefficients were 0.9778 and 0.9756, respectively. The regularities between audiovisual and audio-only biofeedback for six volunteers' respirations were same statistically from the paired t-test. Conclusion: The difference between the audiovisual and audio-only biofeedback methods was not significant. Audio-only biofeedback has many advantages, as patients do not require a mask and can quickly adapt to this method in the clinic.

Robust Person Identification Using Optimal Reliability in Audio-Visual Information Fusion

  • Tariquzzaman, Md.;Kim, Jin-Young;Na, Seung-You;Choi, Seung-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • 제28권3E호
    • /
    • pp.109-117
    • /
    • 2009
  • Identity recognition in real environment with a reliable mode is a key issue in human computer interaction (HCI). In this paper, we present a robust person identification system considering score-based optimal reliability measure of audio-visual modalities. We propose an extension of the modified reliability function by introducing optimizing parameters for both of audio and visual modalities. For degradation of visual signals, we have applied JPEG compression to test images. In addition, for creating mismatch in between enrollment and test session, acoustic Babble noises and artificial illumination have been added to test audio and visual signals, respectively. Local PCA has been used on both modalities to reduce the dimension of feature vector. We have applied a swarm intelligence algorithm, i.e., particle swarm optimization for optimizing the modified convection function's optimizing parameters. The overall person identification experiments are performed using VidTimit DB. Experimental results show that our proposed optimal reliability measures have effectively enhanced the identification accuracy of 7.73% and 8.18% at different illumination direction to visual signal and consequent Babble noises to audio signal, respectively, in comparison with the best classifier system in the fusion system and maintained the modality reliability statistics in terms of its performance; it thus verified the consistency of the proposed extension.

Audio Steganography Method Using Least Significant Bit (LSB) Encoding Technique

  • Alarood, Alaa Abdulsalm;Alghamdi, Ahmed Mohammed;Alzahrani, Ahmed Omar;Alzahrani, Abdulrahman;Alsolami, Eesa
    • International Journal of Computer Science & Network Security
    • /
    • 제22권7호
    • /
    • pp.427-442
    • /
    • 2022
  • MP3 is one of the most widely used file formats for encoding and representing audio data. One of the reasons for this popularity is their significant ability to reduce audio file sizes in comparison to other encoding techniques. Additionally, other reasons also include ease of implementation, its availability and good technical support. Steganography is the art of shielding the communication between two parties from the eyes of attackers. In steganography, a secret message in the form of a copyright mark, concealed communication, or serial number can be embedded in an innocuous file (e.g., computer code, video film, or audio recording), making it impossible for the wrong party to access the hidden message during the exchange of data. This paper describes a new steganography algorithm for encoding secret messages in MP3 audio files using an improved least significant bit (LSB) technique with high embedding capacity. Test results obtained shows that the efficiency of this technique is higher compared to other LSB techniques.

오디오 신호를 위한 표본화율 변환 알고리듬 성능 비교 (A Performance Comparison of Sampling Rate Conversion Algorithms for Audio Signal)

  • 이용희;김인철
    • 방송공학회논문지
    • /
    • 제9권4호
    • /
    • pp.384-390
    • /
    • 2004
  • 본 논문에서는 44.1KHz에서 48KHz로 표본화 주파수를 변환하는 알고리듬들의 성능을 비교한다. 비교 한 기법은 다위상 구조로 구현된 기본적인 기법, sinc 함수를 이용한 기법, 다위상 구조의 다단계 구현 기법, 그리고 B-spline을 이용한 기법 등이다. 먼저, 공정한 비교를 위해 이 4가지 기법을 이용한 표본화율 변환기들을 고품질 조건하에 재설계하고, 이들의 H/W 복잡도를 메모리 요구량과 계산량 측면에서 비교하였다. 그 결과, 메모리 요구량 측면에서는 B-spline을 이용한 기법이 가장 우수하였지만, 계산량 측면에서는 기본적인 기법과 sinc 함수를 이용한 기법이 가장 우수함을 확인할 수 있었다.

오디오 신호를 위한 표본화율 변환 알고리듬 성능 비교 (A Performance Comparison of Sampling Rate Conversion Algorithms for Audio Signal)

  • 이용희;김인철
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.187-190
    • /
    • 2002
  • 본 논문에서는 지금까지 소개된 44.1KHz compact disc (CD)에서 48KHz digital audio tape (DAT)로의 표본화율 변환기법들에 대해서 가청 주파수 대역에서 100dB 이상의 dynamic range와 ±5x10­4dB 이하의 리플 크기를 유지할 수 있도록 각 기법들을 재설계하였으며, 메모리 요구량 및 계산량에 대해서 살펴보고자한다.

  • PDF