• Title/Summary/Keyword: Digital audio

Search Result 630, Processing Time 0.032 seconds

An Audio Watermarking Technique Using BPSK with Variable Carrier Frequency (가변 반송파 BPSK를 이용한 오디오 워터마킹 기법)

  • 이형욱;박세형;문용민;한상우;신재호
    • Proceedings of the IEEK Conference
    • /
    • 2000.06d
    • /
    • pp.110-113
    • /
    • 2000
  • In this paper, we consider the problem of digital audio watermarking to robust about compression without original audio data. We specifically address the audio watermarking using BPSK with variable carrier frequency. This technique make audio data embeded watermarking robust with compression attack, for example MPEG, AC-3, etc.

  • PDF

Development of a Digital Down-mixer to Convert 5.1 Channel Audio Signals to Stereo Signals (5.1 채널 오디오 신호를 스테레오 신호로 변환하는 디지털 다운믹서 개발)

  • Jeon, Kwang-Sub;Cheong, Ho-Yong;Lee, Seung-Yo
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.62 no.12
    • /
    • pp.1764-1770
    • /
    • 2013
  • Use of the 5.1 channel audio signals suitable for the television system is improper for the radio broadcasting system, which uses the stereo audio system. Therefore, it is necessary to develop an audio down-mixer to convert 5.1 multi-channel audio signals to stereo signals for radio broadcasting. In this paper, a development of an audio down-mixer was carried out to convert 5.1 multi-channel audio signals to stereo signals. The down-mixer which was developed can use the audio signals separated from video signals, including sound signals or individual signals provided from 3-channel AES/EBU signals including Left(L), Right(R), Left Surround(Ls), Right Surround(Rs), Center(C) and Low Frequency Effect(Lfe) sounds as mixer inputs.

Development of Integrated Mixer Controller for Digital Public Address (디지털전관방송을 위한 통합믹서컨트롤러 개발)

  • Cho, Juphil;Kim, Kwan-Woong;Kim, Daeik
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.17 no.1
    • /
    • pp.19-24
    • /
    • 2017
  • Nowadays, based on the advancement of IT techniques, innovative products combining IT techniques to PA system are developing. In this paper, we presented the hybrid mixer controller for digital PA system. We develop the integrated mixer controller which includes the digital mixer composing an existing digital PA system and function of digital integrated controller. Developed integrated mixer controller consists of multichannel mixer function with 16 audio input channels, 8 output channels. And, it has an equalizer for processing digital audio signal, matrix and limiter. Also, the developed controller has some features such as internet connection for controlling of overall PA system and remote monitoring of mixer process condition.

An Efficient Representation Method for ICLD with Robustness to Spectral Distortion

  • Beack, Seung-Kwon;Seo, Jeong-Il;Kang, Kyung-Ok;Hanh, Min-Soo
    • ETRI Journal
    • /
    • v.27 no.3
    • /
    • pp.330-333
    • /
    • 2005
  • The Inter-Channel Level Difference (ICLD) is a cue parameter to estimate spectral information in a binaural cue coding that has been recently in the spotlight as a multichannel audio signal compression technique. Even though the ICLD is an essential parameter, it is generally distorted by quantization. In this paper, a new modified ICLE representation method to minimize the quantization distortion is proposed by adopting a flexible determination of the reference channel and the unidirectional quantization. Our experimental result confirms that the proposed method improves the multichannel audio output quality even with the reduced bit-rate.

  • PDF

DTV Lip-Sync Test Using Embedded Audio-Video Time Indexed Signals (숨겨진 오디오 비디오 시간 인덱스 신호를 사용한 DTV 립싱크 테스트)

  • 한찬호;송규익
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.3
    • /
    • pp.155-162
    • /
    • 2004
  • This paper concentrated on lip synchronization (lip sync) test for DTV with respect to audio and video signals using a finite digital bitstream In this paper, we propose a new lip sync test method which does not effect on the current program by use of the transient effect area test signals (TATS) and audio-video time index lip sync test signals (TILS).the experimental result shows that the time difference between audio and video signal can be easily measured by captured oscilloscope waveform at any time.

Digital Watermarking Using Psychoacoustic Model

  • Poomdaeng, S.;Toomnark, S.;Amornraksa, T.
    • Proceedings of the IEEK Conference
    • /
    • 2002.07b
    • /
    • pp.872-875
    • /
    • 2002
  • A digital watermarking technique applying psychoacoustic model for audio signal is proposed in this paper. In the watermarking scheme, the pseudo-random bit stream used as a watermark signal is embedded into the audio signal in both speech and music. The strength of the embedded signal is subject to the human auditory system in such a way that the disturbances on host audio signal are beyond the sensing of human ears. The experimental results show that the quality of the watermarked audio signal, in term of signal to noise ratio, can be improved up to 3.2 dB.

  • PDF

Reversible Watermarking for Audio Using Recompression Method (재압축 기술을 이용한 오디오 파일에서의 가역 정보은닉)

  • Whang, Ho Young;Kim, Hyoung Joong
    • Journal of Digital Contents Society
    • /
    • v.14 no.2
    • /
    • pp.199-206
    • /
    • 2013
  • Various methods of data compression have been developed to handle data within limited storage capacity and limited transmission speed. Recompression technology, a technology most recent among them, is a technology that can embed data regardless of the information entropy of a data. Recompression technology separates original multimedia data in to blocks and embeds 0 or 1 according to whether each block is flipped or not. In this paper, this technology has been applied on audio files. And was able to implement reversible watermarking for audio files.

CoNSIST : Consist of New methodologies on AASIST, leveraging Squeeze-and-Excitation, Positional Encoding, and Re-formulated HS-GAL

  • Jae-Hoon Ha;Joo-Won Mun;Sang-Yup Lee
    • Annual Conference of KIPS
    • /
    • 2024.05a
    • /
    • pp.692-695
    • /
    • 2024
  • With the recent advancements in artificial intelligence (AI), the performance of deep learning-based audio deepfake technology has significantly improved. This technology has been exploited for criminal activities, leading to various cases of victimization. To prevent such illicit outcomes, this paper proposes a deep learning-based audio deepfake detection model. In this study, we propose CoNSIST, an improved audio deepfake detection model, which incorporates three additional components into the graph-based end-to-end model AASIST: (i) Squeeze and Excitation, (ii) Positional Encoding, and (iii) Reformulated HS-GAL, This incorporation is expected to enable more effective feature extraction, elimination of unnecessary operations, and consideration of more diverse information, thereby improving the performance of the original AASIST. The results of multiple experiments indicate that CoNSIST has enhanced the performance of audio deepfake detection compared to existing models.

A Color Image Watermarking Method for Embedding Audio Signal

  • Kim Sang Jin;Kim Chung Hwa
    • Proceedings of the IEEK Conference
    • /
    • 2004.08c
    • /
    • pp.631-635
    • /
    • 2004
  • The rapid development of digital media and communication network urgently brings about the need of data certification technology to protect IPR (Intellectual property right). This paper proposed a new watermarking method for embedding contents owner's audio signal in order to protect color image IPR. Since this method evolves the existing static model and embeds audio signal of big data, it has the advantage of restoring signal transformed due to attacks. Three basic stages of watermarking include: 1) Encode analogue ID owner's audio signal using PCM and create new 3D audio watermark; 2) Interleave 3D audio watermark by linear bit expansion and 3) Transform Y signal of color image into wavelet and embed interleaved audio watermark in the low frequency band on the transform domain. The results demonstrated that the audio signal embedding in color image proposed in this paper enhanced robustness against lossy JPEG compression, standard image compression and image cropping and rotation which remove a part of image.

  • PDF

A Proposal for High-Resolution Encoding System with Backward Compatibility in CDDA (상용 CDDA와 하위 호환성을 가지는 고해상도 부호화방석의 제안)

  • Moon, Dong-Wook;Kim, Lark-Kyo;Nam, Moon-Hyun
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.150-152
    • /
    • 2004
  • Conventional CDDA (Compact Disc Digital Audio) system has limitations come from sampling frequency and quantization bit, 44.1kHz and 16 bit respectively. So, new medium is developed for high-resolution audio recording, like as DVD-audio etc. But CDDA is a widely used medium for high fidelity audio yet, because new medium has complexity and difficulty in manufacturing system. In this paper, we design a new encoding system for high-resolution audio signal. The system is backward compatible with conventional CDDA. By evaluation for encoding and decoding process, we describe practicability of our proposal system.

  • PDF