• Title/Summary/Keyword: audio frequency

Search Result 376, Processing Time 0.022 seconds

Authentication of a smart phone user using audio frequency analysis (음향 주파수 분석을 이용한 스마트폰 사용자 인증)

  • Kim, Jin-Bok;Song, Jeong-Eun;Lee, Mun-Kyu
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.22 no.2
    • /
    • pp.327-336
    • /
    • 2012
  • In this paper, we propose user authentication methods using a microphone and a speaker in smart phones. The proposed methods guarantee that the user is located close to the target device by transmitting the challenge via an audio channel. We propose two authentication methods; user authentication for a PC or a website using a smart phone as a hardware token, and user authentication to log on to a smart phone using a PC as a token. Because our methods use typical peripheral devices such as a microphone and a speaker, they do not require any special-purpose hardware equipment. In addition, the elderly and the handicapped can easily use our methods because the methods are activated by simple operations.

A 3D Audio Core-Codec Employing an Improved Buffer Control Method (향상된 버퍼 제어 방법을 사용한 3D 오디오 핵심 부호화기)

  • Kim, Rin Chul
    • Journal of Broadcast Engineering
    • /
    • v.25 no.2
    • /
    • pp.233-241
    • /
    • 2020
  • In this paper, a new buffer control method is proposed for improving the performance of the frequency domain part of the 3D audio (3DA) core codec. For the proposed buffer control method, we first combine the 3DA RM9 with the 3GPP AAC buffer control method which includes the psychoacoustic model and rate-distortion control process with the spectral hole avoidance algorithm. Then, we revise the 3GPP buffer control method so as to achieve a faithful bit allocation to the frames with higher activity. With the MUSHRA test, we prove that the proposed buffer control method demonstrates better performance than the 3DA RM9 and 3GPP AAC.

A New Robust Acoustic Crosstalk Cancellation Method with Sum and Difference Filter in 3D Audio System (3차원 오디오 시스템에서 합과 차 여파기를 이용한 새로운 광대억 간섭신호 제거 방법)

  • 김래훈;임준석;성굉모
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.17-21
    • /
    • 2001
  • There are some methods to enhance the ‘sweet spot’in loudspeaker-based 3D audio systems. Most of them can be only applied to narrow frequency band inherently. In this paper, we introduce the more robust 3D sound reproduction system which has far wider robust bandwidth. The system applies a sum and difference filter to the conventional three loudspeaker-based one.

  • PDF

Classification of Phornographic Videos Using Audio Information (오디오 신호를 이용한 음란 동영상 판별)

  • Kim, Bong-Wan;Choi, Dae-Lim;Bang, Man-Won;Lee, Yong-Ju
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.207-210
    • /
    • 2007
  • As the Internet is prevalent in our life, harmful contents have been increasing on the Internet, which has become a very serious problem. Among them, pornographic video is harmful as poison to our children. To prevent such an event, there are many filtering systems which are based on the keyword based methods or image based methods. The main purpose of this paper is to devise a system that classifies the pornographic videos based on the audio information. We use Mel-Cepstrum Modulation Energy (MCME) which is modulation energy calculated on the time trajectory of the Mel-Frequency cepstral coefficients (MFCC) and MFCC as the feature vector and Gaussian Mixture Model (GMM) as the classifier. With the experiments, the proposed system classified the 97.5% of pornographic data and 99.5% of non-pornographic data. We expect the proposed method can be used as a component of the more accurate classification system which uses video information and audio information simultaneously.

  • PDF

Automatic melody extraction algorithm using a convolutional neural network

  • Lee, Jongseol;Jang, Dalwon;Yoon, Kyoungro
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.12
    • /
    • pp.6038-6053
    • /
    • 2017
  • In this study, we propose an automatic melody extraction algorithm using deep learning. In this algorithm, feature images, generated using the energy of frequency band, are extracted from polyphonic audio files and a deep learning technique, a convolutional neural network (CNN), is applied on the feature images. In the training data, a short frame of polyphonic music is labeled as a musical note and a classifier based on CNN is learned in order to determine a pitch value of a short frame of audio signal. We want to build a novel structure of melody extraction, thus the proposed algorithm has a simple structure and instead of using various signal processing techniques for melody extraction, we use only a CNN to find a melody from a polyphonic audio. Despite of simple structure, the promising results are obtained in the experiments. Compared with state-of-the-art algorithms, the proposed algorithm did not give the best result, but comparable results were obtained and we believe they could be improved with the appropriate training data. In this paper, melody extraction and the proposed algorithm are introduced first, and the proposed algorithm is then further explained in detail. Finally, we present our experiment and the comparison of results follows.

A Study on the Car Audio Sound Quality Enhancement under Vehicle Noise and Its Subjective Evaluation (차량 주행소음을 고려한 자동차 오디오 음질 개선 및 주관적 음질평가 연구)

    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.8
    • /
    • pp.108-115
    • /
    • 1999
  • In this study we suggested a digital filter method to enhance car audio sound quality against the sound distortion due to cabin's acoustic characteristics and car driving noises. The digital filters designed were based on the characteristics on car driving noises and cabin acoustic characteristics. Car driving noises were analyzed by two ways; one is an objective method, octave band frequency analysis method. The other is a subjective method; sensory evaluation method, NCB method. On these results, seven sets of modified coefficients of eleven band digital filters were obtained. To find optimum audio sound quality among nine sound samples filtered by designing seven types of digital filters, which were mixed car driving noises at 100km/h, subjective evaluation method was used, paired comparison method; Scheffe' seven point method.

  • PDF

Frequency Band Selection Exited Linear Prediction Wideband Speech/Audio Coding Using SBR (SBR을 이용한 주파수 밴드선택 여기 선형예측 광대역 음성/오디오 부호화)

  • Jang, Sunghoon;Lee, Insung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.6
    • /
    • pp.556-562
    • /
    • 2013
  • This paper is aimed to improve performance of Band-Selection speech/audio Coder reconstucted band spectrum that is not sent by the comfort noise. To improve the performance, we use the Spectral Band Replication(SBR) technique instead of substitution of Comfort noise. To synthesize SBR signal, the SBR algorithm is referenced in selected signals and the spectrum synthesized by SBR is injected to non-selected band. Each sub-band spectrum has been energy-weighted by real audio signal. We propose the enhanced the Band-Selection Coder that utilizes synthesized SBR signal from selected signal instead of comfort noise.

Audio Source Separation Method Based on Beamspace-domain Multichannel Non-negative Matrix Factorization, Part I: Beamspace-domain Multichannel Non-negative Matrix Factorization system (빔공간-영역 다채널 비음수 행렬 분해 알고리즘을 이용한 음원 분리 기법 Part I: 빔공간-영역 다채널 비음수 행렬 분해 시스템)

  • Lee, Seok-Jin;Park, Sang-Ha;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.5
    • /
    • pp.317-331
    • /
    • 2012
  • In this paper, we develop a multichannel blind source separation algorithm based on a beamspace transform and the multichannel non-negative matrix factorization (NMF) method. The NMF algorithm is a famous algorithm which is used to solve the source separation problems. In this paper, we consider a beamspace-time-frequency domain data model for multichannel NMF method, and enhance the conventional method using a beamspace transform. Our decomposition algorithm is applied to audio source separation, using a dataset from the international Signal Separation Evaluation Campaign 2010 (SiSEC 2010) for evaluation.

Design and Implementation of Multimedia Retrieval a System (멀티미디어 검색 시스템의 설계 및 구현)

  • 노승민;황인준
    • Journal of KIISE:Databases
    • /
    • v.30 no.5
    • /
    • pp.494-506
    • /
    • 2003
  • Recently, explosive popularity of multimedia information has triggered the need for retrieving multimedia contents efficiently from the database including audio, video and images. In this paper, we propose an XML-based retrieval scheme and a data model that complement the weak aspects of annotation and conent based retrieval methods. The Property and hierarchy structure of image and video data are represented and manipulated based on the Multimedia Description Schema (MDS) that conforms to the MPEG-7 standard. For audio contents, pitch contours extracted from their acoustic features are converted into UDR string. Especially, to improve the retrieval performance, user's access pattern and frequency are utilized in the construction of an index. We have implemented a prototype system and evaluated its performance through various experiments.

A Performance Comparison of Sampling Rate Conversion Algorithms for Audio Signal (오디오 신호를 위한 표본화율 변환 알고리듬 성능 비교)

  • Lee Yong-Hee;Kim Rin-Chul
    • Journal of Broadcast Engineering
    • /
    • v.9 no.4 s.25
    • /
    • pp.384-390
    • /
    • 2004
  • In this paper we compare the performance of 4 different algorithms for converting the sampling frequency of an audio from 44.1KHz to 48KHz. The algorithms considered here include the basic polyphase method. sine function based method. multi-stage method. and B-spline based method. For a fair comparison, the sampling rate converters using the 4 algorithms are redesigned under a high fidelity condition. Then, their H/W complexities are compared in terms of the computational complexity and the memory size. As a result, it is shown that the basic polyphase method and sine function based method outperform the other two in terms of the computational complexity, while the B-spline based method requires less memory than the others.