• Title/Summary/Keyword: 음원 탐색

Search Result 28, Processing Time 0.025 seconds

Investigation of Timbre-related Music Feature Learning using Separated Vocal Signals (분리된 보컬을 활용한 음색기반 음악 특성 탐색 연구)

  • Lee, Seungjin
    • Journal of Broadcast Engineering
    • /
    • v.24 no.6
    • /
    • pp.1024-1034
    • /
    • 2019
  • Preference for music is determined by a variety of factors, and identifying characteristics that reflect specific factors is important for music recommendations. In this paper, we propose a method to extract the singing voice related music features reflecting various musical characteristics by using a model learned for singer identification. The model can be trained using a music source containing a background accompaniment, but it may provide degraded singer identification performance. In order to mitigate this problem, this study performs a preliminary work to separate the background accompaniment, and creates a data set composed of separated vocals by using the proven model structure that appeared in SiSEC, Signal Separation and Evaluation Campaign. Finally, we use the separated vocals to discover the singing voice related music features that reflect the singer's voice. We compare the effects of source separation against existing methods that use music source without source separation.

Drone Location Tracking with Circular Microphone Array by HMM (HMM에 의한 원형 마이크로폰 어레이 적용 드론 위치 추적)

  • Jeong, HyoungChan;Lim, WonHo;Guo, Junfeng;Ahmad, Isitiaq;Chang, KyungHi
    • Journal of Advanced Navigation Technology
    • /
    • v.24 no.5
    • /
    • pp.393-407
    • /
    • 2020
  • In order to reduce the threat by illegal unmanned aerial vehicles, a tracking system based on sound was implemented. There are three main points to the drone acoustic tracking method. First, it scans the space through variable beam formation to find a sound source and records the sound using a microphone array. Second, it classifies it into a hidden Markov model (HMM) to find out whether the sound source exists or not, and finally, the sound source is In the case of a drone, a sound source recorded and stored as a tracking reference signal based on an adaptive beam pattern is used. The simulation was performed in both the ideal condition without background noise and interference sound and the non-ideal condition with background noise and interference sound, and evaluated the tracking performance of illegal drones. The drone tracking system designed the criteria for determining the presence or absence of a drone according to the improvement of the search distance performance according to the microphone array performance and the degree of sound pattern matching, and reflected in the design of the speech reading circuit.

Source finding in reflection and refraction environment using based on ray tracing method TRM (음선 추적법 기반 TRM을 이용한 반사 및 굴절 환경 속의 소음원 탐색에 대한 연구)

  • Moon, Sang Il;Lee, Jae Hyung;Choi, Jong Soo
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2014.10a
    • /
    • pp.727-732
    • /
    • 2014
  • The goal is to find the position of the sound source with the TRM at reflections and refractions environment. The Fermat's principle applied to the ray tracing method are expected to follow the acoustic path in order to obtain acoustic distance and transmittance to. Utilizing them in the TRM was confirmed rear reflectance affect on estimated position, resolution and side lobe. And the TRM performance were superior to one of the beam forming techniques.

  • PDF

Adaptive depth control algorithm for sound tracing (사운드 트레이싱을 위한 적응형 깊이 조절 알고리즘)

  • Kim, Eunjae;Yun, Juwon;Chung, Woonam;Kim, Youngsik;Park, Woo-Chan
    • Journal of the Korea Computer Graphics Society
    • /
    • v.24 no.5
    • /
    • pp.21-30
    • /
    • 2018
  • In this paper, we use Sound-tracing, a 3D sound technology based on ray-tracing that uses geometric method as auditory technology to enhance realism. The Sound-tracing is costly in the sound propagation stage. In order to reduce the sound propagation cost, we propose a method to calculate the average effective frame number of previous frames using the frame coherence property and to adjust the depth according to the space based on the calculated number. Experimental results show that the path loss rate is 0.72% and the traversal & Intersection test calculation amount is decreased by 85.13% and the frame rate is increased by 4.48% when the sound source is indoors, compared with the result of the case without depth control. When the sound source was outdoors, the path loss was 0% and the traversal & Intersection test calculation amount is decreased by 25.01% and the frame rate increased by 7.85%. This allowed the rendering performance to be increased while minimizing the path loss rate.

Musical Instrument Recognition for the Categorization of UCC Music Source (UCC 음원분류를 위한 연주악기 분류에 대한 연구)

  • Kwon, Soon-Il;Park, Wan-Joo
    • The KIPS Transactions:PartB
    • /
    • v.17B no.2
    • /
    • pp.107-114
    • /
    • 2010
  • A guitar, a piano, and a violin are popular musical instruments for User Created Contents(UCC). However the patterns of audio signal generated by a guitar and a piano are too similar to differentiate. The difference between two musical instruments can be found by analyzing the frequency variation per each band near signal peaks. The distribution of probability on the existence of signal peaks based on Cumulative Histogram were applied to musical instrument recognition. Experiments with statistical models of the frequency variation per each band near signal peaks showed the 14% improvement of musical instrument recognition.

A Study on TSIUVC Approximate-Synthesis Method using Least Mean Square and Frequency Division (주파수 분할 및 최소 자승법을 이용한 TSIUVC 근사합성법에 관한 연구)

  • 이시우
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.3
    • /
    • pp.462-468
    • /
    • 2003
  • In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech quality in case coexist with a voiced and an unvoiced consonants in a frame. So, I propose TSIUVC(Transition Segment Including Unvoiced Consonant) searching and extraction method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This paper present a new method of TSIUVC approximate-synthesis by using Least Mean Square and frequency band division. As a result, this method obtain a high quality approximation-synthesis waveforms within TSIUVC by using frequency information of 0.547KHz below and 2.813KHz above. The important thing is that the maximum error signal can be made with low distortion approximation-synthesis waveform within TSIUVC. This method has the capability of being applied to a new speech coding of Voiced/Silence/TSIUVC, speech analysis and speech synthesis.

  • PDF

Optimal Directivity Synthesis of Linear array Sources (선형배열음원의 최적 지향성합성)

  • Jeong, Eui-Cheol;Kim, Sang-Yun;Kim, On;Cho, Ki-Ryang
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.4A
    • /
    • pp.250-259
    • /
    • 2012
  • This paper compared and investigated the choice of optimal algorithm affects on the directivity synthesis of linear array in the satisfaction to the design specification of the desired directivity, convergence characteristic, and adaptability. Optimal algorithms use a quasi-Newton method(DFP and BFGS method) for realizing the desired directivity, used a quasi-ideal beam, steering beam, and a multi-beam, chosen as desired directivity. In the numerical result, this paper verified the effectiveness of the quasi-Newton method to the directivity synthesis, and offered a solving approach of occurred problems in the numerical simulation process.

A Study on 8kbps FBD-MPC Method Considering Low Bit Rate (Low Bit Rate을 고려한 8kbps FBD-MPC 방식에 관한 연구)

  • Lee, See-Woo
    • Journal of Digital Convergence
    • /
    • v.12 no.6
    • /
    • pp.271-276
    • /
    • 2014
  • In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech quality in case coexist with a voiced and unvoiced consonants in a frame. In this paper, I propose a method of 8kbps Multi-Pulse Speech Coding(FBD-MPC: Frequency Band Division MPC) by using TSIUVC(Transition Segment Including Unvoiced Consonant) searching, extraction and approximation-synthesis method in a frequency domain. I evaluate the 8kbps MPC and FBD-MPC. As a result, SNRseg of FBD-MPC was improved 0.5dB for female voice and 0.2dB for male voice respectively. Compared to the MPC, SNRseg of FBD-MPC has been improved that I was able to control the distortion of the speech waveform finally. And so, I expect to be able to this method for cellular phone and smart phone using excitation source of low bit rate.

High Resolution Wideband Local Polynomial Approximation Beamforming for Moving Sources (이동하는 음원에 적합한 고분해능 광대역 LPA 빔형성기법)

  • Park Do-Hyun;Park Gyu-Tae;Lee Jung-Hoon;Lee Su-Hvoung;Lee Kyun-Kyung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.1
    • /
    • pp.1-10
    • /
    • 2005
  • This paper presents a wideband LPA (local polynomial approximation) beamforming algorithm that is appropriate for wideband moving sources. The Proposed wideband LPA algorithm adopts STMV (steered minimum variance) method that utilizes a steered covariance matrix obtained from multiple frequency components in one data snapshot, instead of multiple data snapshots in one frequency bin. The wideband LPA cost function is formed using STMV weight vector. The Proposed algorithm searches for the instantaneous DOA and angular velocity that maximize the wideband LPA cost function. resulting in a higher resolution performance than that of a DS LPA beamforming algorithm. Several simulations using artificial data and sea trial data are used to demonstrate the performance of the Proposed algorithm.

Possibility of Debt Financing by Korean Entertainment Companies : Case of SM Entertainment and YG Entertainment (한국 엔터테인먼트 기업의 부채금융 가능성 탐색 - SM엔터와 YG엔터 사례를 중심으로)

  • Kim, Daewon;Kim, Seongcheol
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.10
    • /
    • pp.227-236
    • /
    • 2014
  • The purpose of this paper is to explore the possibility for Korean entertainment companies to enter into debt financing. In particular, this study focuses on the possibility of issuing corporate bond and the asset backed securities (ABS) by two leading entertainment companies in Korea: SM Entertainment (SM) and YG Entertainment (YG). Depth interview with specialists such as investment bankers (IB), bond brokers, and financial directors and executives in entertainment companies was done. The results show that IB's opinion on issuing corporate bonds by SM and YG is positive. However, they may need to meet four requirements including maintaining stable cash-flow, diversifying sales source, enhancing accounting and legal transparencies and verifying managerial capabilities. In addition, Psy' s 'Gangnam style', his global hit song, turns out to have high potential as a base asset for ABS.