• Title/Summary/Keyword: MUSIC Algorithm

Search Result 347, Processing Time 0.029 seconds

A Study on the Comparision of One-Dimensional Scattering Extraction Algorithms for Radar Target Identification (레이더 표적 구분을 위한 1차원 산란점 추출 기법 알고리즘들의 성능에 관한 비교 연구)

  • Jung, Ho-Ryung;Seo, Dong-Kyu;Kim, Kyung-Tae;Kim, Hyo-Tae
    • Proceedings of the Korea Electromagnetic Engineering Society Conference
    • /
    • 2003.11a
    • /
    • pp.193-197
    • /
    • 2003
  • Radar target identification can be achieved by using various radar signatures, such as one-dimensional(1-D) range profile, 2-D radar images, and 1-D or 2-D scattering centers on a target. In this letter, five 1-D scattering center extraction methods are discussed - TLS(Total Least Square)-Prony, Fast Root-MUSIC (Multiple Signal Classification), Matrix-Pencil, GEESE(GEneralized Eigenvalues utilizing Signal-subspace Eigenvalues), TLS-ESPRIT(Total Least Squares - Estimation of Signal Parameters via Rotational Invariance Technique), These methods are compared in the context of estimation accuracy as well as a computational efficiency using a noisy data. Finally these methods are applied to the target classification experiment with the measured data in the POSTECH compact range facility.

  • PDF

Quality Assessment and Predistortion Evaluation of the Multi-channel Audio Codec according to the bitrate changing (압축율 변화에 따른 멀티채널 오디오의 품질 및 Predistortion 의 영향 평가)

  • Cha, Kyung-Hwan;Jang, Dae-Young;Kim, Sung-Han;Kim, Chun-Duck
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.2
    • /
    • pp.55-60
    • /
    • 1996
  • This paper describes the subjective assessment of the multi-channel audio quality according to the bitrate changing and evaluates the predistortion effect to avoid the unmasked noise after matrixing/dematrxing process in transmission and regeneration of the multi-channel audio. The simulation is processed by the perceptual coding that is MPEG-2 Audio layer II algorithm. We evaluate the quality improvement about predistortion using or not by 384, 320, 256, 128kbps. As the result of the double blind subjective assessment, 5 Grade-Impairment Scale is scored under minus one to 320kbps and so audio quality is evaluated to be perceptible, but not annoying in 3/2 channel. The effect of the predistortion is improved one level in 128kbps and especially speech test material I better improved than music test materials.

  • PDF

Automated Recognition of Printed Music using Fuzzy ART Algorithm and Performance (퍼지 ART 알고리즘을 이용한 인쇄 악보의 자동 인식과 연주)

  • Kim, Mi-Jeong;Kim, Kwang-Baek;Park, Choong-Shik
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.10a
    • /
    • pp.410-414
    • /
    • 2008
  • 음악 연구에 따른 컴퓨터의 역할이 점차 중요한 비중을 차지함에 따라 보다 효과적인 악보 인식 방법이 요구된다. 기존의 악보 인식 방법에서는 특정 수정 프로그램에서 만든 악보만 그 프로그램에서 재수정과 재생이 가능하다는 단점이 있다. 본 논문에서는 이러한 단점을 보완하기 위하여 이미 작성 되어있는 악보들을 자동으로 인식하고 재생을 할 수 있는 방법을 제안한다. 제안된 악보 인식 방법은 수평 히스토그램을 이용하여 악보 이미지의 오선을 제거한 후, Grassfire 알고리즘을 적용하여 잡음을 제거하고 악보 구성 기호들을 추출한다. 추출된 악보 구성 기호들은 악보 구성 기호의 특징을 이용하여 음표와 쉼표, 그 외의 기호들로 분리한다. 분리된 음표 기호들은 박자마다 다른 음표 형태의 특징을 이용하여 다시 세밀하게 분리하고 쉼표와 그 외의 기호들은 퍼지 ART 알고리즘을 적용하여 인식한다. 인식된 악보 구성 기호들을 이용하여 각각 정보를 저장하고 향후에 악보 구성 기호에 해당하는 음의 재생을 용이하게 한다. 제안된 악보 인식 방법의 성능을 평가하기 위해 50장의 악보 영상을 대상으로 실험한 결과, 본 논문에서 제시한 악보 영상의 인식 방법이 실험을 통해서 효율적인 것을 확인하였다.

  • PDF

A Method of Color KANSEI Information Extraction in Video Data (비디오 데이터에서의 컬러 감성 정보 추출 방법)

  • Choi, Jun-Ho;Hwangi, Myung-Gwon;Choi, Chang;Kim, Pan-Koo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.10a
    • /
    • pp.532-535
    • /
    • 2008
  • The requirement of Digital Culture Content(Movie, Music, Animation, Digital TV, Exhibition and etc.) is increasing so variety and quantity of content is also increasing. The Movie what majority of the digital Content is developing of technology and data. In the result, the efficient retrieval service has required and user want to use a recommendation engine and semantic retrieval methods through the recommendation system. Therefore, this paper will suggest analysing trait element of digital content data, building of retrieval technology, analysing and retrieval technology base on KANSEI vocabulary and etc. For the these, we made a extraction technology of trait element based on semantics and KANSEI processing algorithm based on color information.

  • PDF

Development of Audio Melody Extraction and Matching Engine for MIREX 2011 tasks

  • Song, Chai-Jong;Jang, Dalwon;Lee, Seok-Pil;Park, Hochong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.164-166
    • /
    • 2012
  • In this paper, we proposed a method for extracting predominant melody of polyphonic music based on harmonic structure. Harmonic structure is an important feature parameter of monophonic signal that has spectral peaks at the integer multiples of its fundamental frequency. We extract all fundamental frequency candidates contained in the polyphonic signal by verifying the required condition of harmonic structure. Then, we combine those harmonic peaks corresponding to each extracted fundamental frequency and assign a rank to each after calculating its harmonic average energy. We run pitch tracking based on the rank of extracted fundamental frequency and continuity of fundamental frequency, and determine the predominant melody. For the query by singing/humming (QbSH) task, we proposed Dynamic Time Warping (DTW) based matching engine. Our system reduces false alarm by combining the distances of multiple DTW processes. To improve the performance, we introduced the asymmetric sense, pitch level compensation, and distance intransitiveness to DTW algorithm.

  • PDF

Angle-of-arrival Estimation fit for an Elliptical Scattering Channel in a Wireless Positioning (무선 위치 인식에서 타원형 산란 채널에 적합한 초광대역 신호 도착 방향 추정)

  • Lee, Yong-Up;Park, Joong-Hoo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.11C
    • /
    • pp.949-954
    • /
    • 2008
  • An ultrawide band (UWB) signal model is proposed to estimate the angle-of-arrivals of the signals arrived in clusters at an UWB receiver for a short-range, high-speed, indoor wireless communication system in an elliptical scattering environment. And a new estimation technique is proposed by modifying the conventional MUSIC algorithm. By using this estimation technique, the estimates of the two unknown parameter sets, angle-of-arrivals and distribution parameters, are obtained with the proposed UWB signal model. The proposed UWB signal model and estimation technique are verified through computer simulations in an ultrawide band communication environment.

Signal-Space Jamming Scheme for Disturbing Target Localization of Bistatic MIMO Radar System (바이스태틱 MIMO 레이다 시스템의 위치탐지 무력화를 위한 신호공간 재밍 기법)

  • Yeo, Kwanggoo;Chung, Wonzoo
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.29 no.11
    • /
    • pp.878-883
    • /
    • 2018
  • A jamming design scheme to disturb target position estimation of a bistatic multiple-input multiple-output(MIMO) radar system is presented. The proposed method exploits the received signals from distributed multiple electronic sensors and combines them to produce a jamming signal. The proposed algorithm can eliminate the target by transmitting the delayed sum or the weighted sum of the received senor signals. Simulation results confirm the performance of the proposed method.

Social Network-based Hybrid Collaborative Filtering using Genetic Algorithms (유전자 알고리즘을 활용한 소셜네트워크 기반 하이브리드 협업필터링)

  • Noh, Heeryong;Choi, Seulbi;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.19-38
    • /
    • 2017
  • Collaborative filtering (CF) algorithm has been popularly used for implementing recommender systems. Until now, there have been many prior studies to improve the accuracy of CF. Among them, some recent studies adopt 'hybrid recommendation approach', which enhances the performance of conventional CF by using additional information. In this research, we propose a new hybrid recommender system which fuses CF and the results from the social network analysis on trust and distrust relationship networks among users to enhance prediction accuracy. The proposed algorithm of our study is based on memory-based CF. But, when calculating the similarity between users in CF, our proposed algorithm considers not only the correlation of the users' numeric rating patterns, but also the users' in-degree centrality values derived from trust and distrust relationship networks. In specific, it is designed to amplify the similarity between a target user and his or her neighbor when the neighbor has higher in-degree centrality in the trust relationship network. Also, it attenuates the similarity between a target user and his or her neighbor when the neighbor has higher in-degree centrality in the distrust relationship network. Our proposed algorithm considers four (4) types of user relationships - direct trust, indirect trust, direct distrust, and indirect distrust - in total. And, it uses four adjusting coefficients, which adjusts the level of amplification / attenuation for in-degree centrality values derived from direct / indirect trust and distrust relationship networks. To determine optimal adjusting coefficients, genetic algorithms (GA) has been adopted. Under this background, we named our proposed algorithm as SNACF-GA (Social Network Analysis - based CF using GA). To validate the performance of the SNACF-GA, we used a real-world data set which is called 'Extended Epinions dataset' provided by 'trustlet.org'. It is the data set contains user responses (rating scores and reviews) after purchasing specific items (e.g. car, movie, music, book) as well as trust / distrust relationship information indicating whom to trust or distrust between users. The experimental system was basically developed using Microsoft Visual Basic for Applications (VBA), but we also used UCINET 6 for calculating the in-degree centrality of trust / distrust relationship networks. In addition, we used Palisade Software's Evolver, which is a commercial software implements genetic algorithm. To examine the effectiveness of our proposed system more precisely, we adopted two comparison models. The first comparison model is conventional CF. It only uses users' explicit numeric ratings when calculating the similarities between users. That is, it does not consider trust / distrust relationship between users at all. The second comparison model is SNACF (Social Network Analysis - based CF). SNACF differs from the proposed algorithm SNACF-GA in that it considers only direct trust / distrust relationships. It also does not use GA optimization. The performances of the proposed algorithm and comparison models were evaluated by using average MAE (mean absolute error). Experimental result showed that the optimal adjusting coefficients for direct trust, indirect trust, direct distrust, indirect distrust were 0, 1.4287, 1.5, 0.4615 each. This implies that distrust relationships between users are more important than trust ones in recommender systems. From the perspective of recommendation accuracy, SNACF-GA (Avg. MAE = 0.111943), the proposed algorithm which reflects both direct and indirect trust / distrust relationships information, was found to greatly outperform a conventional CF (Avg. MAE = 0.112638). Also, the algorithm showed better recommendation accuracy than the SNACF (Avg. MAE = 0.112209). To confirm whether these differences are statistically significant or not, we applied paired samples t-test. The results from the paired samples t-test presented that the difference between SNACF-GA and conventional CF was statistical significant at the 1% significance level, and the difference between SNACF-GA and SNACF was statistical significant at the 5%. Our study found that the trust/distrust relationship can be important information for improving performance of recommendation algorithms. Especially, distrust relationship information was found to have a greater impact on the performance improvement of CF. This implies that we need to have more attention on distrust (negative) relationships rather than trust (positive) ones when tracking and managing social relationships between users.

Long Term Average Spectrum Characteristics of Speaking Voice of Western Operatic Singers (Long Term Average Spectrum을 이용한 성악가들의 Speaking Voice 분석)

  • Lee, Kyung-Chul;Hong, Seok-Jin;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.2
    • /
    • pp.122-127
    • /
    • 2004
  • Background and Objectives : Many studies have described and analyzed singer's formant and it has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. A similar phenomenon produced by trained singers in their speech led some authors to examine the speaker's ring. This study was designed to analyze the speaking voice of the singers and speaker's ring. Baterials and Methods : Ten tenors, fifteen baritones, fifteen sopranos and ten mezzo sopranos attending the music college, department of vocal music were chosen for this study. Fifteen male and fifteen female untrained normal speakers were chosen for control group. Each subject was asked to produce a sample of a sustained spoken vowel /ah/ sound for at least five seconds and read sentence 'Kaeul'. The sound data was analyzed using the Fast Fourier Transform(FFT) - based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social Sciences(SPSS). Results : For LTA Power spectrum of/ah/ sound, a significant increase was seen in the 2,500-3,500Hz region(p<0.01) in four trained singer group compared with untrained speaker group, and a significant increase in the 9,000-10,000Hz region(p<0.01) in soparano group. Similarly, in sentence 'Kaeul', there was a significant increase in energy in the tenor, baritone, mezzo soprano group compared with the untrained speaker group in the 2,500-3,500Hz region(p<0.01), and a significant increase in all frequency region(p<0.01) in the soprano group. Conclusions : The LTA power spectrum suggests that trained singers group show more energy concentration in the 'singer's formant' region in the speaking voice, and authors believe this region to be the 'speaker's ring'. Further research is needed on the effect of singing training on the resonance of the speaking voice.

  • PDF

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers : Extended Study (성악다들의 목소리에 대한 Long Term Average Spectrum 분석 -$2^{nd}$ Singer's Formant의 존재 가능성에 대하여-)

  • Ban, Jae-Ho;Kwon, Young-Kyung;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.31-36
    • /
    • 2004
  • Background and Objectives : It has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. In previous study, authors showed that in trained tenors, besides the conventional singer's formant in the region of ,5500Hz, another energy peak was observed in the region of 8,000Hz. This peak was interpreted as the second resonance of the epilarynx tube. Singers in other voice categories who produce vocal ring are assumed to have the same peak, but no measurements have as yet been made. Materials and Methods : Fifteen tenors, fourteen baritones, seven sopranos and five mezzo sopranos attending the music college, department of vocal music who could reliably produce the head and chest registers were chosen for this study. Each subject was asked to produce an/ah/sound for at least three seconds for the head register sound(tenors ; G4, barions ; E4 sopranos ; F5 and mezzosopranos ; C5) and for the chest register sound (tenors ; C3, baritones ; D3, sopranos ; D4 and Mezzosoprano ; A3). The sound data was analyzed using the Fast Fourier Transform (FFT)-based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab (CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social sciences(SPSS). Results : For head register sounds, a significant increase was seen in the 2,200-3,400Hz region(p<0.05) and the Similar to the head register sounds, there was a significant increase in energy in the four trained singer group compared with the untrained group in the 2,200-3,100Hz region(p<0.05), the 7,800-8,400Hz region(p<0.05) for the chest register sounds. Conclusions : When good vocal production was made for the head and chest registers, an energy peak was observed near 2,500Hz, a frequency already known as the "singer's formant', in all subjects in the study group. Another region of increased energy was observed around 8,000Hz that had not been noticed previously. The authors believe this region to be the second singer's formant.

  • PDF