• Title/Summary/Keyword: Music Algorithm

Search Result 344, Processing Time 0.024 seconds

A Method of Color KANSEI Information Extraction in Video Data (비디오 데이터에서의 컬러 감성 정보 추출 방법)

  • Choi, Jun-Ho;Hwangi, Myung-Gwon;Choi, Chang;Kim, Pan-Koo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.10a
    • /
    • pp.532-535
    • /
    • 2008
  • The requirement of Digital Culture Content(Movie, Music, Animation, Digital TV, Exhibition and etc.) is increasing so variety and quantity of content is also increasing. The Movie what majority of the digital Content is developing of technology and data. In the result, the efficient retrieval service has required and user want to use a recommendation engine and semantic retrieval methods through the recommendation system. Therefore, this paper will suggest analysing trait element of digital content data, building of retrieval technology, analysing and retrieval technology base on KANSEI vocabulary and etc. For the these, we made a extraction technology of trait element based on semantics and KANSEI processing algorithm based on color information.

  • PDF

Development of Audio Melody Extraction and Matching Engine for MIREX 2011 tasks

  • Song, Chai-Jong;Jang, Dalwon;Lee, Seok-Pil;Park, Hochong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.164-166
    • /
    • 2012
  • In this paper, we proposed a method for extracting predominant melody of polyphonic music based on harmonic structure. Harmonic structure is an important feature parameter of monophonic signal that has spectral peaks at the integer multiples of its fundamental frequency. We extract all fundamental frequency candidates contained in the polyphonic signal by verifying the required condition of harmonic structure. Then, we combine those harmonic peaks corresponding to each extracted fundamental frequency and assign a rank to each after calculating its harmonic average energy. We run pitch tracking based on the rank of extracted fundamental frequency and continuity of fundamental frequency, and determine the predominant melody. For the query by singing/humming (QbSH) task, we proposed Dynamic Time Warping (DTW) based matching engine. Our system reduces false alarm by combining the distances of multiple DTW processes. To improve the performance, we introduced the asymmetric sense, pitch level compensation, and distance intransitiveness to DTW algorithm.

  • PDF

Angle-of-arrival Estimation fit for an Elliptical Scattering Channel in a Wireless Positioning (무선 위치 인식에서 타원형 산란 채널에 적합한 초광대역 신호 도착 방향 추정)

  • Lee, Yong-Up;Park, Joong-Hoo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.11C
    • /
    • pp.949-954
    • /
    • 2008
  • An ultrawide band (UWB) signal model is proposed to estimate the angle-of-arrivals of the signals arrived in clusters at an UWB receiver for a short-range, high-speed, indoor wireless communication system in an elliptical scattering environment. And a new estimation technique is proposed by modifying the conventional MUSIC algorithm. By using this estimation technique, the estimates of the two unknown parameter sets, angle-of-arrivals and distribution parameters, are obtained with the proposed UWB signal model. The proposed UWB signal model and estimation technique are verified through computer simulations in an ultrawide band communication environment.

Signal-Space Jamming Scheme for Disturbing Target Localization of Bistatic MIMO Radar System (바이스태틱 MIMO 레이다 시스템의 위치탐지 무력화를 위한 신호공간 재밍 기법)

  • Yeo, Kwanggoo;Chung, Wonzoo
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.29 no.11
    • /
    • pp.878-883
    • /
    • 2018
  • A jamming design scheme to disturb target position estimation of a bistatic multiple-input multiple-output(MIMO) radar system is presented. The proposed method exploits the received signals from distributed multiple electronic sensors and combines them to produce a jamming signal. The proposed algorithm can eliminate the target by transmitting the delayed sum or the weighted sum of the received senor signals. Simulation results confirm the performance of the proposed method.

Social Network-based Hybrid Collaborative Filtering using Genetic Algorithms (유전자 알고리즘을 활용한 소셜네트워크 기반 하이브리드 협업필터링)

  • Noh, Heeryong;Choi, Seulbi;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.19-38
    • /
    • 2017
  • Collaborative filtering (CF) algorithm has been popularly used for implementing recommender systems. Until now, there have been many prior studies to improve the accuracy of CF. Among them, some recent studies adopt 'hybrid recommendation approach', which enhances the performance of conventional CF by using additional information. In this research, we propose a new hybrid recommender system which fuses CF and the results from the social network analysis on trust and distrust relationship networks among users to enhance prediction accuracy. The proposed algorithm of our study is based on memory-based CF. But, when calculating the similarity between users in CF, our proposed algorithm considers not only the correlation of the users' numeric rating patterns, but also the users' in-degree centrality values derived from trust and distrust relationship networks. In specific, it is designed to amplify the similarity between a target user and his or her neighbor when the neighbor has higher in-degree centrality in the trust relationship network. Also, it attenuates the similarity between a target user and his or her neighbor when the neighbor has higher in-degree centrality in the distrust relationship network. Our proposed algorithm considers four (4) types of user relationships - direct trust, indirect trust, direct distrust, and indirect distrust - in total. And, it uses four adjusting coefficients, which adjusts the level of amplification / attenuation for in-degree centrality values derived from direct / indirect trust and distrust relationship networks. To determine optimal adjusting coefficients, genetic algorithms (GA) has been adopted. Under this background, we named our proposed algorithm as SNACF-GA (Social Network Analysis - based CF using GA). To validate the performance of the SNACF-GA, we used a real-world data set which is called 'Extended Epinions dataset' provided by 'trustlet.org'. It is the data set contains user responses (rating scores and reviews) after purchasing specific items (e.g. car, movie, music, book) as well as trust / distrust relationship information indicating whom to trust or distrust between users. The experimental system was basically developed using Microsoft Visual Basic for Applications (VBA), but we also used UCINET 6 for calculating the in-degree centrality of trust / distrust relationship networks. In addition, we used Palisade Software's Evolver, which is a commercial software implements genetic algorithm. To examine the effectiveness of our proposed system more precisely, we adopted two comparison models. The first comparison model is conventional CF. It only uses users' explicit numeric ratings when calculating the similarities between users. That is, it does not consider trust / distrust relationship between users at all. The second comparison model is SNACF (Social Network Analysis - based CF). SNACF differs from the proposed algorithm SNACF-GA in that it considers only direct trust / distrust relationships. It also does not use GA optimization. The performances of the proposed algorithm and comparison models were evaluated by using average MAE (mean absolute error). Experimental result showed that the optimal adjusting coefficients for direct trust, indirect trust, direct distrust, indirect distrust were 0, 1.4287, 1.5, 0.4615 each. This implies that distrust relationships between users are more important than trust ones in recommender systems. From the perspective of recommendation accuracy, SNACF-GA (Avg. MAE = 0.111943), the proposed algorithm which reflects both direct and indirect trust / distrust relationships information, was found to greatly outperform a conventional CF (Avg. MAE = 0.112638). Also, the algorithm showed better recommendation accuracy than the SNACF (Avg. MAE = 0.112209). To confirm whether these differences are statistically significant or not, we applied paired samples t-test. The results from the paired samples t-test presented that the difference between SNACF-GA and conventional CF was statistical significant at the 1% significance level, and the difference between SNACF-GA and SNACF was statistical significant at the 5%. Our study found that the trust/distrust relationship can be important information for improving performance of recommendation algorithms. Especially, distrust relationship information was found to have a greater impact on the performance improvement of CF. This implies that we need to have more attention on distrust (negative) relationships rather than trust (positive) ones when tracking and managing social relationships between users.

Long Term Average Spectrum Characteristics of Speaking Voice of Western Operatic Singers (Long Term Average Spectrum을 이용한 성악가들의 Speaking Voice 분석)

  • Lee, Kyung-Chul;Hong, Seok-Jin;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.2
    • /
    • pp.122-127
    • /
    • 2004
  • Background and Objectives : Many studies have described and analyzed singer's formant and it has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. A similar phenomenon produced by trained singers in their speech led some authors to examine the speaker's ring. This study was designed to analyze the speaking voice of the singers and speaker's ring. Baterials and Methods : Ten tenors, fifteen baritones, fifteen sopranos and ten mezzo sopranos attending the music college, department of vocal music were chosen for this study. Fifteen male and fifteen female untrained normal speakers were chosen for control group. Each subject was asked to produce a sample of a sustained spoken vowel /ah/ sound for at least five seconds and read sentence 'Kaeul'. The sound data was analyzed using the Fast Fourier Transform(FFT) - based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social Sciences(SPSS). Results : For LTA Power spectrum of/ah/ sound, a significant increase was seen in the 2,500-3,500Hz region(p<0.01) in four trained singer group compared with untrained speaker group, and a significant increase in the 9,000-10,000Hz region(p<0.01) in soparano group. Similarly, in sentence 'Kaeul', there was a significant increase in energy in the tenor, baritone, mezzo soprano group compared with the untrained speaker group in the 2,500-3,500Hz region(p<0.01), and a significant increase in all frequency region(p<0.01) in the soprano group. Conclusions : The LTA power spectrum suggests that trained singers group show more energy concentration in the 'singer's formant' region in the speaking voice, and authors believe this region to be the 'speaker's ring'. Further research is needed on the effect of singing training on the resonance of the speaking voice.

  • PDF

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers : Extended Study (성악다들의 목소리에 대한 Long Term Average Spectrum 분석 -$2^{nd}$ Singer's Formant의 존재 가능성에 대하여-)

  • Ban, Jae-Ho;Kwon, Young-Kyung;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.31-36
    • /
    • 2004
  • Background and Objectives : It has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. In previous study, authors showed that in trained tenors, besides the conventional singer's formant in the region of ,5500Hz, another energy peak was observed in the region of 8,000Hz. This peak was interpreted as the second resonance of the epilarynx tube. Singers in other voice categories who produce vocal ring are assumed to have the same peak, but no measurements have as yet been made. Materials and Methods : Fifteen tenors, fourteen baritones, seven sopranos and five mezzo sopranos attending the music college, department of vocal music who could reliably produce the head and chest registers were chosen for this study. Each subject was asked to produce an/ah/sound for at least three seconds for the head register sound(tenors ; G4, barions ; E4 sopranos ; F5 and mezzosopranos ; C5) and for the chest register sound (tenors ; C3, baritones ; D3, sopranos ; D4 and Mezzosoprano ; A3). The sound data was analyzed using the Fast Fourier Transform (FFT)-based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab (CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social sciences(SPSS). Results : For head register sounds, a significant increase was seen in the 2,200-3,400Hz region(p<0.05) and the Similar to the head register sounds, there was a significant increase in energy in the four trained singer group compared with the untrained group in the 2,200-3,100Hz region(p<0.05), the 7,800-8,400Hz region(p<0.05) for the chest register sounds. Conclusions : When good vocal production was made for the head and chest registers, an energy peak was observed near 2,500Hz, a frequency already known as the "singer's formant', in all subjects in the study group. Another region of increased energy was observed around 8,000Hz that had not been noticed previously. The authors believe this region to be the second singer's formant.

  • PDF

System Design and Evaluation of Digital Retrodirective Array Antenna for High Speed Tracking Performance (고속 추적 특성을 위한 디지털 역지향성 배열 안테나 시스템 설계와 특성 평가)

  • Kim, So-Ra;Ryu, Heung-Gyun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38A no.8
    • /
    • pp.623-628
    • /
    • 2013
  • The retrodirective array antenna system is operated faster than existing techniques of beamforming due to its less complexity. Therefore, it is effective for beam tracking in the environment of fast vehicle. On the other hand, it also has difficulty in estimating AOA according to multipath environment or multiuser signals. To improve the certainty of estimating AOA), this article proposes hybrid digital retrodirective array antenna systme combined with MUSIC algorithm. In this paper, the digital retrodirective array antenna system is designed according to the number of antenna array by using only one digital PLL which finds angle of delayed phase. And we evaluate the performance of the digital retrodirective array antenna for the high speed tracking application. Performance is studied by simulink when the speed of mobile is 300km/h and the distance between transmitter and receiver is 100m and then we have to confirm the performance of the system in multi path environment. As a result, the mean of AOA (Angle Of Arrival) error is $4.2^{\circ}$ when SNR is 10dB and it is $1.3^{\circ}$ when SNR is 20dB. Consequently, the digital RDA shows very good performance for high speed tracking due to the simple calculation and realization.

Optimized Mix Proportioning of Steel and Hybrid Reinforced Concrete Using Harmony Search Algorithm (화음탐색법을 이용한 강섬유 및 하이브리드 섬유보강 콘크리트의 최적배합 설계)

  • Lee, Chi-Hoon;Lee, Joo-Ha;Yoon, Young-Soo
    • Journal of the Korea Concrete Institute
    • /
    • v.18 no.2 s.92
    • /
    • pp.151-159
    • /
    • 2006
  • The guide line of the SFRC mix design was not established, and the convenience of the practical application on the spot is not so good. In this paper, hence, the program which is optimized to result the mix proportion by the flexural strength and toughness, was developed to apply to SFRC on the practical spot. This program could minimize the number of trial mixes and get an economical and appropriate mixture. In addition, the theoretical background on which the program is based, will be the basis of the embodied method to mixing SFRC. Additionally, new algorithm, in this paper, was used to develop the mix proportioning program of SFRC. The new algorithm is the Harmony Search which is the heuristic method mimicking the improvisation of music players, Musical performances seek a best state determined by aesthetic estimation, as the optimization algorithms seek a best state determined by objected function value. And, it was developed the program about single fiber reinforced concrete, beside to the hybrid fiber reinforced concrete that two kinds of steel fibers, which have the different geometry, was reinforced. This will be able to keep the world trend to study, hence, offers the basis of the next research about hybrid fiber reinforced concrete.

Mean Square Projection Error Gradient-based Variable Forgetting Factor FAPI Algorithm (평균 제곱 투영 오차의 기울기에 기반한 가변 망각 인자 FAPI 알고리즘)

  • Seo, YoungKwang;Shin, Jong-Woo;Seo, Won-Gi;Kim, Hyoung-Nam
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.5
    • /
    • pp.177-187
    • /
    • 2014
  • This paper proposes a fast subspace tracking methods, which is called GVFF FAPI, based on FAPI (Fast Approximated Power Iteration) method and GVFF RLS (Gradient-based Variable Forgetting Factor Recursive Lease Squares). Since the conventional FAPI uses a constant forgetting factor for estimating covariance matrix of source signals, it has difficulty in applying to non-stationary environments such as continuously changing DOAs of source signals. To overcome the drawback of conventioanl FAPI method, the GVFF FAPI uses the gradient-based variable forgetting factor derived from an improved means square error (MSE) analysis of RLS. In order to achieve the decreased subspace error in non-stationary environments, the GVFF-FAPI algorithm used an improved forgetting factor updating equation that can produce a fast decreasing forgetting factor when the gradient is positive and a slowly increasing forgetting factor when the gradient is negative. Our numerical simulations show that GVFF-FAPI algorithm offers lower subspace error and RMSE (Root Mean Square Error) of tracked DOAs of source signals than conventional FAPI based MUSIC (MUltiple SIgnal Classification).