• Title/Summary/Keyword: Music classification

Search Result 237, Processing Time 0.026 seconds

Analysis on performance of grid-free compressive beamforming based on experiment (실험 기반 무격자 압축 빔형성 성능 분석)

  • Shin, Myoungin;Cho, Youngbin;Choo, Youngmin;Lee, Keunhwa;Hong, Jungpyo;Kim, Seongil;Hong, Wooyoung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.3
    • /
    • pp.179-190
    • /
    • 2020
  • In this paper, we estimated the Direction of Arrival (DOA) using Conventional BeamForming (CBF), adaptive beamforming and compressive beamforming. Minimum Variance Distortionless Response (MVDR) and Multiple Signal Classification (MUSIC) are used as the adaptive beamforming, and grid-free compressive sensing is applied for the compressive sensing beamforming. Theoretical background and limitations of each technique are introduced, and the performance of each technique is compared through simulation and real experiments. The real experiments are conducted in the presence of reflected signal, transmitting a sound using two speakers and receiving acoustic data through a linear array consisting of eight microphones. Simulation and experimental results show that the adaptive beamforming and the grid-free compressive beamforming have a higher resolution than conventional beamforming when there are uncorrelated signals. On the other hand, the performance of the adaptive beamforming is degraded by the reflected signals whereas the grid-free compressive beamforming still improves the conventional beamforming resolution regardless of reflected signal presence.

A Study on the Musical Theme Clustering for Searching Note Sequences (음렬 탐색을 위한 주제소절 자동분류에 관한 연구)

  • 심지영;김태수
    • Journal of the Korean Society for information Management
    • /
    • v.19 no.3
    • /
    • pp.5-30
    • /
    • 2002
  • In this paper, classification feature is selected with focus of musical content, note sequences pattern, and measures similarity between note sequences followed by constructing clusters by similar note sequences, which is easier for users to search by showing the similar note sequences with the search result in the CBMR system. Experimental document was $\ulcorner$A Dictionary of Musical Themes$\lrcorner$, the index of theme bar focused on classical music and obtained kern-type file. Humdrum Toolkit version 1.0 was used as note sequences treat tool. The hierarchical clustering method is by stages focused on four-type similarity matrices by whether the note sequences segmentation or not and where the starting point is. For the measurement of the result, WACS standard is used in the case of being manual classification and in the case of the note sequences starling from any point in the note sequences, there is used common feature pattern distribution in the cluster obtained from the clustering result. According to the result, clustering with segmented feature unconnected with the starting point Is higher with distinct difference compared with clustering with non-segmented feature.

Fast DOA Estimation Algorithm using Pseudo Covariance Matrix (근사 공분산 행렬을 이용한 빠른 입사각 추정 알고리듬)

  • 김정태;문성훈;한동석;조명제;김정구
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.40 no.1
    • /
    • pp.15-23
    • /
    • 2003
  • This paper proposes a fast direction of arrival (DOA) estimation algorithm that can rapidly estimate incidence angles of incoming signals using a pseudo covariance matrix. The conventional subspace DOA estimation methods such as MUSIC (multiple signal classification) algorithms need many sample signals to acquire covariance matrix of input signals. Thus, it is difficult to estimate the DOAs of signals because they cannot perform DOA estimation during receiving sample signals. Also if the D0As of signals are changing rapidly, conventional algorithms cannot estimate incidence angles of signals exactly. The proposed algorithm obtains bearing response and directional spectrum after acquiring pseudo covariance matrix of each snapshot. The incidence angles can be exactly estimated by using the bearing response and directional spectrum. The proposed DOA estimation algorithm uses only concurrent snapshot so as to obtain covariance matrix. Compared to conventional DOA estimation methods. The proposed algorithm has an advantage that can estimate DOA of signal rapidly.

Performance Evaluation of Satellite System Based on Transmission Beamformer (송신 빔형성기 기반의 위성 시스템 구조 성능평가)

  • Mun, Ji-Youn;Hwang, Myeong-Hwan;Hwang, Suk-Seung
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.4
    • /
    • pp.713-720
    • /
    • 2018
  • The Signal Intelligence (SIGINT) system based on Angle-of-Arrival(AOA) estimation, interference suppression, and transmission beamforming techniques is a cutting edge technology for efficiently collecting various signal information. In this paper, we present the efficient structure of a satellite system consisted of an AOA estimator, an adaptive beamformer, a signal processing and D/B unit, and a transmission beamformer, for collecting signal information. For accurately estimating AOAs of various signals, efficiently suppressing interference or jamming signals, and efficiently transmitting the collected information or data, we employ Multiple Signal Classification (MUSIC), Minimum Variance Distortionless Response (MVDR), and Minimum Mean Square Error (MMSE) algorithms, respectively. Also, we evaluate and analysis the performance of the presented satellite system through the computer simulation.

Automatic Video Genre Classification Method in MPEG compressed domain (MPEG 부호화 영역에서 Video Genre 자동 분류 방법)

  • Kim, Tae-Hee;Lee, Woong-Hee;Jeong, Dong-Seok
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.8A
    • /
    • pp.836-845
    • /
    • 2002
  • Video summary is one of the tools which can provide the fast and effective browsing for a lengthy video. Video summary consists of many key-frames that could be defined differently depending on the video genre it belongs to. Consequently, the video summary constructed by the uniform manner might lead into inadequate result. Therefore, identifying the video genre is the important first step in generating the meaningful video summary. We propose a new method that can classify the genre of the video data in MPEC compressed bit-stream domain. Since the proposed method operates directly on the compressed bit-stream without decoding the frame, it has merits such as simple calculation and short processing time. In the proposed method, only the visual information is utilized through the spatial-temporal analysis to classify the video genre. Experiments are done for 6 genres of video: Cartoon, commercial, Music Video, News, Sports, and Talk Show. Experimental result shows more than 90% of accuracy in genre classification for the well -structured video data such as Talk Show and Sports.

Performance Analysis of Adaptive Beamforming System Based on Planar Array Antenna (평면 배열 안테나 기반의 적응 빔형성 시스템 성능 분석)

  • Mun, Ji-Youn;Hwang, Suk-Seung
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.6
    • /
    • pp.1207-1212
    • /
    • 2018
  • The signal intelligence (SIGINT) technology is actively used for collecting various data, in a number of fields, including a military industry. In order to collect the signal information and data and to transmit/receive the collected data efficiently, the accurate angle-of-arrival (AOA) information is required and communication disturbance from the interference or jamming signal should be minimized. In this paper, we present the structure of an adaptive beam-forming satellite system based on the planar array antenna, for collecting and transmitting/receiving the signal information and data efficiently. The presented adaptive beam-forming system consists of an antenna in the form of a planar array, an AOA estimator based on the Multiple Signal Classification (MUSIC) algorithm, an adaptive Minimum Variance Distortionless Response (MVDR) interference canceler, a signal processing and D/B unit, and a transmission beamformer based on Minimum mean Square Error (MMSE). In addition, through the computer simulation, we evaluate and analyze the performance of the proposed system.

Adaptive Beamforming System Based on Combined Array Antenna (혼합 배열 안테나 기반의 적응 빔형성 시스템)

  • Kim, Tae-Yun;Hwang, Suk-Seung
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.1
    • /
    • pp.9-18
    • /
    • 2021
  • The 5G communication system employs the millimeter wave with the extremely high frequency. Since the high frequency signal has the strong straightness, the beamforming technology based on the multiple base stations is required for services covering wide range. The beamformer needs the angle-of-arrival(AOA) information of the signal incident to the antenna, and it is generally estimated through the high resolution AOA estimation algorithm such as Multiple Signal Classification (MUSIC) or Estimation of Signal Parameters via Rotational Invariacne Technique (ESPRIT). Although various antenna array shapes can be employed for the beamformer, a single shape (square, circle, or hexagonal) is typically utilized. In this paper, we introduce a transmitting/receiving beamforming system based on the combined array antenna with square and circular shapes, which is proper to various frequency signals, and evaluate its performance. For evaluating the performance of the proposed beamforming system based on the combined array antenna, we implement the computer simulation employing various scenarios.

Towards Low Complexity Model for Audio Event Detection

  • Saleem, Muhammad;Shah, Syed Muhammad Shehram;Saba, Erum;Pirzada, Nasrullah;Ahmed, Masood
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.9
    • /
    • pp.175-182
    • /
    • 2022
  • In our daily life, we come across different types of information, for example in the format of multimedia and text. We all need different types of information for our common routines as watching/reading the news, listening to the radio, and watching different types of videos. However, sometimes we could run into problems when a certain type of information is required. For example, someone is listening to the radio and wants to listen to jazz, and unfortunately, all the radio channels play pop music mixed with advertisements. The listener gets stuck with pop music and gives up searching for jazz. So, the above example can be solved with an automatic audio classification system. Deep Learning (DL) models could make human life easy by using audio classifications, but it is expensive and difficult to deploy such models at edge devices like nano BLE sense raspberry pi, because these models require huge computational power like graphics processing unit (G.P.U), to solve the problem, we proposed DL model. In our proposed work, we had gone for a low complexity model for Audio Event Detection (AED), we extracted Mel-spectrograms of dimension 128×431×1 from audio signals and applied normalization. A total of 3 data augmentation methods were applied as follows: frequency masking, time masking, and mixup. In addition, we designed Convolutional Neural Network (CNN) with spatial dropout, batch normalization, and separable 2D inspired by VGGnet [1]. In addition, we reduced the model size by using model quantization of float16 to the trained model. Experiments were conducted on the updated dataset provided by the Detection and Classification of Acoustic Events and Scenes (DCASE) 2020 challenge. We confirm that our model achieved a val_loss of 0.33 and an accuracy of 90.34% within the 132.50KB model size.

Detecting Prominent Content in Unstructured Audio using Intensity-based Attack/release Patterns (발생/소멸 패턴을 이용한 비정형 혼합 오디오의 주성분 검출)

  • Kim, Samuel
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.12
    • /
    • pp.224-231
    • /
    • 2013
  • Defining the concept of prominent audio content as the most informative audio content from the users' perspective within a given unstructured audio segment, we propose a simple but robust intensity-based attack/release pattern features to detect the prominent audio content. We also propose a web-based annotation procedure to retrieve users' subjective perception and annotated 18 hours of video clips across various genres, such as cartoon, movie, news, etc. The experiments with a linear classification method whose models are trained for speech, music, and sound effect demonstrate promising - but varying across the genres of programs - results (e.g., 86.7% weighted accuracy for speech-oriented talk shows and 49.3% weighted accuracy for {action movies}).

Animation OST Musical Element Analysis based on A Narrative Process Classification Model (내러티브 프로세스 분류 모델 기반 애니메이션 OST의 음악적 요소 분석)

  • Jang, Soeun;Sung, Bongsun;Lee, Jang Hoon;Kim, Jae Ho
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.10
    • /
    • pp.1239-1252
    • /
    • 2014
  • The OST (Original Sound Track) in the film plays a vital role in increasing consensus and concentration to the storyline. The selected 4 animations are classified into 17 Narrative Processes (NP) by using NP Classification Model [1]. For the NPs each having OSTs, the authors have investigated 6 kinds of objective musical elements of the OST such as sound (speech, music, effect), tonality, tempo, range, intensity, and instrumentation. It is found that there are 33.3% common musical elements among all of them for the NPs with OSTs commonly. Among them, it is also found that there are 71.9% of common properties of the musical element. This research is meaningful by firstly showing that there are common properties of objective musical elements in each NP and the corresponding OST.