• Title/Summary/Keyword: 음원 분리

Search Result 88, Processing Time 0.032 seconds

Analysis of Performance of Focused Beamformer Using Water Pulley Model Array (수차 모형 배열을 이용한 표적추정 (Focused) 빔형성기 성능분석)

  • 최주평;이원철
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.5
    • /
    • pp.83-91
    • /
    • 2001
  • This paper proposes the Focused beamforming to estimate the location of target residing near to the observation platform in the underwater environment. The Focused beamforming technique provides the location of target by the coherent summation of a series of incident spherical waveforms considering distinct propagation delay times at the sensor array. But due to the movement of the observation platform and the variation of the underwater environment, the shape of the sensor array is no longer to be linear but it becomes distorted as the platform moves. Thus the Focused beamforming should be peformed regarding to the geometric shape variation at each time. To estimate the target location, the artificial image plane comprised of cells is constructed, and the delays are calculated from each cell where the target could be proximity to sensors for the coherent summation. After the coherent combining, the beam pattern can be obtained through the Focused beamforming on the image plane. Futhermore to compensate the variation of the shape of the sensor array, the paper utilizes the Nth-order polynomial approximation to estimate the shape of the sensor array obeying the water pulley modeling. Simulation results show the performance of the Focused beamforming for different frequency bands of the radiated signal.

  • PDF

Speech Basis Matrix Using Noise Data and NMF-Based Speech Enhancement Scheme (잡음 데이터를 활용한 음성 기저 행렬과 NMF 기반 음성 향상 기법)

  • Kwon, Kisoo;Kim, Hyung Young;Kim, Nam Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.4
    • /
    • pp.619-627
    • /
    • 2015
  • This paper presents a speech enhancement method using non-negative matrix factorization (NMF). In the training phase, each basis matrix of source signal is obtained from a proper database, and these basis matrices are utilized for the source separation. In this case, the performance of speech enhancement relies heavily on the basis matrix. The proposed method for which speech basis matrix is made a high reconstruction error for noise signal shows a better performance than the standard NMF which basis matrix is trained independently. For comparison, we propose another method, and evaluate one of previous method. In the experiment result, the performance is evaluated by perceptual evaluation speech quality and signal to distortion ratio, and the proposed method outperformed the other methods.

A NMF-Based Speech Enhancement Method Using a Prior Time Varying Information and Gain Function (시간 변화에 따른 사전 정보와 이득 함수를 적용한 NMF 기반 음성 향상 기법)

  • Kwon, Kisoo;Jin, Yu Gwang;Bae, Soo Hyun;Kim, Nam Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38C no.6
    • /
    • pp.503-511
    • /
    • 2013
  • This paper presents a speech enhancement method using non-negative matrix factorization. In training phase, we can obtain each basis matrix from speech and specific noise database. After training phase, the noisy signal is separated from the speech and noise estimate using basis matrix in enhancement phase. In order to improve the performance, we model the change of encoding matrix from training phase to enhancement phase using independent Gaussian distribution models, and then use the constraint of the objective function almost same as that of the above Gaussian models. Also, we perform a smoothing operation to the encoding matrix by taking into account previous value. Last, we apply the Log-Spectral Amplitude type algorithm as gain function.

Simulation of acoustic waves horizontal refraction using a three-dimensional parabolic equation model (3차원 포물선방정식을 이용한 음파의 수평굴절 모의)

  • Na, Youngnam;Son, Su-Uk;Hahn, Jooyoung;Lee, Keunhwa
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.2
    • /
    • pp.131-142
    • /
    • 2022
  • In order to examine the possibility of horizontal simulations of acoustic waves on the environments of big water depth variations, this study introduces a 3-dimensional model based on the pababolic equation. The model gives approximated solutions by separating the cross- and non cross-terms in the equation. Assuming artificial bathymetry (25 km × 4 km) with a source frequency 75 Hz, the simulations give clear horizontal refractions on the transmission loss distributions. The degree of refractions shows non-linear increase along the propagating range and proportional increase with water depth along the cross range. Another simulations with the real bathymetry (25 km × 8 km) also give clear horizontal refractions. The horizontal distributions present little difference with the depth resolution variations of the same data source because the model gives interpolations over the depth data before simulations. Meanwhile, the horizontal distributions show big difference with those of different data sources.

The Optimization of Hybrid BCI Systems based on Blind Source Separation in Single Channel (단일 채널에서 블라인드 음원분리를 통한 하이브리드 BCI시스템 최적화)

  • Yang, Da-Lin;Nguyen, Trung-Hau;Kim, Jong-Jin;Chung, Wan-Young
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.19 no.1
    • /
    • pp.7-13
    • /
    • 2018
  • In the current study, we proposed an optimized brain-computer interface (BCI) which employed blind source separation (BBS) approach to remove noises. Thus motor imagery (MI) signal and steady state visual evoked potential (SSVEP) signal were easily to be detected due to enhancement in signal-to-noise ratio (SNR). Moreover, a combination between MI and SSVEP which is typically can increase the number of commands being generated in the current BCI. To reduce the computational time as well as to bring the BCI closer to real-world applications, the current system utilizes a single-channel EEG signal. In addition, a convolutional neural network (CNN) was used as the multi-class classification model. We evaluated the performance in term of accuracy between a non-BBS+BCI and BBS+BCI. Results show that the accuracy of the BBS+BCI is achieved $16.15{\pm}5.12%$ higher than that in the non-BBS+BCI by using BBS than non-used on. Overall, the proposed BCI system demonstrate a feasibility to be applied for multi-dimensional control applications with a comparable accuracy.

Development of a Listener Position Adaptive Real-Time Sound Reproduction System (청취자 위치 적응 실시간 사운드 재생 시스템의 개발)

  • Lee, Ki-Seung;Lee, Seok-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.7
    • /
    • pp.458-467
    • /
    • 2010
  • In this paper, a new audio reproduction system was developed in which the cross-talk signals would be reasonably cancelled at an arbitrary listener position. To adaptively remove the cross-talk signals according to the listener's position, a method of tracking the listener position was employed. This was achieved using the two microphones, where the listener direction was estimated using the time-delay between the two signals from the two microphones, respectively. Moreover, room reverberation effects were taken into consideration where linear prediction analysis was involved. To remove the cross-talk signals at the left-and right-ears, the paths between the sources and the ears were represented using the KEMAR head-related transfer functions (HRTFs) which were measured from the artificial dummy head. To evaluate the usefulness of the proposed listener tracking system, the performance of cross-talk cancellation was evaluated at the estimated listener positions. The performance was evaluated in terms of the channel separation ration (CSR), a -10 dB of CSR was experimentally achieved although the listener positions were more or less deviated. A real-time system was implemented using a floating-point digital signal processor (DSP). It was confirmed that the average errors of the listener direction was 5 degree and the subjects indicated that 80 % of the stimuli was perceived as the correct directions.

Seismic Imaging of a Tidal Flat: A Case Study for the Mineopo Area (조간대(갯벌)에서의 탄성파 탐사: 민어포 지역의 사례)

  • Jou, Hyeong-Tae;Kim, Han-Joon;Lee, Gwang-Hoon;Lee, Sang-Hoon;Jung, Baek-Hoon;Cho, Hyun-Moo;Jang, Nam-Do
    • Geophysics and Geophysical Exploration
    • /
    • v.11 no.3
    • /
    • pp.197-203
    • /
    • 2008
  • A shallow high-resolution seismic reflection survey was carried out at the Mineopo tidal flat on the western coast of Korea. The purpose of the survey was to investigate shallow sedimentary structure of the tidal flat associated with the recent sea level change. A total of 795 shots were generated at 1 m interval from a 5-kg hammer source and recorded on 48 channels of 100 Hz geophones along two mutually perpendicular profiles. The water-saturated ground condition resulted in suppressed ground rolls by significantly decreasing rigidity. In addition, seismic velocities over 1500 m/s provided easy segregation of reflected arrivals from lower velocity noise. As a consequence, seismic sections from the study area show significantly higher resolution and signal to noise ratio than conventional land seismic sections. The tidal flat consists of 5 sedimentary sequences above acoustic basement. The seismic sections reveal the continuous structure of the tidal flat formed in association with sea level rise during the Holocene.

Seismic reflection survey in a tidal flat: A case study for the Mineopo area (갯벌 지역에서의 탄성파 탐사: 민어포 조간대 지역의 사례)

  • Jou Hyeong-Tae;Kim Han-Joon;Lee Gwang-Hoon;Choi Dong-Lim;Kim Min-Ji;Cho Hyun-Moo
    • 한국지구물리탐사학회:학술대회논문집
    • /
    • 2002.09a
    • /
    • pp.67-84
    • /
    • 2002
  • A shallow high-resolution seismic reflection survey was carried out at the Mineopo tidal flat on the western coast of Korea. The purpose of the survey was to investigate shallow sedimentary structure of the tidal (fat associated with the recent sea level change. A total of 795 shots were generated at 1 m interval from a 5-kg hammer source and recorded on 48 channels of 100 Hz geophones along two mutually perpendicular profiles. The water-saturated ground condition resulted in suppressed ground rolls by significantly decreasing rigidity. In addition, seismic velocities over 1500 m/s provided easy segregation of reflected arrivals from lower velocity noise. As a consequence, seismic sections were created that are high in resolution and signal to noise ratio as well. The stack sections show that the tidal flat consists of 5 sedimentary sequences above acoustic basement. Although deposition is largely characterized by the transgressive sedimentary facies resulting from sea level rise, erosional surfaces are well-resolved within the sequences.

  • PDF