• Title/Summary/Keyword: 음원 복원

Search Result 30, Processing Time 0.023 seconds

Home monitoring system based on sound event detection for the hard-of-hearing (청각장애인을 위한 사운드 이벤트 검출 기반 홈 모니터링 시스템)

  • Kim, Gee Yeun;Shin, Seung-Su;Kim, Hyoung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.38 no.4
    • /
    • pp.427-432
    • /
    • 2019
  • In this paper, we propose a home monitoring system using sound event detection based on a bidirectional gated recurrent neural network for the hard-of-hearing. First, in the proposed system, packet loss concealment is used to recover a lost signal captured through wireless sensor networks, and reliable channels are selected using multi-channel cross correlation coefficient for effective sound event detection. The detected sound event is converted into the text and haptic signal through a harmonic/percussive sound source separation method to be provided to hearing impaired people. Experimental results show that the performance of the proposed sound event detection method is superior to the conventional methods and the sound can be expressed into detailed haptic signal using the source separation.

2-D/3-D Seismic Data Acquisition and Quality Control for Gas Hydrate Exploration in the Ulleung Basin (울릉분지 가스하이드레이트 2/3차원 탄성파 탐사자료 취득 및 품질관리)

  • Koo, Nam-Hyung;Kim, Won-Sik;Kim, Byoung-Yeop;Cheong, Snons;Kim, Young-Jun;Yoo, Dong-Geun;Lee, Ho-Young;Park, Keun-Pil
    • Geophysics and Geophysical Exploration
    • /
    • v.11 no.2
    • /
    • pp.127-136
    • /
    • 2008
  • To identify the potential area of gas hydrate in the Ulleung Basin, 2-D and 3-D seismic surveys using R/V Tamhae II were conducted in 2005 and 2006. Seismic survey equipment consisted of navigation system, recording system, streamer cable and air-gun source. For reliable velocity analysis in a deep sea area where water depths are mostly greater than 1,000 m and the target depth is up to about 500 msec interval below the seafloor, 3-km-long streamer and 1,035 $in^3$ tuned air-gun array were used. During the survey, a suite of quality control operations including source signature analysis, 2-D brute stack, RMS noise analysis and FK analysis were performed. The source signature was calculated to verify its conformity to quality specification and the gun dropout test was carried out to examine signature changes due to a single air gun's failure. From the online quality analysis, we could conclude that the overall data quality was very good even though some seismic data were affected by swell noise, parity error, spike noise and current rip noise. Especially, by checking the result of data quality enhancement using FK filtering and missing trace restoration technique for the 3-D seismic data inevitably contaminated with current rip noises, the acquired data were accepted and the field survey could be conducted continuously. Even in survey areas where the acquired data would be unsuitable for quality specification, the marine seismic survey efficiency could be improved by showing the possibility of noise suppression through onboard data processing.

Broadband Processing of Conventional Marine Seismic Data Through Source and Receiver Deghosting in Frequency-Ray Parameter Domain (주파수-파선변수 영역에서 음원 및 수신기 고스트 제거를 통한 전통적인 해양 탄성파 자료의 광대역 자료처리)

  • Kim, Su-min;Koo, Nam-Hyung;Lee, Ho-Young
    • Geophysics and Geophysical Exploration
    • /
    • v.19 no.4
    • /
    • pp.220-227
    • /
    • 2016
  • Marine seismic data have not only primary signals from subsurface but also ghost signals reflected from the sea surface. The ghost decreases temporal resolution of seismic data because it attenuates specific frequency components. For eliminating the ghost signals effectively, the exact ghost delaytimes and reflection coefficients are required. Because of undulation of the sea surface and vertical movements of airguns and streamers, the ghost delaytime varies spatially and randomly while acquiring seismic data. The reflection coefficient is a function of frequency, incidence angle of plane-wave and the sea state. In order to estimate the proper ghost delaytimes considering these characteristics, we compared the ghost delaytimes estimated with L-1 norm, L-2 norm and kurtosis of the deghosted trace and its autocorrelation on synthetic data. L-1 norm of autocorrelation showed a minimal error and the reflection coefficient was calculated using Kirchhoff approximation equation which can handle the effect of wave height. We applied the estimated ghost delaytimes and the calculated reflection coefficients to remove the source and receiver ghost effects. By removing ghost signals, we reconstructed the frequency components attenuated near the notch frequency and produced the migrated stack section with enhanced temporal resolution.

An ACLMS-MPC Coding Method Integrated with ACFBD-MPC and LMS-MPC at 8kbps bit rate. (8kbps 비트율을 갖는 ACFBD-MPC와 LMS-MPC를 통합한 ACLMS-MPC 부호화 방식)

  • Lee, See-woo
    • Journal of Internet Computing and Services
    • /
    • v.19 no.6
    • /
    • pp.1-7
    • /
    • 2018
  • This paper present an 8kbps ACLMS-MPC(Amplitude Compensation and Least Mean Square - Multi Pulse Coding) coding method integrated with ACFBD-MPC(Amplitude Compensation Frequency Band Division - Multi Pulse Coding) and LMS-MPC(Least Mean Square - Multi Pulse Coding) used V/UV/S(Voiced / Unvoiced / Silence) switching, compensation in a multi-pulses each pitch interval and Unvoiced approximate-synthesis by using specific frequency in order to reduce distortion of synthesis waveform. In integrating several methods, it is important to adjust the bit rate of voiced and unvoiced sound source to 8kbps while reducing the distortion of the speech waveform. In adjusting the bit rate of voiced and unvoiced sound source to 8 kbps, the speech waveform can be synthesized efficiently by restoring the individual pitch intervals using multi pulse in the representative interval. I was implemented that the ACLMS-MPC method and evaluate the SNR of APC-LMS in coding condition in 8kbps. As a result, SNR of ACLMS-MPC was 15.0dB for female voice and 14.3dB for male voice respectively. Therefore, I found that ACLMS-MPC was improved by 0.3dB~1.8dB for male voice and 0.3dB~1.6dB for female voice compared to existing MPC, ACFBD-MPC and LMS-MPC. These methods are expected to be applied to a method of speech coding using sound source in a low bit rate such as a cellular phone or internet phone. In the future, I will study the evaluation of the sound quality of 6.9kbps speech coding method that simultaneously compensation the amplitude and position of multi-pulse source.

Speech Transition Detection and approximate-synthesis Method for Speech Signal Compression and Recovery (음성신호 압축 및 복원을 위한 음성 천이구간 검출과 근사합성 방식)

  • Lee, Kwang-Seok;Kim, Bong-Gi;Kang, Seong-Soo;Kim, Hyun-Deok
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.05a
    • /
    • pp.763-767
    • /
    • 2008
  • In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech qualify in case coexist with a voiced and an unvoiced consonants in a frame. So, We proposed TS(Transition Segment) including unvoiced consonant searching and extraction method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This research present a new method of TS approximate-synthesis by using Least Mean Square and frequency band division. As a result, this method obtain a high quality approximation-synthesis waveforms within TS by using frequency information of 0.547kHz below and 2.813kHz above. The important thing is that the maximum error signal can be made with low distortion approximation-synthesis waveform within TS. This method has the capability of being applied to a new speech coding of Voiced/Silence/TS, speech analysis and speech synthesis.

  • PDF

Speech Signal Compression and Recovery Using Transition Detection and Approximate-Synthesis (천이구간 추출 및 근사합성에 의한 음성신호 압축과 복원)

  • Lee, Kwang-Seok;Lee, Byeong-Ro
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.13 no.2
    • /
    • pp.413-418
    • /
    • 2009
  • In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech qualify in case coexist with a voiced and an unvoiced consonants in a frame. So, We proposed TS(Transition Segment) including unvoiced consonant searching and extraction method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This research present a new method of TS approximate-synthesis by using Least Mean Square and frequency band division. As a result, this method obtain a high qualify approximation-synthesis waveforms within TS by using frequency information of 0.547kHz below and 2.813kHz above. The important thing is that the maximum error signal can be made with low distortion approximation-synthesis waveform within TS. This method has the capability of being applied to a new speech coding of Voiced/Silence/TS, speech analysis and speech synthesis.

A Novel Digital Image Protection using Cellular Automata Transform (셀룰라 오토마타 변환을 이용한 정지영상 보호 방법)

  • Shin, Jin-Wook;Yoon, Sook;Yoo, Hyuck-Min;Park, Dong-Sun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.8C
    • /
    • pp.689-696
    • /
    • 2010
  • The goal of this paper is to present a novel method for protecting digital image using 2-D cellular automata transform (CAT). A copyright and transform coefficients are used to generate a new content-based copyright and an original digital image is distributed without any hidden copyright. The parameter, which is called gateway value, for 2-D CAT is consisted of rule number, initial configuration, lattice length, number of neighbors, and etc. Since 2-D CAT has various gateway values, it is more secure than conventional methods. The proposed algorithm is verified using attacked images such as filtering, cropping, JPEG compression, and rotation for robustness.

Quality Improvement of Low-Bitrate HE-AAC Encoder (HE-AAC 부호화의 저비트율에서 음질향상 기법)

  • Kim, Jeong-Geun;Lee, Jae-Seong;Lee, Tae-Jin;Kang, Kyeong-Ok;Park, Young-Cheol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.2
    • /
    • pp.66-74
    • /
    • 2008
  • In this paper, we propose new techniques that can improve the quality of AAC and SBR encoders comprised in low bitrate HE-AAC. To reduce the pre-echo artifacts often occurring for transient blocks in AAC, we propose an extended Temporal Noise Shaping (sTNS) in which the frequency range is selectively extended down to the low-frequency region. Also, for he high-frequency region being coded by SBR encoder, tones are identified through a sinusoidal modeling and their frequencies are adjusted within the QMF band in order to reduce the noise floor due to aliasing. Spectrograms of the decoded signals were compared and listening tests were conducted to evaluate the proposed algorithm. Results confirmed the effectiveness of the proposed algorithm.

Musical Analysis of Jindo Dasiraegi music for the Scene of Performing Arts Contents (연희현장에서의 올바른 활용을 위한 진도다시래기 음악분석)

  • Han, Seung Seok;Nam, Cho Long
    • (The) Research of the performance art and culture
    • /
    • no.25
    • /
    • pp.253-289
    • /
    • 2012
  • Dasiraegi is a traditional funeral rite performance of Jindo located in the South Jeolla Province of South Korea. With its unique stylistic structure including various dances, songs and witty dialogues, and a storyline depicting the birth of a new life in the wake of death, embodying the Buddhism belief that life and death is interconnected; it attracted great interest from performance organizers and performers who were desperately seeking new contents that can be put on stage as a performance. It is needless to say previous research on Dasiraegi had been most valuable in its recreation as it analyzed the performance from a wide range of perspectives. Despite its contributions, the previous researches were mainly academic focusing on: the symbolic meanings of the performance, basic introduction to the components of the performance such as script, lyrics, witty dialogue, appearance (costume and make-up), stage properties, rhythm, dance and etc., lacking accurate representation of the most crucial element of the performance which is sori (song). For this reason, the study analyzes the music of Dasiraegi and presents its musical characteristics along with its scores to provide practical support for performers who are active in the field. Out of all the numbers in Dasiraegi, this study analyzed all of Geosa-nori and Sadang-nori, the funeral dirge (mourning chant) sung as the performers come on stage and Gasangjae-nori, because among the five proceedings of the funeral rite they were the most commonly performed. There are a plethora of performance recordings to choose from, however, this study chose Jindo Dasiraegi, an album released by E&E Media. The album offers high quality recordings of performances, but more importantly, it is easy to obtain and utilize for performers who want to learn the Dasiraegi based on the script provided in this study. The musical analysis discovered a number of interesting findings. Firstly, most of the songs in Dasiraegi use a typical Yukjabaegi-tori which applies the Mi scale frequently containing cut-off (breaking) sounds. Although, Southern Kyoung-tori which applies the Sol scale was used, it was only in limited parts and was musically incomplete. Secondly, there was no musical affinity between Ssitgim-gut and Dasiraegi albeit both are for funeral rites. The fundamental difference in character and function of Ssitgim-gut and Dasiraegi may be the reason behind this lack of affinity, as Ssitgim-gut is sung to guide the deceased to heaven by comforting him/her, whereas, Dasiaregi is sung to reinvigorate the lives of the living. Lastly, traces of musical grammar found in Pansori are present in the earlier part of Dasiraegi. This may be attributed to the master artist (Designee of Important Intangible Cultural Heritage), who was instrumental in the restoration and hand-down of Dasiaregi, and his experience in a Changgeuk company. The performer's experience with Changgeuk may have induced the alterations in Dasiraegi, causing it to deviate from its original form. On the other hand, it expanded the performative bais by enhancing the performance aspect of Dasiraegi allowing it to be utilized as contents for Performing Arts. It would be meaningful to see this study utilized to benefit future performance artists, taking Dasiraegi as their inspiration, which overcomes the loss of death and invigorates the vibrancy of life.

Geoacoustic characteristics of Quaternary stratigraphic sequences in the mid-eastern Yellow Sea (황해 중동부 제4기 퇴적층의 지음향 특성)

  • Jin, Jae-Hwa;Jang, Seong-Hyeong;Kim, Seong-Pil;Kim, Hyeon-Tae;Lee, Chi-Won;Chang, Jeong-Hae;Choi, Jin-Hyeok;Ryang, Woo-Heon
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.6 no.2
    • /
    • pp.81-92
    • /
    • 2001
  • According to analyses of high-resolution seismic profiles (air gun, sparker, and SBP) and a deep-drill core(YSDP 105) in the mid-eastern Yellow Sea, stratigraphic and geoacoustic models have been established and seismo-acoustic modeling has been fulfilled using ray tracing of finite element method. Stratigraphic model reflects seismo-, litho-, and chrono-stratigraphic sequences formed under a significant influence of Quaternary glacio-eustatic sea-level fluctuations. Each sequence consists of terrestrial to very-shallow-marine coarse-grained lowstand systems tract and tidal fine-grained transgressive to highstand systems tract. Based on mean grain-size data (121 samples) of the drill core, bulk density and P-wave velocity of depositional units have been inferred and extrapolated down to a depth of the recovery using the Hamilton's regression equations. As goo-acoustic parameters, the 121 pairs of bulk density and P-wave velocity have been averaged on each unit of the stratigraphic model. As a result of computer ray-tracing simulation of the subsurface strata, we have found that there are complex ray paths and many acoustic-shadow zones owing to the presence of irregular layer boundaries and low-velocity layers.

  • PDF