통합 검색 | Korea Science

RawNet3 화자 표현을 활용한 임의의 화자 간 음성 변환을 위한 StarGAN의 확장 (Extending StarGAN-VC to Unseen Speakers Using RawNet3 Speaker Representation)

박보경;박소민;홍현기
- 정보처리학회논문지:소프트웨어 및 데이터공학
- /
- 제12권7호
- /
- pp.303-314
- /
- 2023
음성 변환(Voice Conversion)은 개인의 음성 데이터를 다른 사람의 음향적 특성(음조, 리듬, 성별 등)으로 재생성할 수 있는 기술로, 교육, 의사소통, 엔터테인먼트 등 다양한 분야에서 활용되고 있다. 본 논문은 StarGAN-VC 모델을 기반으로 한 접근 방식을 제안하여, 병렬 발화(Utterance) 없이도 현실적인 음성을 생성할 수 있다. 고정된 원본(source) 및 목표(target)화자 정보의 원핫 벡터(One-hot vector)를 이용하는 기존 StarGAN-VC 모델의 제약을 극복하기 위해, 본 논문에서는 사전 훈련된 Rawnet3를 사용하여 목표화자의 특징 벡터를 추출한다. 이를 통해 음성 변환은 직접적인 화자 간 매핑 없이 잠재 공간(latent space)에서 이루어져 many-to-many를 넘어서 any-to-any 구조가 가능하다. 기존 StarGAN-VC 모델에서 사용된 손실함수 외에도, Wasserstein-1 거리를 사용하여 생성된 음성 세그먼트가 목표 음성의 음향적 특성과 일치하도록 보장했다. 또한, 안정적인 훈련을 위해 Two Time-Scale Update Rule (TTUR)을 사용한다. 본 논문에서 제시한 평가 지표들을 적용한 실험 결과에 따르면, 제한된 목소리 변환만이 가능한 기존 StarGAN-VC 기법 대비, 본 논문의 제안 방법을 통해 다양한 발화자에 대한 성능이 개선된 음성 변환을 제공할 수 있음을 정량적으로 확인하였다.
https://doi.org/10.3745/KTSDE.2023.12.7.303 인용 PDF

Duct ANC 시스템에서 2차음원 방향별 소음감소효과 (An attenuation effect of noise according to the direction of secondary sound source in duct ANC system)

이형석;이응석
- 한국소음진동공학회:학술대회논문집
- /
- 한국소음진동공학회 2008년도 추계학술대회논문집
- /
- pp.497-502
- /
- 2008
In this paper, we studied on an attenuation effect of automobile exhaust noise according to the direction of secondary sound source in duct ANC system. Automobile exhaust noise was recorded at 800rpm. 3500rpm and 5000rpm of a diesel engine. Directions of loudspeaker(second sound source) can be exchanged to $30^{\circ}$, $90^{\circ}$ and $150^{\circ}$ against the primary noise flow by acrylic ducts to be made for experimentation. DSP board with TMS320C6416 chip of Texas Instrument Co used to control adaptive ANC system. This ANC system is based on the single-channel FxLMS algorithm. In experiment result, when the loud speaker direction was $150^{\circ}$, the attenuation effect showed largely. In case of $90^{\circ}$ duct, the noise was a little increased. In case of $30^{\circ}$ duct, the noise was a little increased or decreased according to the frequency range and the sound pressure(dB) of exhaust noise to comply with engine rpm.
PDF

압력 섭동 장치 설계/제작 및 검증시험 (A Development of A Gas Mechanical Pulsator)

김태완;황오식;고영성;정세용
- 한국추진공학회지
- /
- 제13권3호
- /
- pp.50-57
- /
- 2009
본 연구에서는 액체로켓 및 가스터빈 등의 각종 연소기의 연소불안정 특성 연구에 활용하기 위하여, 공급 기체에 인위적인 섭동을 유발할 수 있는 압력 섭동 장치의 설계/개발을 수행하였다. 이를 위하여 디스크 형태의 교란 발생 장치를 설계/제작하고, 디스크 회전속도를 제어하면서 압력 진폭, 주파수와 질량 유량을 측정하였다. 먼저 이 장치를 기존의 연소불안정 연구를 위한 모델 연소기의 스피커를 대신하여 장착한 후 음향공 감쇠 효과 특성 실험을 수행한 결과, 기존의 스피커를 이용한 실험 결과와 거의 유사함을 확인하였다. 또한 일정한 장치 상류 압력 하에서 회전 주파수의 변화는 공급 유량에 영향을 미치지 않고, 가압 압력에 따라 공급 유량을 조절할 수 있음을 확인하였다. 따라서 이 장치는 향후 가진 크기에 제한이 없으며 유동이 있는 상태에서의 연소불안정 특성을 위한 가진 장치와 기체 공기를 이용하는 각종 연소기에서의 기체 교란에 따른 연소 특성 연구에 활용할 수 있음을 확인하였다.
PDF KSCI

가변부하를 갖는 선형 증폭기를 구동하기 위한 전압적응 변환기용 전력공급기 개발 (Development of Power Supply for Voltage-Adaptable Converter to Drive Linear Amplifiers with Variable Loads)

엄기홍
- 한국인터넷방송통신학회논문지
- /
- 제14권6호
- /
- pp.251-257
- /
- 2014
모터의 일종으로서 엑츄에이터는 전기 에너지를 운동 에너지로 변환하기 위하여 전류를 이용하여 동작하는 메커니즘을 제어하는 시스템이다. 전압을 가청 신호로 변환하는 기능을 갖는 오디오 액츄 에이터로서는 스피커와 증폭기가 흔히 사용된다. 산업 현장에서는 고출력 고양질의 전력 시스템이 필요하다. 이러한 시스템들이 품질이 좋은 출력을 생성하기 위하여 오디오 시스템의 출력 임피던스를 제어해야 만 한다. 우리는 이 논문에서 가변 부하가 연결되어 있는 능동 증폭기 시스템을 구동하기 위한 적응 특성을 전력 공급기를 제시한다. 전기 신호를 오디오 신호로 변환하는 스피커의 저항값이 변동함에 따라 능동 증폭기에 대한 전력 공급 장치는 부하값의 변동에 적응하여 스피커에 최대 전력을 공급하며, 피크전류의 급격한 변동과 과잉전류의 흐름으로부터 증폭기를 보호하게 된다.
https://doi.org/10.7236/JIIBC.2014.14.6.251 인용 PDF KSCI

직렬연결 다채널 스피커의 PC 기반 제어 시스템 (PC-based Control System of Serially Connected Multi-channel Speakers)

이선용;김태완;변지성;송문빈;정연모
- 정보처리학회논문지A
- /
- 제15A권6호
- /
- pp.317-324
- /
- 2008
본 논문에서는 최근에 연구되어 발표된 하나의 선을 사용하여 여러 채널의 음향 신호를 전송하는 기술인 다채널 직렬연결 스피커 시스템에 USB 인터페이스를 사용하여 PC 환경에서 보다 많은 채널의 음향 신호를 제어할 수 있는 시스템을 제시하였다. USB 호스트에서 음원 파일을 분석하고 처리한 후 전송 알고리즘에 맞게 패킷을 생성하여 오디오 데이터를 실시간으로 전송한다. 각 스피커에서는 해당하는 디지털 신호만을 검출하여 처리한 후 DAC를 통해 음향을 재생한다. 사용자는 PC에서 시스템을 GUI 환경을 통해서 쉽게 제어할 수 있다.
https://doi.org/10.3745/KIPSTA.2008.15-A.6.317 인용 PDF KSCI

Adaptive Multi-Rate(AMR) 음성부호화 알고리즘 (Adaptive Multi-Rate(AMR) Speech Coding Algorithm)

서정욱;배건성
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 하계종합학술대회 논문집(4)
- /
- pp.92-97
- /
- 2000
An AMR(Adaptive Multi-Rate) speech coding algorithm has been adopted as a standard speech codec for IMT-2000. It is based on the algebraic CELP, and consists of eight speech coding modes having the bit rate from 4.75 kbit/s to 12.2 kbit/s. It also contains the VAD(Voice Activity Detector), SCR (Source Controlled Rate) operation, and error concealment scheme for robustness in a radio channel. The bit rate of AMR is changed on a frame basis depending on the channel condition. In this paper, we introduced AMR speech coding algorithm and performed the real-time implementation using TMS320C6201, i.e., a Texas Instrument's fixed-point DSP. With the ANSI C source code released from ETSI and 3GPP, we convert and optimize the program to make it run in real time using the C compiler and assembly language. It is verified that the decoded result of the implemented speech codec on the DSP is identical with the PC simulation result using ANSI C code for test sequences. Also, actual sound input/output test using microphone and speaker demonstrates its proper real-time operation without distortions or delays.
PDF

소음계 교정 자동화 시스템 개발 및 성능평가 (Development and Performance of Automated Calibration System of Sound Level Meters)

김용태;조문재;이용봉;서재갑;서상준
- 한국소음진동공학회:학술대회논문집
- /
- 한국소음진동공학회 1998년도 춘계학술대회논문집; 용평리조트 타워콘도, 21-22 May 1998
- /
- pp.704-709
- /
- 1998
An automated calibration system of sound level meters was developed and tested. As a standard sound source, the speaker unit(Forstex FE208) cabineted by 440 * 390 * 490 mm$^{3}$(LHW) volume wood box was adopted. Including this source, the driving part was found out to have a good linearity of sound pressure output vs AC input. We use the Hybrid-Bisect, /Newton-Raphson method modified by the linearity as searching algorithm. Personal computer and program do the control, measurements, and calculations and finally do the accumulation of useful data and results. Several trials of automatic calibration using this developed system give reliable results.
PDF

소음계 교정 자동화 시스템 개발 및 성능평가 (Development and Performance of Automated Calibration System of Sound Level Meters)

김용태;조문재;이용봉;서재갑
- 소음진동
- /
- 제8권5호
- /
- pp.879-886
- /
- 1998
An automated calibration system of sound level meters was developed and tested. As a standard sound source, the speaker unit(Forstex FE208) cabineted by 440$\times$390$\times$490 $\textrm{mm}^3$(LHW) volume wood box was adopted. Including this source, the driving part was found out to have a good linearity of sound pressure output vs AC voltage input. The Hybrid-Bisect/Newton-Raphson method modified by the linearity was adopted as a searching algorithm. Uisng GPIB interface, the console PC make the control, measurements, and calculations and finally make the accumulation of useful data and results automatically by the instructon in the program coded by C languate. Several trials of automatic calibration using this developed system give the reliable results.
PDF

Application of Block On-Line Blind Source Separation to Acoustic Echo Cancellation

Ngoc, Duong Q.K.;Park, Chul;Nam, Seung-Hyon
- The Journal of the Acoustical Society of Korea
- /
- 제27권1E호
- /
- pp.17-24
- /
- 2008
Blind speech separation (BSS) is well-known as a powerful technique for speech enhancement in many real world environments. In this paper, we propose a new application of BSS - acoustic echo cancellation (AEC) in a car environment. For this purpose, we develop a block-online BSS algorithm which provides robust separation than a batch version in changing environments with moving speakers. Simulation results using real world recordings show that the block-online BSS algorithm is very robust to speaker movement. When combined with AEC, simulation results using real audio recording in a car confirm the expectation that BSS improves double talk detection and echo suppression.
PDF KSCI

단일 센서 방식의 적응 능동 소음제어 (Adaptive Active Noise Control of Single Sensor Method)

김영달;장석구
- 소음진동
- /
- 제10권6호
- /
- pp.941-948
- /
- 2000
Active noise control is an approach to reduce the noise by utilizing a secondary noise source that destructively interferes with the unwanted noise. In general, active noise control systems rely on multiple sensors to measure the unwanted noise field and the effect of the cancellation. This paper develops an approach that utilizes a single sensor. The noise field is modeled as a stochastic process, and an adaptive algorithm is used to adaptively estimate the parameters of the process. Based on these parameter estimates, a canceling signal is generated. Oppenheim assumed that transfer function characteristics from the canceling source to the error sensor is only a propagation delay. This paper proposes a modified Oppenheim algorithm by considering transfer characteristics of speaker-path-sensor This transfer characteristics is adaptively cancelled by the proposed adaptive modeling technique. Feasibility of the proposed method is proved by computer simulations with artificially generated random noises and sine wave noise. The details of the proposed architecture. and theoretical simulation of the noise cancellation system for three dimension enclosure are presented in the Paper.
PDF

검색결과 104건 처리시간 0.029초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)