[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5391/JKIIS.2004.14.6.759

Real-time Voice Change System using Pitch Change

Kim, Weon-Goo (군산대학교 전자정보공학부)

Publication Information

Journal of the Korean Institute of Intelligent Systems / v.14, no.6, 2004 , pp. 759-763 More about this Journal

Abstract

In this paper, real-time voice change method using pitch change technique is proposed to change one's voice to the other voice. For this purpose, sampling rate change method using DFT (Discrete Fourier Transform) method and time scale modification method using SOLA (Synchronized Overlap and Add) method is combined to change pitch. In order to evaluate the performance of the proposed method, voice transformation experiments were conducted. Experimental results showed that original speech signal is changed to the other speech signal in which original speaker's identity is difficult to find. The system is implemented using TI TMS320C6711DSK board to verify the system runs in real time.

Keywords

음성신호;피치;음성변환;표본화율 변환;시간축 변환;

Citations & Related Records

Reference

1	T. F. Quatieri and R. J. Mcaulay, "Shape invariance time-scale & pitch modification of speech," IEEE Trans. on Acoustic Speech and Signal Processing, vol. 40, No. 3, pp. 497-510, March, 1992. DOI ScienceOn
2	T. Takgi and E. Miyasaka, "A speech prosody conversion system with a high quality speech analysis-synthesis method," proc. of EUROSPEECH '93, Berlin, pp. 995-998, 1993.
3	J. Laroche, Y. Stylianou and E. Moulines, "HNS ; speech modification based on a harmonic + noise model," proc. of ICASSP, vol. 2, pp. 550-553, 1993. DOI
4	M. A. Richards, "Helium speech enhancement using the short-time fourier transform," IEEE Trans. on Acoustic Speech and Signal Processing, vol. ASSP-30, No. 6, pp. 841-853, December, 1982. DOI
5	P. J. Bloom, "High-quality digital audio in the entertainment industry: an overview to achievements and challenges," IEEE ASSP Magazine, pp. 2-25, October, 1985.
6	Il Hyun Nam, "Voice personality transformation," Ph. D Thesis, Electrical Engineering Rensselaer Polytechnic Institute, Troy, NY, 1991.
7	H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, vol. 11, pp. 175-187, 1992. DOI ScienceOn
8	S. Roucos and A. M. Wilgus, "High quality time-scale modification for speech," proc. of ICASSP, vol. 1, pp. 493-469, 1985 DOI
9	J. Makhoul and A. E. Jaroudi, "Time-scale modification in medium to low rate speech coding," proc. of ICASSP, vol. 1, pp. 1705-1708, 1986 DOI
10	E. Hardam, "High-quality time scale modification of speech signals using fast synchronized-overlap-add algorithm," proc. of ICASSP, vol. 1, pp. 409-412, 1990 DOI
11	E. Moulines and F. Charpentier, "Pitch Synchronous Waveform Processing Techniques for Text-to-speech Synthesis using Diphones," Speech Communication, vol. 9 (5/6), pp. 453-467, 1990 DOI ScienceOn
12	E. Moulines and J. Laroche, "Non-parametric techniques for pitch-scale and time-scale modification of speech," Speech Communication, vol. 16, pp. 175-205, 1995 DOI ScienceOn
13	R. J. Mcaulay and T. F. Quatieri, "Speech transformations based on a sinusoidal representation," IEEE Trans. on Acoustic Speech and Signal Processing, vol. 34, No. 1, pp. 1449-1464, December, 1986 DOI
14	M. Narendranath, H. A. Murthy, S. Rajendran and B. Yegnanarayana, "Transformation of formants of voice conversion using artificial neural networks," Speech Communication, vol. 16, No. 2, pp. 207-216, 1995. DOI ScienceOn
15	K. S. Lee, D. H. Youn, and I. W. Cha, "Voice personality transformation using an orthogonal vector space conversion," proc. of EUROSPEECH '95, Madrid, pp. 427-430, 1995.
16	N. Iwahashi and Y. Sagisaka, "Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks," Speech Communication, vol. 16, No. 2, pp. 139-152, 1995. DOI ScienceOn
17	H. Mizuno and M. Abe, "Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt," Speech Communication, vol. 16, No. 2, pp. 153-164, 1995. DOI ScienceOn
18	M. Abe, S. Nakamura, K. Shikano and H. Kuwabara, "Voice conversion through vector quantization," proc. of ICASSP, vol. 1, pp. 565-568, 1988.
19	M. Abe, "A segment-based approach to voice conversion," proc. of ICASSP, vol. 1, pp. 765-768, 1991.
20	Y. Stylianou O. Cappe and E. Moulines, "Statistical methods for voice quality transformation," proc. of EUROSPEECH '95, Madrid, pp. 447-450, 1995.
21	L. R. Rabiner and R. W. Schafer, "Digital Processing of Speech Signal", Prentice- Hall Inc., 1978.
22	D. W. Griffin and J. S. Lim, "Signal estimation from the modified short -time fourier transform," IEEE Trans. on Acoustic Speech and Signal Processing, vol. ASSP-32, pp. 236-243, April, 1984 DOI

KSCI

Real-time Voice Change System using Pitch Change 피치 변환을 사용한 실시간 음성 변환 시스템

Real-time Voice Change System using Pitch Change