Browse > Article
http://dx.doi.org/10.5391/JKIIS.2004.14.6.759

Real-time Voice Change System using Pitch Change  

Kim, Weon-Goo (군산대학교 전자정보공학부)
Publication Information
Journal of the Korean Institute of Intelligent Systems / v.14, no.6, 2004 , pp. 759-763 More about this Journal
Abstract
In this paper, real-time voice change method using pitch change technique is proposed to change one's voice to the other voice. For this purpose, sampling rate change method using DFT (Discrete Fourier Transform) method and time scale modification method using SOLA (Synchronized Overlap and Add) method is combined to change pitch. In order to evaluate the performance of the proposed method, voice transformation experiments were conducted. Experimental results showed that original speech signal is changed to the other speech signal in which original speaker's identity is difficult to find. The system is implemented using TI TMS320C6711DSK board to verify the system runs in real time.
Keywords
음성신호;피치;음성변환;표본화율 변환;시간축 변환;
Citations & Related Records
연도 인용수 순위
  • Reference
1 T. F. Quatieri and R. J. Mcaulay, "Shape invariance time-scale & pitch modification of speech," IEEE Trans. on Acoustic Speech and Signal Processing, vol. 40, No. 3, pp. 497-510, March, 1992.   DOI   ScienceOn
2 T. Takgi and E. Miyasaka, "A speech prosody conversion system with a high quality speech analysis-synthesis method," proc. of EUROSPEECH '93, Berlin, pp. 995-998, 1993.
3 J. Laroche, Y. Stylianou and E. Moulines, "HNS ; speech modification based on a harmonic + noise model," proc. of ICASSP, vol. 2, pp. 550-553, 1993.   DOI
4 M. A. Richards, "Helium speech enhancement using the short-time fourier transform," IEEE Trans. on Acoustic Speech and Signal Processing, vol. ASSP-30, No. 6, pp. 841-853, December, 1982.   DOI
5 P. J. Bloom, "High-quality digital audio in the entertainment industry: an overview to achievements and challenges," IEEE ASSP Magazine, pp. 2-25, October, 1985.
6 Il Hyun Nam, "Voice personality transformation," Ph. D Thesis, Electrical Engineering Rensselaer Polytechnic Institute, Troy, NY, 1991.
7 H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, vol. 11, pp. 175-187, 1992.   DOI   ScienceOn
8 S. Roucos and A. M. Wilgus, "High quality time-scale modification for speech," proc. of ICASSP, vol. 1, pp. 493-469, 1985   DOI
9 J. Makhoul and A. E. Jaroudi, "Time-scale modification in medium to low rate speech coding," proc. of ICASSP, vol. 1, pp. 1705-1708, 1986   DOI
10 E. Hardam, "High-quality time scale modification of speech signals using fast synchronized-overlap-add algorithm," proc. of ICASSP, vol. 1, pp. 409-412, 1990   DOI
11 E. Moulines and F. Charpentier, "Pitch Synchronous Waveform Processing Techniques for Text-to-speech Synthesis using Diphones," Speech Communication, vol. 9 (5/6), pp. 453-467, 1990   DOI   ScienceOn
12 E. Moulines and J. Laroche, "Non-parametric techniques for pitch-scale and time-scale modification of speech," Speech Communication, vol. 16, pp. 175-205, 1995   DOI   ScienceOn
13 R. J. Mcaulay and T. F. Quatieri, "Speech transformations based on a sinusoidal representation," IEEE Trans. on Acoustic Speech and Signal Processing, vol. 34, No. 1, pp. 1449-1464, December, 1986   DOI
14 M. Narendranath, H. A. Murthy, S. Rajendran and B. Yegnanarayana, "Transformation of formants of voice conversion using artificial neural networks," Speech Communication, vol. 16, No. 2, pp. 207-216, 1995.   DOI   ScienceOn
15 K. S. Lee, D. H. Youn, and I. W. Cha, "Voice personality transformation using an orthogonal vector space conversion," proc. of EUROSPEECH '95, Madrid, pp. 427-430, 1995.
16 N. Iwahashi and Y. Sagisaka, "Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks," Speech Communication, vol. 16, No. 2, pp. 139-152, 1995.   DOI   ScienceOn
17 H. Mizuno and M. Abe, "Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt," Speech Communication, vol. 16, No. 2, pp. 153-164, 1995.   DOI   ScienceOn
18 M. Abe, S. Nakamura, K. Shikano and H. Kuwabara, "Voice conversion through vector quantization," proc. of ICASSP, vol. 1, pp. 565-568, 1988.
19 M. Abe, "A segment-based approach to voice conversion," proc. of ICASSP, vol. 1, pp. 765-768, 1991.
20 Y. Stylianou O. Cappe and E. Moulines, "Statistical methods for voice quality transformation," proc. of EUROSPEECH '95, Madrid, pp. 447-450, 1995.
21 L. R. Rabiner and R. W. Schafer, "Digital Processing of Speech Signal", Prentice- Hall Inc., 1978.
22 D. W. Griffin and J. S. Lim, "Signal estimation from the modified short -time fourier transform," IEEE Trans. on Acoustic Speech and Signal Processing, vol. ASSP-32, pp. 236-243, April, 1984   DOI