References
- S. Roucos and A. M. Wilgus, "High quality time-scale modification for speech," proc. of ICASSP, vol. 1, pp. 493-469, 1985 https://doi.org/10.1109/ICASSP.1985.1168381
- J. Makhoul and A. E. Jaroudi, "Time-scale modification in medium to low rate speech coding," proc. of ICASSP, vol. 1, pp. 1705-1708, 1986 https://doi.org/10.1109/ICASSP.1986.1169252
- E. Hardam, "High-quality time scale modification of speech signals using fast synchronized-overlap-add algorithm," proc. of ICASSP, vol. 1, pp. 409-412, 1990 https://doi.org/10.1109/ICASSP.1990.115715
- E. Moulines and F. Charpentier, "Pitch Synchronous Waveform Processing Techniques for Text-to-speech Synthesis using Diphones," Speech Communication, vol. 9 (5/6), pp. 453-467, 1990 https://doi.org/10.1016/0167-6393(90)90021-Z
- E. Moulines and J. Laroche, "Non-parametric techniques for pitch-scale and time-scale modification of speech," Speech Communication, vol. 16, pp. 175-205, 1995 https://doi.org/10.1016/0167-6393(94)00054-E
- R. J. Mcaulay and T. F. Quatieri, "Speech transformations based on a sinusoidal representation," IEEE Trans. on Acoustic Speech and Signal Processing, vol. 34, No. 1, pp. 1449-1464, December, 1986 https://doi.org/10.1109/TASSP.1986.1164985
- T. F. Quatieri and R. J. Mcaulay, "Shape invariance time-scale & pitch modification of speech," IEEE Trans. on Acoustic Speech and Signal Processing, vol. 40, No. 3, pp. 497-510, March, 1992. https://doi.org/10.1109/78.120793
- T. Takgi and E. Miyasaka, "A speech prosody conversion system with a high quality speech analysis-synthesis method," proc. of EUROSPEECH '93, Berlin, pp. 995-998, 1993.
- J. Laroche, Y. Stylianou and E. Moulines, "HNS ; speech modification based on a harmonic + noise model," proc. of ICASSP, vol. 2, pp. 550-553, 1993. https://doi.org/10.1109/ICASSP.1993.319365
- M. A. Richards, "Helium speech enhancement using the short-time fourier transform," IEEE Trans. on Acoustic Speech and Signal Processing, vol. ASSP-30, No. 6, pp. 841-853, December, 1982. https://doi.org/10.1109/TASSP.1982.1163973
- P. J. Bloom, "High-quality digital audio in the entertainment industry: an overview to achievements and challenges," IEEE ASSP Magazine, pp. 2-25, October, 1985.
- Il Hyun Nam, "Voice personality transformation," Ph. D Thesis, Electrical Engineering Rensselaer Polytechnic Institute, Troy, NY, 1991.
- H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, vol. 11, pp. 175-187, 1992. https://doi.org/10.1016/0167-6393(92)90012-V
- K. S. Lee, D. H. Youn, and I. W. Cha, "Voice personality transformation using an orthogonal vector space conversion," proc. of EUROSPEECH '95, Madrid, pp. 427-430, 1995.
- N. Iwahashi and Y. Sagisaka, "Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks," Speech Communication, vol. 16, No. 2, pp. 139-152, 1995. https://doi.org/10.1016/0167-6393(94)00051-B
- H. Mizuno and M. Abe, "Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt," Speech Communication, vol. 16, No. 2, pp. 153-164, 1995. https://doi.org/10.1016/0167-6393(94)00052-C
- M. Narendranath, H. A. Murthy, S. Rajendran and B. Yegnanarayana, "Transformation of formants of voice conversion using artificial neural networks," Speech Communication, vol. 16, No. 2, pp. 207-216, 1995. https://doi.org/10.1016/0167-6393(94)00058-I
- M. Abe, S. Nakamura, K. Shikano and H. Kuwabara, "Voice conversion through vector quantization," proc. of ICASSP, vol. 1, pp. 565-568, 1988.
- M. Abe, "A segment-based approach to voice conversion," proc. of ICASSP, vol. 1, pp. 765-768, 1991.
- Y. Stylianou O. Cappe and E. Moulines, "Statistical methods for voice quality transformation," proc. of EUROSPEECH '95, Madrid, pp. 447-450, 1995.
- L. R. Rabiner and R. W. Schafer, "Digital Processing of Speech Signal", Prentice- Hall Inc., 1978.
- D. W. Griffin and J. S. Lim, "Signal estimation from the modified short -time fourier transform," IEEE Trans. on Acoustic Speech and Signal Processing, vol. ASSP-32, pp. 236-243, April, 1984 https://doi.org/10.1109/TASSP.1984.1164317