Browse > Article
http://dx.doi.org/10.9728/dcs.2015.16.2.291

A Study about the Users's Preferred Playing Speeds on Categorized Video Content using WSOLA method  

Kim, I-Gil (KT Institute of Convergence Technology)
Publication Information
Journal of Digital Contents Society / v.16, no.2, 2015 , pp. 291-298 More about this Journal
Abstract
In a fast-paced information technology environment, consumption of video content is changing from one-way television viewing to VOD (Video on Demand) playing anywhere, anytime, on any device. This video-watching trend gives additional importance to videos with fine-speed-control, in addition to the strength of the digital video signal. Currently, many video players provide a fine-speed-control function which can speed up the video to skip a boring part, or slow it down to focus on an exciting scene. The audio information is just as important as the visual information for understanding the content of the speed-controlled video. Thus, a number of algorithms for fine-speed-control video-playing technologies have been proposed to solve the pitch distortion in the audio-processing area. In this study, well-known techniques for prosodic modification of speech signals, WSOLA (Waveform-Similarity-Based Overlap-Add), have been applied to analyze users' needs for fine-speed-control video playing. By surveying the users' preferred speeds on categorized video content and analyzing the results, this paper proposes that various fine-speed adjustments are needed to accommodate users' preferred video consumption.
Keywords
Multimedia Video Service; Video Trick Play Service; Video Fine Speed Control; Preferred Content Playing Speed; WSOLA;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 A. Efrat, Q. Fan, and S. Venkatasubramanian. Curve matching, time warping, and light fields: New algorithms for computing similarity between curves. J.Mathematic Imaging and Vision, 2007.
2 M. Munich and P. Perona, "Continuous dynamic time warping for translation-invariant curve alignmentwith applications to signature verification," in International Conference on Computer Vision (ICCV), pp.108-115, 1999.
3 K. Huang and H. Yan. On-line signature verification based on dynamic segmentation and global andlocal matching. Optical Engineering, 34(12):3480-3487, 1995.   DOI
4 R. Martens and L. Claesen. On-line signature verification by dynamic time-warping. In Proc. 13th Int. Conf. Pattern Recognition, pages 38-42, 1996.
5 http://www.g-school.co.kr/community/pollEnd.jsp?poll_code=2009030400001
6 Sun-jin Kim, The present and prospect of Online Video, Music service and Media Usage, Journal of Digital Contents Society,. vol. 16, pp.137-144, 2015   DOI   ScienceOn
7 J. Laroche and M. Dolson, "Improved phase vocoder time-scale modification of audio," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 323-332, May1999.   DOI   ScienceOn
8 D. W. Griffin and J. S. Lim, "Signal estimation from modified short time Fourier transform," IEEE Trans. Audio, Speech, Signal Process., vol. ASSP-32, no.2, pp. 236-243, Apr. 1984.
9 E. Moulines and J. Laroche, "Non-parametric techniques for pitchscale and time-scale modification of speech," Speech Commun., vol. 16, no. 2, pp. 175-206,1995.   DOI   ScienceOn
10 E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speechsynthesis using diphones," Speech Commun., vol. 9,no. 5-6, pp. 453-467, 1990.   DOI   ScienceOn
11 W. Verhelst, "Overlap-add methods for time-scaling of speech," Speech Commun., vol. 30, no. 4, pp. 207-221, 2000.   DOI   ScienceOn
12 Shahaf Grofit, Yizhar Lavner, "TimeScale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients", IEEE Transactions on Audio, Speech & Language Processing-TASLP, vol. 16, no. 1, pp. 106-115, 2008   DOI   ScienceOn
13 Ivan Damnjanovic, Dan Barry, David Dorran, JoshuaD. Reiss, "A Real-Time Framework for Video Timeand Pitch Scale Modification," IEEE Transactionson Multimedia-TMM, vol. 12, no. 4, pp. 247-256, 2010   DOI   ScienceOn
14 Wlodarczyk, M., Sekalski, P., "Evaluation of time-scale modification methods for audio signals on mobile devices with android OS", Proceedings of the 21st International Conference on Mixed Design of Integrated Circuits & Systems (MIXDES), 2014
15 H.Valbret,E.Moulines,andJ.P.Tubach,"Voice transformation using PSOLA techniques," Speech Communication., vol. 11, pp. 175-187, 1992.   DOI   ScienceOn
16 S. Roucos and A. Wilgus, "High quality time-scalemodification of speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Tampa, FL, Mar.,pp. 493-496, 1985.
17 S. Grofit and Y. Lavner, Time-scale modification of audio signals using enhanced wsola with management of transients, IEEE Transactions on Audio, Speech & Language Processing, 16, pp. 106-115, 2008   DOI   ScienceOn
18 W. Verhelst and M. Roelands, "An overlap-add technique based on waveform similarity (WSOLA) forhigh quality time-scale modifi-cation of speech,"in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Minneapolis, MN, pp. 554-557, 1993.