DOI QR코드

DOI QR Code

An Optimized e-Lecture Video Search and Indexing framework

  • Received : 2021.08.05
  • Published : 2021.08.30

Abstract

The demand for e-learning through video lectures is rapidly increasing due to its diverse advantages over the traditional learning methods. This led to massive volumes of web-based lecture videos. Indexing and retrieval of a lecture video or a lecture video topic has thus proved to be an exceptionally challenging problem. Many techniques listed by literature were either visual or audio based, but not both. Since the effects of both the visual and audio components are equally important for the content-based indexing and retrieval, the current work is focused on both these components. A framework for automatic topic-based indexing and search depending on the innate content of the lecture videos is presented. The text from the slides is extracted using the proposed Merged Bounding Box (MBB) text detector. The audio component text extraction is done using Google Speech Recognition (GSR) technology. This hybrid approach generates the indexing keywords from the merged transcripts of both the video and audio component extractors. The search within the indexed documents is optimized based on the Naïve Bayes (NB) Classification and K-Means Clustering models. This optimized search retrieves results by searching only the relevant document cluster in the predefined categories and not the whole lecture video corpus. The work is carried out on the dataset generated by assigning categories to the lecture video transcripts gathered from e-learning portals. The performance of search is assessed based on the accuracy and time taken. Further the improved accuracy of the proposed indexing technique is compared with the accepted chain indexing technique.

Keywords

Acknowledgement

This work is supported by University Grants Commission (UGC) under Minor Research Project titled "Fast Content Based Search, Navigation and Retrieval system for E-Learning". Project Id: F.No:4-4/2015(MRP/UGC-SERO).

References

  1. Aasif Ansari and Muzammil H Mohammed, "Content based Video Retrieval Systems - Methods, Techniques, Trends and Challenges", International Journal of Computer Applications, Volume 112, No. 7, Feb 2015.
  2. Jose M. Perea-Ortega, M. Teresa Martin-Valdivia, Arturo Montejo-Raez and L. Alfonso Urena-Lopez, "A Content-based Information Retrieval System for Video Searching", IEEE Eighth International Symposium on Natural Language Processing, 2009.
  3. M. Ikonomakis, S. Kotsiantis and V. Tampakas, "Text Classification Using Machine Learning Techniques", WSEAS Transactions on Computers, Volume 4, Issue 8, August 2005, pp. 966-974.
  4. V. S. Reddy, P. Kinnicutt and R. Lee, "Text Document Clustering: The Application of Cluster Analysis to Textual Document", International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, 2016, pp. 1174-1179. doi: 10.1109/CSCI.2016.0222
  5. H. Yang, M. Siebert, P. Luhne, H. Sack, and C. Meinel, "Automatic lecture video indexing using video ocr technology", in Multimedia (ISM), 2011 IEEE International Symposium on. IEEE, 2011, pp. 111-116.
  6. Shuangbao Paul Wang, Xiaolong Cheng, Carolyn Maher and William Kelly,"InVideo: An Automatic Video Index and Search Engine for Large Video Collections", in The Second International Conference on Advances in Signal, Image and Video Processing (SIGNAL), 2017, pp. 27-32.
  7. Yang Haojin, Franka Grunewald and Christoph Meinel, "Automated Extraction of Lecture Outlines from Lecture Videos - A Hybrid Solution for Lecture Video Indexing", in 4th International Conference on Computer Supported Education (CSEDU), 2012, pp. 13-22.
  8. Manish Kanadje, Zachary Miller, Anurag Agarwal, Roger Gaborski, Richard Zanibbi and Stephanie Ludi, "Assisted keyword indexing for lecture videos using unsupervised keyword spotting", Pattern Recognition Letters, Volume 71, Feb 2016, pp. 8-15. https://doi.org/10.1016/j.patrec.2015.11.012
  9. Yang H., Oehlke C. and Meinel C, "An Automated Analysis and Indexing Framework for Lecture Video Portal", In: Popescu E., Li Q., Klamma R., Leung H., Specht M. (eds) Advances in Web-Based Learning - ICWL 2012. ICWL 2012. Lecture Notes in Computer Science, vol 7558. Springer, Berlin, Heidelberg.
  10. V.K. Kamabathula and S. Iyer, "Automated tagging to enable fine-grained browsing of lecture videos", in IEEE International Conference on Technology of Education, 2011.
  11. Stephan Repp, Andreas Gross and Christoph Meinel, "Browsing within Lecture Videos Based on the Chain Index of Speech Transcription", IEEE Transactions On Learning Technologies, Vol. 1, No. 3, July-September 2008, pp. 145-156. https://doi.org/10.1109/TLT.2008.22
  12. MIT opencourseware. [Online]. Available: https://ocw.mit.edu/index.htm
  13. Khan Academy. [Online]. Available: https://www.khanacademy.org/
  14. NPTEL. [Online]. Available: http://nptel.ac.in/
  15. I. Mani, M. T. Maybury, "Advances in Automatic Text Summarization," Volume 293, Mit press, 1999.
  16. I. Rish, "An empirical study of the naive bayes classifier," in IJCAI 2001 workshop on empirical methods in artificial intelligence, pp. 41-46, 2001.
  17. M. L. Haritha, K. Ramani,"An Optimized E-Lecture Video Retrieval based on Machine Learning Classification", International Journal of Engineering and Advanced Technology,Volume8, Issue6,August 2019, pp.4820-4827. doi: 10.35940/ijeat.F9114.088619
  18. Zhang, Harry. "The Optimality of Naive Bayes." FLAIRS Conference, 2004.
  19. T. Kanungo, D. M. Mount, N. S. Netanyahu, C. D. Piatko, R. Silverman and A. Y. Wu, "An efficient k-means clustering algorithm: analysis and implementation", in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 7, July 2002,pp. 881-892.doi: 10.1109/TPAMI.2002.1017616
  20. Bing Zhou, Yiyu Yao, "Evaluating information retrieval system performance based on user preference", Journal Of Intelligent Information Systems, Volume 34, Issue 3, pp 227-248, June 2010. https://doi.org/10.1007/s10844-009-0096-5