DOI QR코드

DOI QR Code

Context Sharing Framework Based on Time Dependent Metadata for Social News Service

소셜 뉴스를 위한 시간 종속적인 메타데이터 기반의 컨텍스트 공유 프레임워크

  • Ga, Myung-Hyun (Department of Computer and Information Engineering, Inha University) ;
  • Oh, Kyeong-Jin (Department of Computer and Information Engineering, Inha University) ;
  • Hong, Myung-Duk (Department of Computer and Information Engineering, Inha University) ;
  • Jo, Geun-Sik (School of Computer and Information Engineering, Inha University)
  • 가명현 (인하대학교 컴퓨터정보공학과) ;
  • 오경진 (인하대학교 컴퓨터정보공학과) ;
  • 홍명덕 (인하대학교 컴퓨터정보공학과) ;
  • 조근식 (인하대학교 컴퓨터정보공학과)
  • Received : 2013.10.25
  • Accepted : 2013.10.31
  • Published : 2013.12.31

Abstract

The emergence of the internet technology and SNS has increased the information flow and has changed the way people to communicate from one-way to two-way communication. Users not only consume and share the information, they also can create and share it among their friends across the social network service. It also changes the Social Media behavior to become one of the most important communication tools which also includes Social TV. Social TV is a form which people can watch a TV program and at the same share any information or its content with friends through Social media. Social News is getting popular and also known as a Participatory Social Media. It creates influences on user interest through Internet to represent society issues and creates news credibility based on user's reputation. However, the conventional platforms in news services only focus on the news recommendation domain. Recent development in SNS has changed this landscape to allow user to share and disseminate the news. Conventional platform does not provide any special way for news to be share. Currently, Social News Service only allows user to access the entire news. Nonetheless, they cannot access partial of the contents which related to users interest. For example user only have interested to a partial of the news and share the content, it is still hard for them to do so. In worst cases users might understand the news in different context. To solve this, Social News Service must provide a method to provide additional information. For example, Yovisto known as an academic video searching service provided time dependent metadata from the video. User can search and watch partial of video content according to time dependent metadata. They also can share content with a friend in social media. Yovisto applies a method to divide or synchronize a video based whenever the slides presentation is changed to another page. However, we are not able to employs this method on news video since the news video is not incorporating with any power point slides presentation. Segmentation method is required to separate the news video and to creating time dependent metadata. In this work, In this paper, a time dependent metadata-based framework is proposed to segment news contents and to provide time dependent metadata so that user can use context information to communicate with their friends. The transcript of the news is divided by using the proposed story segmentation method. We provide a tag to represent the entire content of the news. And provide the sub tag to indicate the segmented news which includes the starting time of the news. The time dependent metadata helps user to track the news information. It also allows them to leave a comment on each segment of the news. User also may share the news based on time metadata as segmented news or as a whole. Therefore, it helps the user to understand the shared news. To demonstrate the performance, we evaluate the story segmentation accuracy and also the tag generation. For this purpose, we measured accuracy of the story segmentation through semantic similarity and compared to the benchmark algorithm. Experimental results show that the proposed method outperforms benchmark algorithms in terms of the accuracy of story segmentation. It is important to note that sub tag accuracy is the most important as a part of the proposed framework to share the specific news context with others. To extract a more accurate sub tags, we have created stop word list that is not related to the content of the news such as name of the anchor or reporter. And we applied to framework. We have analyzed the accuracy of tags and sub tags which represent the context of news. From the analysis, it seems that proposed framework is helpful to users for sharing their opinions with context information in Social media and Social news.

인터넷의 발달과 SNS의 등장으로 정보흐름의 방식이 크게 바뀌었다. 이러한 변화에 따라 소셜 미디어가 급부상하고 있으며 소셜 미디어와 비디오 콘텐츠가 융합된 소셜 TV, 소셜 뉴스의 중요성이 강조되고 있다. 이러한 환경 속에서 사용자들은 단순히 콘텐츠를 탐색만 하는 것이 아니라 같은 콘텐츠를 이용하고 있는 친구들이나 지인들과 콘텐츠에 대한 정보나 경험들을 공유하고 더 나아가 새로운 콘텐츠를 만들어내기도 한다. 하지만 기존의 소셜 뉴스에서는 이러한 사용자들의 특성을 반영해 주지 못하고 있다. 특히 이용자들의 참여성만을 고려하고 있어서 서비스간의 차별화가 어렵고 뉴스 콘텐츠에 대한 정보나 경험 공유 시 컨텍스트 공유가 어렵다는 문제가 있다. 이를 해결하기 위해 본 논문에서는 뉴스를 내용별로 분할하고 분할된 뉴스에서 추출된 시간 종속적인 메타데이터를 제공하는 프레임워크를 제안한다. 제안하는 프레임워크에서는 스토리 분할 방법을 이용하여 뉴스 대본을 내용별로 분할한다. 또한 뉴스 전체내용을 대표하는 태그, 분할된 뉴스를 나타내는 서브 태그, 분할된 뉴스가 비디오에서 시작하는 위치 즉, 시간 종속적인 메타데이터를 제공한다. 소셜 뉴스 이용자들에게 시간 종속적인 메타데이터를 제공한다면 이용자들은 전체의 뉴스 내용 중에 자신이 원하는 부분만을 탐색 할 수 있으며 이 부분에 대한 견해를 남길 수 있다. 그리고 뉴스의 전달이나 의견 공유 시 메타데이터를 함께 전달함으로써 전달하고자 하는 내용에 바로 접근이 가능하며 프레임워크의 성능은 추출된 서브 태그가 뉴스의 실제 내용을 얼마나 잘 나타내 주느냐에 따라 결정된다. 그리고 서브 태그는 스토리 분할의 정확성과 서브 태그를 추출하는 방법에 따라 다르게 추출된다. 이 점을 고려하여 의미적 유사도 기반의 스토리 분할 방법을 프레임워크에 적용하였고 벤치마크 알고리즘과 성능 비교 실험을 수행하였으며 분할된 뉴스에서 추출된 서브 태그들과 실제 뉴스의 내용을 비교하여 서브 태그들의 정확도를 분석하였다. 결과적으로 의미적 유사도를 고려한 스토리 분할 방법이 더 우수한 성능을 보였으며 추출된 서브 태그들도 컨텍스트와 관련된 단어들이 추출 되었다.

Keywords

References

  1. Budanitsky, A. and G. Hirst, "Evaluating wordnet-based measures of lexical semantic relatedness," Computational Linguistics, Vol. 32, No.1(2006), 13-47. https://doi.org/10.1162/coli.2006.32.1.13
  2. Cesar, P. and D. Geerts, "Past, present, and future of social TV : A categorization," In Consumer Communications and Networking Conference, (2011), 347-351.
  3. Cunningham, H., "GATE, a general architecture for text engineering," Computers and the Humanities, Vol.36, No.2(2002), 223-254. https://doi.org/10.1023/A:1014348124664
  4. Hearst, M. A., "TextTiling : Segmenting text into multi-paragraph subtopic passages," Computational Linguistics, Vol.23, No.1(1997), 33-64.
  5. Jung, Y.-C., N.-D. Kim and Y.-H. Kim, 2012 Broadcast Media Usage Patterns Research, Korea communications commission, 2012.
  6. Lee, L. S. and B. Chen, "Spoken document understanding and organization," IEEE Signal Processing Magazine, Vol.22, No.5(2005), 42-60. https://doi.org/10.1109/MSP.2005.1511823
  7. Mihalcea, R., C. Corley and C. Strapparava, "Corpus-based and knowledge-based measures of text semantic similarity," Proceedings of the American Association for Artificial Intelligence, Vol.6(2006), 775-780.
  8. Miller, G. A., R. Beckwith, C. Fellbaum, D. Gross and K. J. Miller, "Introduction to wordnet : An on-line lexical database," International Journal of Lexicography, Vol.3, No.4(1990), 235-244. https://doi.org/10.1093/ijl/3.4.235
  9. Misra, H., F. Hopfgartner, A. Goyal, P. Punitha and J. M. Jose, "Tv news story segmentation based on semantic coherence and content similarity," Advances in Multimedia Modeling, Vol.5916(2010), 347-357. https://doi.org/10.1007/978-3-642-11301-7_36
  10. Oh, J. H., K. Torisawa, C. Hashimoto, T. Kawada, S. De Saeger, J. I. Kazama and Y. Wang, "Why question answering using sentiment analysis and word classes," In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Association for Computational Linguistics, (2012), 368-378.
  11. Park, S.-H., "SNS News communicating," Communication and Information Research, Vol. 49, No.2(2012), 37-73.
  12. Pedersen, T., S. Patwardhan and J. Michelizzi, "WordNet::Similarity : measuring the relatedness of concepts," Demonstration Papers at HLT-NAACL 2004, Association for Computational Linguistics, (2004), 38-41.
  13. Pedersen, T., "Information content measures of semantic similarity perform better without sense-tagged text," In Human Language Technologies : The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, (2010), 329-332.
  14. R Malik, L. V. Subramaniam, and S. Kaushik, "Automatically Selecting Answer Templates to Respond to Customer Emails," In Proceedings of the 20th international joint conference on Artifical intelligence, Vol.7(2007), 1659-1664.
  15. Sack, H. and J. Waitelonis, "Automated annotations of synchronized multimedia presentations," In Proceedings of the ESWC 2006 Workshop on Mastering the Gap : From Information Extraction to Semantic Representation, CEUR Workshop Proceedings, (2006).
  16. Sack, H. and J. Waitelonis, "Exploratory Semantic Video Search with yovisto," In Semantic Computing (ICSC), IEEE Fourth International Conference on IEEE, (2010), 446-447.
  17. Stokes, N., J. Carthy and A. F. Smeaton, "SeLeCT : a lexical cohesion based news story segmentation system," AI Communications, Vol.17, No.1(2004), 3-12.
  18. Wu, Z. and M. Palmer, "Verb semantics and lexical selection," Proceedings of the 32nd Annual Meeting of the Associations for Computational Linguistics, (1994), 133-138.
  19. Xie, L., L. Zheng, Z. Liu and Y. Zhang, "Laplacian eigenmaps for automatic story segmentation of broadcast news," Audio, Speech, and Language Processing, IEEE Transactions, Vol. 20 No.1(2012), 276-289. https://doi.org/10.1109/TASL.2011.2160853
  20. Yoon, S.-H., Revolution of Social TV, ebizbooks, 2012.