• Title/Summary/Keyword: 댓글 분류 시스템

Search Result 26, Processing Time 0.023 seconds

A Location Information System using Automatically Web-Crawled Data (자동으로 웹크롤링된 데이터를 이용한 위치정보 시스템)

  • Lim, Seung Hwan;Jo, hyeong Seok;Lee, Songwook
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.11a
    • /
    • pp.391-394
    • /
    • 2017
  • 우리는 웹상에 존재하는 여러 포털사이트의 위치정보와 그에 대한 데이터들을 수집하고 수집된 데이터들로부터 각각의 정보를 분류하여 웹상에 표시하는 페이지를 만들었다. 그리고 사람들이 자유롭게 위치정보에 대한 내용을 추가하고 공유할 수 있도록 하였고, 검색기능을 활용하여 해당위치를 빠르게 찾을 수 있도록 하였다. 그리고 각각의 데이터들에 대해 자유롭게 댓글을 달 수 있게 하여 사람들이 해당 장소에 대해 자유롭게 의견을 나눌 수 있도록 하였다.

Preprocessing technique for natural language processing considering the form of characters used in malicious comments (악성 댓글에 사용된 문자의 형태를 고려한 한국어 자연어처리를 위한 전처리 기법)

  • Kim, Hae-Soo;Kim, Mi-hui
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.543-545
    • /
    • 2022
  • 최근 악플에 대한 논란이 끊이지 않고 있어 이것을 해결하기위한 방법으로 자연어 처리를 이용하고 있다. 특히 소셜 미디어, 온라인 커뮤니티에서 많이 발생하고 있고 해당 매체에서는 한글을 그대로 사용하지 않고 그들의 은어를 섞어서 사용하며 그중에서 한글이 아닌 문자를 섞어서 만들어낸 문장도 있다. 이러한 문장은 기존의 모델에 학습된 데이터의 형태와 다르며 한글이 아닌 문장이 많을수록 모델의 예측이 부정확해진다는 단점이 있어 본 논문에서는 인공지능을 이용한 이미지 분류와 띄어쓰기, 오타 교정을 이용한 전처리 기법을 제안한다.

An Emotion Scanning System on Text Documents (텍스트 문서 기반의 감성 인식 시스템)

  • Kim, Myung-Kyu;Kim, Jung-Ho;Cha, Myung-Hoon;Chae, Soo-Hoan
    • Science of Emotion and Sensibility
    • /
    • v.12 no.4
    • /
    • pp.433-442
    • /
    • 2009
  • People are tending to buy products through the Internet rather than purchasing them from the store. Some of the consumers give their feedback on line such as reviews, replies, comments, and blogs after they purchased the products. People are also likely to get some information through the Internet. Therefore, companies and public institutes have been facing this situation where they need to collect and analyze reviews or public opinions for them because many consumers are interested in other's opinions when they are about to make a purchase. However, most of the people's reviews on web site are too numerous, short and redundant. Under these circumstances, the emotion scanning system of text documents on the web is rising to the surface. Extracting writer's opinions or subjective ideas from text exists labeled words like GI(General Inquirer) and LKB(Lexical Knowledge base of near synonym difference) in English, however Korean language is not provided yet. In this paper, we labeled positive, negative, and neutral attribute at 4 POS(part of speech) which are noun, adjective, verb, and adverb in Korean dictionary. We extract construction patterns of emotional words and relationships among words in sentences from a large training set, and learned them. Based on this knowledge, comments and reviews regarding products are classified into two classes polarities with positive and negative using SO-PMI, which found the optimal condition from a combination of 4 POS. Lastly, in the design of the system, a flexible user interface is designed to add or edit the emotional words, the construction patterns related to emotions, and relationships among the words.

  • PDF

The Differential Impacts of Positive and Negative Emotions on Travel-Related YouTube Video Engagement (유튜브 여행 동영상의 긍정적 감정과 부정적 감정이 사용자 참여에 미치는 영향)

  • Heejin Kim;Hayeon Song;Jinyoung Yoo;Sungchul Choi
    • Journal of Service Research and Studies
    • /
    • v.13 no.3
    • /
    • pp.1-19
    • /
    • 2023
  • Despite the growing importance of video-based social media content, such as vlogs, as a marketing tool in the travel industry, there is limited research on the characteristics that enhance engagement among potential travelers. This study explores the influence of emotional valence in YouTube travel content on viewer engagement, specifically likes and comments. We analyzed 4,619 travel-related YouTube videos from eight popular tourist cities. Using negative binomial regression analysis, we found that both positive and negative emotions significantly influence the number of likes received. Videos with higher positive emotions as well as negative emotions receive more likes. However, when it comes to the number of comments, only negative emotions showed a significant positive influence, while positive emotions had no significant impact. These findings offer valuable insights for marketers seeking to optimize engagement strategies on YouTube, considering the unique nature of travel products. Further research into the effects of specific emotions on engagement is warranted to improve marketing strategies. This study highlights the powerful impact of emotions on viewer engagement in the context of social media, particularly on YouTube.

A Study on the Perception of Predatory Journals among Members of the Korea Researcher Communities (국내 연구자 커뮤니티 구성원의 부실 학술지 인식에 대한 연구)

  • Myoung-A Hong;Wonsik Shim
    • Journal of the Korean Society for information Management
    • /
    • v.41 no.2
    • /
    • pp.97-130
    • /
    • 2024
  • The current debate in the academic community is on the criteria for predatory journals. Researchers are perplexed about what constitutes a predatory journal. The purpose of this study is to investigate how South Korean researchers discover and evaluate predatory journals. In order to achieve this, we collected 2,484 statements, comprising posts and comments, from Korean researcher communities, namely the Biological Research Information Center (BRIC), Hibrain.net, Phdkim.net, and the Scholarly Ecosystem Against Fake Publication Environment (SAFE). We divided the data into three primary categories-journals, publishers, and researchers-for the topic analysis. For each statement, we assigned 11 in-depth subtopic tags based on these categories. Six main points of contention emerged from the combinations of these sub-topic tags: (1) researchers' confusion about predatory journals and discussions about research performance; (2)(3) researchers' positive and negative perceptions of predatory journals; (4) researchers' evaluation criteria for journal quality and problems associated with the quality of Korean journals; (5) changes in publishing brought about by the introduction of open access (OA) and associated issues; and (6) discussions on broader issues within the academic ecosystem. By using a qualitative approach to examine how South Korean researchers view predatory journals, this study aims to advance basic knowledge of the discourse around them in the communities of domestic researchers.

Building a Korean Sentiment Lexicon Using Collective Intelligence (집단지성을 이용한 한글 감성어 사전 구축)

  • An, Jungkook;Kim, Hee-Woong
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.49-67
    • /
    • 2015
  • Recently, emerging the notion of big data and social media has led us to enter data's big bang. Social networking services are widely used by people around the world, and they have become a part of major communication tools for all ages. Over the last decade, as online social networking sites become increasingly popular, companies tend to focus on advanced social media analysis for their marketing strategies. In addition to social media analysis, companies are mainly concerned about propagating of negative opinions on social networking sites such as Facebook and Twitter, as well as e-commerce sites. The effect of online word of mouth (WOM) such as product rating, product review, and product recommendations is very influential, and negative opinions have significant impact on product sales. This trend has increased researchers' attention to a natural language processing, such as a sentiment analysis. A sentiment analysis, also refers to as an opinion mining, is a process of identifying the polarity of subjective information and has been applied to various research and practical fields. However, there are obstacles lies when Korean language (Hangul) is used in a natural language processing because it is an agglutinative language with rich morphology pose problems. Therefore, there is a lack of Korean natural language processing resources such as a sentiment lexicon, and this has resulted in significant limitations for researchers and practitioners who are considering sentiment analysis. Our study builds a Korean sentiment lexicon with collective intelligence, and provides API (Application Programming Interface) service to open and share a sentiment lexicon data with the public (www.openhangul.com). For the pre-processing, we have created a Korean lexicon database with over 517,178 words and classified them into sentiment and non-sentiment words. In order to classify them, we first identified stop words which often quite likely to play a negative role in sentiment analysis and excluded them from our sentiment scoring. In general, sentiment words are nouns, adjectives, verbs, adverbs as they have sentimental expressions such as positive, neutral, and negative. On the other hands, non-sentiment words are interjection, determiner, numeral, postposition, etc. as they generally have no sentimental expressions. To build a reliable sentiment lexicon, we have adopted a concept of collective intelligence as a model for crowdsourcing. In addition, a concept of folksonomy has been implemented in the process of taxonomy to help collective intelligence. In order to make up for an inherent weakness of folksonomy, we have adopted a majority rule by building a voting system. Participants, as voters were offered three voting options to choose from positivity, negativity, and neutrality, and the voting have been conducted on one of the largest social networking sites for college students in Korea. More than 35,000 votes have been made by college students in Korea, and we keep this voting system open by maintaining the project as a perpetual study. Besides, any change in the sentiment score of words can be an important observation because it enables us to keep track of temporal changes in Korean language as a natural language. Lastly, our study offers a RESTful, JSON based API service through a web platform to make easier support for users such as researchers, companies, and developers. Finally, our study makes important contributions to both research and practice. In terms of research, our Korean sentiment lexicon plays an important role as a resource for Korean natural language processing. In terms of practice, practitioners such as managers and marketers can implement sentiment analysis effectively by using Korean sentiment lexicon we built. Moreover, our study sheds new light on the value of folksonomy by combining collective intelligence, and we also expect to give a new direction and a new start to the development of Korean natural language processing.