• Title/Summary/Keyword: social information processing model

Search Result 177, Processing Time 0.029 seconds

Building a Korean Sentiment Lexicon Using Collective Intelligence (집단지성을 이용한 한글 감성어 사전 구축)

  • An, Jungkook;Kim, Hee-Woong
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.49-67
    • /
    • 2015
  • Recently, emerging the notion of big data and social media has led us to enter data's big bang. Social networking services are widely used by people around the world, and they have become a part of major communication tools for all ages. Over the last decade, as online social networking sites become increasingly popular, companies tend to focus on advanced social media analysis for their marketing strategies. In addition to social media analysis, companies are mainly concerned about propagating of negative opinions on social networking sites such as Facebook and Twitter, as well as e-commerce sites. The effect of online word of mouth (WOM) such as product rating, product review, and product recommendations is very influential, and negative opinions have significant impact on product sales. This trend has increased researchers' attention to a natural language processing, such as a sentiment analysis. A sentiment analysis, also refers to as an opinion mining, is a process of identifying the polarity of subjective information and has been applied to various research and practical fields. However, there are obstacles lies when Korean language (Hangul) is used in a natural language processing because it is an agglutinative language with rich morphology pose problems. Therefore, there is a lack of Korean natural language processing resources such as a sentiment lexicon, and this has resulted in significant limitations for researchers and practitioners who are considering sentiment analysis. Our study builds a Korean sentiment lexicon with collective intelligence, and provides API (Application Programming Interface) service to open and share a sentiment lexicon data with the public (www.openhangul.com). For the pre-processing, we have created a Korean lexicon database with over 517,178 words and classified them into sentiment and non-sentiment words. In order to classify them, we first identified stop words which often quite likely to play a negative role in sentiment analysis and excluded them from our sentiment scoring. In general, sentiment words are nouns, adjectives, verbs, adverbs as they have sentimental expressions such as positive, neutral, and negative. On the other hands, non-sentiment words are interjection, determiner, numeral, postposition, etc. as they generally have no sentimental expressions. To build a reliable sentiment lexicon, we have adopted a concept of collective intelligence as a model for crowdsourcing. In addition, a concept of folksonomy has been implemented in the process of taxonomy to help collective intelligence. In order to make up for an inherent weakness of folksonomy, we have adopted a majority rule by building a voting system. Participants, as voters were offered three voting options to choose from positivity, negativity, and neutrality, and the voting have been conducted on one of the largest social networking sites for college students in Korea. More than 35,000 votes have been made by college students in Korea, and we keep this voting system open by maintaining the project as a perpetual study. Besides, any change in the sentiment score of words can be an important observation because it enables us to keep track of temporal changes in Korean language as a natural language. Lastly, our study offers a RESTful, JSON based API service through a web platform to make easier support for users such as researchers, companies, and developers. Finally, our study makes important contributions to both research and practice. In terms of research, our Korean sentiment lexicon plays an important role as a resource for Korean natural language processing. In terms of practice, practitioners such as managers and marketers can implement sentiment analysis effectively by using Korean sentiment lexicon we built. Moreover, our study sheds new light on the value of folksonomy by combining collective intelligence, and we also expect to give a new direction and a new start to the development of Korean natural language processing.

Local Information-based Betweenness Centrality to Identify Important Nodes in Social Networks (사회관계망에서 중요 노드 식별을 위한 지역정보 기반 매개 중심도)

  • Shon, Jin Gon;Kim, Yong-Hwan;Han, Youn-Hee
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.2 no.5
    • /
    • pp.209-216
    • /
    • 2013
  • In traditional social network analysis, the betweenness centrality measure has been heavily used to identify the relative importance of nodes in terms of message delivery. Since the time complexity to calculate the betweenness centrality is very high, however, it is difficult to get it of each node in large-scale social network where there are so many nodes and edges. In this paper, we define a new type of network, called the expanded ego network, which is built only with each node's local information, i.e., neighbor information of the node's neighbor nodes, and also define a new measure, called the expended ego betweenness centrality. Through the intensive experiment with Barab$\acute{a}$si-Albert network model to generate the scale-free networks which most social networks have as their embedded feature, we also show that the nodes' importance rank based on the expanded ego betweenness centrality has high similarity with that based on the traditional betweenness centrality.

Formulating Analytical Solution of Network ODE Systems Based on Input Excitations

  • Bagchi, Susmit
    • Journal of Information Processing Systems
    • /
    • v.14 no.2
    • /
    • pp.455-468
    • /
    • 2018
  • The concepts of graph theory are applied to model and analyze dynamics of computer networks, biochemical networks and, semantics of social networks. The analysis of dynamics of complex networks is important in order to determine the stability and performance of networked systems. The analysis of non-stationary and nonlinear complex networks requires the applications of ordinary differential equations (ODE). However, the process of resolving input excitation to the dynamic non-stationary networks is difficult without involving external functions. This paper proposes an analytical formulation for generating solutions of nonlinear network ODE systems with functional decomposition. Furthermore, the input excitations are analytically resolved in linearized dynamic networks. The stability condition of dynamic networks is determined. The proposed analytical framework is generalized in nature and does not require any domain or range constraints.

Design and Implementation of Motivation based Behavior Decision Model for Social Robot (소셜 로봇을 위한 동기부여 기반 행동결정모델 설계 및 구현)

  • Yu, Soo-jeong;Park, Jung-Min
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.10a
    • /
    • pp.1796-1799
    • /
    • 2015
  • 최근 소셜 로봇은 단순히 인간의 사회적 행동을 모방하는 것에 그치지 않고 인간에게 유용한 서비스를 제공하는 데에 초점을 맞추고 있다. 소셜 로봇이 효과적으로 서비스를 제공하기 위해서는 자발적으로 행동을 실행할 수 있도록 지원하는 메커니즘이 필요하나 이에 대한 연구는 미비하다. 본 연구는 소셜 로봇과 상호작용하는 사람에게 효과적인 서비스를 제공하기 위하여 소셜 로봇의 행동을 결정하는 동기 시스템을 제안하고 구현하였다. 제안된 행동 결정 동기 시스템은 장노년층의 스마트폰 활용 도우미 서비스를 제공하는 소셜 로봇에 적용하였으며, 이를 위해 스마트폰 기반의 소셜 로봇을 설계하고 구현하였다.

Personalized Movie Recommendation System Combining Data Mining with the k-Clique Method

  • Vilakone, Phonexay;Xinchang, Khamphaphone;Park, Doo-Soon
    • Journal of Information Processing Systems
    • /
    • v.15 no.5
    • /
    • pp.1141-1155
    • /
    • 2019
  • Today, most approaches used in the recommendation system provide correct data prediction similar to the data that users need. The method that researchers are paying attention and apply as a model in the recommendation system is the communities' detection in the big social network. The outputted result of this approach is effective in improving the exactness. Therefore, in this paper, the personalized movie recommendation system that combines data mining for the k-clique method is proposed as the best exactness data to the users. The proposed approach was compared with the existing approaches like k-clique, collaborative filtering, and collaborative filtering using k-nearest neighbor. The outputted result guarantees that the proposed method gives significant exactness data compared to the existing approach. In the experiment, the MovieLens data were used as practice and test data.

A Model for Blog Rank based on User Behavior and Social Relationship (사용자 행동과 사회적 관계 기반의 블로그 랭크 모델)

  • Hwang, Jae-Seon;Kim, Jangwon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2009.04a
    • /
    • pp.547-550
    • /
    • 2009
  • 블로그는 누구나 쉽게 이용할 수 있는 도구이며, 블로그를 통한 콘텐츠의 생산과 소비는 빠른 속도로 증가하고 있다. 이런 블로그의 글은 단순히 정보를 전달하는 웹 페이지 이상의 사회적 관계를 포함하고 있다. 하지만 지금까지 웹 페이지 및 블로그에 대한 검색은 이러한 사회적 관계를 고려하지 않고 있다. 따라서 본 논문에서는 사용자 행동과 사회적 관계에 기반한 블로그 랭크 모델을 제안한다. 이를 기반으로 국내의 서로 다른 서비스에서 제공한 블로그 랭킹을 새롭게 제안한 블로그 모델과 비교하였고, 이를 통해 제안하는 블로그 모델의 타당성을 제시하였다.

A Proposal of a Model for Measuring Influences of Social Networks (소셜 네트웍 영향력 측정 모델 제안)

  • Lee, Seung-Hee;Park, Young-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.1497-1500
    • /
    • 2011
  • 최근, 스마트폰 가입자 수가 급격히 증가하는 가운데, 사람과 사람의 관계에 중심을 두는 서비스인 SNS가 급격히 성장하고 있다. 모바일 디바이스를 사용한 소셜 네트워크가 접목되어 사용자들 사이에 자연스러운 의사 교류 관계가 형성되며, 장소나 이벤트를 기반하여 빠르고 끊임없이 정보들이 발생하게 되었다. 그러나 너무 많이 발생하는 불필요한 데이터들로 인해 의미 있는 정보, 필요한 정보를 파악하는 데 어려움이 생긴다. 이러한 문제점을 해결하기 위해, 소셜 네트워크 서비스를 통해 제공되는 정보들 가운데 영향력 있는 정보가 무엇인지 판단하는 연구가 중요시 되고 있다. 본 논문에서는 소셜 네트워크 영향력 측정을 위해 장소를 기반으로 한 동적 소셜 커뮤니티에서의 소셜 네트워크 액티비티를 중심으로 이벤트에 대한 영향력을 측정하는 방법에 대해 고찰하고, 새로운 영향력 측정 방안에 대해 제안한다.

Educational contents creation model extension designed based on Social Resource (소셜자원기반 교수-학습 콘텐츠 생성모델 확장 설계)

  • Kim, Kyung-Rog;Moon, NamMee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.11a
    • /
    • pp.1505-1506
    • /
    • 2011
  • 소셜 서비스의 확산에 따라 이러닝 분야에서도 소셜러닝이 확산되고 있다. 소셜러닝이 기존 교육과 구별되는 가장 큰 특징은 콘텐츠의 생산과 소비 방법으로, 네트워크를 통해 가치를 전달하고, 다른 사람으로부터 배운다는 것이다. 따라서 소셜미디어 콘텐츠와 소셜네트워크 활동 콘텐츠를 학습객체화하여 함께 이용할 수 있어야 한다고 본다. 이를 위해 본 논문에서는 소셜미디어 콘텐츠를 학습객체화 할 수 있도록 콘텐츠 생성모델 확장 방안을 제안하고자 한다. 소셜자원기반 콘텐츠 생성모델은, 학습객체 정의와 메타데이터 생성모델로 구성된다.

A Network Packet Analysis Method to Discover Malicious Activities

  • Kwon, Taewoong;Myung, Joonwoo;Lee, Jun;Kim, Kyu-il;Song, Jungsuk
    • Journal of Information Science Theory and Practice
    • /
    • v.10 no.spc
    • /
    • pp.143-153
    • /
    • 2022
  • With the development of networks and the increase in the number of network devices, the number of cyber attacks targeting them is also increasing. Since these cyber-attacks aim to steal important information and destroy systems, it is necessary to minimize social and economic damage through early detection and rapid response. Many studies using machine learning (ML) and artificial intelligence (AI) have been conducted, among which payload learning is one of the most intuitive and effective methods to detect malicious behavior. In this study, we propose a preprocessing method to maximize the performance of the model when learning the payload in term units. The proposed method constructs a high-quality learning data set by eliminating unnecessary noise (stopwords) and preserving important features in consideration of the machine language and natural language characteristics of the packet payload. Our method consists of three steps: Preserving significant special characters, Generating a stopword list, and Class label refinement. By processing packets of various and complex structures based on these three processes, it is possible to make high-quality training data that can be helpful to build high-performance ML/AI models for security monitoring. We prove the effectiveness of the proposed method by comparing the performance of the AI model to which the proposed method is applied and not. Forthermore, by evaluating the performance of the AI model applied proposed method in the real-world Security Operating Center (SOC) environment with live network traffic, we demonstrate the applicability of the our method to the real environment.

Building Sentence Meaning Identification Dataset Based on Social Problem-Solving R&D Reports (사회문제 해결 연구보고서 기반 문장 의미 식별 데이터셋 구축)

  • Hyeonho Shin;Seonki Jeong;Hong-Woo Chun;Lee-Nam Kwon;Jae-Min Lee;Kanghee Park;Sung-Pil Choi
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.4
    • /
    • pp.159-172
    • /
    • 2023
  • In general, social problem-solving research aims to create important social value by offering meaningful answers to various social pending issues using scientific technologies. Not surprisingly, however, although numerous and extensive research attempts have been made to alleviate the social problems and issues in nation-wide, we still have many important social challenges and works to be done. In order to facilitate the entire process of the social problem-solving research and maximize its efficacy, it is vital to clearly identify and grasp the important and pressing problems to be focused upon. It is understandable for the problem discovery step to be drastically improved if current social issues can be automatically identified from existing R&D resources such as technical reports and articles. This paper introduces a comprehensive dataset which is essential to build a machine learning model for automatically detecting the social problems and solutions in various national research reports. Initially, we collected a total of 700 research reports regarding social problems and issues. Through intensive annotation process, we built totally 24,022 sentences each of which possesses its own category or label closely related to social problem-solving such as problems, purposes, solutions, effects and so on. Furthermore, we implemented four sentence classification models based on various neural language models and conducted a series of performance experiments using our dataset. As a result of the experiment, the model fine-tuned to the KLUE-BERT pre-trained language model showed the best performance with an accuracy of 75.853% and an F1 score of 63.503%.