• Title/Summary/Keyword: 소셜 데이터 분석

Search Result 735, Processing Time 0.025 seconds

Analysis of Major COVID-19 Issues Using Unstructured Big Data (비정형 빅데이터를 이용한 COVID-19 주요 이슈 분석)

  • Kim, Jinsol;Shin, Donghoon;Kim, Heewoong
    • Knowledge Management Research
    • /
    • v.22 no.2
    • /
    • pp.145-165
    • /
    • 2021
  • As of late December 2019, the spread of COVID-19 pandemic began which put the entire world in panic. In order to overcome the crisis and minimize any subsequent damage, the government as well as its affiliated institutions must maximize effects of pre-existing policy support and introduce a holistic response plan that can reflect this changing situation- which is why it is crucial to analyze social topics and people's interests. This study investigates people's major thoughts, attitudes and topics surrounding COVID-19 pandemic through the use of social media and big data. In order to collect public opinion, this study segmented time period according to government countermeasures. All data were collected through NAVER blog from 31 December 2019 to 12 December 2020. This research applied TF-IDF keyword extraction and LDA topic modeling as text-mining techniques. As a result, eight major issues related to COVID-19 have been derived, and based on these keywords, this research presented policy strategies. The significance of this study is that it provides a baseline data for Korean government authorities in providing appropriate countermeasures that can satisfy needs of people in the midst of COVID-19 pandemic.

Outdoor Healing Places Perception Analysis Using Named Entity Recognition of Social Media Big Data (소셜미디어 빅데이터의 개체명 인식을 활용한 옥외 힐링 장소 인식 분석)

  • Sung, Junghan;Lee, Kyungjin
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.50 no.5
    • /
    • pp.90-102
    • /
    • 2022
  • In recent years, as interest in healing increases, outdoor spaces with the concept of healing have been created. For more professional and in-depth planning and design, the perception and characteristics of outdoor healing places through social media posts were analyzed using NER. Text mining was conducted using 88,155 blog posts, and frequency analysis and clique cohesion analysis were conducted. Six elements were derived through a literature review, and two elements were added to analyze the perception and the characteristics of healing places. As a result, visitors considered place elements, date and time, social elements, and activity elements more important than personnel, psychological elements, plants and color, and form and shape when visiting healing places. The analysis allowed the derivation of perceptions and characteristics of healing places through keywords. From the results of the Clique, keywords, such as places, date and time, and relationship, were clustered, so it was possible to know where, when, what time, and with whom people were visiting places for healing. Through the study, the perception and characteristics of healing places were derived by analyzing large-scale data written by visitors. It was confirmed that specific elements could be used in planning and marketing.

A Study on WT-Algorithm for Effective Reduction of Association Rules (효율적인 연관규칙 감축을 위한 WT-알고리즘에 관한 연구)

  • Park, Jin-Hee;Pi, Su-Young
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.20 no.5
    • /
    • pp.61-69
    • /
    • 2015
  • We are in overload status of information not just in a flood of information due to the data pouring from various kinds of mobile devices, online and Social Network Service(SNS) every day. While there are many existing information already created, lots of new information has been created from moment to moment. Linkage analysis has the shortcoming in that it is difficult to find the information we want since the number of rules increases geometrically as the number of item increases with the method of finding out frequent item set where the frequency of item is bigger than minimum support in this information. In this regard, this thesis proposes WT-algorithm that represents the transaction data set as Boolean variable item and grants weight to each item by making algorithm with Quine-McKluskey used to simplify the logical function. The proposed algorithm can improve efficiency of data mining by reducing the unnecessary rules due to the advantage of simplification regardless of number of items.

Subgroup Analysis of Global Communication Network on Twitter (트위터에 나타난 국제 커뮤니케이션 네트워크의 하위집단 분석)

  • Seo, Il-Jung;Cho, Jaehee
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.6
    • /
    • pp.671-679
    • /
    • 2016
  • We investigated subgroups within a global communication network to improve the empirical understanding of global communication phenomenon from the social network perspective. We collected global communication data from Twitter and constructed a global communication network. We also added countries' geographic and economic properties used in the United Nations and the World Economic Forum. We analyzed the subgroups' structure within the global communication network using centrality analysis, core-peripheral analysis, and cohesion analysis. We also detected communities embedded within the global communication network with modularity-based community detection methods. We found that the core countries occupy central positions in the global communication network and there is a hierarchical communication structure among the economic subgroups. Futhermore, we discovered some communities within the global communication network and found that countries within the communities can have homophily such as economy, geography, history, culture, and religion.

SWOSpark : Spatial Web Object Retrieval System based on Distributed Processing (SWOSpark : 분산 처리 기반 공간 웹 객체 검색 시스템)

  • Yang, Pyoung Woo;Nam, Kwang Woo
    • Journal of KIISE
    • /
    • v.45 no.1
    • /
    • pp.53-60
    • /
    • 2018
  • This study describes a spatial web object retrieval system using Spark, an in - memory based distributed processing system. Development of social networks has created massive amounts of spatial web objects, and retrieval and analysis of data is difficult by using exist spatial web object retrieval systems. Recently, development of distributed processing systems supports the ability to analyze and retrieve large amounts of data quickly. Therefore, a method is promoted to search a large-capacity spatial web object by using the distributed processing system. Data is processed in block units, and one of these blocks is converted to RDD and processed in Spark. Regarding the discussed method, we propose a system in which each RDD consists of spatial web object index for the included data, dividing the entire spatial region into non-overlapping spatial regions, and allocating one divided region to one RDD. We propose a system that can efficiently use the distributed processing system by dividing space and increasing efficiency of searching the divided space. Additionally by comparing QP-tree with R-tree, we confirm that the proposed system is better for searching the spatial web objects; QP-tree builds index with both spatial and words information while R-tree build index only with spatial information.

Research on Success & Failure of Platform business in perspective of multi-method research (결합형 방법론 관점에서의 플랫폼 비즈니스의 성공과 실패에 대한 연구)

  • Jin, Dong-Su
    • International Commerce and Information Review
    • /
    • v.15 no.2
    • /
    • pp.387-410
    • /
    • 2013
  • The competition aspect of business has been transformed from competition among companies to competition among ecosystem, and has been grown to platform based business, which is defined as ecosystem among business. Coming to the spotlight with the advantages of platform business combined software and hardware like Apple, platform business have been emerging in many fields. In this research, we define platform and platform based business, and then review related researches. After this, we review four representative research methodologies which are Yin(2011)' s case analysis research, Eisenhardt(2007)' s case analysis research, Romano etc' s web based qualitative data analysis method(2003), and Creswll(2010)' s open coding technique. And then, we suggest this research' s natural methodology combined with the advantages of four research methodologies. Based on our research methodology, we choose three high commercialized categories, which are smartphone platform business, social platform business, and search engine platform business. And then, we choose seven companies in three categories with success cases & failure cases, and analysis each case in perspective of our research methodology. And then, we suggest critical success & failure elements. Based on our findings, we suggest three strategic elements for the longevity of platform based business. Finally, we suggest the limitations of our research and further research issues.

  • PDF

User Perception of Personal Information Security: An Analytic Hierarch Process (AHP) Approach and Cross-Industry Analysis (기업의 개인정보 보호에 대한 사용자 인식 연구: 다차원 접근법(Analytic Hierarch Process)을 활용한 정보보안 속성 평가 및 업종별 비교)

  • Jonghwa Park;Seoungmin Han;Yoonhyuk Jung
    • Information Systems Review
    • /
    • v.25 no.4
    • /
    • pp.233-248
    • /
    • 2023
  • The increasing integration of intelligent information technologies within organizational systems has amplified the risk to personal information security. This escalation, in turn, has fueled growing apprehension about an organization's capabilities in safeguarding user data. While Internet users adopt a multifaceted approach in assessing a company's information security, existing research on the multiple dimensions of information security is decidedly sparse. Moreover, there is a conspicuous gap in investigations exploring whether users' evaluations of organizational information security differ across industry types. With an aim to bridge these gaps, our study strives to identify which information security attributes users perceive as most critical and to delve deeper into potential variations in these attributes across different industry sectors. To this end, we conducted a structured survey involving 498 users and utilized the analytic hierarchy process (AHP) to determine the relative significance of various information security attributes. Our results indicate that users place the greatest importance on the technological dimension of information security, followed closely by transparency. In the technological arena, banks and domestic portal providers earned high ratings, while for transparency, banks and governmental agencies stood out. Contrarily, social media providers received the lowest evaluations in both domains. By introducing a multidimensional model of information security attributes and highlighting the relative importance of each in the realm of information security research, this study provides a significant theoretical contribution. Moreover, the practical implications are noteworthy: our findings serve as a foundational resource for Internet service companies to discern the security attributes that demand their attention, thereby facilitating an enhancement of their information security measures.

Visual Analytics using Topic Composition for Predicting Event Flow (토픽의 조합으로 이벤트 흐름을 예측하기 위한 시각적 분석 시스템)

  • Yeon, Hanbyul;Kim, Seokyeon;Jang, Yun
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.12
    • /
    • pp.768-773
    • /
    • 2015
  • Emergence events are the cause of much economic damage. In order to minimize the damage that these events cause, it must be possible to predict what will happen in the future. Accordingly, many researchers have focused on real-time monitoring, detecting events, and investigating events. In addition, there have also been many studies on predictive analysis for forecasting of future trends. However, most studies provide future tendency per event without contextual compositive analysis. In this paper, we present a predictive visual analytics system using topic composition to provide future trends per event. We first extract abnormal topics from social media data to find interesting and unexpected events. We then search for similar emergence patterns in the past. Relevant topics in the past are provided by news media data. Finally, the user combines the relevant topics and a new context is created for contextual prediction. In a case study, we demonstrate our visual analytics system with two different cases and validate our system with possible predictive story lines.

Measuring Similarity Between Movies Based on Sentiment of Tweets (트위터를 활용한 감성 기반의 영화 유사도 측정)

  • Kim, Kyoungmin;Kim, Dong-Yun;Lee, Jee-Hyong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.3
    • /
    • pp.292-297
    • /
    • 2014
  • As a Social Network Service (SNS) has become an integral part of our everyday lives, millions of users can express their opinion and share information regardless of time and place. Hence sentiment analysis using micro-blogs has been studied in various field to know people's opinion on particular topics. Most of previous researches on movie reviews consider only positive and negative sentiment and use it to predict movie rating. As people feel not only positive and negative but also various emotion, the sentiment that people feel while watching a movie need to be classified in more detail to extract more information than personal preference. We measure sentiment distributions of each movie from tweets according to the Thayer's model. Then, we find similar movies by calculating similarity between each sentiment distributions. Through the experiments, we verify that our method using micro-blogs performs better than using only genre information of movies.

Analysis of National R&D Patent Performance Network in Bio-Healthcare Sector (바이오 헬스케어 분야 국가연구개발 특허성과 네트워크 분석)

  • Kwon, Young-Eun;Kim, Jaesoo
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.12
    • /
    • pp.17-24
    • /
    • 2018
  • This study attempted to analyze technology convergence structure and key technology research sectors in bio-health. For this, network analysis was performed based on the patent outcomes achieved through national R&Ds. Then, a patent network was analyzed to derive problems and collect data from the National Science & Technology Information Service. With the five groups obtained through the analysis of IPC network and national R&D patents in bio-health based on a research frame network, topics were chosen based on the bio-healthcare technology system. Then, the technology with the greatest ripple effects was derived and compared to other sectors, suggesting a direction for national R&D investments. It is anticipated that this study would make a contribution to a search for R&D investment direction by additionally analyzing overseas patent data and improving correlation analysis between technology convergence and government-led R&D expenses.