• Title/Summary/Keyword: 소셜 데이터 분석

Search Result 739, Processing Time 0.024 seconds

Implementation on Online Storage with Hadoop (하둡을 이용한 온라인 대용량 저장소 구현)

  • Eom, Se-Jin;Lim, Seung-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.05a
    • /
    • pp.56-58
    • /
    • 2013
  • 최근 페이스북이나 트위터와 같은 소셜네트워크 서비스를 포함하여 대용량의 빅데이터에 대한 처리와 분석이 중요한 이슈로 다뤄지고 있으며, 사용자들이 끊임없이 쏟아내는 데이터로 인해서 이러한 데이터들을 어떻게 다룰 것인지, 혹은 어떻게 분석하여 의미 있고, 가치 있는 것으로 가공할 것인지가 중요한 사안으로 여겨지고 있다. 이러한 빅데이터 관리 도구로써 하둡은 빅데이터의 처리와 분석에 있어서 가장 해결에 근접한 도구로 평가받고 있다. 이 논문은 하둡의 주요 구성요소인 HDFS(Hadoop Distributed File System)와 JAVA에 기반하여 제작되는 온라인 대용량 저장소 시스템의 가장 기본적인 요소인 온라인 데이터 저장소를 직접 설계하고 제작하고, 구현하여 봄으로써 대용량 저장소의 구현 방식에 대한 이슈를 다뤄보도록 한다.

Sensitivity Identification Method for New Words of Social Media based on Naive Bayes Classification (나이브 베이즈 기반 소셜 미디어 상의 신조어 감성 판별 기법)

  • Kim, Jeong In;Park, Sang Jin;Kim, Hyoung Ju;Choi, Jun Ho;Kim, Han Il;Kim, Pan Koo
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.51-59
    • /
    • 2020
  • From PC communication to the development of the internet, a new term has been coined on the social media, and the social media culture has been formed due to the spread of smart phones, and the newly coined word is becoming a culture. With the advent of social networking sites and smart phones serving as a bridge, the number of data has increased in real time. The use of new words can have many advantages, including the use of short sentences to solve the problems of various letter-limited messengers and reduce data. However, new words do not have a dictionary meaning and there are limitations and degradation of algorithms such as data mining. Therefore, in this paper, the opinion of the document is confirmed by collecting data through web crawling and extracting new words contained within the text data and establishing an emotional classification. The progress of the experiment is divided into three categories. First, a word collected by collecting a new word on the social media is subjected to learned of affirmative and negative. Next, to derive and verify emotional values using standard documents, TF-IDF is used to score noun sensibilities to enter the emotional values of the data. As with the new words, the classified emotional values are applied to verify that the emotions are classified in standard language documents. Finally, a combination of the newly coined words and standard emotional values is used to perform a comparative analysis of the technology of the instrument.

Implementation of a Travel Route Recommendation System Utilizing Daily Scheduling Templates

  • Kim, Hyeon Gyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.10
    • /
    • pp.137-146
    • /
    • 2022
  • In relation to the travel itinerary recommendation service, which has recently become in high demand, our previous work introduces a method to quantify the popularity of places including tour spots, restaurants, and accommodations through social big data analysis, and to create a travel schedule based on the analysis results. On the other hand, the generated schedule was mainly composed of travel routes that connected tour spots with the shorted distance, and detailed schedule information including restaurants and accommodation information for each travel date was not provided. This paper presents an algorithm for constructing a detailed travel route using a scenario template in a travel schedule created based on social big data, and introduces a prototype system that implements it. The proposed system consists of modules such as place information collection, place-specific popularity score estimation, shortest travel rout generation, daily schedule organization, and UI visualization. Experiments conducted based on social reviews collected from 63,000 places in the Gyeongnam province proved effectiveness of the proposed system.

A Study on Customer Satisfaction for Courier Companies based on SNS Big data (소셜 네트워크 빅데이터 기반 택배업체 고객만족도에 관한 연구)

  • Lee, DongJun;Won, JongUn;Kwon, YongJang;Kim, MiRye
    • The Journal of Society for e-Business Studies
    • /
    • v.21 no.4
    • /
    • pp.55-67
    • /
    • 2016
  • Global courier companies have been devoting to get more customers and profits with different service because of the worse profits from price competition. So, the effort of improving satisfaction of customers through improving courier service qualities is more important than any other time. However, the previous way to measure courier service has limitation that costs lots of time and money from off-line survey. This limitation could be overcome with less effort and costs if utilizing on-line social big data analysis and it is so helpful to improve competitiveness of courier companies. Therefore, I have collected comments from domestic and international courier companies from big data on social network service, analyzed the satisfaction of customers by R and verified the result by comparing with American Customer Satisfaction Index (ACSI) and Korea National Customer Index (NCSI) in this research. I found out the result depicts clear correlation between SNS analysis and customer satisfaction. This study can be the foundation to predict customer satisfaction easily by utilizing real time SNS information.

A study of MapReduce Algorithm for Bigdata (빅데이터 처리를 위한 맵리듀스 연구)

  • Kim, Man-Yun;Youn, Hee-Yong
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2014.07a
    • /
    • pp.341-342
    • /
    • 2014
  • 지난 10년간 데이터의 폭발적인 증가로 우리는 빅데이터 시대를 맞이하게 되었다. 특히, 최근 몇 년 사이 소셜 네트워크의 발전으로 인해 발생하는 데이터의 양이 증가하면서, 이를 처리하기 위한 시스템으로 하둡이 등장하였다. 이전에는 저장 및 처리할 수 없었던 대용량 데이터를 오픈소스인 하둡의 등장으로 누구나가 대용량 데이터를 처리할 수 있는 시스템을 운영할 수 있게 된 것이다. 대규모 처리 분석을 위한 소프트웨어 프레임워크인 하둡은 클라우드 컴퓨팅의 대표적인 기술로 널리 사용되고 있다. 하둡은 크게 데이터의 저장을 담당하는 HDFS(Hadoop Distribute File System)와 데이터를 처리하는 맵리듀스로 나뉜다. 본 논문에서는 기존의 MapReduce와 차세대 맵리듀스로 불리는 YARN을 비교 분석하고 맵리듀스의 용도와 효율적인 활용방안을 제시한다.

  • PDF

Airline Customer Satisfaction Analysis using Social Media Sentiment Evaluation: Full Service Carriers vs. Low Cost Carriers (소셜 미디어 감성평가를 활용한 항공사 고객만족도 분석 - 대형항공사와 저비용항공사 비교연구)

  • Lee, Ju-Yang;Jang, Phil-Sik
    • Journal of Digital Convergence
    • /
    • v.15 no.6
    • /
    • pp.189-196
    • /
    • 2017
  • This study investigates customer satisfaction with full service carriers (FSC) and low cost carriers (LCC) using social media sentiment evaluation. From 2008 to 2016, a total of 77,591 tweets about two FSC and six LCC were aggregated and classified as per airline choice factors. Sentiment evaluation was employed to assess customer satisfaction by three appraisers. The results showed that customer satisfaction with LCC was significantly higher (p<0.001) compared to FSC. Furthermore, overall customer satisfaction with both FSC and LCC has been facing a consistent downward trend since the last seven years. The results also highlighted low customer satisfaction with respect to booking and flight operation factors, and a steep decline in customer satisfaction across booking, onboard services, and marketing factors for FSC. The results of this study have practical implications for the airline industry, which can use this quantitative data to improve customer satisfaction with FSC and LCC.

A Critical Review on Social Media Campaign Studies: Trends and Issues (소셜미디어 선거캠페인 연구 동향과 쟁점)

  • Chang, Woo-young
    • Informatization Policy
    • /
    • v.26 no.1
    • /
    • pp.3-24
    • /
    • 2019
  • This study examined the trends and issues of social media campaign studies from three aspects-campaign strategy, institutional environment regulating the social media, and political effect. Then, this study performed an empirical analysis on the case of the 20th general election in order to discuss the political effect, which has been analyzed the least. Specifically, this study empirically examined the trends of candidates' participation in the twitter campaign, the partial mobilization and voter response, and the platform effect on the election results. The study examined all of the candidates' twitter accounts and traffic and found the following results.-first, the number of participants in the twitter campaign increased significantly compared to the 19th general election, and the campaign was dominated by only two political parties that had more power to mobilize resources; second, it was clearly identified that twitter is a partisan media. where specifically, those in the mainstream of the Democratic Party mobilized much more supporters; and lastly, the twitter campaign has a positive impact on the increase in the rate of votes and chances of winning the election. Particularly, the number of followers and the duration of activities were found statistically meaningful, proving that promotion of networking and social capital is more important in election campaigns.

National Awareness of the 2019 World Swimming Championships using Big Data from Social Network Analysis (소셜네트워크 분석의 빅데이터를 활용한 2019세계수영선수권 대회의 국내 인식조사)

  • Kim, Gi-Tak
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.4
    • /
    • pp.173-184
    • /
    • 2019
  • The data processing of this study is based on the word data search in social media through textom and the big data analysis is carried out and three areas (2019 Gwangju World Swimming Championships, 2019 Gwangju World Swimming Masters Competition, 2019 World Swimming Championships Problem) was consistently handled through data collection and refinement in the web environment. We applied the collected words to the program of Ucinet6, visualized them, and conducted a CONCOR analysis to grasp the similar relationship of words and to identify the cluster of common factors. As a result of the analysis, the clusters related to the 2019 Gwangju World Swimming Championships mainly consisted of four major areas of recognition and perception, mainly searching for operational aspects related to the swimming championship, and the community related to the 2019 Gwangju World Swimming Masters Competition Is mainly searched for the promotion of the Masters Competition and the aspect of the competition divided into two areas of major recognition and peripheral recognition. The cluster related to the problems of the 2019 Gwangju World Swimming Championships is divided into five areas, And they are mainly searching for the place, operation, institution, event, etc. of the problem of the swimming championship.

A Co-Occuring HashTag Analysis Technique In SNS EnvironMents (SNS 환경에서 동시출현 해시태그 분석 기법)

  • Kim, Se-Jin;Lee, Sang-Don
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2014.11a
    • /
    • pp.223-224
    • /
    • 2014
  • 최근 빅데이터 시대에 다가와서 소셜 네트워크 서비스(Social Network Service)가 중요한 정보 공유의 수단으로 발전함에 따라 그에 따른 예측분석, 동향분석, 이슈탐지 등이 증가하고 있으며, 콘텐츠 분야에서 빅데이터 기법 사례가 증가하는 추세이다. 모바일기기 보급이 빠르게 확산되면서 SNS 활성화와 함께 많은 양의 데이터가 증가하고 있으며, 인스타그램과 같은 해시태그 사용 가능 SNS 서비스에서 해시태그의 동시출현은 해시태그만의 연관성이 있음을 의미한다. 본 논문에서는 대상 SNS의 동시출현 해시태그를 분석하기 위해 발생되는 데이터를 가지고 현재 트렌드에 맞게 분석하여 정보를 제공하는 방법을 제시한다.

  • PDF

A Study on the Estimation of Character Value in Media Works: Based on Network Centralities and Web-Search Data (미디어 작품 캐릭터 가치 측정 연구: 네트워크 중심성 척도와 검색 데이터를 활용하여)

  • Cho, Seonghyun;Lee, Minhyung;Choi, HanByeol Stella;Lee, Heeseok
    • Knowledge Management Research
    • /
    • v.22 no.4
    • /
    • pp.1-26
    • /
    • 2021
  • Measuring the intangible asset has been vigorously studied for its importance. Especially, the value of character in media industry is difficult to quantitatively evaluate in spite of the industry's rapid growth. Recently, the Social Network Analysis (i.e., SNA) has been actively applied to understand human usage patterns in a media field. By using SNA methodology, this study attempts to investigate how the character network characteristics of media works are linked to human search behaviors. Our analysis reveals the positive correlation and causality between character network centralities and character search data. This result implies that the character network can be used as a clue for the valuation of character assets.