• Title/Summary/Keyword: SNS Crawling

Search Result 27, Processing Time 0.027 seconds

A Development of Optimal Travel Course Recommendation System based on Altered TSP and Elasticsearch Algorithm (변형된 TSP 및 엘라스틱서치 알고리즘 기반의 최적 여행지 코스 추천 시스템 개발)

  • Kim, Jun-Yeong;Jo, Kyeong-Ho;Park, Jun;Jung, Se-Hoon;Sim, Chun-Bo
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.9
    • /
    • pp.1108-1121
    • /
    • 2019
  • As the quality and level of life rise, many people are doing search for various pieces of information about tourism. In addition, users prefer the search methods reflecting individual opinions such as SNS and blogs to the official websites of tourist destination. Many of previous studies focused on a recommendation system for tourist courses based on the GPS information and past travel records of users, but such a system was not capable of recommending the latest tourist trends. This study thus set out to collect and analyze the latest SNS data to recommend tourist destination of high interest among users. It also aimed to propose an altered TSP algorithm to recommend the optimal routes to the recommended destination within an area and a system to recommend the optimal tourist courses by applying the Elasticsearch engine. The altered TSP algorithm proposed in the study used the location information of users instead of Dijkstra's algorithm technique used in previous studies to select a certain tourist destination and allowed users to check the recommended courses for the entire tourist destination within an area, thus offering more diverse tourist destination recommendations than previous studies.

Research on Designing Korean Emotional Dictionary using Intelligent Natural Language Crawling System in SNS (SNS대상의 지능형 자연어 수집, 처리 시스템 구현을 통한 한국형 감성사전 구축에 관한 연구)

  • Lee, Jong-Hwa
    • The Journal of Information Systems
    • /
    • v.29 no.3
    • /
    • pp.237-251
    • /
    • 2020
  • Purpose The research was studied the hierarchical Hangul emotion index by organizing all the emotions which SNS users are thinking. As a preliminary study by the researcher, the English-based Plutchick (1980)'s emotional standard was reinterpreted in Korean, and a hashtag with implicit meaning on SNS was studied. To build a multidimensional emotion dictionary and classify three-dimensional emotions, an emotion seed was selected for the composition of seven emotion sets, and an emotion word dictionary was constructed by collecting SNS hashtags derived from each emotion seed. We also want to explore the priority of each Hangul emotion index. Design/methodology/approach In the process of transforming the matrix through the vector process of words constituting the sentence, weights were extracted using TF-IDF (Term Frequency Inverse Document Frequency), and the dimension reduction technique of the matrix in the emotion set was NMF (Nonnegative Matrix Factorization) algorithm. The emotional dimension was solved by using the characteristic value of the emotional word. The cosine distance algorithm was used to measure the distance between vectors by measuring the similarity of emotion words in the emotion set. Findings Customer needs analysis is a force to read changes in emotions, and Korean emotion word research is the customer's needs. In addition, the ranking of the emotion words within the emotion set will be a special criterion for reading the depth of the emotion. The sentiment index study of this research believes that by providing companies with effective information for emotional marketing, new business opportunities will be expanded and valued. In addition, if the emotion dictionary is eventually connected to the emotional DNA of the product, it will be possible to define the "emotional DNA", which is a set of emotions that the product should have.

An Exploratory Analysis on the User Response Pattern and Quality Characteristics of Marketing Contents in the SNS of Regional Government (지역마케팅 콘텐츠의 사용자 반응패턴과 품질특성에 관한 탐색적 분석: 지방자치단체가 운영하는 SNS를 중심으로)

  • Jeong, Yeon-Su;Jeong, Dae-Yul
    • The Journal of Information Systems
    • /
    • v.26 no.4
    • /
    • pp.419-442
    • /
    • 2017
  • Purpose The purpose of this study is to explore the pattern of user response and it's duration time through social media content response analysis. We also analyze the characteristics of content quality factors which are associate with the user response pattern. The analysis results will provide some implications to develop strategies and schematic plans for the operator of regional marketing on the SNS. Design/methodology/approach This study used mixed methods to verify the effects and responses of social media contents on the users who have concerns about regional events such as local festival, cultural events, and city tours etc. Big data analysis was conducted with the quantitative data from regional government SNSs. The data was collected through web crawling in order to analyze the social media contents. We especially analyzed the contents duration time and peak level time. This study also analyzed the characteristics of contents quality factors using expert evaluation data on the social media contents. Finally, we verify the relationship between the contents quality factors and user response types by cross correlation analysis. Findings According to the big data analysis, we could find some content life cycle which can be explained through empirical distribution with peak time pattern and left skewed long tail. The user response patterns are dependent on time and contents quality. In addition, this study confirms that the level of quality of social media content is closely relate to user interaction and response pattern. As a result of the contents response pattern analysis, it is necessary to develop high quality contents design strategy and content posting and propagation tactics. The SNS operators need to develop high quality contents using rich-media technology and active response contents that induce opinion leader on the SNS.

Web crawling process of each social network service for recognizing water quality accidents in the water supply networks (물공급네트워크 수질사고인지를 위한 소셜네트워크 서비스 별 웹크롤링 방법론 개발)

  • Yoo, Do Guen;Hong, Seunghyeok;Moon, Gihoon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.398-398
    • /
    • 2022
  • 최근 수돗물 공급과정에 있어 적수, 유충 발생 등 지역 단위의 수질문제로 국민의 직간접적인 피해가 발생된 바 있다. 수질문제 발생 시, 소셜네트워크서비스(SNS)에 게시되는 피해 관련 의견은 시공간적으로 빠르게 확산되며, 궁극적으로는 물공급과정 전체의 부정적 인식증가와 신뢰도 저하를 초래한다. 따라서, 물공급시스템에서의 수질사고 발생을 빠르게 인지하는 다양한 방법론의 적용을 통한 피해 최소화를 위한 노력이 반드시 필요하다. 일반적으로 수질사고는 다양한 항목의 실시간 계측기에서 획득되는 시계열자료의 변화양상을 통해 판단할 수 있으나, 이와 같은 방법론의 효율적 적용을 위해서는 선진계측인프라의 도입이 선행되어야 한다. 본 연구에서는 국내의 발달된 정보통신기술환경을 활용하여, 물공급네트워크 내 수질사고인지를 위한 SNS 별 웹크롤링 방법론을 제안하고, 적용결과를 분석하였다. 방법론의 구현에 앞서, 각종 SNS 별(트위터, 인스타그램, 블로그, 네이버 카페 등) 프로그래밍을 통한 웹크롤링 가능여부, 정보획득 기간 등을 확인하였으며, 과거 유사 수질사고 발생 시 영향력과 관련 게시글이 크게 나타난 네이버 카페와 트위터를 중심으로 웹 크롤링 절차를 제시하였다. 네이버 카페의 경우 대상급수구역 내의 시민들이 다수 참여하는 카페를 목록화하고, 지자체명과 핵심 키워드(수돗물, 유충, 적수) 조합을 활용한 웹크롤링을 수행하여, 관련 게시물 건수와 의미를 실시간으로 분석하는 절차를 마련하였다. 개발된 SNS 별 웹크롤링 방법론에 따라 과거 수질사고가 발생된 바 있는 2개 이상의 지자체에 대한 분석을 실시하였으며, SNS 별 결과에 있어 차이점을 확인하여 제시하였다. 향후 제안된 방법을 적용하여 시공간적 수질사고 정보의 전파 및 확산양상을 추가적으로 분석할수 있을 것으로 기대된다.

  • PDF

A Study on the Vitalization Strategy Based on Current Status Analysis of National Archives (국내외 국립기록관의 트위터 운용 현황 분석 및 활성화 방안)

  • Gang, JuYeon;Kim, TaeYoung;Choi, JungWon;Oh, Hyo-Jung
    • Journal of the Korean Society for information Management
    • /
    • v.33 no.3
    • /
    • pp.263-285
    • /
    • 2016
  • Nowadays, Social Network Service (SNS), which has been in the spotlight as a way of communication, has become a most effective tool to improve easy of information use and accessibility for users. In this paper, we chose Twitter as the most representative SNS services because of automatic crawling and investigated tweet data gathered from domestic and foreign National Archives - NARA of U.S.A., TNA of U.K.. NAA of Australia, and National Archives of Korea. We also conducted information genres analysis and trend analysis by timeline. Information genres analysis shows how archives satisfied users' information needs as well as trends analysis of tweets helps to understand how users' interestedness was changed. Based on comparison results, we distilled four characteristics of National Archives and suggested vitalization ways for National Archives of Korea.

Analysis and Recognition of Depressive Emotion through NLP and Machine Learning (자연어처리와 기계학습을 통한 우울 감정 분석과 인식)

  • Kim, Kyuri;Moon, Jihyun;Oh, Uran
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.2
    • /
    • pp.449-454
    • /
    • 2020
  • This paper proposes a machine learning-based emotion analysis system that detects a user's depression through their SNS posts. We first made a list of keywords related to depression in Korean, then used these to create a training data by crawling Twitter data - 1,297 positive and 1,032 negative tweets in total. Lastly, to identify the best machine learning model for text-based depression detection purposes, we compared RNN, LSTM, and GRU in terms of performance. Our experiment results verified that the GRU model had the accuracy of 92.2%, which is 2~4% higher than other models. We expect that the finding of this paper can be used to prevent depression by analyzing the users' SNS posts.

Is BTS Different? Shared Episodes on SNS as a Good Indicator for Celebrity Endorsed Ad Effects

  • Bu, Kyunghee;Kim, Whoe Whun
    • Asia Marketing Journal
    • /
    • v.22 no.4
    • /
    • pp.27-45
    • /
    • 2021
  • This study examines the effects of celebrity endorsed advertising from a new perspective of the prior research that emphasizes the matchup between the brand and the celebrity. Due to the recent sharing experiences of the celebrity and their fans on SNS, it is hypothesized that the shared stories would impact viewers' responses that are often expressed in their likes, dislikes, shares and comments on SNS. In this study, the episodic type of advertising is hypothesized to have more favorable and active responses from viewers than the typical celebrity image-focused ads would have. By crawling and analyzing viewers' responses on YouTube toward 12 BTS endorsed ads, the hypotheses are confirmed as higher ratio of likes, lower ratio of dislikes and significantly higher ratio of comments over both total views and total likes were found. For the rationale behind, total 1800 comments were categorized into 4 major content types such as attached, experiential, empathic and self-related ones that are all considered as important factors influencing the strong ad effect. The results showed that the episodic ads have marginally more emotional comments than the celeb image ads. The difference was only found in experiential and empathic responses but not in self-related responses. Contrary to the hypothesis, the comments expressing attachment were found more for the celebrity image-focused ads than the episodic ones. It does not seem to suggest that the celebrity image focused ads are better to capture viewers' attachment towards the celebrity and the ad endorsed, but that the episodic ads draw viewers into relatively deeper level of attachment such as empathy by perceiving the authenticity of the celebrity and the brand. In conclusion, the shared stories on SNS can be a factor in the match-up theory on celebrity endorsed ad effects.

Gift-giving Behaviors via SNS Mobile App: An Exploratory Study of Fashion Products

  • Ji Yoon Kim;Jiyeon Lee;Kyu-Hye Lee
    • Journal of Fashion Business
    • /
    • v.27 no.6
    • /
    • pp.110-123
    • /
    • 2023
  • As social distancing strengthened after the COVID-19 incident, people looked for things they could do alone. Additionally, as people have more financial resources, they purchase products they had previously considered purchasing, and the phenomenon of giving gifts to oneself has also appeared. Accordingly, this study analyzed fashion product reviews of KakaoTalk Gift, the service to exchange gift via SNS mobile app, to discover the phenomenon of self-gifting and the differences from interpersonal-gifting. For post-hoc data, in collected 18,354 pieces after excluding unnecessary data using a Python-based web crawling technique. The self-gifting behavior of KakaoTalk Gift different from the previous study for self-gift. Regardless of the gift-giving contexts, it determines that most self-gift products are material items. There are differences in product types and price levels when choosing gifts for others and oneself. As a self-gift, people typically buy luxury jewelry and branded bags/wallets to wear and show off. As interpersonal, among fashion products, people usually buy beauty products that reflect less personal tastes. When gift-giving to others, people buy products to appropriate prices to reduce the burden on both. When gift-giving to oneself, people buy wanted products regardless of the price. This study is significant because it suggests a new direction in self-gift research by limited online places to give gifts.

Building an SNS Crawling System Using Python (Python을 이용한 SNS 크롤링 시스템 구축)

  • Lee, Jong-Hwa
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.23 no.5
    • /
    • pp.61-76
    • /
    • 2018
  • Everything is coming into the world of network where modern people are living. The Internet of Things that attach sensors to objects allows real-time data transfer to and from the network. Mobile devices, essential for modern humans, play an important role in keeping all traces of everyday life in real time. Through the social network services, information acquisition activities and communication activities are left in a huge network in real time. From the business point of view, customer needs analysis begins with SNS data. In this research, we want to build an automatic collection system of SNS contents of web environment in real time using Python. We want to help customers' needs analysis through the typical data collection system of Instagram, Twitter, and YouTube, which has a large number of users worldwide. It is stored in database through the exploitation process and NLP process by using the virtual web browser in the Python web server environment. According to the results of this study, we want to conduct service through the site, the desired data is automatically collected by the search function and the netizen's response can be confirmed in real time. Through time series data analysis. Also, since the search was performed within 5 seconds of the execution result, the advantage of the proposed algorithm is confirmed.

Formulating Strategies from Consumer Opinion Analysis on AI Kids Phone using Text Mining (AI 키즈폰의 소비자리뷰 분석을 통한 제품개선 전략에 대한 연구)

  • Kim, Dohun;Cha, Kyungjin
    • The Journal of Society for e-Business Studies
    • /
    • v.24 no.2
    • /
    • pp.71-89
    • /
    • 2019
  • In order to come up with satisfying product and improvement, firms use traditional marketing research methods to obtain consumers' opinions and further try to reflect them. Recently, gathering data from consumer communication platforms like internet and SNS has become popular methods. Meanwhile, with the development of information technology, mobile companies are launching new digital products for children to protect them from harmful content and provide them with necessary functions and information. Among these digital products, Kids Phone, which is a wearable device with safe functions that enable parents to learn childern's location. Kids phone is relatively cheaper and simpler than smartphone but it is noted that there are several problems such as some useless functions and frequent breakdowns. This study analyzes the reviews of Kids phones from domestic mobile companies, identifies the characteristics, strengths and weaknesses of the products, proposes improvement methods strategies for devices and services through SNS consumer analysis. In order to do that customer review data from online shopping malls was gathered and was further analyzed through text mining methods such as TF/IDF, Sentiment Analysis, and network analysis. Customer review data was gathered through crawling Online shopping Mall and Naver Blog/$Caf\acute{e}$. Data analysis and visualization was done using 'R', 'Textom', and 'Python'. Such analysis allowed us to figure out main issues and recent trends regarding kids phones and to suggest possible service improvement strategies based on sentiment analysis.