• Title/Summary/Keyword: 뉴스빅데이터

Search Result 206, Processing Time 0.029 seconds

Implementation of smart chungbuk tourism based on SNS data analysis (SNS 데이터 분석을 통한 스마트 충북관광 구축)

  • Cho, Wan-Sup;Cho, Ah;Kwon, Kaaen;Yoo, Kwan-Hee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.409-418
    • /
    • 2015
  • With the development of mobile devices and Internet, information exchange has actively been made through SNS and Blogs. Blogs are widely used as a space where people share their experience after their visit to tourist attractions. We propose a method of recommending associated tourist attractions based on tourists' opinions using issue analysis, association analysis, and sentimental analysis for various online reviews including news in order to help to develop tour products and policies. The result shows that north area of Chungbuk province has been selected as issue attractions, and associated attractions/keywards have been identified for given well-known attraction. Positive/negative opinion for review texts has been analyzed and user can grasp the reason for the sentiments. Multidimensional analysis technique has been integrated to derive additional sophisticated insights and various policy proposal for smart tourism.

Social Perception of the Invention Education Center as seen in Big Data (빅데이터 분석을 통한 발명 교육 센터에 대한 사회적 인식)

  • Lee, Eun-Sang
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.71-80
    • /
    • 2022
  • The purpose of this study is to analyze the social perception of invention education center using big data analysis method. For this purpose, data from January 2014 to September 2021 were collected using the Textom website as a keyword searched for 'invention+education+center' in blogs, cafes, and news channels of NAVER and DAUM website. The collected data was refined using the Textom website, and text mining analysis and semantic network analysis were performed by the Textom website, Ucinet 6, and Netdraw programs. The collected data were subjected to a primary and secondary refinement process and 60 keywords were selected based on the word frequency. The selected key words were converted into matrix data and analyzed by semantic network analysis. As a result of text mining analysis, it was confirmed that 'student', 'operation', 'Korea Invention Promotion Association', and 'Korean Intellectual Property Office' were the meaningful keywords. As a result of semantic network analysis, five clusters could be identified: 'educational operation', 'invention contest', 'education process and progress', 'recruitment and support for business', and 'supervision and selection institution'. Through this study, it was possible to confirm various meaningful social perceptions of the general public in relation to invention education center on the internet. The results of this study will be used as basic data that provides meaningful implications for researchers and policy makers studying for invention education.

Web Content Loading Speed Enhancement Method using Service Walker-based Caching System (서비스워커 기반의 캐싱 시스템을 이용한 웹 콘텐츠 로딩 속도 향상 기법)

  • Kim, Hyun-gook;Park, Jin-tae;Choi, Moon-Hyuk;Moon, Il-young
    • Journal of Advanced Navigation Technology
    • /
    • v.23 no.1
    • /
    • pp.55-60
    • /
    • 2019
  • The web is one of the most intimate technologies in people's daily lives, and most of the time, people are sharing data on the web. Simple messenger, news, video, as well as various data are now spreading through the web. In addition, with the emergence of Web assembly technology, the programs that run in the existing native environment start to enter the domain of the Web, and the data shared by the Web is now getting wider and larger in terms of VR / AR contents and big data. Therefore, in this paper, we have studied how to effectively deliver web contentsto users who use Web service by using service worker that can operate independently without being dependent on browser and cache API that can effectively store data in web browser.

Online news-based stock price forecasting considering homogeneity in the industrial sector (산업군 내 동질성을 고려한 온라인 뉴스 기반 주가예측)

  • Seong, Nohyoon;Nam, Kihwan
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.1-19
    • /
    • 2018
  • Since stock movements forecasting is an important issue both academically and practically, studies related to stock price prediction have been actively conducted. The stock price forecasting research is classified into structured data and unstructured data, and it is divided into technical analysis, fundamental analysis and media effect analysis in detail. In the big data era, research on stock price prediction combining big data is actively underway. Based on a large number of data, stock prediction research mainly focuses on machine learning techniques. Especially, research methods that combine the effects of media are attracting attention recently, among which researches that analyze online news and utilize online news to forecast stock prices are becoming main. Previous studies predicting stock prices through online news are mostly sentiment analysis of news, making different corpus for each company, and making a dictionary that predicts stock prices by recording responses according to the past stock price. Therefore, existing studies have examined the impact of online news on individual companies. For example, stock movements of Samsung Electronics are predicted with only online news of Samsung Electronics. In addition, a method of considering influences among highly relevant companies has also been studied recently. For example, stock movements of Samsung Electronics are predicted with news of Samsung Electronics and a highly related company like LG Electronics.These previous studies examine the effects of news of industrial sector with homogeneity on the individual company. In the previous studies, homogeneous industries are classified according to the Global Industrial Classification Standard. In other words, the existing studies were analyzed under the assumption that industries divided into Global Industrial Classification Standard have homogeneity. However, existing studies have limitations in that they do not take into account influential companies with high relevance or reflect the existence of heterogeneity within the same Global Industrial Classification Standard sectors. As a result of our examining the various sectors, it can be seen that there are sectors that show the industrial sectors are not a homogeneous group. To overcome these limitations of existing studies that do not reflect heterogeneity, our study suggests a methodology that reflects the heterogeneous effects of the industrial sector that affect the stock price by applying k-means clustering. Multiple Kernel Learning is mainly used to integrate data with various characteristics. Multiple Kernel Learning has several kernels, each of which receives and predicts different data. To incorporate effects of target firm and its relevant firms simultaneously, we used Multiple Kernel Learning. Each kernel was assigned to predict stock prices with variables of financial news of the industrial group divided by the target firm, K-means cluster analysis. In order to prove that the suggested methodology is appropriate, experiments were conducted through three years of online news and stock prices. The results of this study are as follows. (1) We confirmed that the information of the industrial sectors related to target company also contains meaningful information to predict stock movements of target company and confirmed that machine learning algorithm has better predictive power when considering the news of the relevant companies and target company's news together. (2) It is important to predict stock movements with varying number of clusters according to the level of homogeneity in the industrial sector. In other words, when stock prices are homogeneous in industrial sectors, it is important to use relational effect at the level of industry group without analyzing clusters or to use it in small number of clusters. When the stock price is heterogeneous in industry group, it is important to cluster them into groups. This study has a contribution that we testified firms classified as Global Industrial Classification Standard have heterogeneity and suggested it is necessary to define the relevance through machine learning and statistical analysis methodology rather than simply defining it in the Global Industrial Classification Standard. It has also contribution that we proved the efficiency of the prediction model reflecting heterogeneity.

Analysis entrepreneurship trends using keyword analysis of news article Big Data :2013~2022 (뉴스기사 빅데이터의 키워드분석을 활용한 창업 트렌드 분석:2013~2022 )

  • Jaeeog Kim;Byunghoon Jeon
    • Journal of Platform Technology
    • /
    • v.11 no.3
    • /
    • pp.83-97
    • /
    • 2023
  • This research aims to identify startup trends by analyzing a large number of news articles through semantic network analysis. Using the BIGKinds article analysis service provided by the Korea Press Foundation, 330,628 news articles from 19 newspapers from January 2013 to December 2022 were comprehensively analyzed. The study focused on exploring the changes in key issues over the past decade, considering the impact of the social environment and global economic trends on entrepreneurship. We compared the number of news articles and changes in issues before and after the COVID-19 pandemic, and visualized entrepreneurship trends through frequency analysis, relationship analysis, and correlation analysis. The results of the study showed that the top keywords for entrepreneurship-related words are startup activation and commercialization, and the correlation between COVID-19 and entrepreneurship keywords is almost negligible in a linear sense, but the number of news articles decreased during the pandemic, which has an impact. In particular, the most frequently mentioned keywords are Ministry of SMEs and Startups, place is the United States, and person is limited. The agency was the SBA, and the entrepreneurship sector is more affected by social issues than any other sector, with the important characteristics of increased frequency of prompt access. This study supplies essential basic data for understanding and exploring issues and events related to entrepreneurship and suggests future research topics in the field.

  • PDF

Improving Performance of Recommendation Systems Using Topic Modeling (사용자 관심 이슈 분석을 통한 추천시스템 성능 향상 방안)

  • Choi, Seongi;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.3
    • /
    • pp.101-116
    • /
    • 2015
  • Recently, due to the development of smart devices and social media, vast amounts of information with the various forms were accumulated. Particularly, considerable research efforts are being directed towards analyzing unstructured big data to resolve various social problems. Accordingly, focus of data-driven decision-making is being moved from structured data analysis to unstructured one. Also, in the field of recommendation system, which is the typical area of data-driven decision-making, the need of using unstructured data has been steadily increased to improve system performance. Approaches to improve the performance of recommendation systems can be found in two aspects- improving algorithms and acquiring useful data with high quality. Traditionally, most efforts to improve the performance of recommendation system were made by the former approach, while the latter approach has not attracted much attention relatively. In this sense, efforts to utilize unstructured data from variable sources are very timely and necessary. Particularly, as the interests of users are directly connected with their needs, identifying the interests of the user through unstructured big data analysis can be a crew for improving performance of recommendation systems. In this sense, this study proposes the methodology of improving recommendation system by measuring interests of the user. Specially, this study proposes the method to quantify interests of the user by analyzing user's internet usage patterns, and to predict user's repurchase based upon the discovered preferences. There are two important modules in this study. The first module predicts repurchase probability of each category through analyzing users' purchase history. We include the first module to our research scope for comparing the accuracy of traditional purchase-based prediction model to our new model presented in the second module. This procedure extracts purchase history of users. The core part of our methodology is in the second module. This module extracts users' interests by analyzing news articles the users have read. The second module constructs a correspondence matrix between topics and news articles by performing topic modeling on real world news articles. And then, the module analyzes users' news access patterns and then constructs a correspondence matrix between articles and users. After that, by merging the results of the previous processes in the second module, we can obtain a correspondence matrix between users and topics. This matrix describes users' interests in a structured manner. Finally, by using the matrix, the second module builds a model for predicting repurchase probability of each category. In this paper, we also provide experimental results of our performance evaluation. The outline of data used our experiments is as follows. We acquired web transaction data of 5,000 panels from a company that is specialized to analyzing ranks of internet sites. At first we extracted 15,000 URLs of news articles published from July 2012 to June 2013 from the original data and we crawled main contents of the news articles. After that we selected 2,615 users who have read at least one of the extracted news articles. Among the 2,615 users, we discovered that the number of target users who purchase at least one items from our target shopping mall 'G' is 359. In the experiments, we analyzed purchase history and news access records of the 359 internet users. From the performance evaluation, we found that our prediction model using both users' interests and purchase history outperforms a prediction model using only users' purchase history from a view point of misclassification ratio. In detail, our model outperformed the traditional one in appliance, beauty, computer, culture, digital, fashion, and sports categories when artificial neural network based models were used. Similarly, our model outperformed the traditional one in beauty, computer, digital, fashion, food, and furniture categories when decision tree based models were used although the improvement is very small.

A study on the Domestic Consumer's Perception of "Hansik" with Big Data Analysis : Using Text Mining and Semantic Network Analysis (빅데이터를 통한 내국인의 '한식' 인식 연구 : 텍스트마이닝과 의미연결망 중심으로)

  • Park, Kyeong-Won;Yun, Hee-Kyoung
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.6
    • /
    • pp.145-151
    • /
    • 2020
  • 'Hansik', or Korean cuisine is one of Korea national brands. To understand the domestic consumer awareness of Korean cuisine, data was gathered under the keyword search, 'Hansik.' Textom 3.5 was used to gather data from blogs, news media found on Naver from November 1, 2018, to October 31, 2019. The results from frequency and TF-IDF analysis indicate that the 'buffet' had the largest proportion in terms of consumer awareness to Hansik. Also, broadcasting contents starring star chefs had a great influence. The Hansik awareness did not remain in the domains of its traditionality, but also branched into extents into areas such as fusional and gourmet cuisine. UCINET6 and NetDraw were used to conduct CONCOR analysis. Four cluster formations have been found; various food cultural cluster, high-end restaurant cluster referring to aired restaurants on media, Hansik brand cluster, and Hansik buffet cluster. This study proposes presenting a various menu of Hansik which use a multiple number of ingredients. Also, a promotion that introduces fine Hansik and a development of marketing views and media contents about the convenient HMRs make the associated imagery of Hansik to be strengthen.

Examining the Disparity between Court's Assessment of Cognitive Impairment and Online Public Perception through Natural Language Processing (NLP): An Empirical Investigation (Natural Language Processing(NLP)를 활용한 법원의 판결과 온라인상 대중 인식간 괴리에 관한 실증 연구)

  • Seungkook Roh
    • The Journal of Bigdata
    • /
    • v.8 no.1
    • /
    • pp.11-22
    • /
    • 2023
  • This research aimed to examine the public's perception of the "rate of sentence reduction for reasons of mental and physical weakness" and investigate if it aligns with the actual practice. Various sources, such as the Supreme Court's Courtnet search system, the number of mental evaluation requests, and the number of articles and comments related to "mental weakness" on Naver News were utilized for the analysis. The findings indicate that the public has a negative opinion on reducing sentences due to mental and physical weakness, and they are dissatisfied with the vagueness of the standards. However, this study also confirms that the court strictly applies the reduction of responsibility for individuals with mental disabilities specified in Article 10 of the Criminal Act based on the analysis of actual judgments and the number of requests for psychiatric evaluation. In other words, even though the recognition of perpetrators' mental disorders is declining, the public does not seem to recognize this trend. This creates a negative impact on the public's trust in state institutions. Therefore, law enforcement agencies, such as the police and prosecutors, need to enforce the law according to clear standards to gain public trust. The judiciary also needs to make a firm decision on commuting sentences for mentally and physically infirm individuals and inform the public of the outcomes of its application.

Study on Perceptions through Big data Analysis on Gambling related News in Korea (한국 사행산업 관련 뉴스의 빅데이터 분석을 통한 인식 연구)

  • Moon, HyeJung;Kim, SungKyung
    • Journal of Broadcast Engineering
    • /
    • v.22 no.4
    • /
    • pp.438-447
    • /
    • 2017
  • The purpose of this study is to understand the recognition of gambling industry through the semantic analysis of news data on lottery, sports betting, horse racing and casino that was reported between 1990 to 2015 in South Korea. This paper revealed the difference between journalists' intention and public's perception about news by analyzing the frequency and connectivity of news with framing and public's interest through semantic network analysis and explored the policy characteristics and innovation task. The result of analysis, news on lottery game mainly has been reported social issue related with win such as 'winning number', 'prize money', 'suspicion of manipulation' and etc. News on sports betting has been reported mandatory information related with business project and illegal site such as 'bidding', 'illegal site', 'sales target' and etc. News about horse racing has been reported the information about the business advertisement such as 'online race track' and 'promotion'. Lastly, casino related news has been reported 'major information' such as illegality', 'gambling place' and 'foreigner'. As a result of times series analysis, news about casino in the 1990s, news about lottery in the 2000s and news about horse racing in 2010s have been increased. Public's interest also has been moved to 'business scandal', 'winning game', 'citizens' campaign' and etc. Gambling related news has been classified by four types, 1. advertising publicity(horse racing), 2. mandatory information(sports betting), 3. social issue(public agenda, lottery), 4. major information(casino). We could get the insight that news can be formed a public agenda, when news is reported as a social issue with high frequency and public's interest like lottery related news.

A Study on Establishing a Market Entry Strategy for the Satellite Industry Using Future Signal Detection Techniques (미래신호 탐지 기법을 활용한 위성산업 시장의 진입 전략 수립 연구)

  • Sehyoung Kim;Jaehyeong Park;Hansol Lee;Juyoung Kang
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.249-265
    • /
    • 2023
  • Recently, the satellite industry has been paying attention to the private-led 'New Space' paradigm, which is a departure from the traditional government-led industry. The space industry, which is considered to be the next food industry, is still receiving relatively little attention in Korea compared to the global market. Therefore, the purpose of this study is to explore future signals that can help determine the market entry strategies of private companies in the domestic satellite industry. To this end, this study utilizes the theoretical background of future signal theory and the Keyword Portfolio Map method to analyze keyword potential in patent document data based on keyword growth rate and keyword occurrence frequency. In addition, news data was collected to categorize future signals into first symptom and early information, respectively. This is utilized as an interpretive indicator of how the keywords reveal their actual potential outside of patent documents. This study describes the process of data collection and analysis to explore future signals and traces the evolution of each keyword in the collected documents from a weak signal to a strong signal by specifically visualizing how it can be used through the visualization of keyword maps. The process of this research can contribute to the methodological contribution and expansion of the scope of existing research on future signals, and the results can contribute to the establishment of new industry planning and research directions in the satellite industry.