• Title/Summary/Keyword: Text mining analysis

Search Result 1,200, Processing Time 0.027 seconds

Customer Attitude to Artificial Intelligence Features: Exploratory Study on Customer Reviews of AI Speakers (인공지능 속성에 대한 고객 태도 변화: AI 스피커 고객 리뷰 분석을 통한 탐색적 연구)

  • Lee, Hong Joo
    • Knowledge Management Research
    • /
    • v.20 no.2
    • /
    • pp.25-42
    • /
    • 2019
  • AI speakers which are wireless speakers with smart features have released from many manufacturers and adopted by many customers. Though smart features including voice recognition, controlling connected devices and providing information are embedded in many mobile phones, AI speakers are sitting in home and has a role of the central en-tertainment and information provider. Many surveys have investigated the important factors to adopt AI speakers and influ-encing factors on satisfaction. Though most surveys on AI speakers are cross sectional, we can track customer attitude toward AI speakers longitudinally by analyzing customer reviews on AI speakers. However, there is not much research on the change of customer attitude toward AI speaker. Therefore, in this study, we try to grasp how the attitude of AI speaker changes with time by applying text mining-based analysis. We collected the customer reviews on Amazon Echo which has the highest share of AI speakers in the global market from Amazon.com. Since Amazon Echo already have two generations, we can analyze the characteristics of reviews and compare the attitude ac-cording to the adoption time. We identified all sub topics of customer reviews and specified the topics for smart features. And we analyzed how the share of topics varied with time and analyzed diverse meta data for comparisons. The proportions of the topics for general satisfaction and satisfaction on music were increasing while the proportions of the topics for music quality, speakers and wireless speakers were decreasing over time. Though the proportions of topics for smart fea-tures were similar according to time, the share of the topics in positive reviews and importance metrics were reduced in the 2nd generation of Amazon Echo. Even though smart features were mentioned similarly in the reviews, the influential effect on satisfac-tion were reduced over time and especially in the 2nd generation of Amazon Echo.

A Validation of Effectiveness for Intrusion Detection Events Using TF-IDF (TF-IDF를 이용한 침입탐지이벤트 유효성 검증 기법)

  • Kim, Hyoseok;Kim, Yong-Min
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.28 no.6
    • /
    • pp.1489-1497
    • /
    • 2018
  • Web application services have diversified. At the same time, research on intrusion detection is continuing due to the surge of cyber threats. Also, As a single-defense system evolves into multi-level security, we are responding to specific intrusions by correlating security events that have become vast. However, it is difficult to check the OS, service, web application type and version of the target system in real time, and intrusion detection events occurring in network-based security devices can not confirm vulnerability of the target system and success of the attack A blind spot can occur for threats that are not analyzed for problems and associativity. In this paper, we propose the validation of effectiveness for intrusion detection events using TF-IDF. The proposed scheme extracts the response traffics by mapping the response of the target system corresponding to the attack. Then, Response traffics are divided into lines and weights each line with an TF-IDF weight. we checked the valid intrusion detection events by sequentially examining the lines with high weights.

The Stream of Uncertainty in Scientific Knowledge using Topic Modeling (토픽 모델링 기반 과학적 지식의 불확실성의 흐름에 관한 연구)

  • Heo, Go Eun
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.1
    • /
    • pp.191-213
    • /
    • 2019
  • The process of obtaining scientific knowledge is conducted through research. Researchers deal with the uncertainty of science and establish certainty of scientific knowledge. In other words, in order to obtain scientific knowledge, uncertainty is an essential step that must be performed. The existing studies were predominantly performed through a hedging study of linguistic approaches and constructed corpus with uncertainty word manually in computational linguistics. They have only been able to identify characteristics of uncertainty in a particular research field based on the simple frequency. Therefore, in this study, we examine pattern of scientific knowledge based on uncertainty word according to the passage of time in biomedical literature where biomedical claims in sentences play an important role. For this purpose, biomedical propositions are analyzed based on semantic predications provided by UMLS and DMR topic modeling which is useful method to identify patterns in disciplines is applied to understand the trend of entity based topic with uncertainty. As time goes by, the development of research has been confirmed that uncertainty in scientific knowledge is moving toward a decreasing pattern.

Research trends in statistics for domestic and international journal using paper abstract data (초록데이터를 활용한 국내외 통계학 분야 연구동향)

  • Yang, Jong-Hoon;Kwak, Il-Youp
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.267-278
    • /
    • 2021
  • As time goes by, the amount of data is increasing regardless of government, business, domestic or overseas. Accordingly, research on big data is increasing in academia. Statistics is one of the major disciplines of big data research, and it will be interesting to understand the research trend of statistics through big data in the growing number of papers in statistics. In this study, we analyzed what studies are being conducted through abstract data of statistical papers in Korea and abroad. Research trends in domestic and international were analyzed through the frequency of keyword data of the papers, and the relationship between the keywords was visualized through the Word Embedding method. In addition to the keywords selected by the authors, words that are importantly used in statistical papers selected through Textrank were also visualized. Lastly, 10 topics were investigated by applying the LDA technique to the abstract data. Through the analysis of each topic, we investigated which research topics are frequently studied and which words are used importantly.

Analysis of Research Topics and Trends on COVID-19 in Korea Using Latent Dirichlet Allocation (LDA)

  • Heo, Seong-Min;Yang, Ji-Yeon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.12
    • /
    • pp.83-91
    • /
    • 2020
  • This study aims to identify research topics and examine the trend of Covid19-related papers on DBpia. Applying latent Dirichlet allocation (LDA), we have extracted seven research topics, each of which concerns "International Dynamics", "Technology & Security", "Psychological Impact", "Biomedical-Related", "Economic Impact", "Online Education", and "Religion-Related". In addition, we used the multinomial logistic model to examine the trend of research topics. We found that the papers mainly cover topics related to "International Dynamics" and "Biomedical-Related" before June 2020, but the topics have become diverse since then. In particular, topics regarding "Economic Impact", "Online Education" and "Psychological Impact" has drawn increased attention of researchers. The findings would provide a guideline for collaboration in Covid19-related research, and could serve as a reference work for active research.

A Study on Stock Trend Determination in Stock Trend Prediction

  • Lim, Chungsoo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.12
    • /
    • pp.35-44
    • /
    • 2020
  • In this study, we analyze how stock trend determination affects trend prediction accuracy. In stock markets, successful investment requires accurate stock price trend prediction. Therefore, a volume of research has been conducted to improve the trend prediction accuracy. For example, information extracted from SNS (social networking service) and news articles by text mining algorithms is used to enhance the prediction accuracy. Moreover, various machine learning algorithms have been utilized. However, stock trend determination has not been properly analyzed, and conventionally used methods have been employed repeatedly. For this reason, we formulate the trend determination as a moving average-based procedure and analyze its impact on stock trend prediction accuracy. The analysis reveals that trend determination makes prediction accuracy vary as much as 47% and that prediction accuracy is proportional to and inversely proportional to reference window size and target window size, respectively.

Topic modeling and topic change trend analysis for advanced construction technologies (건설신기술에 대한 토픽 모델링 및 토픽 변화추이 분석)

  • Jeong, Seong Yun;Kim, Nam Gon
    • Smart Media Journal
    • /
    • v.10 no.4
    • /
    • pp.102-110
    • /
    • 2021
  • Currently, the advanced construction technology endorsement system is being operated to promote the development of domestic construction technology. We tried to examine the implicit meanings inherent in advanced construction technologies by analyzing the relationship between emerging vocabularies with high importance in relation to the advanced construction technologies endorsed through this system. For this purpose, 918 cases of advanced construction technology information were collected. Based on the endorsed year and summary of the advanced construction technologies, the importance of the emerging vocabularies was measured for each advanced construction technology. And, based on the LDA model, the degree of influence between related vocabularies was evaluated for each of the four topic areas. Topics according to the technical application fields were analyzed. From 1990 to 2021, the trend of changes in highly influential vocabularies by each topic was inferred. In the future, changes in the degree of influence of the topics of environment, machinery, facilities, and maintenance and reinforcement of structures and related technology fields were predicted.

Study on Tendency of Cloud Computing Using R and LDA Technique : Focusing on Tendency of Overseas Studies (R과 LDA 기법을 활용한 클라우드 컴퓨팅 동향에 관한 연구: 해외 연구 동향을 중심으로)

  • Kang, Tae-Gu
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.5
    • /
    • pp.261-266
    • /
    • 2022
  • The full-fledged digital age derived from the fourth industrial revolution and the impact of COVID-19 lead to changes in various fields, including companies. In other words, the importance of cloud computing is being emphasized in the rapidly changing digital environment due to the rapid growth of the cloud market due to the rapid increase in digital services. The cloud may be one of the representative strategies for sustainable growth and survival in various fields as well as related industries. Although there have been a variety of studies on the cloud, the tendency of them has been not been adequately examined. This paper, therefore, analyzed the tendency of studies on the cloud computing. by using SCOPUS, the database of overseas academic journals using both R and LAD technique. The findings showed that many studies with high interest in the cloud computing have been conducted, the cloud computing were most often drawn from an analysis on key words. Moreover, various key words, including cloud, cloud and computing, data and computing were drawn, except for the theme of cloud computing. It is expected that could be used as a basic data, in that they provide the foundation for activating the related industries in terms of practice of the cloud computing.

Selection of Effective Herbal Medicines for Parkinson's Disease Based on the Text Mining of the Classical Korean Medical Literature Donguibogam

  • Bae, Hyo Won;Lee, Tae Wook;Choi, Byung Tae;Shin, Hwa Kyoung;Yun, Young Ju
    • The Journal of Korean Medicine
    • /
    • v.42 no.4
    • /
    • pp.120-132
    • /
    • 2021
  • Objectives: The prevalence of Parkinson's disease is on an upward trend along with an increase in the aging population but there is no available treatment that halts the progression of neurodegeneration. This study reports a numerical analysis on Donguibogam and suggests novel herbal drugs, which have never been researched before but found to be deemed effective in this study. Methods: Referring to 71 Korean medicine symptom terms that represent the symptoms of Parkinson's disease, 4170 prescriptions described in Donguibogam were classified into two groups based on whether their main effects were effective for Parkinson's disease or not. Comparing the two groups, the chi-square test was performed to select statistically significant herbs, while the t-test, Wilcoxon test, and descriptive statistics were performed to determine the appropriate dose. Results: One hundred and twenty-seven prescriptions effective for Parkinson's disease were identified. The chi-square test determined 17 herbs that are effective for symptomatic treatment. Among the medicinal herbs, the authors suggest Osterici seu Notopterygii Radix et Rhizoma, Ephedrae Herba, Aconiti Tuber, Myrrha, Sinomeni Caulis et Rhizoma, and Aconiti Kusnezoffii Tuber as herbal candidates that have never been studied for Parkinson's disease. Through the statistical tests, it was judged that the mean value of the dose of the entire prescription was the appropriate dose for each herb. Conclusions: Seventeen herbs were selected for Parkinson's disease and the appropriate daily dose were calculated. Furthermore, this study presented a new process that applies a statistical method to traditional medical literature and preselecting herbs deemed effective for specific diseases.

Development of Social Data Collection and Loading Engine-based Reliability analysis System Against Infectious Disease Pandemic (감염병 위기 대응을 위한 소셜 데이터 수집 및 적재 엔진 기반 신뢰도 분석 시스템 개발)

  • Doo Young Jung;Sang-Jun Lee;MIN KYUNG IL;Seogsong Jeong;HyunWook Han
    • The Journal of Bigdata
    • /
    • v.7 no.2
    • /
    • pp.103-111
    • /
    • 2022
  • There are many institutions, organizations, and sites related to responding to infectious diseases, but as the pandemic situation such as COVID-19 continues for years, there are many changes in the initial and current aspects, and accordingly, policies and response systems are evolving. As a result, regional gaps arise, and various problems are scattered due to trust, distrust, and implementation of policies. Therefore, in the process of analyzing social data including information transmission, Twitter data, one of the major social media platforms containing inaccurate information from unknown sources, was developed to prevent facts in advance. Based on social data, which is unstructured data, an algorithm that can automatically detect infectious disease threats is developed to create an objective basis for responding to the infectious disease crisis to solidify international competitiveness in related fields.