• Title/Summary/Keyword: Review data mining

Search Result 273, Processing Time 0.026 seconds

Development of Hybrid Recommender System Using Review Data Mining: Kindle Store Data Analysis Case (리뷰 데이터 마이닝을 이용한 하이브리드 추천시스템 개발: Amazon Kindle Store 데이터 분석사례)

  • Yihua Zhang;Qinglong Li;Ilyoung Choi;Jaekyeong Kim
    • Information Systems Review
    • /
    • v.23 no.1
    • /
    • pp.155-172
    • /
    • 2021
  • With the recent increase in online product purchases, a recommender system that recommends products considering users' preferences has still been studied. The recommender system provides personalized product recommendation services to users. Collaborative Filtering (CF) using user ratings on products is one of the most widely used recommendation algorithms. During CF, the item-based method identifies the user's product by using ratings left on the product purchased by the user and obtains the similarity between the purchased product and the unpurchased product. CF takes a lot of time to calculate the similarity between products. In particular, it takes more time when using text-based big data such as review data of Amazon store. This paper suggests a hybrid recommendation system using a 2-phase methodology and text data mining to calculate the similarity between products easily and quickly. To this end, we collected about 980,000 online consumer ratings and review data from the online commerce store, Amazon Kinder Store. As a result of several experiments, it was confirmed that the suggested hybrid recommendation system reflecting the user's rating and review data has resulted in similar recommendation time, but higher accuracy compared to the CF-based benchmark recommender systems. Therefore, the suggested system is expected to increase the user's satisfaction and increase its sales.

Dectection of Insurance Fraud using Visualization Data Mining Tool (Visualization Data Mining Tool을 활용한 보험사기 적발)

  • Sung, Tae-Kyung
    • Information Systems Review
    • /
    • v.5 no.1
    • /
    • pp.49-60
    • /
    • 2003
  • The purpose of this study is to empirically and practically verify the applicability of visualization data mining tool in detecting real-word insurance frauds that are now emerged as one of the most serious problems socially and economically. For the verification, Analyst's Notebook by i2, which has been known as the most effective visualization data mining tool, was adopted. With Analyst's Notebook, fraud-probable insurance transactions from a very large insurance claims are selected and then substantiation for insurance frauds are attempted. The results show that Analyst's Notebook not only detects insurance fraud transactions from a vast number of insurance claims, but is also able to pinpoint organized crime group by associating one fraud transaction to another fraud transaction. Therefore, it is safe to conclude that visualization data mining is very effective in detecting false transactions and crime behaviors including insurance fraud.

Analysis of VR Game Trends using Text Mining and Word Cloud -Focusing on STEAM review data- (텍스트마이닝과 워드 클라우드를 활용한 VR 게임 트렌드 분석 -스팀(steam) 리뷰 데이터를 중심으로-)

  • Na, Ji Young
    • Journal of Korea Game Society
    • /
    • v.22 no.1
    • /
    • pp.87-98
    • /
    • 2022
  • With the development of fourth industrial revolution-related technology and increased demands for non-face-to-face services, VR games attract attention. This study collected VR game review data from an online game platform STEAM and analyzed chronical trends using text mining and word cloud analysis. According to the results, experience and perceived cost were major trends from 2016 to 2017, increased demands for FPS and rhythm games were from 2018 to 2019, and story and immersion were from 2020 to 2021. It aims to contribute to expanding the base of VR games by identifying the keywords VR users take interest in by period.

Use of Information Component (IC) and Relative Risk (RR) for Signal Detection of Drug Interactions of Clopidogrel : Data-mining Study Using Health Insurance Review & Assessment Service (HIRA) Claims Database (정보 성분과 상대위험도를 이용한 clopidogrel의 약물상호작용 시그널 검색 : 건강보험데이터베이스를 대상으로 한 데이터마이닝 연구)

  • Kim, Jin-Hyung;Choi, Chung-Am;Oh, Jung-Mi;Son, Sung-Ho;Shin, Wan-Gyoon
    • Korean Journal of Clinical Pharmacy
    • /
    • v.21 no.2
    • /
    • pp.90-99
    • /
    • 2011
  • Health Insurance Review & Assessment Service (HIRA) claims database has a high potential to detect signals of new drug interactions. The aim of this study was to evaluate the usefulness of information component (IC) and relative risk (RR) as a tool for signal detection, and to analyze the possible drug interactions caused by clopidogrel using HIRA claims database. This study was performed in elderly patients over 65 years of age who administered clopidogrel from January 2005 to June 2006 in South Korea. Serious Adverse Events (SAEs) as drug interactions of clopidogrel were defined as any ambulatory hospitalization for ischemic diseases within comcomitant medication period of clopidogrel. Information Component (IC) and Relative Risk (RR) were calculated to compare the proportion of drug-SAE pairs in order to select drug specific SAEs. IC and RR signals of clopidogrel drug interaction were screened when IC's 95% confidence interval was greater than 0 and RR's 95% confidence interval was greater than 1 respectively. All detected signals were compared to references such as $Micromedex^{(R)}$ and 2010 Drug Interaction $Facts^{TM}$. Sensitivity, specificity, positive predicted value and negative predicted value were used to evaluate usefulness of this method. Among 13,252,930 cases of elderly patients who co-administered clopidogrel and other drugs, 47,485 cases were detected as SAE. Of these, one-hundred nine cases were detected by the IC-based data-mining approach and ninety one cases were detected by the RR-based data-mining approach. Total One-hundred sixty three unrecognized signals were detected by IC or RR. Twelve signals from IC-based data-mining (57.1%) were corresponded with drug interactions from references and eight signals from RR-based data-mining (38.1%) were corresponded with drug interactions from references. These signals include proton pump inhibitors, calcium channel blockers and HMG CoA reductase Inhibitors, which were known to affect CYP450 metabolism. Further studies using HIRA claims database are necessary to develop appropriate data-mining measure.

Study on the Application of Big Data Mining to Activate Physical Distribution Cooperation : Focusing AHP Technique (물류공동화 활성화를 위한 빅데이터 마이닝 적용 연구 : AHP 기법을 중심으로)

  • Young-Hyun Pak;Jae-Ho Lee;Kyeong-Woo Kim
    • Korea Trade Review
    • /
    • v.46 no.5
    • /
    • pp.65-81
    • /
    • 2021
  • The technological development in the era of the 4th industrial revolution is changing the paradigm of various industries. Various technologies such as big data, cloud, artificial intelligence, virtual reality, and the Internet of Things are used, creating synergy effects with existing industries, creating radical development and value creation. Among them, the logistics sector has been greatly influenced by quantitative data from the past and has been continuously accumulating and managing data, so it is highly likely to be linked with big data analysis and has a high utilization effect. The modern advanced technology has developed together with the data mining technology to discover hidden patterns and new correlations in such big data, and through this, meaningful results are being derived. Therefore, data mining occupies an important part in big data analysis, and this study tried to analyze data mining techniques that can contribute to the logistics field and common logistics using these data mining technologies. Therefore, by using the AHP technique, it was attempted to derive priorities for each type of efficient data mining for logisticalization, and R program and R Studio were used as tools to analyze this. Criteria of AHP method set association analysis, cluster analysis, decision tree method, artificial neural network method, web mining, and opinion mining. For the alternatives, common transport and delivery, common logistics center, common logistics information system, and common logistics partnership were set as factors.

Text Mining of Online News, Social Media, and Consumer Review on Artificial Intelligence Service (인공지능 서비스에 대한 온라인뉴스, 소셜미디어, 소비자리뷰 텍스트마이닝)

  • Li, Xu;Lim, Hyewon;Yeo, Harim;Hwang, Hyesun
    • Human Ecology Research
    • /
    • v.59 no.1
    • /
    • pp.23-43
    • /
    • 2021
  • This study looked through the text mining analysis to check the status of the virtual assistant service, and explore the needs of consumers, and present consumer-oriented directions. Trendup 4.0 was used to analyze the keywords of AI services in Online News and social media from 2016 to 2020. The R program was used to collect consumer comment data and implement Topic Modeling analysis. According to the analysis, the number of mentions of AI services in mass media and social media has steadily increased. The Sentimental Analysis showed consumers were feeling positive about AI services in terms of useful and convenient functional and emotional aspects such as pleasure and interest. However, consumers were also experiencing complexity and difficulty with AI services and had concerns and fears about the use of AI services in the early stages of their introduction. The results of the consumer review analysis showed that there were topics(Technical Requirements) related to technology and the access process for the AI services to be provided, and topics (Consumer Request) expressed negative feelings about AI services, and topics(Consumer Life Support Area) about specific functions in the use of AI services. Text mining analysis enable this study to confirm consumer expectations or concerns about AI service, and to examine areas of service support that consumers experienced. The review data on each platform also revealed that the potential needs of consumers could be met by expanding the scope of support services and applying platform-specific strengths to provide differentiated services.

Text Mining and Visualization of Papers Reviews Using R Language

  • Li, Jiapei;Shin, Seong Yoon;Lee, Hyun Chang
    • Journal of information and communication convergence engineering
    • /
    • v.15 no.3
    • /
    • pp.170-174
    • /
    • 2017
  • Nowadays, people share and discuss scientific papers on social media such as the Web 2.0, big data, online forums, blogs, Twitter, Facebook and scholar community, etc. In addition to a variety of metrics such as numbers of citation, download, recommendation, etc., paper review text is also one of the effective resources for the study of scientific impact. The social media tools improve the research process: recording a series online scholarly behaviors. This paper aims to research the huge amount of paper reviews which have generated in the social media platforms to explore the implicit information about research papers. We implemented and shown the result of text mining on review texts using R language. And we found that Zika virus was the research hotspot and association research methods were widely used in 2016. We also mined the news review about one paper and derived the public opinion.

Unstructured Data Quantification Scheme Based on Text Mining for User Feedback Extraction (사용자 의견 추출을 위한 텍스트 마이닝 기반 비정형 데이터 정량화 방안)

  • Jo, Jung-Heum;Chung, Yong-Taek;Choi, Seong-Wook;Ok, Changsoo
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.41 no.4
    • /
    • pp.131-137
    • /
    • 2018
  • People write reviews of numerous products or services on the Internet, in their blogs or community bulletin boards. These unstructured data contain important emotions and opinions about the author's product or service, which can provide important information for future product design or marketing. However, this text-based information cannot be evaluated quantitatively, and thus they are difficult to apply to mathematical models or optimization problems for product design and improvement. Therefore, this study proposes a method to quantitatively extract user's opinion or preference about a specific product or service by utilizing a lot of text-based information existing on the Internet or online. The extracted unstructured text information is decomposed into basic unit words, and positive rate is evaluated by using existing emotional dictionaries and additional lists proposed in this study. This can be a way to effectively utilize unstructured text data, which is being generated and stored in vast quantities, in product or service design. Finally, to verify the effectiveness of the proposed method, a case study was conducted using movie review data retrieved from a portal website. By comparing the positive rates calculated by the proposed framework with user ratings for movies, a guideline on text mining based evaluation of unstructured data is provided.

Applications of Data Mining Techniques to Operations Planning for Real Time Order Confirmation (실시간 주문 확답을 위한 데이터 마이닝 기반 운용 계획 모델)

  • Han Hyun-Soo;Oh Dong-Ha
    • Korean Management Science Review
    • /
    • v.21 no.3
    • /
    • pp.101-113
    • /
    • 2004
  • In the rapidly propagating Internet based electronic transaction environment. the importance of real time order confirmation has been more emphasized, In this paper, using data mining techniques, we develop intelligent operations decision model to allow real time order confirmation at the time the customer places an order with required delivery terms. Among various operation plannings used for order fulfillment. mill routing is the first interface decision point to link the order receiving at the marketing with the production planning for order fulfillment. Though linear programming based mathematical optimization techniques are mostly used for mill routing problems, some early orders should wait until sufficient orders are gathered for optimization. And that could effect longer order fulfillment lead-time, and prevent instant order confirmation of delivery terms. To cope with this problem, we provide the intelligent decision model to allow instant order based mill routing decisions. Data mining techniques of decision trees and neural networks. which are more popular in marketing and financial applications, are used to develop the model. Through diverse computational trials with the industrial data from the steel company. we have reported that the performance of the proposed approach is effective compared to the present heuristic only mill routing results. Various issues of data mining techniques application to the mill routing problems having linear programming characteristics are also discussed.

A study on the Elements of Interest for VR Game Users Using Text Mining and Text Network Analysis - Focused on STEAM User Review Data - (텍스트마이닝과 네트워크 분석을 적용한 VR 게임 사용자의 관심 요소 연구 - STEAM 사용자 리뷰 데이터를 중심으로 -)

  • Wui, Min-Young;Na, Ji Young;Park, Young Il
    • Journal of Korea Game Society
    • /
    • v.18 no.6
    • /
    • pp.69-82
    • /
    • 2018
  • The need of high quality VR contents has been steadily raised in recent years. Therefore, this study investigated the user's interest factors of VR game which is receiving the most attention among VR contents. We used STEAM review data and applied Text mining and Network analysis to perform this research. As a result, it was possible to confirm 4 word clusters related VR game users. Each cluster is named by 'presence', 'first person view game', 'auditory factor' and 'interaction'. This study has its meaning. First, user related research would be very helpful to develop high quality VR game. Second, it confirms that review data of VR game users can be structured, analyzed and used.