• Title/Summary/Keyword: Predictive analytics

Search Result 52, Processing Time 0.024 seconds

Using Predictive Analytics to Profile Potential Adopters of Autonomous Vehicles

  • Lee, Eun-Ju;Zafarzon, Nordirov;Zhang, Jing
    • Asia Marketing Journal
    • /
    • v.20 no.2
    • /
    • pp.65-83
    • /
    • 2018
  • Technological advances are bringing autonomous vehicles to the ever-evolving transportation system. Anticipating adoption of these technologies by users is essential to vehicle manufacturers for making more precise production and marketing strategies. The research investigates regulatory focus and consumer innovativeness with consumers' adoption of autonomous vehicles (AVs) and to consumers' subsequent willingness to pay for AVs. An online questionnaire was fielded to confirm predictions, and regression analysis was conducted to verify the model's validity. The results show that a promotion focus does not have a significantly positive effect on the automation level at which consumers will adopt AVs, but a prevention focus has a significantly positive effect on conditional AV adoption. Consumer innovativeness, consumers' novelty-seeking have a significantly positive relationship with high and full AV adoption, and consumers' independent decision-making has a significantly positive effect on full AV adoption. The higher the level of automation at which a consumer adopts AVs, the higher the willingness to pay for them. Finally, using a neural network and decision tree analyses, we show methods with which to describe three categories for potential adopters of AVs.

Prediction Model for Gas-Energy Consumption using Ontology-based Breakdown Structure of Multi-Family Housing Complex (온톨로지 기반 공동주택 분류체계를 활용한 가스에너지 사용량 예측 모델)

  • Hong, Tae-Hoon;Park, Sung-Ki;Koo, Choong-Wan;Kim, Hyun-Joong;Kim, Chun-Hag
    • Korean Journal of Construction Engineering and Management
    • /
    • v.12 no.6
    • /
    • pp.110-119
    • /
    • 2011
  • Global warming caused by excessive greenhouse gas emission is causing climate change all over the world. In Korea, greenhouse gas emission from residential buildings accounts for about 10% of gross domestic emission. Also, the number of deteriorated multi-family housing complexes is increasing. Therefore, the goal of this research is to establish the bases to manage energy consumption continuously and methodically during MR&R period of multi-family housings. The research process and methodologies are as follows. First, research team collected the data on project characteristics and energy consumption of multi-family housing complexes in Seoul. Second, an ontology-based breakdown structure was established with some primary characteristics affecting the energy consumption, which were selected by statistical analysis. Finally, a predictive model of energy consumption was developed based on the ontology-based breakdown structure, with application of CBR, ANN, MRA and GA. In this research, PASW (Predictive Analytics SoftWare) Statistics 18, Microsoft EXCEL, Protege 4.1 were utilized for data analysis and prediction. In future research, the model will be more continuous and methodical by developing the web-base system. And it has facility manager of government or local government, or multi-family housing complex make a decision with definite references regarding moderate energy consumption.

Prediction of Customer Satisfaction Using RFE-SHAP Feature Selection Method (RFE-SHAP을 활용한 온라인 리뷰를 통한 고객 만족도 예측)

  • Olga Chernyaeva;Taeho Hong
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.325-345
    • /
    • 2023
  • In the rapidly evolving domain of e-commerce, our study presents a cohesive approach to enhance customer satisfaction prediction from online reviews, aligning methodological innovation with practical insights. We integrate the RFE-SHAP feature selection with LDA topic modeling to streamline predictive analytics in e-commerce. This integration facilitates the identification of key features-specifically, narrowing down from an initial set of 28 to an optimal subset of 14 features for the Random Forest algorithm. Our approach strategically mitigates the common issue of overfitting in models with an excess of features, leading to an improved accuracy rate of 84% in our Random Forest model. Central to our analysis is the understanding that certain aspects in review content, such as quality, fit, and durability, play a pivotal role in influencing customer satisfaction, especially in the clothing sector. We delve into explaining how each of these selected features impacts customer satisfaction, providing a comprehensive view of the elements most appreciated by customers. Our research makes significant contributions in two key areas. First, it enhances predictive modeling within the realm of e-commerce analytics by introducing a streamlined, feature-centric approach. This refinement in methodology not only bolsters the accuracy of customer satisfaction predictions but also sets a new standard for handling feature selection in predictive models. Second, the study provides actionable insights for e-commerce platforms, especially those in the clothing sector. By highlighting which aspects of customer reviews-like quality, fit, and durability-most influence satisfaction, we offer a strategic direction for businesses to tailor their products and services.

Care Cost Prediction Model for Orphanage Organizations in Saudi Arabia

  • Alhazmi, Huda N;Alghamdi, Alshymaa;Alajlani, Fatimah;Abuayied, Samah;Aldosari, Fahd M
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.4
    • /
    • pp.84-92
    • /
    • 2021
  • Care services are a significant asset in human life. Care in its overall nature focuses on human needs and covers several aspects such as health care, homes, personal care, and education. In fact, care deals with many dimensions: physical, psychological, and social interconnections. Very little information is available on estimating the cost of care services that provided to orphans and abandoned children. Prediction of the cost of the care system delivered by governmental or non-governmental organizations to support orphans and abandoned children is increasingly needed. The purpose of this study is to analyze the care cost for orphanage organizations in Saudi Arabia to forecast the cost as well as explore the most influence factor on the cost. By using business analytic process that applied statistical and machine learning techniques, we proposed a model includes simple linear regression, Naive Bayes classifier, and Random Forest algorithms. The finding of our predictive model shows that Naive Bayes has addressed the highest accuracy equals to 87% in predicting the total care cost. Our model offers predictive approach in the perspective of business analytics.

Bankruptcy Prediction Modeling Using Qualitative Information Based on Big Data Analytics (빅데이터 기반의 정성 정보를 활용한 부도 예측 모형 구축)

  • Jo, Nam-ok;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.33-56
    • /
    • 2016
  • Many researchers have focused on developing bankruptcy prediction models using modeling techniques, such as statistical methods including multiple discriminant analysis (MDA) and logit analysis or artificial intelligence techniques containing artificial neural networks (ANN), decision trees, and support vector machines (SVM), to secure enhanced performance. Most of the bankruptcy prediction models in academic studies have used financial ratios as main input variables. The bankruptcy of firms is associated with firm's financial states and the external economic situation. However, the inclusion of qualitative information, such as the economic atmosphere, has not been actively discussed despite the fact that exploiting only financial ratios has some drawbacks. Accounting information, such as financial ratios, is based on past data, and it is usually determined one year before bankruptcy. Thus, a time lag exists between the point of closing financial statements and the point of credit evaluation. In addition, financial ratios do not contain environmental factors, such as external economic situations. Therefore, using only financial ratios may be insufficient in constructing a bankruptcy prediction model, because they essentially reflect past corporate internal accounting information while neglecting recent information. Thus, qualitative information must be added to the conventional bankruptcy prediction model to supplement accounting information. Due to the lack of an analytic mechanism for obtaining and processing qualitative information from various information sources, previous studies have only used qualitative information. However, recently, big data analytics, such as text mining techniques, have been drawing much attention in academia and industry, with an increasing amount of unstructured text data available on the web. A few previous studies have sought to adopt big data analytics in business prediction modeling. Nevertheless, the use of qualitative information on the web for business prediction modeling is still deemed to be in the primary stage, restricted to limited applications, such as stock prediction and movie revenue prediction applications. Thus, it is necessary to apply big data analytics techniques, such as text mining, to various business prediction problems, including credit risk evaluation. Analytic methods are required for processing qualitative information represented in unstructured text form due to the complexity of managing and processing unstructured text data. This study proposes a bankruptcy prediction model for Korean small- and medium-sized construction firms using both quantitative information, such as financial ratios, and qualitative information acquired from economic news articles. The performance of the proposed method depends on how well information types are transformed from qualitative into quantitative information that is suitable for incorporating into the bankruptcy prediction model. We employ big data analytics techniques, especially text mining, as a mechanism for processing qualitative information. The sentiment index is provided at the industry level by extracting from a large amount of text data to quantify the external economic atmosphere represented in the media. The proposed method involves keyword-based sentiment analysis using a domain-specific sentiment lexicon to extract sentiment from economic news articles. The generated sentiment lexicon is designed to represent sentiment for the construction business by considering the relationship between the occurring term and the actual situation with respect to the economic condition of the industry rather than the inherent semantics of the term. The experimental results proved that incorporating qualitative information based on big data analytics into the traditional bankruptcy prediction model based on accounting information is effective for enhancing the predictive performance. The sentiment variable extracted from economic news articles had an impact on corporate bankruptcy. In particular, a negative sentiment variable improved the accuracy of corporate bankruptcy prediction because the corporate bankruptcy of construction firms is sensitive to poor economic conditions. The bankruptcy prediction model using qualitative information based on big data analytics contributes to the field, in that it reflects not only relatively recent information but also environmental factors, such as external economic conditions.

Machine Learning-Based Prediction of COVID-19 Severity and Progression to Critical Illness Using CT Imaging and Clinical Data

  • Subhanik Purkayastha;Yanhe Xiao;Zhicheng Jiao;Rujapa Thepumnoeysuk;Kasey Halsey;Jing Wu;Thi My Linh Tran;Ben Hsieh;Ji Whae Choi;Dongcui Wang;Martin Vallieres;Robin Wang;Scott Collins;Xue Feng;Michael Feldman;Paul J. Zhang;Michael Atalay;Ronnie Sebro;Li Yang;Yong Fan;Wei-hua Liao;Harrison X. Bai
    • Korean Journal of Radiology
    • /
    • v.22 no.7
    • /
    • pp.1213-1224
    • /
    • 2021
  • Objective: To develop a machine learning (ML) pipeline based on radiomics to predict Coronavirus Disease 2019 (COVID-19) severity and the future deterioration to critical illness using CT and clinical variables. Materials and Methods: Clinical data were collected from 981 patients from a multi-institutional international cohort with real-time polymerase chain reaction-confirmed COVID-19. Radiomics features were extracted from chest CT of the patients. The data of the cohort were randomly divided into training, validation, and test sets using a 7:1:2 ratio. A ML pipeline consisting of a model to predict severity and time-to-event model to predict progression to critical illness were trained on radiomics features and clinical variables. The receiver operating characteristic area under the curve (ROC-AUC), concordance index (C-index), and time-dependent ROC-AUC were calculated to determine model performance, which was compared with consensus CT severity scores obtained by visual interpretation by radiologists. Results: Among 981 patients with confirmed COVID-19, 274 patients developed critical illness. Radiomics features and clinical variables resulted in the best performance for the prediction of disease severity with a highest test ROC-AUC of 0.76 compared with 0.70 (0.76 vs. 0.70, p = 0.023) for visual CT severity score and clinical variables. The progression prediction model achieved a test C-index of 0.868 when it was based on the combination of CT radiomics and clinical variables compared with 0.767 when based on CT radiomics features alone (p < 0.001), 0.847 when based on clinical variables alone (p = 0.110), and 0.860 when based on the combination of visual CT severity scores and clinical variables (p = 0.549). Furthermore, the model based on the combination of CT radiomics and clinical variables achieved time-dependent ROC-AUCs of 0.897, 0.933, and 0.927 for the prediction of progression risks at 3, 5 and 7 days, respectively. Conclusion: CT radiomics features combined with clinical variables were predictive of COVID-19 severity and progression to critical illness with fairly high accuracy.

Data economy in Korea: Cases of finance, real estate, and medical care sectors (한국의 데이터경제 현황 및 평가: 금융, 부동산, 의료 부문을 중심으로)

  • Cho, Man;Moon, Seongwuk;Rhee, Inbok;Choi, Seongyun
    • Journal of Technology Innovation
    • /
    • v.31 no.1
    • /
    • pp.65-103
    • /
    • 2023
  • With the recent surge in the share of data-based economic activities, there have been vibrant discussions on the data economy. Yet, few extant works provide a framework for systematically analyzing the transition to the data economy by major industries in Korea. By reviewing the existing literature, we first summarize the main characteristics of the data economy as building platforms, the greater importance of predictive power, and the increased use of new analytics. Next, based on such understanding, we provide a comparative analysis regarding the degree of data-based activities in Korea's financial, real estate, and medical sectors. We find that the speed at which, and the content of the data economy characteristics being realized were different for the different sectors. These findings suggest that differentiated policy approaches by major industrial sectors such as finance, real estate, and medical care are needed to improve economic productivity and increase welfare through the spread of the data economy.

A Study on analyzing brand character of myth material, relevant keyword and relevance with big data of portal site and SNS (포털사이트, SNS의 빅데이터를 이용한 신화소재의 브랜드 캐릭터와 연관어, 연관도 분석)

  • Oh, Sejong;Doo, Illchul
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.11 no.1
    • /
    • pp.157-169
    • /
    • 2015
  • In digital marketing, means of public relations and marketing of enterprises are changing into marketing techniques of predictive analytics. A significant study can be carried out by an analysis of 'the patterns of customers' uses' using big data on major portal sites and SNSs and their correlation with related keywords. This study analyzes the origins of mythological characters in major brands such as Nike, Hermes, Versace, Canon and Starbucks. Also, it extracts related keywords and relevance using big data on portal sites and SNS and their correlation. Nike marketing that reminds people of 'the goddess of victory, Nike' formed a good combination of the brand with relevance. Most of them are based on Greek mythology and have rich materials for storytelling and artistic values in common. Hopefully, this case analysis of foreign brands would become a starting point of discovering the materials of the domestic mythological characters.

Enhance Health Risks Prediction Mechanism in the Cloud Using RT-TKRIBC Technique

  • Konduru, Venkateswara Raju;Bharamgoudra, Manjula R
    • Journal of information and communication convergence engineering
    • /
    • v.19 no.3
    • /
    • pp.166-174
    • /
    • 2021
  • A large volume of patient data is generated from various devices used in healthcare applications. With increase in the volume of data generated in the healthcare industry, more wellness monitoring is required. A cloud-enabled analysis of healthcare data that predicts patient risk factors is required. Machine learning techniques have been developed to address these medical care problems. A novel technique called the radix-trie-based Tanimoto kernel regressive infomax boost classification (RT-TKRIBC) technique is introduced to analyze the heterogeneous health data in the cloud to predict the health risks and send alerts. The infomax boost ensemble technique improves the prediction accuracy by finding the maximum mutual information, thereby minimizing the mean square error. The performance evaluation of the proposed RT-TKRIBC technique is realized through extensive simulations in the cloud environment, which provides better prediction accuracy and less prediction time than those provided by the state-of-the-art methods.

A study on the MD&A Disclosure Quality in real-time calculated and provided By Programming Technology

  • Shin, YeounOuk
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.11 no.3
    • /
    • pp.41-48
    • /
    • 2019
  • The Management Discussion and Analysis(MD&A) provides investors with an opportunity to gain insight into the company from a manager's perspective and enables short-term and long-term analysis of the business. And MD&A is an important channel through which companies and investors can communicate, providing a useful source of information for analyzing financialstatements. MD&A is measured by the quality of disclosure and there are many previous studies on the usefulness of disclosure information. Therefore, it is very important for the financial analyst who is the representative information user group in the capital market that MD&A Disclosure Quality is measured in real-time in combination with IT information technology and provided timely to financial analyst. In this study, we propose a method that real-time data is converted to digitalized data by combining MD&A disclosure with IT information technology and provided to financial analyst's information environment in real-time. The real-time information provided by MD&A can help the financial analysts' activities and reduce information asymmetry.