• Title/Summary/Keyword: machine learning classification models

Search Result 364, Processing Time 0.027 seconds

Machine Learning Algorithm Accuracy for Code-Switching Analytics in Detecting Mood

  • Latib, Latifah Abd;Subramaniam, Hema;Ramli, Siti Khadijah;Ali, Affezah;Yulia, Astri;Shahdan, Tengku Shahrom Tengku;Zulkefly, Nor Sheereen
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.9
    • /
    • pp.334-342
    • /
    • 2022
  • Nowadays, as we can notice on social media, most users choose to use more than one language in their online postings. Thus, social media analytics needs reviewing as code-switching analytics instead of traditional analytics. This paper aims to present evidence comparable to the accuracy of code-switching analytics techniques in analysing the mood state of social media users. We conducted a systematic literature review (SLR) to study the social media analytics that examined the effectiveness of code-switching analytics techniques. One primary question and three sub-questions have been raised for this purpose. The study investigates the computational models used to detect and measures emotional well-being. The study primarily focuses on online postings text, including the extended text analysis, analysing and predicting using past experiences, and classifying the mood upon analysis. We used thirty-two (32) papers for our evidence synthesis and identified four main task classifications that can be used potentially in code-switching analytics. The tasks include determining analytics algorithms, classification techniques, mood classes, and analytics flow. Results showed that CNN-BiLSTM was the machine learning algorithm that affected code-switching analytics accuracy the most with 83.21%. In addition, the analytics accuracy when using the code-mixing emotion corpus could enhance by about 20% compared to when performing with one language. Our meta-analyses showed that code-mixing emotion corpus was effective in improving the mood analytics accuracy level. This SLR result has pointed to two apparent gaps in the research field: i) lack of studies that focus on Malay-English code-mixing analytics and ii) lack of studies investigating various mood classes via the code-mixing approach.

Inhalation Configuration Detection for COVID-19 Patient Secluded Observing using Wearable IoTs Platform

  • Sulaiman Sulmi Almutairi;Rehmat Ullah;Qazi Zia Ullah;Habib Shah
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.6
    • /
    • pp.1478-1499
    • /
    • 2024
  • Coronavirus disease (COVID-19) is an infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus. COVID-19 become an active epidemic disease due to its spread around the globe. The main causes of the spread are through interaction and transmission of the droplets through coughing and sneezing. The spread can be minimized by isolating the susceptible patients. However, it necessitates remote monitoring to check the breathing issues of the patient remotely to minimize the interactions for spread minimization. Thus, in this article, we offer a wearable-IoTs-centered framework for remote monitoring and recognition of the breathing pattern and abnormal breath detection for timely providing the proper oxygen level required. We propose wearable sensors accelerometer and gyroscope-based breathing time-series data acquisition, temporal features extraction, and machine learning algorithms for pattern detection and abnormality identification. The sensors provide the data through Bluetooth and receive it at the server for further processing and recognition. We collect the six breathing patterns from the twenty subjects and each pattern is recorded for about five minutes. We match prediction accuracies of all machine learning models under study (i.e. Random forest, Gradient boosting tree, Decision tree, and K-nearest neighbor. Our results show that normal breathing and Bradypnea are the most correctly recognized breathing patterns. However, in some cases, algorithm recognizes kussmaul well also. Collectively, the classification outcomes of Random Forest and Gradient Boost Trees are better than the other two algorithms.

Identifying Personal Values Influencing the Lifestyle of Older Adults: Insights From Relative Importance Analysis Using Machine Learning (중고령 노인의 개인적 가치에 따른 라이프스타일 분류: 머신러닝을 활용한 상대적 중요도 분석 )

  • Lim, Seungju;Park, Ji-Hyuk
    • Therapeutic Science for Rehabilitation
    • /
    • v.13 no.2
    • /
    • pp.69-84
    • /
    • 2024
  • Objective : This study aimed to categorize the lifestyles of older adults into two types - healthy and unhealthy, and use machine learning to identify the personal values that influence these lifestyles. Methods : This cross-sectional study targeting middle-aged and older adults (55 years and above) living in local communities in South Korea. Data were collected from 300 participants through online surveys. Lifestyle types were dichotomized by the Yonsei Lifestyle Profile (YLP)-Active, Balanced, Connected, and Diverse (ABCD) responses using latent profile analysis. Personal value information was collected using YLP-Values (YLP-V) and analyzed using machine learning to identify the relative importance of personal values on lifestyle types. Results : The lifestyle of older adults was categorized into healthy (48.87%) and unhealthy (51.13%). These two types showed the most significant difference in social relationship characteristics. Among the machine learning models used in this study, the support vector machine showed the highest classification performance, achieving 96% accuracy and 95% area under the receiver operating characteristic (ROC) curve. The model indicated that individuals who prioritized a healthy diet, sought health information, and engaged in hobbies or cultural activities were more likely to have a healthy lifestyle. Conclusion : This study suggests the need to encourage the expansion of social networks among older adults. Furthermore, it highlights the necessity to comprehensively intervene in individuals' perceptions and values that primarily influence lifestyle adherence.

Evaluation of Interpretability for Generated Rules from ANFIS (ANFIS에서 생성된 규칙의 해석용이성 평가)

  • Song, Hee-Seok;Kim, Jae-Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.4
    • /
    • pp.123-140
    • /
    • 2009
  • Fuzzy neural network is an integrated model of artificial neural network and fuzzy system and it has been successfully applied in control and forecasting area. Recently ANFIS(Adaptive Network-based Fuzzy Inference System) has been noticed widely among various fuzzy neural network models because of outstanding performance of control and forecasting accuracy. ANFIS has capability to refine its fuzzy rules interactively with human expert. In particular, when we use initial rule structure for machine learning which is generated from human expert, it is highly probable to reach global optimum solution as well as shorten time to convergence. We propose metrics to evaluate interpretability of generated rules as a means of acquiring domain knowledge and compare level of interpretability of ANFIS fuzzy rules to those of C5.0 classification rules. The proposed metrics also can be used to evaluate capability of rule generation for the various machine learning methods.

  • PDF

Deep Learning for Classification of High-End Fashion Brand Sensibility (딥러닝을 통한 하이엔드 패션 브랜드 감성 학습)

  • Jang, Seyoon;Kim, Ha Youn;Lee, Yuri;Seol, Jinseok;Kim, Seongjae;Lee, Sang-goo
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.46 no.1
    • /
    • pp.165-181
    • /
    • 2022
  • The fashion industry is creating innovative business models using artificial intelligence. To efficiently utilize artificial intelligence (AI), fashion data must be classified. Until now, such data have been classified focusing only on the objective properties of fashion products. Their subjective attributes, such as fashion brand sensibilities, are holistic and heuristic intuitions created by a combination of design elements. This study aims to improve the performance of collaborative filtering in the fashion industry by extracting fashion brand sensibility using computer vision technology. The image data set of fashion brand sensibility consists of high-end fashion brand photos that share sensibilities and communicate well in fashion. About 26,000 fashion photos of 11 high-end fashion brand sensibility labels have been collected from the 16FW to 21SS runway and 50 years of US Vogue magazines beginning from 1971. We use EfficientNet-B1 to establish the main architecture and fine-tune the network with ImageNet-ILSVRC. After training fashion brand sensibilities through deep learning, the proposed model achieved an F-1 score of 74% on accuracy tests. Furthermore, as a result of comparing AI machine and human experts, the proposed model is expected to be expanded to mass fashion brands.

Predicting Game Results using Machine Learning and Deriving Strategic Direction from Variable Importance (기계학습을 활용한 게임승패 예측 및 변수중요도 산출을 통한 전략방향 도출)

  • Kim, Yongwoo;Kim, Young‐Min
    • Journal of Korea Game Society
    • /
    • v.21 no.4
    • /
    • pp.3-12
    • /
    • 2021
  • In this study, models for predicting the final result of League of Legends game were constructed for each rank using data from the first 10 minutes of the game. Variable importance was extracted from the prediction models to derive strategic direction in early phase of the game. As a result, it was possible to predict final results with over 70% accuracy in all ranks. It was found that early game advantage tends to lead to the final win and this tendency appeared stronger as it goes to challenger ranks. Kill(death) was found to be the most influential factor for win, however, there were also variables whose importance rank changed according to rank. This indicates there is a difference in the strategic direction in the early stage of the game depending on the rank.

A Practical Feature Extraction for Improving Accuracy and Speed of IDS Alerts Classification Models Based on Machine Learning (기계학습 기반 IDS 보안이벤트 분류 모델의 정확도 및 신속도 향상을 위한 실용적 feature 추출 연구)

  • Shin, Iksoo;Song, Jungsuk;Choi, Jangwon;Kwon, Taewoong
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.28 no.2
    • /
    • pp.385-395
    • /
    • 2018
  • With the development of Internet, cyber attack has become a major threat. To detect cyber attacks, intrusion detection system(IDS) has been widely deployed. But IDS has a critical weakness which is that it generates a large number of false alarms. One of the promising techniques that reduce the false alarms in real time is machine learning. However, there are problems that must be solved to use machine learning. So, many machine learning approaches have been applied to this field. But so far, researchers have not focused on features. Despite the features of IDS alerts are important for performance of model, the approach to feature is ignored. In this paper, we propose new feature set which can improve the performance of model and can be extracted from a single alarm. New features are motivated from security analyst's know-how. We trained and tested the proposed model applied new feature set with real IDS alerts. Experimental results indicate the proposed model can achieve better accuracy and false positive rate than SVM model with ordinary features.

Pedestrian Classification using CNN's Deep Features and Transfer Learning (CNN의 깊은 특징과 전이학습을 사용한 보행자 분류)

  • Chung, Soyoung;Chung, Min Gyo
    • Journal of Internet Computing and Services
    • /
    • v.20 no.4
    • /
    • pp.91-102
    • /
    • 2019
  • In autonomous driving systems, the ability to classify pedestrians in images captured by cameras is very important for pedestrian safety. In the past, after extracting features of pedestrians with HOG(Histogram of Oriented Gradients) or SIFT(Scale-Invariant Feature Transform), people classified them using SVM(Support Vector Machine). However, extracting pedestrian characteristics in such a handcrafted manner has many limitations. Therefore, this paper proposes a method to classify pedestrians reliably and effectively using CNN's(Convolutional Neural Network) deep features and transfer learning. We have experimented with both the fixed feature extractor and the fine-tuning methods, which are two representative transfer learning techniques. Particularly, in the fine-tuning method, we have added a new scheme, called M-Fine(Modified Fine-tuning), which divideslayers into transferred parts and non-transferred parts in three different sizes, and adjusts weights only for layers belonging to non-transferred parts. Experiments on INRIA Person data set with five CNN models(VGGNet, DenseNet, Inception V3, Xception, and MobileNet) showed that CNN's deep features perform better than handcrafted features such as HOG and SIFT, and that the accuracy of Xception (threshold = 0.5) isthe highest at 99.61%. MobileNet, which achieved similar performance to Xception and learned 80% fewer parameters, was the best in terms of efficiency. Among the three transfer learning schemes tested above, the performance of the fine-tuning method was the best. The performance of the M-Fine method was comparable to or slightly lower than that of the fine-tuningmethod, but higher than that of the fixed feature extractor method.

Predicting fetal toxicity of drugs through attention algorithm (Attention 알고리즘 기반 약물의 태아 독성 예측 연구)

  • Jeong, Myeong-hyeon;Yoo, Sun-yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.273-275
    • /
    • 2022
  • The use of drugs by pregnant women poses a potential risk to the fetus. Therefore, it is essential to classify drugs that pregnant women should prohibit. However, the fetal toxicity of most drugs has not been identified. This takes a lot of time and cost. In silico approaches, such as virtual screening, can identify compounds that may present a high risk to the fetus for a wide range of compounds at the low cost and time. We collected class information of each drug from the hazard classification lists for prescribing drugs in pregnancy by the government of Korea and Australia. Using the structural and chemical features of each drug, various machine learning models were constructed to predict fetal toxicity of drugs. For all models, the quantitative performance evaluation was performed. Based on the attention algorithm, important molecular substructures of compounds were identified in the process of predicting the fetal toxicity of the drug by the proposed model. From the results, we confirmed that drugs with a high risk of fetal toxicity can be predicted for a wide range of compounds by machine learning. This study can be used as a pre-screening tool for fetal toxicity predictions, as it provides key molecular substructures associated with the fetal toxicity of compounds.

  • PDF

Classification of 18F-Florbetaben Amyloid Brain PET Image using PCA-SVM

  • Cho, Kook;Kim, Woong-Gon;Kang, Hyeon;Yang, Gyung-Seung;Kim, Hyun-Woo;Jeong, Ji-Eun;Yoon, Hyun-Jin;Jeong, Young-Jin;Kang, Do-Young
    • Biomedical Science Letters
    • /
    • v.25 no.1
    • /
    • pp.99-106
    • /
    • 2019
  • Amyloid positron emission tomography (PET) allows early and accurate diagnosis in suspected cases of Alzheimer's disease (AD) and contributes to future treatment plans. In the present study, a method of implementing a diagnostic system to distinguish ${\beta}$-Amyloid ($A{\beta}$) positive from $A{\beta}$ negative with objectiveness and accuracy was proposed using a machine learning approach, such as the Principal Component Analysis (PCA) and Support Vector Machine (SVM). $^{18}F$-Florbetaben (FBB) brain PET images were arranged in control and patients (total n = 176) with mild cognitive impairment and AD. An SVM was used to classify the slices of registered PET image using PET template, and a system was created to diagnose patients comprehensively from the output of the trained model. To compare the per-slice classification, the PCA-SVM model observing the whole brain (WB) region showed the highest performance (accuracy 92.38, specificity 92.87, sensitivity 92.87), followed by SVM with gray matter masking (GMM) (accuracy 92.22, specificity 92.13, sensitivity 92.28) for $A{\beta}$ positivity. To compare according to per-subject classification, the PCA-SVM with WB also showed the highest performance (accuracy 89.21, specificity 71.67, sensitivity 98.28), followed by PCA-SVM with GMM (accuracy 85.80, specificity 61.67, sensitivity 98.28) for $A{\beta}$ positivity. When comparing the area under curve (AUC), PCA-SVM with WB was the highest for per-slice classifiers (0.992), and the models except for SVM with WM were highest for the per-subject classifier (1.000). We can classify $^{18}F$-Florbetaben amyloid brain PET image for $A{\beta}$ positivity using PCA-SVM model, with no additional effects on GMM.