• Title/Summary/Keyword: class imbalance

Search Result 120, Processing Time 0.026 seconds

Using Machine Learning Technique for Analytical Customer Loyalty

  • Mohamed M. Abbassy
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.8
    • /
    • pp.190-198
    • /
    • 2023
  • To enhance customer satisfaction for higher profits, an e-commerce sector can establish a continuous relationship and acquire new customers. Utilize machine-learning models to analyse their customer's behavioural evidence to produce their competitive advantage to the e-commerce platform by helping to improve overall satisfaction. These models will forecast customers who will churn and churn causes. Forecasts are used to build unique business strategies and services offers. This work is intended to develop a machine-learning model that can accurately forecast retainable customers of the entire e-commerce customer data. Developing predictive models classifying different imbalanced data effectively is a major challenge in collected data and machine learning algorithms. Build a machine learning model for solving class imbalance and forecast customers. The satisfaction accuracy is used for this research as evaluation metrics. This paper aims to enable to evaluate the use of different machine learning models utilized to forecast satisfaction. For this research paper are selected three analytical methods come from various classifications of learning. Classifier Selection, the efficiency of various classifiers like Random Forest, Logistic Regression, SVM, and Gradient Boosting Algorithm. Models have been used for a dataset of 8000 records of e-commerce websites and apps. Results indicate the best accuracy in determining satisfaction class with both gradient-boosting algorithm classifications. The results showed maximum accuracy compared to other algorithms, including Gradient Boosting Algorithm, Support Vector Machine Algorithm, Random Forest Algorithm, and logistic regression Algorithm. The best model developed for this paper to forecast satisfaction customers and accuracy achieve 88 %.

Energy Use Coordinator for Multiple Personal Sensor Devices

  • Rhee, Yunseok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.2
    • /
    • pp.9-19
    • /
    • 2017
  • Useful continuous sensing applications are increasingly emerging as a new class of mobile applications. Meanwhile, open, multi-use sensor devices are newly adopted beyond smartphones, and provide huge opportunities to expand potential application categories. In this upcoming environment, uncoordinated use of sensor devices would cause severe imbalance in power consumption of devices, and thus result in early shutdown of some sensing applications depending on power-hungry devices. In this paper, we propose EnergyCordy, a novel inter-device energy use coordination system; with a system-wide holistic view, it coordinates the energy use of concurrent sensing applications over multiple sensor devices. As its key approach, we propose a relaxed sensor association; it decouples the energy use of an application from specific sensor devices leveraging multiple context inference alternatives, allowing flexible energy coordination at runtime. We demonstrated the effectiveness of EnergyCordy by developing multiple example applications over custom-designed wearable senor devices. We show that EnergyCordy effectively coordinates the power usage of concurrent sensing applications over multiple devices and prevent undesired early shutdown of applications.

The Comparison of Vibration and Power according to Operation Method of 100W IPM Type Motor

  • Lee, Gyeong-Deuk;Jo, Eul-Gyu;Kim, Gyu-Tak
    • Journal of international Conference on Electrical Machines and Systems
    • /
    • v.3 no.4
    • /
    • pp.383-388
    • /
    • 2014
  • In This paper, the output characteristics and vibrations were compared and analyzed according to operation method in 100W class. The voltage source is applied only two phase in BLDC drive system therefore commutation torque ripple and imbalance of RMF occurred. Due to this efficiency was significantly degraded because mechanical loss is increased, besides the vibration and noise were greatly generated. The vibration and output characteristics were compared and analyzed according to three phase and BLDC drive system.

Adversarial Training Method for Handling Class Imbalance Problems in Dialog Datasets (대화 데이터셋의 클래스 불균형 문제 보정을 위한 적대적 학습 기법)

  • Cho, Su-Phil;Choi, Yong Suk
    • Annual Conference on Human and Language Technology
    • /
    • 2019.10a
    • /
    • pp.434-439
    • /
    • 2019
  • 딥러닝 기반 분류 모델에 있어 데이터의 클래스 불균형 문제는 소수 클래스의 분류 성능을 크게 저하시킨다. 본 논문에서는 앞서 언급한 클래스 불균형 문제를 보완하기 위한 방안으로 적대적 학습 기법을 제안한다. 적대적 학습 기법의 성능 향상 여부를 확인하기 위해 총 4종의 딥러닝 기반 분류 모델을 정의하였으며, 해당 모델 간 분류 성능을 비교하였다. 실험 결과, 대화 데이터셋을 이용한 모델 학습 시 적대적 학습 기법을 적용할 경우 다수 클래스의 분류 성능은 유지하면서 동시에 소수 클래스의 분류 성능을 크게 향상시킬 수 있음을 확인하였다.

  • PDF

Classification Performance Improvement of UNSW-NB15 Dataset Based on Feature Selection (특징선택 기법에 기반한 UNSW-NB15 데이터셋의 분류 성능 개선)

  • Lee, Dae-Bum;Seo, Jae-Hyun
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.5
    • /
    • pp.35-42
    • /
    • 2019
  • Recently, as the Internet and various wearable devices have appeared, Internet technology has contributed to obtaining more convenient information and doing business. However, as the internet is used in various parts, the attack surface points that are exposed to attacks are increasing, Attempts to invade networks aimed at taking unfair advantage, such as cyber terrorism, are also increasing. In this paper, we propose a feature selection method to improve the classification performance of the class to classify the abnormal behavior in the network traffic. The UNSW-NB15 dataset has a rare class imbalance problem with relatively few instances compared to other classes, and an undersampling method is used to eliminate it. We use the SVM, k-NN, and decision tree algorithms and extract a subset of combinations with superior detection accuracy and RMSE through training and verification. The subset has recall values of more than 98% through the wrapper based experiments and the DT_PSO showed the best performance.

Load Balancing in Cloud Computing Using Meta-Heuristic Algorithm

  • Fahim, Youssef;Rahhali, Hamza;Hanine, Mohamed;Benlahmar, El-Habib;Labriji, El-Houssine;Hanoune, Mostafa;Eddaoui, Ahmed
    • Journal of Information Processing Systems
    • /
    • v.14 no.3
    • /
    • pp.569-589
    • /
    • 2018
  • Cloud computing, also known as "country as you go", is used to turn any computer into a dematerialized architecture in which users can access different services. In addition to the daily evolution of stakeholders' number and beneficiaries, the imbalance between the virtual machines of data centers in a cloud environment impacts the performance as it decreases the hardware resources and the software's profitability. Our axis of research is the load balancing between a data center's virtual machines. It is used for reducing the degree of load imbalance between those machines in order to solve the problems caused by this technological evolution and ensure a greater quality of service. Our article focuses on two main phases: the pre-classification of tasks, according to the requested resources; and the classification of tasks into levels ('odd levels' or 'even levels') in ascending order based on the meta-heuristic "Bat-algorithm". The task allocation is based on levels provided by the bat-algorithm and through our mathematical functions, and we will divide our system into a number of virtual machines with nearly equal performance. Otherwise, we suggest different classes of virtual machines, but the condition is that each class should contain machines with similar characteristics compared to the existing binary search scheme.

Dynamically weighted loss based domain adversarial training for children's speech recognition (어린이 음성인식을 위한 동적 가중 손실 기반 도메인 적대적 훈련)

  • Seunghee, Ma
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.6
    • /
    • pp.647-654
    • /
    • 2022
  • Although the fields in which is utilized children's speech recognition is on the rise, the lack of quality data is an obstacle to improving children's speech recognition performance. This paper proposes a new method for improving children's speech recognition performance by additionally using adult speech data. The proposed method is a transformer based domain adversarial training using dynamically weighted loss to effectively address the data imbalance gap between age that grows as the amount of adult training data increases. Specifically, the degree of class imbalance in the mini-batch during training was quantified, and the loss function was defined and used so that the smaller the data, the greater the weight. Experiments validate the utility of proposed domain adversarial training following asymmetry between adults and children training data. Experiments show that the proposed method has higher children's speech recognition performance than traditional domain adversarial training method under all conditions in which asymmetry between age occurs in the training data.

Geometric and Semantic Improvement for Unbiased Scene Graph Generation

  • Ruhui Zhang;Pengcheng Xu;Kang Kang;You Yang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.10
    • /
    • pp.2643-2657
    • /
    • 2023
  • Scene graphs are structured representations that can clearly convey objects and the relationships between them, but are often heavily biased due to the highly skewed, long-tailed relational labeling in the dataset. Indeed, the visual world itself and its descriptions are biased. Therefore, Unbiased Scene Graph Generation (USGG) prefers to train models to eliminate long-tail effects as much as possible, rather than altering the dataset directly. To this end, we propose Geometric and Semantic Improvement (GSI) for USGG to mitigate this issue. First, to fully exploit the feature information in the images, geometric dimension and semantic dimension enhancement modules are designed. The geometric module is designed from the perspective that the position information between neighboring object pairs will affect each other, which can improve the recall rate of the overall relationship in the dataset. The semantic module further processes the embedded word vector, which can enhance the acquisition of semantic information. Then, to improve the recall rate of the tail data, the Class Balanced Seesaw Loss (CBSLoss) is designed for the tail data. The recall rate of the prediction is improved by penalizing the body or tail relations that are judged incorrectly in the dataset. The experimental findings demonstrate that the GSI method performs better than mainstream models in terms of the mean Recall@K (mR@K) metric in three tasks. The long-tailed imbalance in the Visual Genome 150 (VG150) dataset is addressed better using the GSI method than by most of the existing methods.

Wild Bird Sound Classification Scheme using Focal Loss and Ensemble Learning (Focal Loss와 앙상블 학습을 이용한 야생조류 소리 분류 기법)

  • Jaeseung Lee;Jehyeok Rew
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.29 no.2
    • /
    • pp.15-25
    • /
    • 2024
  • For effective analysis of animal ecosystems, technology that can automatically identify the current status of animal habitats is crucial. Specifically, animal sound classification, which identifies species based on their sounds, is gaining great attention where video-based discrimination is impractical. Traditional studies have relied on a single deep learning model to classify animal sounds. However, sounds collected in outdoor settings often include substantial background noise, complicating the task for a single model. In addition, data imbalance among species may lead to biased model training. To address these challenges, in this paper, we propose an animal sound classification scheme that combines predictions from multiple models using Focal Loss, which adjusts penalties based on class data volume. Experiments on public datasets have demonstrated that our scheme can improve recall by up to 22.6% compared to an average of single models.

The Effect of Daytime Jobs on the Dietary Behaviors and Nutrient Intakes among Evening Class University Students (직장의 유무가 야간대학생의 식행동 및 영양섭취 상태에 미치는 영향)

  • 최현주;전예숙;박민경;최미경
    • Korean Journal of Community Nutrition
    • /
    • v.4 no.2
    • /
    • pp.175-185
    • /
    • 1999
  • This study was conducted among the university students to investigate the effects of working in an daytime and studying in an evening class on dietary behaviors and nutrient intakes. A 24-hour recall method was used along with food frequency questionnaires. The subjects were composed of three groups : 172 were in the daytime calss regular student group(RS), 178 were evening class students without a job(ESWOJ), and 140 were evening class students with a job(ESWJ). An average age of the ESWJ was 31 years old, which was 9 years older than the RS and 7 years older than the ESWOJ. The types of the residence were significantly different among the groups ; were seif-boarding 74% of the ESWJ stayed at home while 47% of the RS were self-boarding and 61% of the ESWOJ. The survey shows that while a great majority of the RS and ESWOJ has breakfast and supper at home, about 10% and 17% of the ESWJ ate breakfast and supper, respectively, at the cafeteria in their work place. The most frequently skipped meal was the breakfast in all groups. Approximately 9% of the ESWJ tended to skip the breakfast, while 27% of the RS and 33% of the ESWOJ did. It turns out that the mose common reason why ESWJ skipped meals was due to a lack of time. About 71% of the ESWJ ate out more than once a week compared to 57% and 62% of the RS and ESWOJ, respectively. Approximately 30% of the ESWJ ate out more than once a week compared to 57% and 62% of the RS and ESWOJ, respectively. Approximately 30% of the ESWJ spent less than 10 minutes at meal time, showing a hurried eating habit. The results show that the ESWJ tended to eat a variety of food s and eat less instant foods, but drink coffee more often compared to the other two groups. The results also show that in all groups, the intakes of energy, Ca, and vitamin A were lower than the Korean Recommended Daily Allowances(RDA). In addition to these nutrients, protein, vitamin B1, vitamin $B_2$, and niacin were also lower than the RDA in all make students, and Fe was lower than the RDA in all female students. Among the deficient nutrients, vitamin A showed the lowest % RDA : 48% in the male and 64% in the female students. Among the deficient nutrients, vitamin A showed the lowest % RDA ; 48% in the male and 64% in the female students. The intakes of Ca, Fe, and niacin in the ESWJ were significantly greater than those in the RS and ESWOJ groups, and the vitamin C intake in the ESWJ was greater than that in the ESWOJ(p<0.01). In conclusion, university students in all three groups have a serious imbalance in the nutrient intakes, and the evening class students with a daytime job have unhealthy dietary behaviors in the light of high frequency of eating out and heavy coffee drinking, a rushed eating pattern, and therefore showing a strong need of proper nutritional education for them.

  • PDF