• Title/Summary/Keyword: support vector machine(SVM)

Search Result 1,254, Processing Time 0.024 seconds

Performance Evaluation of Machine Learning Algorithms for Cloud Removal of Optical Imagery: A Case Study in Cropland (광학 영상의 구름 제거를 위한 기계학습 알고리즘의 예측 성능 평가: 농경지 사례 연구)

  • Soyeon Park;Geun-Ho Kwak;Ho-Yong Ahn;No-Wook Park
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_1
    • /
    • pp.507-519
    • /
    • 2023
  • Multi-temporal optical images have been utilized for time-series monitoring of croplands. However, the presence of clouds imposes limitations on image availability, often requiring a cloud removal procedure. This study assesses the applicability of various machine learning algorithms for effective cloud removal in optical imagery. We conducted comparative experiments by focusing on two key variables that significantly influence the predictive performance of machine learning algorithms: (1) land-cover types of training data and (2) temporal variability of land-cover types. Three machine learning algorithms, including Gaussian process regression (GPR), support vector machine (SVM), and random forest (RF), were employed for the experiments using simulated cloudy images in paddy fields of Gunsan. GPR and SVM exhibited superior prediction accuracy when the training data had the same land-cover types as the cloud region, and GPR showed the best stability with respect to sampling fluctuations. In addition, RF was the least affected by the land-cover types and temporal variations of training data. These results indicate that GPR is recommended when the land-cover type and spectral characteristics of the training data are the same as those of the cloud region. On the other hand, RF should be applied when it is difficult to obtain training data with the same land-cover types as the cloud region. Therefore, the land-cover types in cloud areas should be taken into account for extracting informative training data along with selecting the optimal machine learning algorithm.

Resume Classification System using Natural Language Processing & Machine Learning Techniques

  • Irfan Ali;Nimra;Ghulam Mujtaba;Zahid Hussain Khand;Zafar Ali;Sajid Khan
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.7
    • /
    • pp.108-117
    • /
    • 2024
  • The selection and recommendation of a suitable job applicant from the pool of thousands of applications are often daunting jobs for an employer. The recommendation and selection process significantly increases the workload of the concerned department of an employer. Thus, Resume Classification System using the Natural Language Processing (NLP) and Machine Learning (ML) techniques could automate this tedious process and ease the job of an employer. Moreover, the automation of this process can significantly expedite and transparent the applicants' selection process with mere human involvement. Nevertheless, various Machine Learning approaches have been proposed to develop Resume Classification Systems. However, this study presents an automated NLP and ML-based system that classifies the Resumes according to job categories with performance guarantees. This study employs various ML algorithms and NLP techniques to measure the accuracy of Resume Classification Systems and proposes a solution with better accuracy and reliability in different settings. To demonstrate the significance of NLP & ML techniques for processing & classification of Resumes, the extracted features were tested on nine machine learning models Support Vector Machine - SVM (Linear, SGD, SVC & NuSVC), Naïve Bayes (Bernoulli, Multinomial & Gaussian), K-Nearest Neighbor (KNN) and Logistic Regression (LR). The Term-Frequency Inverse Document (TF-IDF) feature representation scheme proven suitable for Resume Classification Task. The developed models were evaluated using F-ScoreM, RecallM, PrecissionM, and overall Accuracy. The experimental results indicate that using the One-Vs-Rest-Classification strategy for this multi-class Resume Classification task, the SVM class of Machine Learning algorithms performed better on the study dataset with over 96% overall accuracy. The promising results suggest that NLP & ML techniques employed in this study could be used for the Resume Classification task.

Using Machine Learning Technique for Analytical Customer Loyalty

  • Mohamed M. Abbassy
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.8
    • /
    • pp.190-198
    • /
    • 2023
  • To enhance customer satisfaction for higher profits, an e-commerce sector can establish a continuous relationship and acquire new customers. Utilize machine-learning models to analyse their customer's behavioural evidence to produce their competitive advantage to the e-commerce platform by helping to improve overall satisfaction. These models will forecast customers who will churn and churn causes. Forecasts are used to build unique business strategies and services offers. This work is intended to develop a machine-learning model that can accurately forecast retainable customers of the entire e-commerce customer data. Developing predictive models classifying different imbalanced data effectively is a major challenge in collected data and machine learning algorithms. Build a machine learning model for solving class imbalance and forecast customers. The satisfaction accuracy is used for this research as evaluation metrics. This paper aims to enable to evaluate the use of different machine learning models utilized to forecast satisfaction. For this research paper are selected three analytical methods come from various classifications of learning. Classifier Selection, the efficiency of various classifiers like Random Forest, Logistic Regression, SVM, and Gradient Boosting Algorithm. Models have been used for a dataset of 8000 records of e-commerce websites and apps. Results indicate the best accuracy in determining satisfaction class with both gradient-boosting algorithm classifications. The results showed maximum accuracy compared to other algorithms, including Gradient Boosting Algorithm, Support Vector Machine Algorithm, Random Forest Algorithm, and logistic regression Algorithm. The best model developed for this paper to forecast satisfaction customers and accuracy achieve 88 %.

Adult Image Classification using Adaptive Skin Detection and Edge Information (적응적 피부색 검출과 에지 정보를 이용한 유해 영상분류방법)

  • Park, Chan-Woo;Park, Ki-Tae;Moon, Young-Shik
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.48 no.1
    • /
    • pp.127-132
    • /
    • 2011
  • In this paper, we propose a novel method of adult image classification by combining skin color regions and edges in an input image. The proposed method consists of four steps. In the first step, initial skin color regions are detected by logical AND operation of all skin color regions detected by the existing methods of skin color detection. In the second step, a skin color probability map is created by modeling the distribution of skin color in the initial regions. Then, a binary image is generated by using threshold value from the skin color probability map. In the third step, after using the binary image and edge information, we detect final skin color regions using a region growing method. In the final step, adult image classification is performed by support vector machine(SVM). To this end, a feature vector is extracted by combining the final skin color regions and neighboring edges of them. As experimental results, the proposed method improves performance of the adult image classification by 9.6%, compared to the existing method.

Traffic Classification based on Adjustable Convex-hull Support Vector Machines (조절할 수 있는 볼록한 덮개 서포트 벡터 머신에 기반을 둔 트래픽 분류 방법)

  • Yu, Zhibin;Choi, Yong-Do;Kil, Gi-Beom;Kim, Sung-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.3
    • /
    • pp.67-76
    • /
    • 2012
  • Traffic classification plays an important role in traffic management. To traditional methods, P2P and encryption traffic may become a problem. Support Vector Machine (SVM) is a useful classification tool which is able to overcome the traditional bottleneck. The main disadvantage of SVM algorithms is that it's time-consuming to train large data set because of the quadratic programming (QP) problem. However, the useful support vectors are only a small part of the whole data. If we can discard the useless vectors before training, we are able to save time and keep accuracy. In this article, we discussed the feasibility to remove the useless vectors through a sequential method to accelerate training speed when dealing with large scale data.

Variation for Mental Health of Children of Marginalized Classes through Exercise Therapy using Deep Learning (딥러닝을 이용한 소외계층 아동의 스포츠 재활치료를 통한 정신 건강에 대한 변화)

  • Kim, Myung-Mi
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.15 no.4
    • /
    • pp.725-732
    • /
    • 2020
  • This paper uses variables following as : to follow me well(0-9), it takes a lot of time to make a decision (0-9), lethargy(0-9) during physical activity in the exercise learning program of the children in the marginalized class. This paper classifies 'gender', 'physical education classroom', and 'upper, middle and lower' of age, and observe changes in ego-resiliency and self-control through sports rehabilitation therapy to find out changes in mental health. To achieve this, the data acquired was merged and the characteristics of large and small numbers were removed using the Label encoder and One-hot encoding. Then, to evaluate the performance by applying each algorithm of MLP, SVM, Dicesion tree, RNN, and LSTM, the train and test data were divided by 75% and 25%, and then the algorithm was learned with train data and the accuracy of the algorithm was measured with the Test data. As a result of the measurement, LSTM was the most effective in sex, MLP and LSTM in physical education classroom, and SVM was the most effective in age.

River Flow Forecasting using Satellite-based Products and Machine Learning Technique over the Ungauged River Flow in Korean Peninsula, Imjin River: Using MODIS, ASCAT, and SDS dataset (위성 데이터 및 기계 학습 기법을 활용한 한반도 임진강 미계측 지역 유출량 예측: MODIS, ASCAT, SDS 데이터를 활용하여)

  • Choi, Min Ha;Kim, Hyung Lok;Li, Li;Jun, Kyung Soo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2016.05a
    • /
    • pp.159-159
    • /
    • 2016
  • 북한 지역에서 시작되어 한반도의 금문댐까지 연결되는 임진강은 북한지역의 유출량 미계측으로 인해 유출량 산출에 많은 어려움이 있어왔다. 본 연구에서는 위성 데이터를 활용하여 미계측 유역의 유출량을 추정 할 수 있는 기법을 제시하였다. Satellite-derived Flow Signal (SDF)는 위성 기반 특정 지역의 유출 정보를 제공하며, JAXA의 GCOM-W1 위성에 탑재된 Advanced Microwave Scanning Radiometer 2(AMSR2) 센서에서 산출된다. 본 연구에서는 SDS 뿐 아니라 유출에 크게 관련이 있는 지표 토양수분 데이터와 식생인자를 임진강 유출 값을 예측하기 위한 입력 값으로 활용하였다. 토양수분 데이터는 Metop-A 위성에 탑재된 Advanced Scatterometer(ASCAT) 센서에서 산출되는 데이터를 활용하였으며, 식생데이터는 Aqua 위성에 탑재된 Moderate Resolution Imaging Spectroradiometer(MODIS) 센서에서 측정되는 Normalized Difference Vegetation Index(NDVI) 데이터를 활용하였다. 추가적으로 SDS, 토양수분, NDVI 데이터는 다양한 lag time으로 약 150여개의 입력데이터로 세분화되었다. 150개의 방대한 입력인자는 Partial Mutual Information(PMI) 방법을 통해 소수 중요 인자들로 간추려져 기계 학습 입력인자로 활용되었다. 기계학습에 있어서는 Support Vector Machine(SVM), Artificial Neural Network (ANN) 기법을 활용하였다. SVM, ANN을 통해 모델화된 유출데이터는 금문댐 유출데이터와 비교/분석되었다. SVM 기법 기반의 유출량은 실제 유출량과 0.73의 상관계수를 보여주었고, ANN 기법 기반의 유출량은 0.66의 상관계수를 결과를 나타내었다. 하지만 SVM 기반 유출데이터는 과소 산정 되는 경향을 보였으며, ANN 기법 기반의 유출량은 과대산정되는 결과가 산출되는 한계점이 있음을 파악할 수 있었다.

  • PDF

SVM-based Energy-Efficient scheduling on Heterogeneous Multi-Core Mobile Devices (비대칭 멀티코어 모바일 단말에서 SVM 기반 저전력 스케줄링 기법)

  • Min-Ho, Han;Young-Bae, Ko;Sung-Hwa, Lim
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.6
    • /
    • pp.69-75
    • /
    • 2022
  • We propose energy-efficient scheduling considering real-time constraints and energy efficiency in smart mobile with heterogeneous multi-core structure. Recently, high-performance applications such as VR, AR, and 3D game require real-time and high-level processings. The big.LITTLE architecture is applied to smart mobiles devices for high performance and high energy efficiency. However, there is a problem that the energy saving effect is reduced because LITTLE cores are not properly utilized. This paper proposes a heterogeneous multi-core assignment technique that improves real-time performance and high energy efficiency with big.LITTLE architecture. Our proposed method optimizes the energy consumption and the execution time by predicting the actual task execution time using SVM (Support Vector Machine). Experiments on an off-the-shelf smartphone show that the proposed method reduces energy consumption while ensuring the similar execution time to legacy schemes.

A Study on Automatic Vehicle Extraction within Drone Image Bounding Box Using Unsupervised SVM Classification Technique (무감독 SVM 분류 기법을 통한 드론 영상 경계 박스 내 차량 자동 추출 연구)

  • Junho Yeom
    • Land and Housing Review
    • /
    • v.14 no.4
    • /
    • pp.95-102
    • /
    • 2023
  • Numerous investigations have explored the integration of machine leaning algorithms with high-resolution drone image for object detection in urban settings. However, a prevalent limitation in vehicle extraction studies involves the reliance on bounding boxes rather than instance segmentation. This limitation hinders the precise determination of vehicle direction and exact boundaries. Instance segmentation, while providing detailed object boundaries, necessitates labour intensive labelling for individual objects, prompting the need for research on automating unsupervised instance segmentation in vehicle extraction. In this study, a novel approach was proposed for vehicle extraction utilizing unsupervised SVM classification applied to vehicle bounding boxes in drone images. The method aims to address the challenges associated with bounding box-based approaches and provide a more accurate representation of vehicle boundaries. The study showed promising results, demonstrating an 89% accuracy in vehicle extraction. Notably, the proposed technique proved effective even when dealing with significant variations in spectral characteristics within the vehicles. This research contributes to advancing the field by offering a viable solution for automatic and unsupervised instance segmentation in the context of vehicle extraction from image.

A Baseline Correction for Effective Analysis of Alzheimer’s Disease based on Raman Spectra from Platelet (혈소판 라만 스펙트럼의 효율적인 분석을 위한 기준선 보정 방법)

  • Park, Aa-Ron;Baek, Sung-June
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.49 no.1
    • /
    • pp.16-22
    • /
    • 2012
  • In this paper, we proposed a method of baseline correction for analysis of Raman spectra of platelets from Alzheimer's disease (AD) transgenic mice. Measured Raman spectra include the meaningful information and unnecessary noise which is composed of baseline and additive noise. The Raman spectrum is divided into the local region including several peaks and the spectrum of the region is modeled by curve fitting using Gaussian model. The additive noise is clearly removed from the process of replacing the original spectrum with the fitted model. The baseline correction after interpolating the local minima of the fitted model with linear, piecewise cubic Hermite and cubic spline algorithm. The baseline corrected models extract the feature with principal component analysis (PCA). The classification result of support vector machine (SVM) and maximum $a$ posteriori probability (MAP) using linear interpolation method showed the good performance about overall number of principal components, especially SVM gave the best performance which is about 97.3% true classification average rate in case of piecewise cubic Hermite algorithm and 5 principal components. In addition, it confirmed that the proposed baseline correction method compared with the previous research result could be effectively applied in the analysis of the Raman spectra of platelet.