• Title/Summary/Keyword: support vector machine(SVM)

Search Result 1,260, Processing Time 0.03 seconds

The Hardware Implementation of Speaker Verification System Using Support Vector Machine (SVM을 이용한 화자인증 시스템 하드웨어 구현)

  • Hwang, Byung-Hee;Choi, Woo-Yong;Moon, Dae-Sung;Pan, Sung-Bum;Chung, Yong-Wha;Chung, Sang-Hwa
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.05c
    • /
    • pp.1933-1936
    • /
    • 2003
  • 최근 목소리를 이용하여 사용자를 인증하는 화자인증(speaker verification)에 대한 관심이 증가하고 있으며, 다양한 화자 인증방법 중에서 SVM을 적용한 방법이 다른 알고리즘에 비해 우수한 성능을 나타내고 있다. 그러나 SVM을 이용한 화자인증 방법은 복잡한 계산으로 인해 휴대폰 등 휴대기기에서 실시간 처리에 어려움이 있다. 본 논문에서는 SVM을 이용한 화자인증 알고리즘을 실시간으로 처리하기 위한 하드웨어 구조를 제안하였고, VHDL을 이용하여 모델링 후 실험한 결과를 분석하였으며 전체 시스템 구성에 대하여 설명하였다.

  • PDF

SOM-based Combination Method of OVA SVMs for Effective Fingerprint Classification (효과적인 지문분류를 위한 SOM기반 OVA SVM의 결합 기법)

  • Hong Jin-Hyuk;Min Jun-Ki;Cho Sung-Bae
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.622-624
    • /
    • 2005
  • 대규모 지문인식 시스템에서 비교해야할 지문의 수를 줄이기 위해서 지문분류는 필수적인 과정이다. 최근 이진분류기인 지지 벡터 기계(Support Vector Machine: SVM)를 이용한 지문분류 기법이 많이 연구되고 있다. 본 논문에서는 다중부류 지문분류에 적합하도록 자기 구성 지도(Self-Organizing Map:SOM)를 이용하여 OVA(One-Vs-All) SVM들을 결합하는 지문분류 기법을 제안한다. SOM을 이용하여 OVA SVM들을 동적으로 결합하기 위한 결합 지도를 생성하여 지문분류 성능을 높인다. 지문분류에 있어 대표적인 NIST-4 지문 데이터베이스를 대상으로 Jain이 구축한 FingerCode 데이터베이스에 제안하는 방법을 적용하여 $1.8\%$의 거부율에서 $90.5\%$의 분류율을 획득하였으며, 기존의 결합 방법인 승자독식(Winner-takes-all)과 다수결 투표(Majority vote)보다 높은 성능을 확인하였다.

  • PDF

Evolutionary Nonlinear Compensation and Support Vector Machine Based Prediction of Windstorm Advisory (진화적 비선형 보정 및 SVM 분류에 의한 강풍 특보 예측 기법)

  • Seo, Kisung
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.12
    • /
    • pp.1799-1803
    • /
    • 2017
  • This paper introduces the prediction methods of windstorm advisory using GP nonlinear compensation and SVM. The existing special report prediction is not specialized for strong wind, such as windstorm, because it is based on the wide range of predicted values for wind speed from low to high. In order to improve the performance of strong wind reporting prediction, a method that can efficiently classify boundaries of strong wind is necessary. First, evolutionary nonlinear regression based compensation technique is applied to obtain more accurate values of prediction for wind speed using UM data. Based on the prediction wind speed, the windstorm advisory is determined. Second, SVM method is applied to classify directly using the data of UM predictors and windstorm advisory. Above two methods are compared to evaluate of the performances for the windstorm data in Jeju Island in South Korea. The data of 2007-2009, 2011 year is used for training, and 2012 year is used for test.

Negative example generation methods of SVM for predicting protein-protein interactions (단백질 상호 작용 예측을 위한 SVM의 부정예제 생성방법론)

  • 김철환;정유진
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.265-267
    • /
    • 2004
  • 생명체의 기본 정보가 저장된 DNA에서 생성되는 단백질은 생명 현상의 중요한 기능적 역할을 수행하기 때문에 단백질과 관련된 다양한 연구가 진행되고 있다. 본 논문에서는 단백질간 상호작용(protein-protein interaction)을 예측하기 위해 시스템을 통계학적 모델인 Support Vector Machine(SVM)을 사용하였다. SVM 시스템은 상호작용이 있는 데이터(긍정예제)와 상호작용이 없는 데이터(부정예제)를 입력으로 하여 모델링 생성과 테스트를 하는데, 상호작용이 있는 데이터는 DIP에 있는 interaction list로 해결이 가능하지만 상호작용이 없는 데이터는 현재 존재하지 않기 때문에 이를 생성하기 위한 생성방법이 필요하다. 이 논문에서는 shuffling, non-interaction list, 그리고 앞의 두 방법을 보완하는 non-interaction list + shuffling이라는 방법을 제시하고 기존의 실험 결과를 상회하는 부정예제 생성방법을 제시한다.

  • PDF

Initial Analysis of Positive/Negative Opinion Classification of Twitter Data Using Naïve Bayes and SVM (Naïve Bayes와 SVM을 이용한 트위터 데이터의 긍정/부정 의견 자동분류 결과 분석)

  • Cho, Heeryon;Kim, Songkuk
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.04a
    • /
    • pp.406-409
    • /
    • 2012
  • '나꼼수 비키니 시위'에 대 긍정적(지지), 부정적(비판) 의견을 담은 트위터 데이터를, 단어의 출현에 주목하여 Naïve Bayes (NB)와 Support Vector Machine (SVM)을 적용하여 자동분류 한 결과, NB가 75.98%로, 73.65%인 SVM 보다 약간 더 나은 성능을 보였다. 본 실험을 통해, 기계학습을 이용한 대중의견(opinion) 자동분류 시스템을 실용화할 때의 고려사항에 대해 살펴 본다.

Cyberbullying Detection by Sentiment Analysis of Tweets' Contents Written in Arabic in Saudi Arabia Society

  • Almutairi, Amjad Rasmi;Al-Hagery, Muhammad Abdullah
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.3
    • /
    • pp.112-119
    • /
    • 2021
  • Social media has become a global means of communication in people's lives. Most people are using Twitter for communication purposes and its inappropriate use, which has negative effects on people's lives. One of the widely common misuses of Twitter is cyberbullying. As the resources of dialectal Arabic are rare, so for cyberbullying most people are using dialectal Arabic. For this reason, the ultimate goal of this study is to detect and classify cyberbullying on Twitter in the Arabic context in Saudi Arabia. To help in the detection and classification of tweets, Pointwise Mutual Information (PMI) to generate a lexicon, and Support Vector Machine (SVM) algorithms are used. The evaluation is performed on both methods in terms of the F1-score. However, the F1-score after applying the PMI is 50%, while after the SVM application on the resampling data it is 82%. The analysis of the results shows that the SVM algorithm outperforms better.

Neuronal Spike Train Decoding Methods for the Brain-Machine Interface Using Nonlinear Mapping (비선형매핑 기반 뇌-기계 인터페이스를 위한 신경신호 spike train 디코딩 방법)

  • Kim, Kyunn-Hwan;Kim, Sung-Shin;Kim, Sung-June
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.54 no.7
    • /
    • pp.468-474
    • /
    • 2005
  • Brain-machine interface (BMI) based on neuronal spike trains is regarded as one of the most promising means to restore basic body functions of severely paralyzed patients. The spike train decoding algorithm, which extracts underlying information of neuronal signals, is essential for the BMI. Previous studies report that a linear filter is effective for this purpose and there is no noteworthy gain from the use of nonlinear mapping algorithms, in spite of the fact that neuronal encoding process is obviously nonlinear. We designed several decoding algorithms based on the linear filter, and two nonlinear mapping algorithms using multilayer perceptron (MLP) and support vector machine regression (SVR), and show that the nonlinear algorithms are superior in general. The MLP often showed unsatisfactory performance especially when it is carelessly trained. The nonlinear SVR showed the highest performance. This may be due to the superiority of the SVR in training and generalization. The advantage of using nonlinear algorithms were more profound for the cases when there are false-positive/negative errors in spike trains.

Runoff Prediction from Machine Learning Models Coupled with Empirical Mode Decomposition: A case Study of the Grand River Basin in Canada

  • Parisouj, Peiman;Jun, Changhyun;Nezhad, Somayeh Moghimi;Narimani, Roya
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.136-136
    • /
    • 2022
  • This study investigates the possibility of coupling empirical mode decomposition (EMD) for runoff prediction from machine learning (ML) models. Here, support vector regression (SVR) and convolutional neural network (CNN) were considered for ML algorithms. Precipitation (P), minimum temperature (Tmin), maximum temperature (Tmax) and their intrinsic mode functions (IMF) values were used for input variables at a monthly scale from Jan. 1973 to Dec. 2020 in the Grand river basin, Canada. The support vector machine-recursive feature elimination (SVM-RFE) technique was applied for finding the best combination of predictors among input variables. The results show that the proposed method outperformed the individual performance of SVR and CNN during the training and testing periods in the study area. According to the correlation coefficient (R), the EMD-SVR model outperformed the EMD-CNN model in both training and testing even though the CNN indicated a better performance than the SVR before using IMF values. The EMD-SVR model showed higher improvement in R value (38.7%) than that from the EMD-CNN model (7.1%). It should be noted that the coupled models of EMD-SVR and EMD-CNN represented much higher accuracy in runoff prediction with respect to the considered evaluation indicators, including root mean square error (RMSE) and R values.

  • PDF

Real-time Recognition of Car Licence Plate on a Moving Car (이동 차량에서의 실시간 자동차 번호판 인식)

  • 박창석;김병만;서병훈;김준우;이광호
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.9 no.2
    • /
    • pp.32-43
    • /
    • 2004
  • In this paper, a system which can effectively recognize the plate image extracted from camera set on a moving car is proposed. To extract car licence plate from moving vehicles, multiple candidates are maintained based on the strong vertical edges which are found in the region of car licence plate. A candidate region is selected among them based on the ratio of background and characters. We also make a comparative study of recognition performance between support vector machines and modular neural networks. The experimental results lead us to the conclusion that the former is superior to the latter. For a better recognition rate, a simple method combining the support vector machine with modular neural network where the output of the latter is used as the input of the former is suggested and evaluated. As we expected, the hybrid one shows the best result among those three methods we have mentioned.

  • PDF

Ensemble Learning with Support Vector Machines for Bond Rating (회사채 신용등급 예측을 위한 SVM 앙상블학습)

  • Kim, Myoung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.29-45
    • /
    • 2012
  • Bond rating is regarded as an important event for measuring financial risk of companies and for determining the investment returns of investors. As a result, it has been a popular research topic for researchers to predict companies' credit ratings by applying statistical and machine learning techniques. The statistical techniques, including multiple regression, multiple discriminant analysis (MDA), logistic models (LOGIT), and probit analysis, have been traditionally used in bond rating. However, one major drawback is that it should be based on strict assumptions. Such strict assumptions include linearity, normality, independence among predictor variables and pre-existing functional forms relating the criterion variablesand the predictor variables. Those strict assumptions of traditional statistics have limited their application to the real world. Machine learning techniques also used in bond rating prediction models include decision trees (DT), neural networks (NN), and Support Vector Machine (SVM). Especially, SVM is recognized as a new and promising classification and regression analysis method. SVM learns a separating hyperplane that can maximize the margin between two categories. SVM is simple enough to be analyzed mathematical, and leads to high performance in practical applications. SVM implements the structuralrisk minimization principle and searches to minimize an upper bound of the generalization error. In addition, the solution of SVM may be a global optimum and thus, overfitting is unlikely to occur with SVM. In addition, SVM does not require too many data sample for training since it builds prediction models by only using some representative sample near the boundaries called support vectors. A number of experimental researches have indicated that SVM has been successfully applied in a variety of pattern recognition fields. However, there are three major drawbacks that can be potential causes for degrading SVM's performance. First, SVM is originally proposed for solving binary-class classification problems. Methods for combining SVMs for multi-class classification such as One-Against-One, One-Against-All have been proposed, but they do not improve the performance in multi-class classification problem as much as SVM for binary-class classification. Second, approximation algorithms (e.g. decomposition methods, sequential minimal optimization algorithm) could be used for effective multi-class computation to reduce computation time, but it could deteriorate classification performance. Third, the difficulty in multi-class prediction problems is in data imbalance problem that can occur when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. SVM ensemble learning is one of machine learning methods to cope with the above drawbacks. Ensemble learning is a method for improving the performance of classification and prediction algorithms. AdaBoost is one of the widely used ensemble learning techniques. It constructs a composite classifier by sequentially training classifiers while increasing weight on the misclassified observations through iterations. The observations that are incorrectly predicted by previous classifiers are chosen more often than examples that are correctly predicted. Thus Boosting attempts to produce new classifiers that are better able to predict examples for which the current ensemble's performance is poor. In this way, it can reinforce the training of the misclassified observations of the minority class. This paper proposes a multiclass Geometric Mean-based Boosting (MGM-Boost) to resolve multiclass prediction problem. Since MGM-Boost introduces the notion of geometric mean into AdaBoost, it can perform learning process considering the geometric mean-based accuracy and errors of multiclass. This study applies MGM-Boost to the real-world bond rating case for Korean companies to examine the feasibility of MGM-Boost. 10-fold cross validations for threetimes with different random seeds are performed in order to ensure that the comparison among three different classifiers does not happen by chance. For each of 10-fold cross validation, the entire data set is first partitioned into tenequal-sized sets, and then each set is in turn used as the test set while the classifier trains on the other nine sets. That is, cross-validated folds have been tested independently of each algorithm. Through these steps, we have obtained the results for classifiers on each of the 30 experiments. In the comparison of arithmetic mean-based prediction accuracy between individual classifiers, MGM-Boost (52.95%) shows higher prediction accuracy than both AdaBoost (51.69%) and SVM (49.47%). MGM-Boost (28.12%) also shows the higher prediction accuracy than AdaBoost (24.65%) and SVM (15.42%)in terms of geometric mean-based prediction accuracy. T-test is used to examine whether the performance of each classifiers for 30 folds is significantly different. The results indicate that performance of MGM-Boost is significantly different from AdaBoost and SVM classifiers at 1% level. These results mean that MGM-Boost can provide robust and stable solutions to multi-classproblems such as bond rating.