Search | Korea Science

LS-SVM for large data sets

Park, Hongrak;Hwang, Hyungtae;Kim, Byungju
- Journal of the Korean Data and Information Science Society
- /
- v.27 no.2
- /
- pp.549-557
- /
- 2016
In this paper we propose multiclassification method for large data sets by ensembling least squares support vector machines (LS-SVM) with principal components instead of raw input vector. We use the revised one-vs-all method for multiclassification, which is one of voting scheme based on combining several binary classifications. The revised one-vs-all method is performed by using the hat matrix of LS-SVM ensemble, which is obtained by ensembling LS-SVMs trained using each random sample from the whole large training data. The leave-one-out cross validation (CV) function is used for the optimal values of hyper-parameters which affect the performance of multiclass LS-SVM ensemble. We present the generalized cross validation function to reduce computational burden of leave-one-out CV functions. Experimental results from real data sets are then obtained to illustrate the performance of the proposed multiclass LS-SVM ensemble.
https://doi.org/10.7465/jkdi.2016.27.2.549 인용 PDF KSCI

Confidence Interval Estimation Using SV in LS-SVM

Seok, Kyung-Ha
- Journal of the Korean Data and Information Science Society
- /
- v.14 no.3
- /
- pp.451-459
- /
- 2003
The present paper suggests a method to estimate confidence interval using SV(Support Vector) in LS-SVM(Least-Squares Support Vector Machine). To get the proposed method we used the fact that the values of the hessian matrix obtained by full data set and SV are not different significantly. Since the suggested method implement only SV, a part of full data, we can save computing time and memory space. Through simulation study we justified the proposed method.
PDF

Comparison of Partial Least Squares and Support Vector Machine for the Flash Point Prediction of Organic Compounds (유기물의 인화점 예측을 위한 부분최소자승법과 SVM의 비교)

Lee, Chang Jun;Ko, Jae Wook;Lee, Gibaek
- Korean Chemical Engineering Research
- /
- v.48 no.6
- /
- pp.717-724
- /
- 2010
The flash point is one of the most important physical properties used to determine the potential for fire and explosion hazards of flammable liquids. Despite the needs of the experimental flash point data for the design and construction of chemical plants, there is often a significant gap between the demands for the data and their availability. This study have built and compared two models of partial least squares(PLS) and support vector machine(SVM) to predict the experimental flash points of 893 organic compounds out of DIPPR 801. As the independent variables of the models, 65 functional groups were chosen based on the group contribution method that was oriented from the assumption that each fragment of a molecule contributes a certain amount to the value of its physical property, and the logarithm of molecular weight was added. The prediction errors calculated from cross-validation were employed to determine the optimal parameters of two models. And, an optimization technique should be used to get three parameters of SVM model. This work adopted particle swarm optimization that is one of heuristic optimization methods. As the selection of training data can affect the prediction performance, 100 data sets of randomly selected data were generated and tested. The PLS and SVM results of the average absolute errors for the whole data range from 13.86 K to 14.55 K and 7.44 K to 10.26 K, respectively, indicating that the predictive ability of the SVM is much superior than PLS.
PDF KSCI

Comparative Study of Estimation Methods of the Endpoint Temperature in Basic Oxygen Furnace Steelmaking Process with Selection of Input Parameters

Park, Tae Chang;Kim, Beom Seok;Kim, Tae Young;Jin, Il Bong;Yeo, Yeong Koo
- Korean Journal of Metals and Materials
- /
- v.56 no.11
- /
- pp.813-821
- /
- 2018
The basic oxygen furnace (BOF) steelmaking process in the steel industry is highly complicated, and subject to variations in raw material composition. During the BOF steelmaking process, it is essential to maintain the carbon content and the endpoint temperature at their set points in the liquid steel. This paper presents intelligent models used to estimate the endpoint temperature in the basic oxygen furnace (BOF) steelmaking process. An artificial neural network (ANN) model and a least-squares support vector machine (LSSVM) model are proposed and their estimation performance compared. The classical partial least-squares (PLS) method was also compared with the others. Results of the estimations using the ANN, LSSVM and PLS models were compared with the operation data, and the root-mean square error (RMSE) for each model was calculated to evaluate estimation performance. The RMSE of the LSSVM model 15.91, which turned out to be the best estimation. RMSE values for the ANN and PLS models were 17.24 and 21.31, respectively, indicating their relative estimation performance. The essential input parameters used in the models can be selected by sensitivity analysis. The RMSE for each model was calculated again after a sequential input selection process was used to remove insignificant input parameters. The RMSE of the LSSVM was then 13.21, which is better than the previous RMSE with all 16 parameters. The results show that LSSVM model using 13 input parameters can be utilized to calculate the required values for oxygen volume and coolant needed to optimally adjust the steel target temperature.
https://doi.org/10.3365/KJMM.2018.56.11.813 인용 KSCI

Model-based Fault Diagnosis Applied to Vibration Data (진동데이터 적용 모델기반 이상진단)

Yang, Ji-Hyuk;Kwon, Oh-Kyu
- Journal of Institute of Control, Robotics and Systems
- /
- v.18 no.12
- /
- pp.1090-1095
- /
- 2012
In this paper, we propose a model-based fault diagnosis method applied to vibration data. The fault detection is performed by comparing estimated parameters with normal parameters and deciding if the observed changes can be explained satisfactorily in terms of noise or undermodelling. The key feature of this method is that it accounts for the effects of noise and model mismatch. And we aslo design a classifier for the fault isolation by applying the multiclass SVM (Support Vector Machine) to the estimated parameters. The proposed fault detection and isolation methods are applied to an engine vibration data to show a good performance. The proposed fault detection method is compared with a signal-based fault detection method through a performance analysis.
https://doi.org/10.5302/J.ICROS.2012.18.12.1090 인용 PDF KSCI

Soft Sensor Development for Predicting the Relative Humidity of a Membrane Humidifier for PEM Fuel Cells (고분자 전해질 연료전지용 막가습기의 상대습도 추정을 위한 소프트센서 개발)

Han, In Su;Shin, Hyun Khil
- Transactions of the Korean hydrogen and new energy society
- /
- v.25 no.5
- /
- pp.491-499
- /
- 2014
It is important to accurately measure and control the relative humidity of humidified gas entering a PEM (polymer electrolyte membrane) fuel cell stack because the level of humidification strongly affects the performance and durability of the stack. Humidity measurement devices can be used to directly measure the relative humidity, but they cost much to be equipped and occupy spaces in a fuel cell system. We present soft sensors for predicting the relative humidity without actual humidity measuring devices. By combining FIR (finite impulse response) model with PLS (partial least square) and SVM (support vector machine) regression models, DPLS (dynamic PLS) and DSVM (dynamic SVM) soft sensors were developed to correctly estimate the relative humidity of humidified gases exiting a planar-type membrane humidifier. The DSVM soft sensor showed a better prediction performance than the DPLS one because it is able to capture nonlinear correlations between the relative humidity and the input data of the soft sensors. Without actual humidity sensors, the soft sensors presented in this work can be used to monitor and control the humidity in operation of PEM fuel cell systems.
https://doi.org/10.7316/KHNES.2014.25.5.491 인용 PDF KSCI

Support Vector Machine for Interval Regression

Hong Dug Hun;Hwang Changha
- Proceedings of the Korean Statistical Society Conference
- /
- 2004.11a
- /
- pp.67-72
- /
- 2004
Support vector machine (SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate interval linear and nonlinear regression models combining the possibility and necessity estimation formulation with the principle of SVM. For data sets with crisp inputs and interval outputs, the possibility and necessity models have been recently utilized, which are based on quadratic programming approach giving more diverse spread coefficients than a linear programming one. SVM also uses quadratic programming approach whose another advantage in interval regression analysis is to be able to integrate both the property of central tendency in least squares and the possibilistic property In fuzzy regression. However this is not a computationally expensive way. SVM allows us to perform interval nonlinear regression analysis by constructing an interval linear regression function in a high dimensional feature space. In particular, SVM is a very attractive approach to model nonlinear interval data. The proposed algorithm here is model-free method in the sense that we do not have to assume the underlying model function for interval nonlinear regression model with crisp inputs and interval output. Experimental results are then presented which indicate the performance of this algorithm.
PDF

Quantile regression using asymmetric Laplace distribution (비대칭 라플라스 분포를 이용한 분위수 회귀)

Park, Hye-Jung
- Journal of the Korean Data and Information Science Society
- /
- v.20 no.6
- /
- pp.1093-1101
- /
- 2009
Quantile regression has become a more widely used technique to describe the distribution of a response variable given a set of explanatory variables. This paper proposes a novel modelfor quantile regression using doubly penalized kernel machine with support vector machine iteratively reweighted least squares (SVM-IRWLS). To make inference about the shape of a population distribution, the widely popularregression, would be inadequate, if the distribution is not approximately Gaussian. We present a likelihood-based approach to the estimation of the regression quantiles that uses the asymmetric Laplace density.
PDF

Endpoint Detection Using Hybrid Algorithm of PLS and SVM (PLS와 SVM복합 알고리즘을 이용한 식각 종료점 검출)

Lee, Yun-Keun;Han, Yi-Seul;Hong, Sang-Jeen;Han, Seung-Soo
- Journal of the Korean Institute of Electrical and Electronic Material Engineers
- /
- v.24 no.9
- /
- pp.701-709
- /
- 2011
In semiconductor wafer fabrication, etching is one of the most critical processes, by which a material layer is selectively removed. Because of difficulty to correct a mistake caused by over etching, it is critical that etch should be performed correctly. This paper proposes a new approach for etch endpoint detection of small open area wafers. The traditional endpoint detection technique uses a few manually selected wavelengths, which are adequate for large open areas. As the integrated circuit devices continue to shrink in geometry and increase in device density, detecting the endpoint for small open areas presents a serious challenge to process engineers. In this work, a high-resolution optical emission spectroscopy (OES) sensor is used to provide the necessary sensitivity for detecting subtle endpoint signal. Partial Least Squares (PLS) method is used to analyze the OES data which reduces dimension of the data and increases gap between classes. Support Vector Machine (SVM) is employed to detect endpoint using the data after PLS. SVM classifies normal etching state and after endpoint state. Two data sets from OES are used in training PLS and SVM. The other data sets are used to test the performance of the model. The results show that the trained PLS and SVM hybrid algorithm model detects endpoint accurately.
https://doi.org/10.4313/JKEM.2011.24.9.701 인용 PDF KSCI

Training for Huge Data set with On Line Pruning Regression by LS-SVM

Kim, Dae-Hak;Shim, Joo-Yong;Oh, Kwang-Sik
- Proceedings of the Korean Statistical Society Conference
- /
- 2003.10a
- /
- pp.137-141
- /
- 2003
LS-SVM(least squares support vector machine) is a widely applicable and useful machine learning technique for classification and regression analysis. LS-SVM can be a good substitute for statistical method but computational difficulties are still remained to operate the inversion of matrix of huge data set. In modern information society, we can easily get huge data sets by on line or batch mode. For these kind of huge data sets, we suggest an on line pruning regression method by LS-SVM. With relatively small number of pruned support vectors, we can have almost same performance as regression with full data set.
PDF

Search Result 67, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)