• Title/Summary/Keyword: least squares-support vector machine

Search Result 67, Processing Time 0.026 seconds

A comparison of ATR-FTIR and Raman spectroscopy for the non-destructive examination of terpenoids in medicinal plants essential oils

  • Rahul Joshi;Sushma Kholiya;Himanshu Pandey;Ritu Joshi;Omia Emmanuel;Ameeta Tewari;Taehyun Kim;Byoung-Kwan Cho
    • Korean Journal of Agricultural Science
    • /
    • v.50 no.4
    • /
    • pp.675-696
    • /
    • 2023
  • Terpenoids, also referred to as terpenes, are a large family of naturally occurring chemical compounds present in the essential oils extracted from medicinal plants. In this study, a nondestructive methodology was created by combining ATR-FT-IR (attenuated total reflectance-Fourier transform infrared), and Raman spectroscopy for the terpenoids assessment in medicinal plants essential oils from ten different geographical locations. Partial least squares regression (PLSR) and support vector regression (SVR) were used as machine learning methodologies. However, a deep learning based model called as one-dimensional convolutional neural network (1D CNN) were also developed for models comparison. With a correlation coefficient (R2) of 0.999 and a lowest RMSEP (root mean squared error of prediction) of 0.006% for the prediction datasets, the SVR model created for FT-IR spectral data outperformed both the PLSR and 1 D CNN models. On the other hand, for the classification of essential oils derived from plants collected from various geographical regions, the created SVM (support vector machine) classification model for Raman spectroscopic data obtained an overall classification accuracy of 0.997% which was superior than the FT-IR (0.986%) data. Based on the results we propose that FT-IR spectroscopy, when coupled with the SVR model, has a significant potential for the non-destructive identification of terpenoids in essential oils compared with destructive chemical analysis methods.

Evaluation of soil-concrete interface shear strength based on LS-SVM

  • Zhang, Chunshun;Ji, Jian;Gui, Yilin;Kodikara, Jayantha;Yang, Sheng-Qi;He, Lei
    • Geomechanics and Engineering
    • /
    • v.11 no.3
    • /
    • pp.361-372
    • /
    • 2016
  • The soil-concrete interface shear strength, although has been extensively studied, is still difficult to predict as a result of the dependence on many factors such as normal stresses, surface roughness, particle sizes, moisture contents, dilation angles of soils, etc. In this study, a well-known rigorous statistical learning approach, namely the least squares support vector machine (LS-SVM) realized in a ubiquitous spreadsheet platform is firstly used in estimating the soil-structure interface shear strength. Instead of studying the complicated mechanism, LS-SVM enables to explore the possible link between the fundamental factors and the interface shear strengths, via a sophisticated statistic approach. As a preliminary investigation, the authors study the expansive soils that are found extensively in most countries. To reduce the complexity, three major influential factors, e.g., initial moisture contents, initial dry densities and normal stresses of soils are taken into account in developing the LS-SVM models for the soil-concrete interface shear strengths. The predicted results by LS-SVM show reasonably good agreement with experimental data from direct shear tests.

An Optimization Algorithm with Novel Flexible Grid: Applications to Parameter Decision in LS-SVM

  • Gao, Weishang;Shao, Cheng;Gao, Qin
    • Journal of Computing Science and Engineering
    • /
    • v.9 no.2
    • /
    • pp.39-50
    • /
    • 2015
  • Genetic algorithm (GA) and particle swarm optimization (PSO) are two excellent approaches to multimodal optimization problems. However, slow convergence or premature convergence readily occurs because of inappropriate and inflexible evolution. In this paper, a novel optimization algorithm with a flexible grid optimization (FGO) is suggested to provide adaptive trade-off between exploration and exploitation according to the specific objective function. Meanwhile, a uniform agents array with adaptive scale is distributed on the gird to speed up the calculation. In addition, a dominance centroid and a fitness center are proposed to efficiently determine the potential guides when the population size varies dynamically. Two types of subregion division strategies are designed to enhance evolutionary diversity and convergence, respectively. By examining the performance on four benchmark functions, FGO is found to be competitive with or even superior to several other popular algorithms in terms of both effectiveness and efficiency, tending to reach the global optimum earlier. Moreover, FGO is evaluated by applying it to a parameter decision in a least squares support vector machine (LS-SVM) to verify its practical competence.

A study on semi-supervised kernel ridge regression estimation (준지도 커널능형회귀모형에 관한 연구)

  • Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.2
    • /
    • pp.341-353
    • /
    • 2013
  • In many practical machine learning and data mining applications, unlabeled data are inexpensive and easy to obtain. Semi-supervised learning try to use such data to improve prediction performance. In this paper, a semi-supervised regression method, semi-supervised kernel ridge regression estimation, is proposed on the basis of kernel ridge regression model. The proposed method does not require a pilot estimation of the label of the unlabeled data. This means that the proposed method has good advantages including less number of parameters, easy computing and good generalization ability. Experiments show that the proposed method can effectively utilize unlabeled data to improve regression estimation.

A Method of Feature Extraction on Motor Imagery EEG Using FLD and PCA Based on Sub-Band CSP (서브 밴드 CSP기반 FLD 및 PCA를 이용한 동작 상상 EEG 특징 추출 방법 연구)

  • Park, Sang-Hoon;Lee, Sang-Goog
    • Journal of KIISE
    • /
    • v.42 no.12
    • /
    • pp.1535-1543
    • /
    • 2015
  • The brain-computer interface obtains a user's electroencephalogram as a replacement communication unit for the disabled such that the user is able to control machines by simply thinking instead of using hands or feet. In this paper, we propose a feature extraction method based on a non-selected filter by SBCSP to classify motor imagery EEG. First, we divide frequencies (4~40 Hz) into 4-Hz units and apply CSP to each Unit. Second, we obtain the FLD score vector by combining FLD results. Finally, the FLD score vector is projected onto the optimal plane for classification using PCA. We use BCI Competition III dataset IVa, and Extracted features are used as input for LS-SVM. The classification accuracy of the proposed method was evaluated using $10{\times}10$ fold cross-validation. For subjects 'aa', 'al', 'av', 'aw', and 'ay', results were $85.29{\pm}0.93%$, $95.43{\pm}0.57%$, $72.57{\pm}2.37%$, $91.82{\pm}1.38%$, and $93.50{\pm}0.69%$, respectively.

Object Tracking Based on Exactly Reweighted Online Total-Error-Rate Minimization (정확히 재가중되는 온라인 전체 에러율 최소화 기반의 객체 추적)

  • JANG, Se-In;PARK, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.53-65
    • /
    • 2019
  • Object tracking is one of important steps to achieve video-based surveillance systems. Object tracking is considered as an essential task similar to object detection and recognition. In order to perform object tracking, various machine learning methods (e.g., least-squares, perceptron and support vector machine) can be applied for different designs of tracking systems. In general, generative methods (e.g., principal component analysis) were utilized due to its simplicity and effectiveness. However, the generative methods were only focused on modeling the target object. Due to this limitation, discriminative methods (e.g., binary classification) were adopted to distinguish the target object and the background. Among the machine learning methods for binary classification, total error rate minimization can be used as one of successful machine learning methods for binary classification. The total error rate minimization can achieve a global minimum due to a quadratic approximation to a step function while other methods (e.g., support vector machine) seek local minima using nonlinear functions (e.g., hinge loss function). Due to this quadratic approximation, the total error rate minimization could obtain appropriate properties in solving optimization problems for binary classification. However, this total error rate minimization was based on a batch mode setting. The batch mode setting can be limited to several applications under offline learning. Due to limited computing resources, offline learning could not handle large scale data sets. Compared to offline learning, online learning can update its solution without storing all training samples in learning process. Due to increment of large scale data sets, online learning becomes one of essential properties for various applications. Since object tracking needs to handle data samples in real time, online learning based total error rate minimization methods are necessary to efficiently address object tracking problems. Due to the need of the online learning, an online learning based total error rate minimization method was developed. However, an approximately reweighted technique was developed. Although the approximation technique is utilized, this online version of the total error rate minimization could achieve good performances in biometric applications. However, this method is assumed that the total error rate minimization can be asymptotically achieved when only the number of training samples is infinite. Although there is the assumption to achieve the total error rate minimization, the approximation issue can continuously accumulate learning errors according to increment of training samples. Due to this reason, the approximated online learning solution can then lead a wrong solution. The wrong solution can make significant errors when it is applied to surveillance systems. In this paper, we propose an exactly reweighted technique to recursively update the solution of the total error rate minimization in online learning manner. Compared to the approximately reweighted online total error rate minimization, an exactly reweighted online total error rate minimization is achieved. The proposed exact online learning method based on the total error rate minimization is then applied to object tracking problems. In our object tracking system, particle filtering is adopted. In particle filtering, our observation model is consisted of both generative and discriminative methods to leverage the advantages between generative and discriminative properties. In our experiments, our proposed object tracking system achieves promising performances on 8 public video sequences over competing object tracking systems. The paired t-test is also reported to evaluate its quality of the results. Our proposed online learning method can be extended under the deep learning architecture which can cover the shallow and deep networks. Moreover, online learning methods, that need the exact reweighting process, can use our proposed reweighting technique. In addition to object tracking, the proposed online learning method can be easily applied to object detection and recognition. Therefore, our proposed methods can contribute to online learning community and object tracking, detection and recognition communities.

Application of Terahertz Spectroscopy and Imaging in the Diagnosis of Prostate Cancer

  • Zhang, Ping;Zhong, Shuncong;Zhang, Junxi;Ding, Jian;Liu, Zhenxiang;Huang, Yi;Zhou, Ning;Nsengiyumva, Walter;Zhang, Tianfu
    • Current Optics and Photonics
    • /
    • v.4 no.1
    • /
    • pp.31-43
    • /
    • 2020
  • The feasibility of the application of terahertz electromagnetic waves in the diagnosis of prostate cancer was examined. Four samples of incomplete cancerous prostatic paraffin-embedded tissues were examined using terahertz spectral imaging (TPI) system and the results obtained by comparing the absorption coefficient and refractive index of prostate tumor, normal prostate tissue and smooth muscle from one of the paraffin tissue masses examined were reported. Three hundred and sixty cases of absorption coefficients from one of the paraffin tissues examined were used as raw data to classify these three tissues using the Principal Component Analysis (PCA) and Least Squares Support Vector Machine (LS-SVM). An excellent classification with an accuracy of 92.22% in the prediction set was achieved. Using the distribution information of THz reflection signal intensity from sample surface and absorption coefficient of the sample, an attempt was made to use the TPI system to identify the boundaries of the different tissues involved (prostate tumors, normal and smooth muscles). The location of three identified regions in the terahertz images (frequency domain slice absorption coefficient imaging, 1.2 THz) were compared with those obtained from the histopathologic examination. The tissue tumor region had a distinctively visible color and could well be distinguished from other tissue regions in terahertz images. Results indicate that a THz spectroscopy imaging system can be efficiently used in conjunction with the proposed advanced computer-based mathematical analysis method to identify tumor regions in the paraffin tissue mass of prostate cancer.