• Title/Summary/Keyword: Selection of input parameter

Search Result 68, Processing Time 0.025 seconds

Comparing Classification Accuracy of Ensemble and Clustering Algorithms Based on Taguchi Design (다구찌 디자인을 이용한 앙상블 및 군집분석 분류 성능 비교)

  • Shin, Hyung-Won;Sohn, So-Young
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.27 no.1
    • /
    • pp.47-53
    • /
    • 2001
  • In this paper, we compare the classification performances of both ensemble and clustering algorithms (Data Bagging, Variable Selection Bagging, Parameter Combining, Clustering) to logistic regression in consideration of various characteristics of input data. Four factors used to simulate the logistic model are (1) correlation among input variables (2) variance of observation (3) training data size and (4) input-output function. In view of the unknown relationship between input and output function, we use a Taguchi design to improve the practicality of our study results by letting it as a noise factor. Experimental study results indicate the following: When the level of the variance is medium, Bagging & Parameter Combining performs worse than Logistic Regression, Variable Selection Bagging and Clustering. However, classification performances of Logistic Regression, Variable Selection Bagging, Bagging and Clustering are not significantly different when the variance of input data is either small or large. When there is strong correlation in input variables, Variable Selection Bagging outperforms both Logistic Regression and Parameter combining. In general, Parameter Combining algorithm appears to be the worst at our disappointment.

  • PDF

Survey on Nucleotide Encoding Techniques and SVM Kernel Design for Human Splice Site Prediction

  • Bari, A.T.M. Golam;Reaz, Mst. Rokeya;Choi, Ho-Jin;Jeong, Byeong-Soo
    • Interdisciplinary Bio Central
    • /
    • v.4 no.4
    • /
    • pp.14.1-14.6
    • /
    • 2012
  • Splice site prediction in DNA sequence is a basic search problem for finding exon/intron and intron/exon boundaries. Removing introns and then joining the exons together forms the mRNA sequence. These sequences are the input of the translation process. It is a necessary step in the central dogma of molecular biology. The main task of splice site prediction is to find out the exact GT and AG ended sequences. Then it identifies the true and false GT and AG ended sequences among those candidate sequences. In this paper, we survey research works on splice site prediction based on support vector machine (SVM). The basic difference between these research works is nucleotide encoding technique and SVM kernel selection. Some methods encode the DNA sequence in a sparse way whereas others encode in a probabilistic manner. The encoded sequences serve as input of SVM. The task of SVM is to classify them using its learning model. The accuracy of classification largely depends on the proper kernel selection for sequence data as well as a selection of kernel parameter. We observe each encoding technique and classify them according to their similarity. Then we discuss about kernel and their parameter selection. Our survey paper provides a basic understanding of encoding approaches and proper kernel selection of SVM for splice site prediction.

The Development of the Web Based Cutting Parameter Selection System Using Group Technology (GT를 이용한 Web 기반 절삭변수 검색시스템의 개발)

  • Lee, Sung-Youl;Kwak, Kyu-Sup
    • IE interfaces
    • /
    • v.15 no.3
    • /
    • pp.308-315
    • /
    • 2002
  • This study presents the web based cutting parameter selection system using Group Technology (GT). The GT is basically applied to classify and code the work material and cutting process which are main factors to affect cutting parameter selection. The proposed system has been designed to electronically select proper cutting conditions based on the stored GT database. The existing approaches used in most small and medium sized companies are basically to use manufacturing engineer's experience or to find the recommended values from the manufacturing engineers handbook. These processes are often time consuming and inconsistent, especially when a new engineer is involved. Consequently, the proposed system could automatically and consistently generate the proper cutting conditions (feed, depth of cut, and cutting speed) as soon as relatively simple data input is given thanks to the classified GT database.

Parameter Estimation of Three-Phase Induction Motor by Using Genetic Algorithm

  • Jangjit, Seesak;Laohachai, Panthep
    • Journal of Electrical Engineering and Technology
    • /
    • v.4 no.3
    • /
    • pp.360-364
    • /
    • 2009
  • This paper suggests the techniques in determining the values of the steady-state equivalent circuit parameters of a three-phase induction machine using genetic algorithm. The parameter estimation procedure is based on the steady-state phase current versus slip and input power versus slip characteristics. The propose estimation algorithm is of non-linear kind based on selection in genetic algorithm. The machine parameters are obtained as the solution of a minimization of objective function by genetic algorithm. Simulation shows good performance of the propose procedures.

Analysis on prediction models of TBM performance: A review (TBM 굴진성능 예측모델 분석: 리뷰)

  • Lee, Hang-Lo;Song, Ki-Il;Cho, Gye-Chun
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.18 no.2
    • /
    • pp.245-256
    • /
    • 2016
  • Prediction of TBM performance is very important for machine selection, and for reliable estimation of construction cost and period. The purpose of this research is to analyze the evaluation process of various prediction models for TBM performance and applied methodology. Based on the solid literature review since 2000, a classification system of TBM performance prediction model is proposed in this study. Classification system suggested in this study can be divided into two stages: selection of input parameter and application of prediction techniques. We also analyzed input and output parameters for prediction model and frequency of use. Lastly, the future research and development trend of TBM performance prediction is suggested.

Comparison Study for Data Fusion and Clustering Classification Performances (다구찌 디자인을 이용한 데이터 퓨전 및 군집분석 분류 성능 비교)

  • 신형원;손소영
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2000.04a
    • /
    • pp.601-604
    • /
    • 2000
  • In this paper, we compare the classification performance of both data fusion and clustering algorithms (Data Bagging, Variable Selection Bagging, Parameter Combining, Clustering) to logistic regression in consideration of various characteristics of input data. Four factors used to simulate the logistic model are (1) correlation among input variables (2) variance of observation (3) training data size and (4) input-output function. Since the relationship between input & output is not typically known, we use Taguchi design to improve the practicality of our study results by letting it as a noise factor. Experimental study results indicate the following: Clustering based logistic regression turns out to provide the highest classification accuracy when input variables are weakly correlated and the variance of data is high. When there is high correlation among input variables, variable bagging performs better than logistic regression. When there is strong correlation among input variables and high variance between observations, bagging appears to be marginally better than logistic regression but was not significant.

  • PDF

Unbalance Control Strategy of Boost Type Three-Phase to Single-Phase Matrix Converters Based on Lyapunov Function

  • Xu, Yu-xiang;Ge, Hong-juan;Guo, Hai
    • Journal of Power Electronics
    • /
    • v.19 no.1
    • /
    • pp.89-98
    • /
    • 2019
  • This paper analyzes the input side performance of a conventional three-phase to single-phase matrix converter (3-1MC). It also presents the input-side waveform quality under this topology. The suppression of low-frequency input current harmonics is studied using the 3-1MC plus capacitance compensation unit. The constraint between the modulation function of the output and compensation sides is analyzed, and the relations among the voltage utilization ratio and the output compensation capacitance, filter capacitors and other system parameters are deduced. For a 3-1MC without large-capacity energy storage, the system performance is susceptible to input voltage imbalance. This paper decouples the inner current of the 3-1MC using a Lyapunov function in the input positive and negative sequence bi-coordinate axes. Meanwhile, the outer loop adopts a voltage-weighted synthesis of the output and compensation sides as a cascade of control objects. Experiments show that this strategy suppresses the low-frequency input current harmonics caused by input voltage imbalance, and ensures that the system maintains good static and dynamic performances under input-unbalanced conditions. At the same time, the parameter selection and debugging methods are simple.

A Hybrid PSO-BPSO Based Kernel Extreme Learning Machine Model for Intrusion Detection

  • Shen, Yanping;Zheng, Kangfeng;Wu, Chunhua
    • Journal of Information Processing Systems
    • /
    • v.18 no.1
    • /
    • pp.146-158
    • /
    • 2022
  • With the success of the digital economy and the rapid development of its technology, network security has received increasing attention. Intrusion detection technology has always been a focus and hotspot of research. A hybrid model that combines particle swarm optimization (PSO) and kernel extreme learning machine (KELM) is presented in this work. Continuous-valued PSO and binary PSO (BPSO) are adopted together to determine the parameter combination and the feature subset. A fitness function based on the detection rate and the number of selected features is proposed. The results show that the method can simultaneously determine the parameter values and select features. Furthermore, competitive or better accuracy can be obtained using approximately one quarter of the raw input features. Experiments proved that our method is slightly better than the genetic algorithm-based KELM model.

Electronic Catalogue Based Cutting Parameter Selection (전자 카탈로그식 절삭변수 선정의 자동화)

  • 이성열
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.6 no.4
    • /
    • pp.1-5
    • /
    • 2001
  • This study presents an electronic catalogue based cutting parameter selection system using MS Access software. The proposed system has been designed to electronically select proper cutting conditions based on the stored data base. The existing approaches used in most small and medium sized companies are basically to use manufacturing engineer's experience or to find the recommended values from the manufacturing engineer handbook. These processes are often time consuming and inconsistent, especially when a new engineer is involved. Therefore, this study proposes a simple, yet quick and consistent electronic catalogue based cutting parameter selection method which uses MS Access in terms of programming and database implementation. Consequently, the proposed system could automatically generate the proper cutting conditions (feed, depth of cut, and cutting speed) as soon as the input data (proper information about the tool and work material) is given. Thanks to the simple structure and popularity of the MS Access, the engineer could be quickly accustomed to the system and easily modify/insert/delete the database if necessary.

  • PDF

Design of Multiple Model Fuzzy Predictors using Data Preprocessing and its Application (데이터 전처리를 이용한 다중 모델 퍼지 예측기의 설계 및 응용)

  • Bang, Young-Keun;Lee, Chul-Heui
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.58 no.1
    • /
    • pp.173-180
    • /
    • 2009
  • It is difficult to predict non-stationary or chaotic time series which includes the drift and/or the non-linearity as well as uncertainty. To solve it, we propose an effective prediction method which adopts data preprocessing and multiple model TS fuzzy predictors combined with model selection mechanism. In data preprocessing procedure, the candidates of the optimal difference interval are determined based on the correlation analysis, and corresponding difference data sets are generated in order to use them as predictor input instead of the original ones because the difference data can stabilize the statistical characteristics of those time series and better reveals their implicit properties. Then, TS fuzzy predictors are constructed for multiple model bank, where k-means clustering algorithm is used for fuzzy partition of input space, and the least squares method is applied to parameter identification of fuzzy rules. Among the predictors in the model bank, the one which best minimizes the performance index is selected, and it is used for prediction thereafter. Finally, the error compensation procedure based on correlation analysis is added to improve the prediction accuracy. Some computer simulations are performed to verify the effectiveness of the proposed method.