• 제목/요약/키워드: Selection of input parameter

검색결과 68건 처리시간 0.021초

다구찌 디자인을 이용한 앙상블 및 군집분석 분류 성능 비교 (Comparing Classification Accuracy of Ensemble and Clustering Algorithms Based on Taguchi Design)

  • 신형원;손소영
    • 대한산업공학회지
    • /
    • 제27권1호
    • /
    • pp.47-53
    • /
    • 2001
  • In this paper, we compare the classification performances of both ensemble and clustering algorithms (Data Bagging, Variable Selection Bagging, Parameter Combining, Clustering) to logistic regression in consideration of various characteristics of input data. Four factors used to simulate the logistic model are (1) correlation among input variables (2) variance of observation (3) training data size and (4) input-output function. In view of the unknown relationship between input and output function, we use a Taguchi design to improve the practicality of our study results by letting it as a noise factor. Experimental study results indicate the following: When the level of the variance is medium, Bagging & Parameter Combining performs worse than Logistic Regression, Variable Selection Bagging and Clustering. However, classification performances of Logistic Regression, Variable Selection Bagging, Bagging and Clustering are not significantly different when the variance of input data is either small or large. When there is strong correlation in input variables, Variable Selection Bagging outperforms both Logistic Regression and Parameter combining. In general, Parameter Combining algorithm appears to be the worst at our disappointment.

  • PDF

Survey on Nucleotide Encoding Techniques and SVM Kernel Design for Human Splice Site Prediction

  • Bari, A.T.M. Golam;Reaz, Mst. Rokeya;Choi, Ho-Jin;Jeong, Byeong-Soo
    • Interdisciplinary Bio Central
    • /
    • 제4권4호
    • /
    • pp.14.1-14.6
    • /
    • 2012
  • Splice site prediction in DNA sequence is a basic search problem for finding exon/intron and intron/exon boundaries. Removing introns and then joining the exons together forms the mRNA sequence. These sequences are the input of the translation process. It is a necessary step in the central dogma of molecular biology. The main task of splice site prediction is to find out the exact GT and AG ended sequences. Then it identifies the true and false GT and AG ended sequences among those candidate sequences. In this paper, we survey research works on splice site prediction based on support vector machine (SVM). The basic difference between these research works is nucleotide encoding technique and SVM kernel selection. Some methods encode the DNA sequence in a sparse way whereas others encode in a probabilistic manner. The encoded sequences serve as input of SVM. The task of SVM is to classify them using its learning model. The accuracy of classification largely depends on the proper kernel selection for sequence data as well as a selection of kernel parameter. We observe each encoding technique and classify them according to their similarity. Then we discuss about kernel and their parameter selection. Our survey paper provides a basic understanding of encoding approaches and proper kernel selection of SVM for splice site prediction.

GT를 이용한 Web 기반 절삭변수 검색시스템의 개발 (The Development of the Web Based Cutting Parameter Selection System Using Group Technology)

  • 이성열;곽규섭
    • 산업공학
    • /
    • 제15권3호
    • /
    • pp.308-315
    • /
    • 2002
  • This study presents the web based cutting parameter selection system using Group Technology (GT). The GT is basically applied to classify and code the work material and cutting process which are main factors to affect cutting parameter selection. The proposed system has been designed to electronically select proper cutting conditions based on the stored GT database. The existing approaches used in most small and medium sized companies are basically to use manufacturing engineer's experience or to find the recommended values from the manufacturing engineers handbook. These processes are often time consuming and inconsistent, especially when a new engineer is involved. Consequently, the proposed system could automatically and consistently generate the proper cutting conditions (feed, depth of cut, and cutting speed) as soon as relatively simple data input is given thanks to the classified GT database.

Parameter Estimation of Three-Phase Induction Motor by Using Genetic Algorithm

  • Jangjit, Seesak;Laohachai, Panthep
    • Journal of Electrical Engineering and Technology
    • /
    • 제4권3호
    • /
    • pp.360-364
    • /
    • 2009
  • This paper suggests the techniques in determining the values of the steady-state equivalent circuit parameters of a three-phase induction machine using genetic algorithm. The parameter estimation procedure is based on the steady-state phase current versus slip and input power versus slip characteristics. The propose estimation algorithm is of non-linear kind based on selection in genetic algorithm. The machine parameters are obtained as the solution of a minimization of objective function by genetic algorithm. Simulation shows good performance of the propose procedures.

TBM 굴진성능 예측모델 분석: 리뷰 (Analysis on prediction models of TBM performance: A review)

  • 이항로;송기일;조계춘
    • 한국터널지하공간학회 논문집
    • /
    • 제18권2호
    • /
    • pp.245-256
    • /
    • 2016
  • TBM을 적용하는 현장에서 장비 선택, 공사기간 및 공사비용의 합리적인 산정을 위하여 TBM의 굴진성능을 정확하게 예측하는 것은 매우 중요한 사안이다. 본 연구에서는 최신 자료들을 바탕으로 기존의 TBM 굴진성능 예측모델들의 평가과정과 방법론에 대한 분석을 수행하였다. 2000년 이후에 발표된 문헌들에 대한 조사를 토대로 TBM 굴진성능 예측모델의 분류체계를 제시하였다. 본 연구에서 제시한 분류체계에서는 TBM 굴진성능 예측모델에 필요한 입력인자 선정단계와 예측기법 적용단계로 크게 구분하였다. 또한 각 예측모델에서 사용된 입력인자, 출력인자 그리고 예측모델에서 사용된 인자의 적용빈도를 분석하였다. 마지막으로 TBM 굴진성능 예측모델의 현황과 향후 연구방향에 대하여 제언하였다.

다구찌 디자인을 이용한 데이터 퓨전 및 군집분석 분류 성능 비교 (Comparison Study for Data Fusion and Clustering Classification Performances)

  • 신형원;손소영
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 2000년도 춘계공동학술대회 논문집
    • /
    • pp.601-604
    • /
    • 2000
  • In this paper, we compare the classification performance of both data fusion and clustering algorithms (Data Bagging, Variable Selection Bagging, Parameter Combining, Clustering) to logistic regression in consideration of various characteristics of input data. Four factors used to simulate the logistic model are (1) correlation among input variables (2) variance of observation (3) training data size and (4) input-output function. Since the relationship between input & output is not typically known, we use Taguchi design to improve the practicality of our study results by letting it as a noise factor. Experimental study results indicate the following: Clustering based logistic regression turns out to provide the highest classification accuracy when input variables are weakly correlated and the variance of data is high. When there is high correlation among input variables, variable bagging performs better than logistic regression. When there is strong correlation among input variables and high variance between observations, bagging appears to be marginally better than logistic regression but was not significant.

  • PDF

Unbalance Control Strategy of Boost Type Three-Phase to Single-Phase Matrix Converters Based on Lyapunov Function

  • Xu, Yu-xiang;Ge, Hong-juan;Guo, Hai
    • Journal of Power Electronics
    • /
    • 제19권1호
    • /
    • pp.89-98
    • /
    • 2019
  • This paper analyzes the input side performance of a conventional three-phase to single-phase matrix converter (3-1MC). It also presents the input-side waveform quality under this topology. The suppression of low-frequency input current harmonics is studied using the 3-1MC plus capacitance compensation unit. The constraint between the modulation function of the output and compensation sides is analyzed, and the relations among the voltage utilization ratio and the output compensation capacitance, filter capacitors and other system parameters are deduced. For a 3-1MC without large-capacity energy storage, the system performance is susceptible to input voltage imbalance. This paper decouples the inner current of the 3-1MC using a Lyapunov function in the input positive and negative sequence bi-coordinate axes. Meanwhile, the outer loop adopts a voltage-weighted synthesis of the output and compensation sides as a cascade of control objects. Experiments show that this strategy suppresses the low-frequency input current harmonics caused by input voltage imbalance, and ensures that the system maintains good static and dynamic performances under input-unbalanced conditions. At the same time, the parameter selection and debugging methods are simple.

A Hybrid PSO-BPSO Based Kernel Extreme Learning Machine Model for Intrusion Detection

  • Shen, Yanping;Zheng, Kangfeng;Wu, Chunhua
    • Journal of Information Processing Systems
    • /
    • 제18권1호
    • /
    • pp.146-158
    • /
    • 2022
  • With the success of the digital economy and the rapid development of its technology, network security has received increasing attention. Intrusion detection technology has always been a focus and hotspot of research. A hybrid model that combines particle swarm optimization (PSO) and kernel extreme learning machine (KELM) is presented in this work. Continuous-valued PSO and binary PSO (BPSO) are adopted together to determine the parameter combination and the feature subset. A fitness function based on the detection rate and the number of selected features is proposed. The results show that the method can simultaneously determine the parameter values and select features. Furthermore, competitive or better accuracy can be obtained using approximately one quarter of the raw input features. Experiments proved that our method is slightly better than the genetic algorithm-based KELM model.

전자 카탈로그식 절삭변수 선정의 자동화 (Electronic Catalogue Based Cutting Parameter Selection)

  • 이성열
    • 한국산업정보학회논문지
    • /
    • 제6권4호
    • /
    • pp.1-5
    • /
    • 2001
  • 기존의 대부분 중소기업에서 사용하고 있는 공작기계의 절삭조건 선정방법은 공작기계, 절삭공구, 피삭재의 종류에 따라 절삭기사의 경험 또는 공구 메뉴얼의 권장값을 수작업으로 찾아서 이용하고 있는 실정이어서 절삭가공 비용의 증가가 따르게 되며, 특히 미숙련자가 현장에 배치될 경우는 이 문제는 더욱 일관성에서 멀어지게 된다. 이것은 결과적으로 가격 경쟁력을 약화시키는 요인이 되고 있다. 특히 CNC 공작기계가 주종을 이루는 경우는 NC 프로그래머가 공정계획에 따라 절삭작업공정을 프로그램 할 때 적정한 절삭변수들을 선정해주어야 하는데, 이 부분이 위에서 설명한 것처럼 수작업에 의해 탐색되어야 함으로 변수 선정의 부정확성은 물론 프로그래머에 따라 다르게 선택할 수 있게 되어 일관성이 결여된 선정을 초래하게 된다. 그러므로, 본 연구에서는 NC 프로그래머 또는 공작기계 조작자가 공구 및 피삭재에 대한 전문지식 없이도 적절한 절삭조건을 손쉽고 일관성 있게 찾을 수 있도록 MS 액세스 소프트웨어를 이용한 전자 카탈로그식 절삭변수 선정 시스템을 개발하였다.

  • PDF

데이터 전처리를 이용한 다중 모델 퍼지 예측기의 설계 및 응용 (Design of Multiple Model Fuzzy Predictors using Data Preprocessing and its Application)

  • 방영근;이철희
    • 전기학회논문지
    • /
    • 제58권1호
    • /
    • pp.173-180
    • /
    • 2009
  • It is difficult to predict non-stationary or chaotic time series which includes the drift and/or the non-linearity as well as uncertainty. To solve it, we propose an effective prediction method which adopts data preprocessing and multiple model TS fuzzy predictors combined with model selection mechanism. In data preprocessing procedure, the candidates of the optimal difference interval are determined based on the correlation analysis, and corresponding difference data sets are generated in order to use them as predictor input instead of the original ones because the difference data can stabilize the statistical characteristics of those time series and better reveals their implicit properties. Then, TS fuzzy predictors are constructed for multiple model bank, where k-means clustering algorithm is used for fuzzy partition of input space, and the least squares method is applied to parameter identification of fuzzy rules. Among the predictors in the model bank, the one which best minimizes the performance index is selected, and it is used for prediction thereafter. Finally, the error compensation procedure based on correlation analysis is added to improve the prediction accuracy. Some computer simulations are performed to verify the effectiveness of the proposed method.