• 제목/요약/키워드: Feature Variables

검색결과 366건 처리시간 0.024초

An Input Feature Selection Method Applied to Fuzzy Neural Networks for Signal Estimation

  • Na, Man-Gyun;Sim, Young-Rok
    • Nuclear Engineering and Technology
    • /
    • 제33권5호
    • /
    • pp.457-467
    • /
    • 2001
  • It is well known that the performance of a fuzzy neural network strongly depends on the input features selected for its training. In its applications to sensor signal estimation, there are a large number of input variables related with an output As the number of input variables increases, the training time of fuzzy neural networks required increases exponentially. Thus, it is essential to reduce the number of inputs to a fuzzy neural network and to select the optimum number of mutually independent inputs that are able to clearly define the input-output mapping. In this work, principal component analysis (PCA), genetic algorithms (CA) and probability theory are combined to select new important input features. A proposed feature selection method is applied to the signal estimation of the steam generator water level, the hot-leg flowrate, the pressurizer water level and the pressurizer pressure sensors in pressurized water reactors and compared with other input feature selection methods.

  • PDF

반도체 제조공정에서의 이상수율 검출 방법론 (A New Abnormal Yields Detection Methodology in the Semiconductor Manufacturing Process)

  • 이장희
    • Journal of Information Technology Applications and Management
    • /
    • 제15권1호
    • /
    • pp.243-260
    • /
    • 2008
  • To prevent low yields in the semiconductor industry is crucial to the success of that industry. However, to prevent low yields is difficult because of too many factors to affect yield variation and their complex relation in the semiconductor manufacturing process. This study presents a new efficient detection methodology for detecting abnormal yields including high and low yields, which can forecast the yield level of a production unit (namely a lot) based on yield-related feature variables' behaviors. In the methodology, we use C5.0 to identify the yield-related feature variables that are the combination of correlated process variables associated with yield, use SOM (Self-Organizing Map) neural networks to extract and classify significant patterns of past abnormal yield lots and finally use C5.0 to generate classification rules for detecting abnormal yield lot. We illustrate the effectiveness of our methodology using a semiconductor manufacturing company's field data.

  • PDF

Feature Selection Algorithm for Intrusions Detection System using Sequential Forward Search and Random Forest Classifier

  • Lee, Jinlee;Park, Dooho;Lee, Changhoon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권10호
    • /
    • pp.5132-5148
    • /
    • 2017
  • Cyber attacks are evolving commensurate with recent developments in information security technology. Intrusion detection systems collect various types of data from computers and networks to detect security threats and analyze the attack information. The large amount of data examined make the large number of computations and low detection rates problematic. Feature selection is expected to improve the classification performance and provide faster and more cost-effective results. Despite the various feature selection studies conducted for intrusion detection systems, it is difficult to automate feature selection because it is based on the knowledge of security experts. This paper proposes a feature selection technique to overcome the performance problems of intrusion detection systems. Focusing on feature selection, the first phase of the proposed system aims at constructing a feature subset using a sequential forward floating search (SFFS) to downsize the dimension of the variables. The second phase constructs a classification model with the selected feature subset using a random forest classifier (RFC) and evaluates the classification accuracy. Experiments were conducted with the NSL-KDD dataset using SFFS-RF, and the results indicated that feature selection techniques are a necessary preprocessing step to improve the overall system performance in systems that handle large datasets. They also verified that SFFS-RF could be used for data classification. In conclusion, SFFS-RF could be the key to improving the classification model performance in machine learning.

약물유전체학에서 약물반응 예측모형과 변수선택 방법 (Feature selection and prediction modeling of drug responsiveness in Pharmacogenomics)

  • 김규환;김원국
    • 응용통계연구
    • /
    • 제34권2호
    • /
    • pp.153-166
    • /
    • 2021
  • 약물유전체학 연구의 주요 목표는 고차원의 유전 변수를 기반으로 개인의 약물 반응성을 예측하는 것이다. 변수의 개수가 많기 때문에 변수의 개수를 줄이기 위해서는 변수 선택이 필요하며, 선택된 변수들은 머신러닝 알고리즘을 사용하여 예측 모델을 구축하는데 사용된다. 본 연구에서는 400명의 뇌전증 환자의 차세대 염기서열 분석 데이터에 로지스틱 회귀, ReliefF, TurF, 랜덤 포레스트, LASSO의 조합과 같은 여러 가지 혼합 변수 선택 방법을 적용하였다. 선택된 변수들에 랜덤포레스트, 그래디언트 부스팅, 서포트벡터머신을 포함한 머신러닝 방법들을 적용했고 스태킹을 통해 앙상블 모형을 구축하였다. 본 연구의 결과는 랜덤포레스트와 ReliefF의 혼합 변수 선택 방법을 이용한 스태킹 모형이 다른 모형보다 더 좋은 성능을 보인다는 것을 보여주었다. 5-폴드 교차 검증을 기반으로 하여 적합한 최적 모형의 평균 검증 정확도는 0.727이고 평균 검증 AUC 값은 0.761로 나타났다. 또한, 동일한 변수를 사용할 때 스태킹 모델이 단일 머신러닝 예측 모델보다 성능이 우수한 것으로 나타났다.

Hybrid Feature Selection Using Genetic Algorithm and Information Theory

  • Cho, Jae Hoon;Lee, Dae-Jong;Park, Jin-Il;Chun, Myung-Geun
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제13권1호
    • /
    • pp.73-82
    • /
    • 2013
  • In pattern classification, feature selection is an important factor in the performance of classifiers. In particular, when classifying a large number of features or variables, the accuracy and computational time of the classifier can be improved by using the relevant feature subset to remove the irrelevant, redundant, or noisy data. The proposed method consists of two parts: a wrapper part with an improved genetic algorithm(GA) using a new reproduction method and a filter part using mutual information. We also considered feature selection methods based on mutual information(MI) to improve computational complexity. Experimental results show that this method can achieve better performance in pattern recognition problems than other conventional solutions.

특징정보 분석을 통한 실시간 얼굴인식 (Realtime Face Recognition by Analysis of Feature Information)

  • 정재모;배현;김성신
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2001년도 추계학술대회 학술발표 논문집
    • /
    • pp.299-302
    • /
    • 2001
  • The statistical analysis of the feature extraction and the neural networks are proposed to recognize a human face. In the preprocessing step, the normalized skin color map with Gaussian functions is employed to extract the region of face candidate. The feature information in the region of the face candidate is used to detect the face region. In the recognition step, as a tested, the 120 images of 10 persons are trained by the backpropagation algorithm. The images of each person are obtained from the various direction, pose, and facial expression. Input variables of the neural networks are the geometrical feature information and the feature information that comes from the eigenface spaces. The simulation results of$.$10 persons show that the proposed method yields high recognition rates.

  • PDF

Comparison of Customers Perception of Feature and Smart Phone Users Mainly in 20s

  • Kim, Hyun-Jong
    • 디지털융복합연구
    • /
    • 제9권1호
    • /
    • pp.115-124
    • /
    • 2011
  • The property of the mobile phone is taking important role to choose it. In the present situation, exploring, comparing and analyzing the important properties of regular mobile phone(feature phone) and smart phone are very meaningful study. Therefore, the survey was carried out to get the properties of feature phone and smart phone and analyze the difference of those phones. And proposed the important variables for customer satisfaction which must be given priority. The result showed that 'design' and 'Quality' are important to both mobile phone user groups. The problems with mobile phones currently in use were 'poor performance' to feature phone users and 'expensive charge' and 'poor A/S' to smart phone users. Two groups also showed significant difference with the customer satisfactions, and smart phone user group showed higher satisfaction. For smart phone user group, four factors are induced from the properties but 'Hardware Quality' (representing 'call Quality', 'A/S', 'Convenience to use', 'Battery life') and 'Design & Function'(representing 'Internet', 'Convergence Functions', 'Design, 'Color') have significant and positive effects on Customer Satisfaction.

특징정보 분석을 통한 실시간 얼굴인식 (Realtime Face Recognition by Analysis of Feature Information)

  • 정재모;배현;김성신
    • 한국지능시스템학회논문지
    • /
    • 제11권9호
    • /
    • pp.822-826
    • /
    • 2001
  • The statistical analysis of the feature extraction and the neural networks are proposed to recognize a human face. In the preprocessing step, the normalized skin color map with Gaussian functions is employed to extract the region of face candidate. The feature information in the region of the face candidate is used to detect the face region. In the recognition step, as a tested, the 120 images of 10 persons are trained by the backpropagation algorithm. The images of each person are obtained from the various direction, pose, and facial expression. Input variables of the neural networks are the geometrical feature information and the feature information that comes from the eigenface spaces. The simulation results of 10 persons show that the proposed method yields high recognition rates.

  • PDF

Classification of High Dimensionality Data through Feature Selection Using Markov Blanket

  • Lee, Junghye;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • 제14권2호
    • /
    • pp.210-219
    • /
    • 2015
  • A classification task requires an exponentially growing amount of computation time and number of observations as the variable dimensionality increases. Thus, reducing the dimensionality of the data is essential when the number of observations is limited. Often, dimensionality reduction or feature selection leads to better classification performance than using the whole number of features. In this paper, we study the possibility of utilizing the Markov blanket discovery algorithm as a new feature selection method. The Markov blanket of a target variable is the minimal variable set for explaining the target variable on the basis of conditional independence of all the variables to be connected in a Bayesian network. We apply several Markov blanket discovery algorithms to some high-dimensional categorical and continuous data sets, and compare their classification performance with other feature selection methods using well-known classifiers.

요골 맥파를 이용한 사상체질 판별 (Determination of Sasang Constitution from Artery Pulse Waves)

  • 조재경
    • 한국산학기술학회논문지
    • /
    • 제21권2호
    • /
    • pp.359-365
    • /
    • 2020
  • 한방병원에 건강검진을 목적으로 내원한 732명의 피험자를 대상으로 사상체질감별설문지 검사를 통하여 한의사가 감별한 사상체질 결과와 맥진기를 사용하여 촌, 관, 척 부위의 요골 맥파를 측정한 자료를 입수하였다. 입수한 맥파 자료로부터 맥파의 폭, 피크의 크기와 개수를 특징 변수로 추출했다. 특징 변수들 중 타당도와 신뢰도가 높은 것들을 판별변수로 선정했다. 5겹 교차 검증법을 적용하여, 맥파 자료를 사상체질을 알려주는 훈련 표본과 사상체질을 알려주지 않는 예측 표본으로 구분했다. 훈련 표본에 대해서 판별분석을 수행하여 판별 함수를 구한 후, 이것을 예측 표본에 적용하여 사상체질을 예측했다. 예측한 사상체질과 설문지 검사로 알고 있는 사상체질을 비교하여 사상체질 분류 정확도를 계산했다. 사상체질 분류 정확도는 나이별, 성별 보정 전에는 태음인: 73.6 %, 소음인: 68.4 %, 소양인: 74.2 %이었으며, 전체 분류 정확도는 72.5 %이었다. 한국 인구를 기반으로 한 성별, 나이별 가중치를 적용하여 보정을 한 후의 사상체질 분류 정확도는 태음인(70.4 %), 소음인(84.2 %), 소양인(67.7 %)이었으며, 전체 분류 정확도는 73.8 %이었다.