• Title/Summary/Keyword: bayesian neural networks

Search Result 64, Processing Time 0.025 seconds

Path-Based Computation Encoder for Neural Architecture Search

  • Yang, Ying;Zhang, Xu;Pan, Hu
    • Journal of Information Processing Systems
    • /
    • v.18 no.2
    • /
    • pp.188-196
    • /
    • 2022
  • Recently, neural architecture search (NAS) has received increasing attention as it can replace human experts in designing the architecture of neural networks for different tasks and has achieved remarkable results in many challenging tasks. In this study, a path-based computation neural architecture encoder (PCE) was proposed. Our PCE first encodes the computation of information on each path in a neural network, and then aggregates the encodings on all paths together through an attention mechanism, simulating the process of information computation along paths in a neural network and encoding the computation on the neural network instead of the structure of the graph, which is more consistent with the computational properties of neural networks. We performed an extensive comparison with eight encoding methods on two commonly used NAS search spaces (NAS-Bench-101 and NAS-Bench-201), which included a comparison of the predictive capabilities of performance predictors and search capabilities based on two search strategies (reinforcement learning-based and Bayesian optimization-based) when equipped with different encoders. Experimental evaluation shows that PCE is an efficient encoding method that effectively ranks and predicts neural architecture performance, thereby improving the search efficiency of neural architectures.

Predicting concrete properties using neural networks (NN) with principal component analysis (PCA) technique

  • Boukhatem, B.;Kenai, S.;Hamou, A.T.;Ziou, Dj.;Ghrici, M.
    • Computers and Concrete
    • /
    • v.10 no.6
    • /
    • pp.557-573
    • /
    • 2012
  • This paper discusses the combined application of two different techniques, Neural Networks (NN) and Principal Component Analysis (PCA), for improved prediction of concrete properties. The combination of these approaches allowed the development of six neural networks models for predicting slump and compressive strength of concrete with mineral additives such as blast furnace slag, fly ash and silica fume. The Back-Propagation Multi-Layer Perceptron (BPMLP) with Bayesian regularization was used in all these models. They are produced to implement the complex nonlinear relationship between the inputs and the output of the network. They are also established through the incorporation of a huge experimental database on concrete organized in the form Mix-Property. Thus, the data comprising the concrete mixtures are much correlated to each others. The PCA is proposed for the compression and the elimination of the correlation between these data. After applying the PCA, the uncorrelated data were used to train the six models. The predictive results of these models were compared with the actual experimental trials. The results showed that the elimination of the correlation between the input parameters using PCA improved the predictive generalisation performance models with smaller architectures and dimensionality reduction. This study showed also that using the developed models for numerical investigations on the parameters affecting the properties of concrete is promising.

Recurrent Neural Network Modeling of Etch Tool Data: a Preliminary for Fault Inference via Bayesian Networks

  • Nawaz, Javeria;Arshad, Muhammad Zeeshan;Park, Jin-Su;Shin, Sung-Won;Hong, Sang-Jeen
    • Proceedings of the Korean Vacuum Society Conference
    • /
    • 2012.02a
    • /
    • pp.239-240
    • /
    • 2012
  • With advancements in semiconductor device technologies, manufacturing processes are getting more complex and it became more difficult to maintain tighter process control. As the number of processing step increased for fabricating complex chip structure, potential fault inducing factors are prevail and their allowable margins are continuously reduced. Therefore, one of the key to success in semiconductor manufacturing is highly accurate and fast fault detection and classification at each stage to reduce any undesired variation and identify the cause of the fault. Sensors in the equipment are used to monitor the state of the process. The idea is that whenever there is a fault in the process, it appears as some variation in the output from any of the sensors monitoring the process. These sensors may refer to information about pressure, RF power or gas flow and etc. in the equipment. By relating the data from these sensors to the process condition, any abnormality in the process can be identified, but it still holds some degree of certainty. Our hypothesis in this research is to capture the features of equipment condition data from healthy process library. We can use the health data as a reference for upcoming processes and this is made possible by mathematically modeling of the acquired data. In this work we demonstrate the use of recurrent neural network (RNN) has been used. RNN is a dynamic neural network that makes the output as a function of previous inputs. In our case we have etch equipment tool set data, consisting of 22 parameters and 9 runs. This data was first synchronized using the Dynamic Time Warping (DTW) algorithm. The synchronized data from the sensors in the form of time series is then provided to RNN which trains and restructures itself according to the input and then predicts a value, one step ahead in time, which depends on the past values of data. Eight runs of process data were used to train the network, while in order to check the performance of the network, one run was used as a test input. Next, a mean squared error based probability generating function was used to assign probability of fault in each parameter by comparing the predicted and actual values of the data. In the future we will make use of the Bayesian Networks to classify the detected faults. Bayesian Networks use directed acyclic graphs that relate different parameters through their conditional dependencies in order to find inference among them. The relationships between parameters from the data will be used to generate the structure of Bayesian Network and then posterior probability of different faults will be calculated using inference algorithms.

  • PDF

Automated segmentation of concrete images into microstructures: A comparative study

  • Yazdi, Mehran;Sarafrazi, Katayoon
    • Computers and Concrete
    • /
    • v.14 no.3
    • /
    • pp.315-325
    • /
    • 2014
  • Concrete is an important material in most of civil constructions. Many properties of concrete can be determined through analysis of concrete images. Image segmentation is the first step for the most of these analyses. An automated system for segmentation of concrete images into microstructures using texture analysis is proposed. The performance of five different classifiers has been evaluated and the results show that using an Artificial Neural Network classifier is the best choice for an automatic image segmentation of concrete.

Text Classification Using Parallel Word-level and Character-level Embeddings in Convolutional Neural Networks

  • Geonu Kim;Jungyeon Jang;Juwon Lee;Kitae Kim;Woonyoung Yeo;Jong Woo Kim
    • Asia pacific journal of information systems
    • /
    • v.29 no.4
    • /
    • pp.771-788
    • /
    • 2019
  • Deep learning techniques such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) show superior performance in text classification than traditional approaches such as Support Vector Machines (SVMs) and Naïve Bayesian approaches. When using CNNs for text classification tasks, word embedding or character embedding is a step to transform words or characters to fixed size vectors before feeding them into convolutional layers. In this paper, we propose a parallel word-level and character-level embedding approach in CNNs for text classification. The proposed approach can capture word-level and character-level patterns concurrently in CNNs. To show the usefulness of proposed approach, we perform experiments with two English and three Korean text datasets. The experimental results show that character-level embedding works better in Korean and word-level embedding performs well in English. Also the experimental results reveal that the proposed approach provides better performance than traditional CNNs with word-level embedding or character-level embedding in both Korean and English documents. From more detail investigation, we find that the proposed approach tends to perform better when there is relatively small amount of data comparing to the traditional embedding approaches.

Sequence-Based Travel Route Recommendation Systems Using Deep Learning - A Case of Jeju Island - (딥러닝을 이용한 시퀀스 기반의 여행경로 추천시스템 -제주도 사례-)

  • Lee, Hee Jun;Lee, Won Sok;Choi, In Hyeok;Lee, Choong Kwon
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.45-50
    • /
    • 2020
  • With the development of deep learning, studies using artificial neural networks based on deep learning in recommendation systems are being actively conducted. Especially, the recommendation system based on RNN (Recurrent Neural Network) shows good performance because it considers the sequential characteristics of data. This study proposes a travel route recommendation system using GRU(Gated Recurrent Unit) and Session-based Parallel Mini-batch which are RNN-based algorithm. This study improved the recommendation performance through an ensemble of top1 and bpr(Bayesian personalized ranking) error functions. In addition, it was confirmed that the RNN-based recommendation system considering the sequential characteristics in the data makes a recommendation reflecting the meaning of the travel destination inherent in the travel route.

Spatial-Temporal Drought Analysis of South Korea Based On Neural Networks (신경망을 이용한 우리나라의 시공간적 가뭄의 해석)

  • Sin, Hyeon-Seok;Park, Mu-Jong
    • Journal of Korea Water Resources Association
    • /
    • v.32 no.1
    • /
    • pp.15-29
    • /
    • 1999
  • A new methodology to analyze and quantify regional meteorological drought based on annual precipitation data has been introduced in this paper In this study, based on posterior probability estimator and Bayesian classifier in Spatial Analysis Neural Network (SANN), point drought probabilities categorized as extreme, severe, mild, and non drought events has been defined, and a Bayesian Drought Severity Index (BPSI) has been introduced to classify the region of interest into four drought severities. In addition, to estimate the regional drought severity for the entire region, regional extreme, severe, mild, and non drought probabilities which are the areal averages of point drought probabilities over the region has been computed and applied. In this study, the proposed methodology has been applied to analyze the regional drought of South Korea during 1967-1996 years. The drought severity for the whole South Korea was defined spatially at each year and each year was classified in a drought severity criterion. The results may be useful for water manager to understand the South Korean drought with respect to the spatial and temporal variation.

  • PDF

Fault Diagnosis in Semiconductor Etch Equipment Using Bayesian Networks

  • Nawaz, Javeria Muhammad;Arshad, Muhammad Zeeshan;Hong, Sang Jeen
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.14 no.2
    • /
    • pp.252-261
    • /
    • 2014
  • A Bayesian network (BN) based fault diagnosis framework for semiconductor etching equipment is presented. Suggested framework contains data preprocessing, data synchronization, time series modeling, and BN inference, and the established BNs show the cause and effect relationship in the equipment module level. Statistically significant state variable identification (SVID) data of etch equipment are preselected using principal component analysis (PCA) and derivative dynamic time warping (DDTW) is employed for data synchronization. Elman's recurrent neural networks (ERNNs) for individual SVID parameters are constructed, and the predicted errors of ERNNs are then used for assigning prior conditional probability in BN inference of the fault diagnosis. For the demonstration of the proposed methodology, 300 mm etch equipment model is reconstructed in subsystem levels, and several fault diagnosis scenarios are considered. BNs for the equipment fault diagnosis consists of three layers of nodes, such as root cause (RC), module (M), and data parameter (DP), and the constructed BN illustrates how the observed fault is related with possible root causes. Four out of five different types of fault scenarios are successfully diagnosed with the proposed inference methodology.

Pattern Recognition using Robust Feedforward Neural Networks (로버스트 다층전방향 신경망을 이용한 패턴인식)

  • Hwang, Chang-Ha;Kim, Sang-Min
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.345-355
    • /
    • 1998
  • The back propagation(BP) algorithm allows multilayer feedforward neural networks to learn input-output mappings from training samples. It iteratively adjusts the network parameters(weights) to minimize the sum of squared approximation errors using a gradient descent technique. However, the mapping acquired through the BP algorithm may be corrupt when errorneous training data are employed. In this paper two types of robust backpropagation algorithms are discussed both from a theoretical point of view and in the case studies of nonlinear regression function estimation and handwritten Korean character recognition. For future research we suggest Bayesian learning approach to neural networks and compare it with two robust backpropagation algorithms.

  • PDF

Multi-temporal Remote-Sensing Imag e ClassificationUsing Artificial Neural Networks (인공신경망 이론을 이용한 위성영상의 카테고리분류)

  • Kang, Moon-Seong;Park, Seung-Woo;Lim, Jae-Chon
    • Proceedings of the Korean Society of Agricultural Engineers Conference
    • /
    • 2001.10a
    • /
    • pp.59-64
    • /
    • 2001
  • The objectives of the thesis are to propose a pattern classification method for remote sensing data using artificial neural network. First, we apply the error back propagation algorithm to classify the remote sensing data. In this case, the classification performance depends on a training data set. Using the training data set and the error back propagation algorithm, a layered neural network is trained such that the training pattern are classified with a specified accuracy. After training the neural network, some pixels are deleted from the original training data set if they are incorrectly classified and a new training data set is built up. Once training is complete, a testing data set is classified by using the trained neural network. The classification results of Landsat TM data show that this approach produces excellent results which are more realistic and noiseless compared with a conventional Bayesian method.

  • PDF