• 제목/요약/키워드: neural network training

검색결과 1,742건 처리시간 0.026초

Effects of Hyper-parameters and Dataset on CNN Training

  • Nguyen, Huu Nhan;Lee, Chanho
    • 전기전자학회논문지
    • /
    • 제22권1호
    • /
    • pp.14-20
    • /
    • 2018
  • The purpose of training a convolutional neural network (CNN) is to obtain weight factors that give high classification accuracies. The initial values of hyper-parameters affect the training results, and it is important to train a CNN with a suitable hyper-parameter set of a learning rate, a batch size, the initialization of weight factors, and an optimizer. We investigate the effects of a single hyper-parameter while others are fixed in order to obtain a hyper-parameter set that gives higher classification accuracies and requires shorter training time using a proposed VGG-like CNN for training since the VGG is widely used. The CNN is trained for four datasets of CIFAR10, CIFAR100, GTSRB and DSDL-DB. The effects of the normalization and the data transformation for datasets are also investigated, and a training scheme using merged datasets is proposed.

공진화에 의한 신경회로망의 구조탐색 및 학습 (A Co-Evolutionary Approach for Learning and Structure Search of Neural Networks)

  • 이동욱;전효병;심귀보
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 1997년도 추계학술대회 학술발표 논문집
    • /
    • pp.111-114
    • /
    • 1997
  • Usually, Evolutionary Algorithms are considered more efficient for optimal system design, However, the performance of the system is determined by fitness function and system environment. In this paper, in order to overcome the limitation of the performance by this factor, we propose a co-evolutionary method that two populations constantly interact and coevolve. In this paper, we apply coevolution to neural network's evolving. So, one population is composed of the structure of neural networks and other population is composed of training patterns. The structure of neural networks evolve to optimal structure and, at the same time, training patterns coevolve to feature patterns. This method prevent the system from the limitation of the performance by random design of neural network structure and inadequate selection of training patterns. In this time neural networks are trained by evolution strategies that are able to apply to the unsupervised learning. And in the coding of neural networks, we propose the method to maintain nonredundancy and character preservingness that are essential factor of genetic coding. We show the validity and the effectiveness of the proposed scheme by applying it to the visual servoing of RV-M2 robot manipulators.

  • PDF

Neural-based Blind Modeling of Mini-mill ASC Crown

  • Lee, Gang-Hwa;Lee, Dong-Il;Lee, Seung-Joon;Lee, Suk-Gyu;Kim, Shin-Il;Park, Hae-Doo;Park, Seung-Gap
    • 한국지능시스템학회논문지
    • /
    • 제12권6호
    • /
    • pp.577-582
    • /
    • 2002
  • Neural network can be trained to approximate an arbitrary nonlinear function of multivariate data like the mini-mill crown values in Automatic Shape Control. The trained weights of neural network can evaluate or generalize the process data outside the training vectors. Sometimes, the blind modeling of the process data is necessary to compare with the scattered analytical model of mini-mill process in isolated electro-mechanical forms. To come up with a viable model, we propose the blind neural-based range-division domain-clustering piecewise-linear modeling scheme. The basic ideas are: 1) dividing the range of target data, 2) clustering the corresponding input space vectors, 3)training the neural network with clustered prototypes to smooth out the convergence and 4) solving the resulting matrix equations with a pseudo-inverse to alleviate the ill-conditioning problem. The simulation results support the effectiveness of the proposed scheme and it opens a new way to the data analysis technique. By the comparison with the statistical regression, it is evident that the proposed scheme obtains better modeling error uniformity and reduces the magnitudes of errors considerably. Approximatly 10-fold better performance results.

다변량 데이터의 분류 성능 향상을 위한 특질 추출 및 분류 기법을 통합한 신경망 알고리즘 (Feature Selecting and Classifying Integrated Neural Network Algorithm for Multi-variate Classification)

  • 윤현수;백준걸
    • 산업공학
    • /
    • 제24권2호
    • /
    • pp.97-104
    • /
    • 2011
  • Research for multi-variate classification has been studied through two kinds of procedures which are feature selection and classification. Feature Selection techniques have been applied to select important features and the other one has improved classification performances through classifier applications. In general, each technique has been independently studied, however consideration of the interaction between both procedures has not been widely explored which leads to a degraded performance. In this paper, through integrating these two procedures, classification performance can be improved. The proposed model takes advantage of KBANN (Knowledge-Based Artificial Neural Network) which uses prior knowledge to learn NN (Neural Network) as training information. Each NN learns characteristics of the Feature Selection and Classification techniques as training sets. The integrated NN can be learned again to modify features appropriately and enhance classification performance. This innovative technique is called ALBNN (Algorithm Learning-Based Neural Network). The experiments' results show improved performance in various classification problems.

신경회로망과 실험계획법을 이용한 칩형상 예측 (Prediction of Chip Forms using Neural Network and Experimental Design Method)

  • 한성종;최진필;이상조
    • 한국정밀공학회지
    • /
    • 제20권11호
    • /
    • pp.64-70
    • /
    • 2003
  • This paper suggests a systematic methodology to predict chip forms using the experimental design technique and the neural network. Significant factors determined with ANOVA analysis are used as input variables of the neural network back-propagation algorithm. It has been shown that cutting conditions and cutting tool shapes have distinct effects on the chip forms, so chip breaking. Cutting tools are represented using the Z-map method, which differs from existing methods using some chip breaker parameters. After training the neural network with selected input variables, chip forms are predicted and compared with original chip forms obtained from experiments under same input conditions, showing that chip forms are same at all conditions. To verify the suggested model, one tool not used in training the model is chosen and input to the model. Under various cutting conditions, predicted chip forms agree well with those obtained from cutting experiments. The suggested method could reduce the cost and time significantly in designing cutting tools as well as replacing the“trial-and-error”design method.

Damage assessment of cable stayed bridge using probabilistic neural network

  • Cho, Hyo-Nam;Choi, Young-Min;Lee, Sung-Chil;Hur, Choon-Kun
    • Structural Engineering and Mechanics
    • /
    • 제17권3_4호
    • /
    • pp.483-492
    • /
    • 2004
  • This paper presents an efficient algorithm for the estimation of damage location and severity in bridge structures using Probabilistic Neural Network (PNN). Generally, the Back Propagation Neural Network (BPNN)-based damage detection methods need a lot of training patterns for neural network learning process and the optimum architecture of a BPNN is selected by trial and error. In this paper, the PNN instead of the conventional BPNN is used as a pattern classifier. The modal properties of damaged structure are somewhat different from those of undamaged one. The basic idea of proposed algorithm is that the PNN classifies a test pattern which consists of the modal characteristics from damaged structure, how close it is to each training pattern which is composed of the modal characteristics from various structural damage cases. In this algorithm, two PNNs are sequentially used. The first PNN estimates the damage location using mode shape and the results of the first PNN are put into the second PNN for the damage severity estimation using natural frequency. The proposed damage assessment algorithm using the PNN is applied to a cable-stayed bridge to verify its applicability.

인공신경망을 이용한 삼차원 물체의 인식과 정확한 자세계산 (3D Object Recognition and Accurate Pose Calculation Using a Neural Network)

  • 박강
    • 대한기계학회논문집A
    • /
    • 제23권11호
    • /
    • pp.1929-1939
    • /
    • 1999
  • This paper presents a neural network approach, which was named PRONET, to 3D object recognition and pose calculation. 3D objects are represented using a set of centroidal profile patterns that describe the boundary of the 2D views taken from evenly distributed view points. PRONET consists of the training stage and the execution stage. In the training stage, a three-layer feed-forward neural network is trained with the centroidal profile patterns using an error back-propagation method. In the execution stage, by matching a centroidal profile pattern of the given image with the best fitting centroidal profile pattern using the neural network, the identity and approximate orientation of the real object, such as a workpiece in arbitrary pose, are obtained. In the matching procedure, line-to-line correspondence between image features and 3D CAD features are also obtained. An iterative model posing method then calculates the more exact pose of the object based on initial orientation and correspondence.

인공신경망 모형을 이용한 급속혼화공정에서 적정 응집제 주입농도 결정 및 응집처리후 탁도의 예측 (Prediction of Turbidity in Treated Water and the Estimation of the Optimum Feed Concentration of Coagulants in Rapid Mixing Process using an Artificial Neural Network Model)

  • 정동환;박규홍
    • 한국물환경학회지
    • /
    • 제21권1호
    • /
    • pp.21-28
    • /
    • 2005
  • The training and prediction modeling using an artificial neural network was implemented to predict the turbidity of treated water as well as to estimate the optimized feed concentration of polyaluminium chloride (PACl) in a water treatment plant. The parameters used in the input layers were pH, temperature, turbidity and alkalinity, while those in output layers were PACl and turbidity of treated water. Levenberg-Marquadt method of feedforward back-propagation perceptron in the neural network toolbox of MATLAB program was used in this study. Correlation coefficients of the training data with the measured data were 0.9997 for PACl and 0.6850 for turbidity and those of the testing data with measured data were 0.9140 for PACl and 0.3828 for turbidity, when four parameters at input layer, 12-12 nodes each at both the first and the second hidden layers, and two parameters(PACl and turbidity) at output layer were used. Although the predictability of PACl was improved, compared to that of the previous studies to use the only coagulant dose as output layer, turbidity in treated water could not be predicted well. Acquisition of more data through several years obtained with the advanced on-line measuring system could make the artificial neural network useful and practical in actual water treatment plants.

FAGON: Fake News Detection Model Using Grammatical Transformation on Deep Neural Network

  • Seo, Youngkyung;Han, Seong-Soo;Jeon, You-Boo;Jeong, Chang-Sung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권10호
    • /
    • pp.4958-4970
    • /
    • 2019
  • As technology advances, the amount of fake news is increasing more and more by various reasons such as political issues and advertisement exaggeration. However, there have been very few research works on fake news detection, especially which uses grammatical transformation on deep neural network. In this paper, we shall present a new Fake News Detection Model, called FAGON(Fake news detection model using Grammatical transformation On deep Neural network) which determines efficiently if the proposition is true or not for the given article by learning grammatical transformation on neural network. Especially, our model focuses the Korean language. It consists of two modules: sentence generator and classification. The former generates multiple sentences which have the same meaning as the proposition, but with different grammar by training the grammatical transformation. The latter classifies the proposition as true or false by training with vectors generated from each sentence of the article and the multiple sentences obtained from the former model respectively. We shall show that our model is designed to detect fake news effectively by exploiting various grammatical transformation and proper classification structure.

확장칼만필터에 의하여 학습된 다층뉴럴네트워크를 이용한 헬리오스타트 태양추적오차의 모델링 (Modeling of Heliostat Sun Tracking Error Using Multilayered Neural Network Trained by the Extended Kalman Filter)

  • 이상은;박영칠
    • 제어로봇시스템학회논문지
    • /
    • 제16권7호
    • /
    • pp.711-719
    • /
    • 2010
  • Heliostat, as a concentrator reflecting the incident solar energy to the receiver located at the tower, is the most important system in the tower-type solar thermal power plant, since it determines the efficiency and performance of solar thermal plower plant. Thus, a good sun tracking ability as well as its good optical property are required. In this paper, we propose a method to compensate the heliostat sun tracking error. We first model the sun tracking error, which could be measured using BCS (Beam Characterization System), by multilayered neural network. Then the extended Kalman filter was employed to train the neural network. Finally the model is used to compensate the sun tracking errors. Simulated result shows that the method proposed in this paper improve the heliostat sun tracking performance dramatically. It also shows that the training of neural network by the extended Kalman filter provides faster convergence property, more accurate estimation and higher measurement noise rejection ability compared with the other training methods like gradient descent method.