• Title/Summary/Keyword: network recognition

검색결과 2,502건 처리시간 0.028초

Vehicle Image Recognition Using Deep Convolution Neural Network and Compressed Dictionary Learning

  • Zhou, Yanyan
    • Journal of Information Processing Systems
    • /
    • 제17권2호
    • /
    • pp.411-425
    • /
    • 2021
  • In this paper, a vehicle recognition algorithm based on deep convolutional neural network and compression dictionary is proposed. Firstly, the network structure of fine vehicle recognition based on convolutional neural network is introduced. Then, a vehicle recognition system based on multi-scale pyramid convolutional neural network is constructed. The contribution of different networks to the recognition results is adjusted by the adaptive fusion method that adjusts the network according to the recognition accuracy of a single network. The proportion of output in the network output of the entire multiscale network. Then, the compressed dictionary learning and the data dimension reduction are carried out using the effective block structure method combined with very sparse random projection matrix, which solves the computational complexity caused by high-dimensional features and shortens the dictionary learning time. Finally, the sparse representation classification method is used to realize vehicle type recognition. The experimental results show that the detection effect of the proposed algorithm is stable in sunny, cloudy and rainy weather, and it has strong adaptability to typical application scenarios such as occlusion and blurring, with an average recognition rate of more than 95%.

Human Motion Recognition Based on Spatio-temporal Convolutional Neural Network

  • Hu, Zeyuan;Park, Sange-yun;Lee, Eung-Joo
    • 한국멀티미디어학회논문지
    • /
    • 제23권8호
    • /
    • pp.977-985
    • /
    • 2020
  • Aiming at the problem of complex feature extraction and low accuracy in human action recognition, this paper proposed a network structure combining batch normalization algorithm with GoogLeNet network model. Applying Batch Normalization idea in the field of image classification to action recognition field, it improved the algorithm by normalizing the network input training sample by mini-batch. For convolutional network, RGB image was the spatial input, and stacked optical flows was the temporal input. Then, it fused the spatio-temporal networks to get the final action recognition result. It trained and evaluated the architecture on the standard video actions benchmarks of UCF101 and HMDB51, which achieved the accuracy of 93.42% and 67.82%. The results show that the improved convolutional neural network has a significant improvement in improving the recognition rate and has obvious advantages in action recognition.

시공간패턴인식 신경회로망의 설계 (Neural Network Design for Spatio-temporal Pattern Recognition)

  • 임정수;이종호
    • 대한전기학회논문지:전력기술부문A
    • /
    • 제48권11호
    • /
    • pp.1464-1471
    • /
    • 1999
  • This paper introduces complex-valued competitive learning neural network for spatio-temporal pattern recognition. There have been quite a few neural networks for spatio-temporal pattern recognition. Among them, recurrent neural network, TDNN, and avalanche model are acknowledged as standard neural network paradigms for spatio-temporal pattern recognition. Recurrent neural network has complicated learning rules and does not guarantee convergence to global minima. TDNN requires too many neurons, and can not be regarded to deal with spatio-temporal pattern basically. Grossberg's avalanche model is not able to distinguish long patterns, and has to be indicated which layer is to be used in learning. In order to remedy drawbacks of the above networks, unsupervised competitive learning using complex umber is proposed. Suggested neural network also features simultaneous recognition, time-shift invariant recognition, stable categorizing, and learning rate modulation. The network is evaluated by computer simulation with randomly generated patterns.

  • PDF

The Neural-Network Approach to Recognize Defect Pattern in LED Manufacturing

  • Chen, Wen-Chin;Tsai, Chih-Hung;Hsu, Shou-Wen
    • International Journal of Quality Innovation
    • /
    • 제7권3호
    • /
    • pp.58-69
    • /
    • 2006
  • This paper presents neural network-based recognition system for automatic light emitting diode (LED) inspection. The back-propagation neural network (BPNN) is proposed and tested. The current-voltage (I-V) characteristic data of LED from the inspection process is used for the network training and testing. This study selects 300 random samples as network training and employs 100 samples as network testing. The experimental results show that if the classification work is done well, the accuracy of recognition is 100%, and the testing speed of the proposed recognition system is almost one half faster than the traditional inspection system does. The proposed neural-network approach is successfully demonstrated by real data sets and can be effectively developed as a recognition system for a practical application purpose.

Recognition of English Calling Cards by Using Projection Method and Enhanced RBE Network

  • Kim, Kwang-Baek
    • 한국지능시스템학회논문지
    • /
    • 제13권4호
    • /
    • pp.474-479
    • /
    • 2003
  • In this paper, we proposed the novel method for the recognition of English calling cards by using the projection method and the enhanced RBF (Radial Basis Function) network. The recognition of calling cards consists of the extraction phase of character areas and the recognition phase of extracted characters. In the extraction phase, first of all, noises are removed from the images of calling cards, and the feature areas including character strings are separated from the calling card images by using the horizontal smearing method and the 8-directional contour tracking method. And using the image projection method, the feature areas are split into the areas of individual characters. We also proposed the enhanced RBF network that organizes the middle layer effectively by using the enhanced ART1 neural network adjusting the vigilance threshold dynamically according to the homogeneity between patterns. In the recognition phase, the proposed neural network is applied to recognize individual characters. Our experiment result showed that the proposed recognition algorithm has higher success rate of recognition and faster learning time than the existing neural network based recognition.

Mongolian Car Plate Recognition using Neural Network

  • Ragchaabazar, Bud;Kim, SooHyung;Na, In Seop
    • 스마트미디어저널
    • /
    • 제2권4호
    • /
    • pp.20-26
    • /
    • 2013
  • This paper presents an approach to Mongolian car plate recognition using artificial neural network. Our proposed method consists of two steps: detection and recognition. In detection step, we implement Flood fill algorithm. In recognition step we proceed to segment the plate for each Cyrillic character, and use an Artificial Neural Network (ANN) machine - learning algorithm to recognize the character. We have learned the theory of ANN and implemented it without using any library. A total of 150 vehicles images obtained from community entrance gates have been tested. The recognition algorithm shows an accuracy rate of 89.75%.

  • PDF

Human Activity Recognition Based on 3D Residual Dense Network

  • Park, Jin-Ho;Lee, Eung-Joo
    • 한국멀티미디어학회논문지
    • /
    • 제23권12호
    • /
    • pp.1540-1551
    • /
    • 2020
  • Aiming at the problem that the existing human behavior recognition algorithm cannot fully utilize the multi-level spatio-temporal information of the network, a human behavior recognition algorithm based on a dense three-dimensional residual network is proposed. First, the proposed algorithm uses a dense block of three-dimensional residuals as the basic module of the network. The module extracts the hierarchical features of human behavior through densely connected convolutional layers; Secondly, the local feature aggregation adaptive method is used to learn the local dense features of human behavior; Then, the residual connection module is applied to promote the flow of feature information and reduced the difficulty of training; Finally, the multi-layer local feature extraction of the network is realized by cascading multiple three-dimensional residual dense blocks, and use the global feature aggregation adaptive method to learn the features of all network layers to realize human behavior recognition. A large number of experimental results on benchmark datasets KTH show that the recognition rate (top-l accuracy) of the proposed algorithm reaches 93.52%. Compared with the three-dimensional convolutional neural network (C3D) algorithm, it has improved by 3.93 percentage points. The proposed algorithm framework has good robustness and transfer learning ability, and can effectively handle a variety of video behavior recognition tasks.

뉴로-퍼지 추론 시스템을 이용한 물체인식 (Object Recognition Using Neuro-Fuzzy Inference System)

  • 김형근;최갑석
    • 한국통신학회논문지
    • /
    • 제17권5호
    • /
    • pp.482-494
    • /
    • 1992
  • In this paper, the neuro-fuzzy inferene system for the effective object recognition is studied. The proposed neuro-fuzzy inference system combines learning capability of neural network with inference process of fuzzy theory, and the system executes the fuzzy inference by neural network automatically. The proposed system consists of the antecedence neural network, the consequent neural network, and the fuzzy operational part, For dissolving the ambiguity of recognition due to input variance in the neuro-fuzzy inference system, the antecedence’s fuzzy proposition of the inference rules are automatically produced by error back propagation learining rule. Therefore, when the fuzzy inference is made, the shape of membership functions os adaptively modified according to the variation. The antecedence neural netwerk constructs a separated MNN(Model Classification Neural Network)and LNN(Line segment Classification Neural Networks)for dissolving the degradation of recognition rate. The antecedence neural network can overcome the limitation of boundary decisoion characteristics of nrural network due to the similarity of extracted features. The increased recognition rate is gained by the consequent neural network which is designed to learn inference rules for the effective system output.

  • PDF

소프트컴퓨팅 기법을 이용한 다음절 단어의 음성인식 (Speech Recognition of Multi-Syllable Words Using Soft Computing Techniques)

  • 이종수;윤지원
    • 정보저장시스템학회논문집
    • /
    • 제6권1호
    • /
    • pp.18-24
    • /
    • 2010
  • The performance of the speech recognition mainly depends on uncertain factors such as speaker's conditions and environmental effects. The present study deals with the speech recognition of a number of multi-syllable isolated Korean words using soft computing techniques such as back-propagation neural network, fuzzy inference system, and fuzzy neural network. Feature patterns for the speech recognition are analyzed with 12th order thirty frames that are normalized by the linear predictive coding and Cepstrums. Using four models of speech recognizer, actual experiments for both single-speakers and multiple-speakers are conducted. Through this study, the recognizers of combined fuzzy logic and back-propagation neural network and fuzzy neural network show the better performance in identifying the speech recognition.

A Proposal of Shuffle Graph Convolutional Network for Skeleton-based Action Recognition

  • Jang, Sungjun;Bae, Han Byeol;Lee, HeanSung;Lee, Sangyoun
    • 한국정보전자통신기술학회논문지
    • /
    • 제14권4호
    • /
    • pp.314-322
    • /
    • 2021
  • Skeleton-based action recognition has attracted considerable attention in human action recognition. Recent methods for skeleton-based action recognition employ spatiotemporal graph convolutional networks (GCNs) and have remarkable performance. However, most of them have heavy computational complexity for robust action recognition. To solve this problem, we propose a shuffle graph convolutional network (SGCN) which is a lightweight graph convolutional network using pointwise group convolution rather than pointwise convolution to reduce computational cost. Our SGCN is composed of spatial and temporal GCN. The spatial shuffle GCN contains pointwise group convolution and part shuffle module which enhances local and global information between correlated joints. In addition, the temporal shuffle GCN contains depthwise convolution to maintain a large receptive field. Our model achieves comparable performance with lowest computational cost and exceeds the performance of baseline at 0.3% and 1.2% on NTU RGB+D and NTU RGB+D 120 datasets, respectively.