• 제목/요약/키워드: deep neural networks

검색결과 866건 처리시간 0.028초

Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion

  • Zhou, Xuan
    • Journal of Information Processing Systems
    • /
    • 제17권2호
    • /
    • pp.337-351
    • /
    • 2021
  • Automatically recognizing facial expressions in video sequences is a challenging task because there is little direct correlation between facial features and subjective emotions in video. To overcome the problem, a video facial expression recognition method using spatiotemporal recurrent neural network and feature fusion is proposed. Firstly, the video is preprocessed. Then, the double-layer cascade structure is used to detect a face in a video image. In addition, two deep convolutional neural networks are used to extract the time-domain and airspace facial features in the video. The spatial convolutional neural network is used to extract the spatial information features from each frame of the static expression images in the video. The temporal convolutional neural network is used to extract the dynamic information features from the optical flow information from multiple frames of expression images in the video. A multiplication fusion is performed with the spatiotemporal features learned by the two deep convolutional neural networks. Finally, the fused features are input to the support vector machine to realize the facial expression classification task. The experimental results on cNTERFACE, RML, and AFEW6.0 datasets show that the recognition rates obtained by the proposed method are as high as 88.67%, 70.32%, and 63.84%, respectively. Comparative experiments show that the proposed method obtains higher recognition accuracy than other recently reported methods.

신경회로망을 이용한 연삭가공의 트러블 인식에 관한 연구(I) (A Study on the Monitoring System of the Grinding Troubles Utilizing Neural Networks(l))

  • 하만경;곽재섭;송지복;김건회;김희술
    • 한국정밀공학회지
    • /
    • 제13권9호
    • /
    • pp.149-155
    • /
    • 1996
  • Recent researches in the trouble monitoring system of grinding process have emphasized the use of deep knowledge. Such works include the monitoring and diagnostic systems for cylindrical grinding using sensors on chatter vibration and grinding burn during the process. But, since grinding operations are especially related with a lalrge amount of ambique parameters, it is effectively difficult to detect the grinding troubles occuring during the grinding process. In this paper, monitoring system for grinding utilizes the neural networks based on grinding power signatures. The monitoring system of grinding operations, which makes use of PDP neural networks, is presented. Then, the implementation results by computer simulations and experimental data with respect to chatter vibration and grinding burn are compared.

  • PDF

ResNet-Based Simulations for a Heat-Transfer Model Involving an Imperfect Contact

  • Guangxing, Wang;Gwanghyun, Jo;Seong-Yoon, Shin
    • Journal of information and communication convergence engineering
    • /
    • 제20권4호
    • /
    • pp.303-308
    • /
    • 2022
  • Simulating the heat transfer in a composite material is an important topic in material science. Difficulties arise from the fact that adjacent materials cannot match perfectly, resulting in discontinuity in the temperature variables. Although there have been several numerical methods for solving the heat-transfer problem in imperfect contact conditions, the methods known so far are complicated to implement, and the computational times are non-negligible. In this study, we developed a ResNet-type deep neural network for simulating a heat transfer model in a composite material. To train the neural network, we generated datasets by numerically solving the heat-transfer equations with Kapitza thermal resistance conditions. Because datasets involve various configurations of composite materials, our neural networks are robust to the shapes of material-material interfaces. Our algorithm can predict the thermal behavior in real time once the networks are trained. The performance of the proposed neural networks is documented, where the root mean square error (RMSE) and mean absolute error (MAE) are below 2.47E-6, and 7.00E-4, respectively.

딥러닝 기반 운동 자세 교정 시스템의 성능 (Performance of Exercise Posture Correction System Based on Deep Learning)

  • 황병선;김정호;이예람;경찬욱;선준호;선영규;김진영
    • 한국인터넷방송통신학회논문지
    • /
    • 제22권5호
    • /
    • pp.177-183
    • /
    • 2022
  • 최근 COVID-19로 인해 홈 트레이닝의 관심도가 증가하고 있다. 이에 따라 HAR(human activity recognition) 기술을 홈 트레이닝에 적용한 연구가 진행되고 있다. 기존 HAR 분야의 논문에서는 동적인 자세보다는 앉기, 일어서기와 같은 정적인 자세들을 분석한다. 본 논문은 동적인 운동 자세를 분석하여 사용자의 운동 자세 정확도를 보여주는 딥러닝 모델을 제안한다. AI hub의 피트니스 이미지를 blaze pose를 사용하여 사람의 자세 데이터를 분석한다. 3개의 딥러닝 모델: RNN(recurrnet neural networks), LSTM(long short-term memory networks), CNN(convolution neural networks)에 대하여 실험을 진행한다. RNN, LSTM, CNN 모델의 f1-score는 각각 0.49, 0.87, 0.98로 CNN 모델이 가장 적합하다는 것을 확인하였다. 이후 연구로는, 다양한 학습 데이터를 사용하여 더 많은 운동 자세를 분석할 예정이다.

깊은 신경망을 이용한 구조물의 유한요소모델 업데이팅 (Finite Element Model Updating of Structures Using Deep Neural Network)

  • 공밍;박원석
    • 대한토목학회논문집
    • /
    • 제39권1호
    • /
    • pp.147-154
    • /
    • 2019
  • 유한요소모델 업데이팅은 계측에 의한 구조물의 실제 응답과 가장 가까운 응답을 내는 유한요소모델의 매개변수를 찾는 문제로 정의할 수 있다. 기존 연구에서는 실 구조물과 해석 모델의 응답의 오차를 최소화하는 최적화에 기반 한 방법이 개발되었다. 이 연구에서는 목표 모드 정보로부터 유한요소 모델의 매개변수를 직접 얻을 수 있는 역 고유치 문제를 구성하고 역 고유치 문제를 빠르고 정확하게 풀기 위한 깊은 신경망(Deep Neural Network)을 구성하는 방법을 제안한다. 개발한 방법의 적용 예로서 현수교의 역 고유치 함수를 모사하는 신경망을 이용한 동적 유한요소모델 업데이트를 보인다. 해석 결과 제시한 방법은 매우 높은 정확도로 목표 모드에 대응하는 매개변수를 찾아낼 수 있음을 보였다.

Research on Forecasting Framework for System Marginal Price based on Deep Recurrent Neural Networks and Statistical Analysis Models

  • Kim, Taehyun;Lee, Yoonjae;Hwangbo, Soonho
    • 청정기술
    • /
    • 제28권2호
    • /
    • pp.138-146
    • /
    • 2022
  • Electricity has become a factor that dramatically affects the market economy. The day-ahead system marginal price determines electricity prices, and system marginal price forecasting is critical in maintaining energy management systems. There have been several studies using mathematics and machine learning models to forecast the system marginal price, but few studies have been conducted to develop, compare, and analyze various machine learning and deep learning models based on a data-driven framework. Therefore, in this study, different machine learning algorithms (i.e., autoregressive-based models such as the autoregressive integrated moving average model) and deep learning networks (i.e., recurrent neural network-based models such as the long short-term memory and gated recurrent unit model) are considered and integrated evaluation metrics including a forecasting test and information criteria are proposed to discern the optimal forecasting model. A case study of South Korea using long-term time-series system marginal price data from 2016 to 2021 was applied to the developed framework. The results of the study indicate that the autoregressive integrated moving average model (R-squared score: 0.97) and the gated recurrent unit model (R-squared score: 0.94) are appropriate for system marginal price forecasting. This study is expected to contribute significantly to energy management systems and the suggested framework can be explicitly applied for renewable energy networks.

Toward Practical Augmentation of Raman Spectra for Deep Learning Classification of Contamination in HDD

  • Seksan Laitrakun;Somrudee Deepaisarn;Sarun Gulyanon;Chayud Srisumarnk;Nattapol Chiewnawintawat;Angkoon Angkoonsawaengsuk;Pakorn Opaprakasit;Jirawan Jindakaew;Narisara Jaikaew
    • Journal of information and communication convergence engineering
    • /
    • 제21권3호
    • /
    • pp.208-215
    • /
    • 2023
  • Deep learning techniques provide powerful solutions to several pattern-recognition problems, including Raman spectral classification. However, these networks require large amounts of labeled data to perform well. Labeled data, which are typically obtained in a laboratory, can potentially be alleviated by data augmentation. This study investigated various data augmentation techniques and applied multiple deep learning methods to Raman spectral classification. Raman spectra yield fingerprint-like information about chemical compositions, but are prone to noise when the particles of the material are small. Five augmentation models were investigated to build robust deep learning classifiers: weighted sums of spectral signals, imitated chemical backgrounds, extended multiplicative signal augmentation, and generated Gaussian and Poisson-distributed noise. We compared the performance of nine state-of-the-art convolutional neural networks with all the augmentation techniques. The LeNet5 models with background noise augmentation yielded the highest accuracy when tested on real-world Raman spectral classification at 88.33% accuracy. A class activation map of the model was generated to provide a qualitative observation of the results.

Deep compression of convolutional neural networks with low-rank approximation

  • Astrid, Marcella;Lee, Seung-Ik
    • ETRI Journal
    • /
    • 제40권4호
    • /
    • pp.421-434
    • /
    • 2018
  • The application of deep neural networks (DNNs) to connect the world with cyber physical systems (CPSs) has attracted much attention. However, DNNs require a large amount of memory and computational cost, which hinders their use in the relatively low-end smart devices that are widely used in CPSs. In this paper, we aim to determine whether DNNs can be efficiently deployed and operated in low-end smart devices. To do this, we develop a method to reduce the memory requirement of DNNs and increase the inference speed, while maintaining the performance (for example, accuracy) close to the original level. The parameters of DNNs are decomposed using a hybrid of canonical polyadic-singular value decomposition, approximated using a tensor power method, and fine-tuned by performing iterative one-shot hybrid fine-tuning to recover from a decreased accuracy. In this study, we evaluate our method on frequently used networks. We also present results from extensive experiments on the effects of several fine-tuning methods, the importance of iterative fine-tuning, and decomposition techniques. We demonstrate the effectiveness of the proposed method by deploying compressed networks in smartphones.

심층 신경망 기반 딥 드로잉 공정 블랭크 두께 변화율 예측 (Prediction of Blank Thickness Variation in a Deep Drawing Process Using Deep Neural Network)

  • 박근태;박지우;곽민준;강범수
    • 소성∙가공
    • /
    • 제29권2호
    • /
    • pp.89-96
    • /
    • 2020
  • The finite element method has been widely applied in the sheet metal forming process. However, the finite element method is computationally expensive and time consuming. In order to tackle this problem, surrogate modeling methods have been proposed. An artificial neural network (ANN) is one such surrogate model and has been well studied over the past decades. However, when it comes to ANN with two or more layers, so called deep neural networks (DNN), there is distinct a lack of research. We chose to use DNNs our surrogate model to predict the behavior of sheet metal in the deep drawing process. Thickness variation is selected as an output of the DNN in order to evaluate workpiece feasibility. Input variables of the DNN are radius of die, die corner and blank holder force. Finite element analysis was conducted to obtain data for surrogate model construction and testing. Sampling points were determined by full factorial, latin hyper cube and monte carlo methods. We investigated the performance of the DNN according to its structure, number of nodes and number of layers, then it was compared with a radial basis function surrogate model using various sampling methods and numbers. The results show that our DNN could be used as an efficient surrogate model for the deep drawing process.

Empirical Comparison of Deep Learning Networks on Backbone Method of Human Pose Estimation

  • Rim, Beanbonyka;Kim, Junseob;Choi, Yoo-Joo;Hong, Min
    • 인터넷정보학회논문지
    • /
    • 제21권5호
    • /
    • pp.21-29
    • /
    • 2020
  • Accurate estimation of human pose relies on backbone method in which its role is to extract feature map. Up to dated, the method of backbone feature extraction is conducted by the plain convolutional neural networks named by CNN and the residual neural networks named by Resnet, both of which have various architectures and performances. The CNN family network such as VGG which is well-known as a multiple stacked hidden layers architecture of deep learning methods, is base and simple while Resnet which is a bottleneck layers architecture yields fewer parameters and outperform. They have achieved inspired results as a backbone network in human pose estimation. However, they were used then followed by different pose estimation networks named by pose parsing module. Therefore, in this paper, we present a comparison between the plain CNN family network (VGG) and bottleneck network (Resnet) as a backbone method in the same pose parsing module. We investigate their performances such as number of parameters, loss score, precision and recall. We experiment them in the bottom-up method of human pose estimation system by adapted the pose parsing module of openpose. Our experimental results show that the backbone method using VGG network outperforms the Resent network with fewer parameter, lower loss score and higher accuracy of precision and recall.