• Title/Summary/Keyword: 순환 신경망

Search Result 309, Processing Time 0.025 seconds

Peak Impact Force of Ship Bridge Collision Based on Neural Network Model (신경망 모델을 이용한 선박-교각 최대 충돌력 추정 연구)

  • Wang, Jian;Noh, Jackyou
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.28 no.1
    • /
    • pp.175-183
    • /
    • 2022
  • The collision between a ship and bridge across a waterway may result in extremely serious consequences that may endanger the safety of life and property. Therefore, factors affecting ship bridge collision must be investigated, and the impact force should be discussed based on various collision conditions. In this study, a finite element model of ship bridge collision is established, and the peak impact force of a ship bridge collision based on 50 operating conditions combined with three input parameters, i.e., ship loading condition, ship speed, and ship bridge collision angle, is calculated via numerical simulation. Using neural network models trained with the numerical simulation results, the prediction model of the peak impact force of ship bridge collision involving an extremely short calculation time on the order of milliseconds is established. The neural network models used in this study are the basic backpropagation neural network model and Elman neural network model, which can manage temporal information. The accuracy of the neural network models is verified using 10 test samples based on the operating conditions. Results of a verification test show that the Elman neural network model performs better than the backpropagation neural network model, with a mean relative error of 4.566% and relative errors of less than 5% in 8 among 10 test cases. The trained neural network can yield a reliable ship bridge collision force instantaneously only when the required parameters are specified and a nonlinear finite element solution process is not required. The proposed model can be used to predict whether a catastrophic collision will occur during ship navigation, and thus hence the safety of crew operating the ship.

A Comparison Study of RNN, CNN, and GAN Models in Sequential Recommendation (순차적 추천에서의 RNN, CNN 및 GAN 모델 비교 연구)

  • Yoon, Ji Hyung;Chung, Jaewon;Jang, Beakcheol
    • Journal of Internet Computing and Services
    • /
    • v.23 no.4
    • /
    • pp.21-33
    • /
    • 2022
  • Recently, the recommender system has been widely used in various fields such as movies, music, online shopping, and social media, and in the meantime, the recommender model has been developed from correlation analysis through the Apriori model, which can be said to be the first-generation model in the recommender system field. In 2005, many models have been proposed, including deep learning-based models, which are receiving a lot of attention within the recommender model. The recommender model can be classified into a collaborative filtering method, a content-based method, and a hybrid method that uses these two methods integrally. However, these basic methods are gradually losing their status as methodologies in the field as they fail to adapt to internal and external changing factors such as the rapidly changing user-item interaction and the development of big data. On the other hand, the importance of deep learning methodologies in recommender systems is increasing because of its advantages such as nonlinear transformation, representation learning, sequence modeling, and flexibility. In this paper, among deep learning methodologies, RNN, CNN, and GAN-based models suitable for sequential modeling that can accurately and flexibly analyze user-item interactions are classified, compared, and analyzed.

Shooting sound analysis using convolutional neural networks and long short-term memory (합성곱 신경망과 장단기 메모리를 이용한 사격음 분석 기법)

  • Kang, Se Hyeok;Cho, Ji Woong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.3
    • /
    • pp.312-318
    • /
    • 2022
  • This paper proposes a model which classifies the type of guns and information about sound source location using deep neural network. The proposed classification model is composed of convolutional neural networks (CNN) and long short-term memory (LSTM). For training and test the model, we use the Gunshot Audio Forensic Dataset generated by the project supported by the National Institute of Justice (NIJ). The acoustic signals are transformed to Mel-Spectrogram and they are provided as learning and test data for the proposed model. The model is compared with the control model consisting of convolutional neural networks only. The proposed model shows high accuracy more than 90 %.

CRNN-Based Korean Phoneme Recognition Model with CTC Algorithm (CTC를 적용한 CRNN 기반 한국어 음소인식 모델 연구)

  • Hong, Yoonseok;Ki, Kyungseo;Gweon, Gahgene
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.3
    • /
    • pp.115-122
    • /
    • 2019
  • For Korean phoneme recognition, Hidden Markov-Gaussian Mixture model(HMM-GMM) or hybrid models which combine artificial neural network with HMM have been mainly used. However, current approach has limitations in that such models require force-aligned corpus training data that is manually annotated by experts. Recently, researchers used neural network based phoneme recognition model which combines recurrent neural network(RNN)-based structure with connectionist temporal classification(CTC) algorithm to overcome the problem of obtaining manually annotated training data. Yet, in terms of implementation, these RNN-based models have another difficulty in that the amount of data gets larger as the structure gets more sophisticated. This problem of large data size is particularly problematic in the Korean language, which lacks refined corpora. In this study, we introduce CTC algorithm that does not require force-alignment to create a Korean phoneme recognition model. Specifically, the phoneme recognition model is based on convolutional neural network(CNN) which requires relatively small amount of data and can be trained faster when compared to RNN based models. We present the results from two different experiments and a resulting best performing phoneme recognition model which distinguishes 49 Korean phonemes. The best performing phoneme recognition model combines CNN with 3hop Bidirectional LSTM with the final Phoneme Error Rate(PER) at 3.26. The PER is a considerable improvement compared to existing Korean phoneme recognition models that report PER ranging from 10 to 12.

Goal Oriented Dialogue System Based on Deep Recurrent Q Network (심층 순환 Q 네트워크 기반 목적 지향 대화 시스템)

  • Park, Geonwoo;Kim, Harksoo
    • Annual Conference on Human and Language Technology
    • /
    • 2018.10a
    • /
    • pp.147-150
    • /
    • 2018
  • 목적 지향 대화 시스템은 자연어 이해, 대화 관리자, 자연어 생성과 같은 세분화 모델들의 결합으로 이루어져있어 하위 모델에 대한 오류 전파에 취약하다. 이러한 문제점을 해결하기 위해 자연어 이해 모델과 대화 관리자를 하나의 네트워크로 구성하고 오류에 강건한 심층 Q 네트워크를 제안한다. 본 논문에서는 대화의 전체 흐름을 파악 할 수 있는 순환 신경망인 LSTM에 심층 Q 네트워크 적용한 심층 순환 Q 네트워크 기반 목적 지향 대화 시스템을 제안한다. 실험 결과, 제안한 심층 순환 Q 네트워크는 LSTM, 심층 Q 네트워크보다 각각 정밀도 1.0%p, 6.7%p 높은 성능을 보였다.

  • PDF

Recurrent Neural Network based Prediction System of Agricultural Photovoltaic Power Generation (영농형 태양광 발전소에서 순환신경망 기반 발전량 예측 시스템)

  • Jung, Seol-Ryung;Koh, Jin-Gwang;Lee, Sung-Keun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.5
    • /
    • pp.825-832
    • /
    • 2022
  • In this paper, we discuss the design and implementation of predictive and diagnostic models for realizing intelligent predictive models by collecting and storing the power output of agricultural photovoltaic power generation systems. Our model predicts the amount of photovoltaic power generation using RNN, LSTM, and GRU models, which are recurrent neural network techniques specialized for time series data, and compares and analyzes each model with different hyperparameters, and evaluates the performance. As a result, the MSE and RMSE indicators of all three models were very close to 0, and the R2 indicator showed performance close to 1. Through this, it can be seen that the proposed prediction model is a suitable model for predicting the amount of photovoltaic power generation, and using this prediction, it was shown that it can be utilized as an intelligent and efficient O&M function in an agricultural photovoltaic system.

A Data-driven Classifier for Motion Detection of Soldiers on the Battlefield using Recurrent Architectures and Hyperparameter Optimization (순환 아키텍쳐 및 하이퍼파라미터 최적화를 이용한 데이터 기반 군사 동작 판별 알고리즘)

  • Joonho Kim;Geonju Chae;Jaemin Park;Kyeong-Won Park
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.107-119
    • /
    • 2023
  • The technology that recognizes a soldier's motion and movement status has recently attracted large attention as a combination of wearable technology and artificial intelligence, which is expected to upend the paradigm of troop management. The accuracy of state determination should be maintained at a high-end level to make sure of the expected vital functions both in a training situation; an evaluation and solution provision for each individual's motion, and in a combat situation; overall enhancement in managing troops. However, when input data is given as a timer series or sequence, existing feedforward networks would show overt limitations in maximizing classification performance. Since human behavior data (3-axis accelerations and 3-axis angular velocities) handled for military motion recognition requires the process of analyzing its time-dependent characteristics, this study proposes a high-performance data-driven classifier which utilizes the long-short term memory to identify the order dependence of acquired data, learning to classify eight representative military operations (Sitting, Standing, Walking, Running, Ascending, Descending, Low Crawl, and High Crawl). Since the accuracy is highly dependent on a network's learning conditions and variables, manual adjustment may neither be cost-effective nor guarantee optimal results during learning. Therefore, in this study, we optimized hyperparameters using Bayesian optimization for maximized generalization performance. As a result, the final architecture could reduce the error rate by 62.56% compared to the existing network with a similar number of learnable parameters, with the final accuracy of 98.39% for various military operations.

Attention/LIME method to analyze decision process of RNN (Attention과 LIME기법을 활용한 순환신경망의 의사결정 요인 분석)

  • Yoon, Joo-Sung;Park, Jong-Cheol;Ha, Jong-Su;An, Jin-Hyeon;Kim, Hyeon-Cheol
    • Proceedings of The KACE
    • /
    • 2017.08a
    • /
    • pp.253-256
    • /
    • 2017
  • 딥러닝으로 만들어진 모델의 내부는 black box와 같은 특성을 가져 동작 규칙을 알기 어렵다. 최근 기계학습의 발전으로 인공지능이 전보다 더 복잡한 문제를 해결할 수 있으나 위와 같은 이유로, 모델이 내린 판단의 근거를 알기 어렵다. 그러므로 딥러닝의 동작 규칙을 사람이 이해할 수 있는 형식으로 나타내려는 노력이 필요하다. 본 연구에서는 Attention과 LIME 기법을 활용하여 IMDB 데이터를 감성 분석한 순환신경망의 의사결정 요인을 분석하였다. 각 기법을 활용했을 때의 장단점과 실제 구현에 있어 등장하는 문제에 대해 알아보고자 한다.

  • PDF

Emotion Recognition using Various Combinations of Audio Features and Textual Information (음성특징의 다양한 조합과 문장 정보를 이용한 감정인식)

  • Seo, Seunghyun;Lee, Bowon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2019.11a
    • /
    • pp.137-139
    • /
    • 2019
  • 본 논문은 다양한 음성 특징과 텍스트를 이용한 멀티 모드 순환신경망 네트워크를 사용하여 음성을 통한 범주형(categorical) 분류 방법과 Arousal-Valence(AV) 도메인에서의 분류방법을 통해 감정인식 결과를 제시한다. 본 연구에서는 음성 특징으로는 MFCC, Energy, Velocity, Acceleration, Prosody 및 Mel Spectrogram 등의 다양한 특징들의 조합을 이용하였고 이에 해당하는 텍스트 정보를 순환신경망 기반 네트워크를 통해 융합하여 범주형 분류 방법과 과 AV 도메인에서의 분류 방법을 이용해 감정을 이산적으로 분류하였다. 실험 결과, 음성 특징의 조합으로 MFCC Energy, Velocity, Acceleration 각 13 차원과 35 차원의 Prosody 의 조합을 사용하였을 때 범주형 분류 방법에서는 75%로 다른 특징 조합들 보다 높은 결과를 보였고 AV 도메인 에서도 같은 음성 특징의 조합이 Arousal 55.3%, Valence 53.1%로 각각 가장 높은 결과를 보였다.

  • PDF