• 제목/요약/키워드: Deep Neural Network

검색결과 2,013건 처리시간 0.033초

Deep neural network 기반 오디오 표식을 위한 데이터 증강 방법 연구 (Study on data augmentation methods for deep neural network-based audio tagging)

  • 김범준;문현기;박성욱;박영철
    • 한국음향학회지
    • /
    • 제37권6호
    • /
    • pp.475-482
    • /
    • 2018
  • 본 논문에서는 DNN(Deep Neural Network) 기반 오디오 표식을 위한 데이터 증강 방법을 연구한다. 본 시스템에서는 오디오 신호를 멜-스펙트로그램으로 변환하여 오디오 표식을 위한 심층신경망의 입력으로 사용한다. 적은 수의 훈련 데이터를 사용하는 경우 발생하는 문제를 해결하기 위해, 타임 스트레칭, 피치 변화, 동적 영역 압축, 블록 혼합 등의 방법을 사용하여 훈련 데이터를 증강시켰다. 사용된 데이터 증강 기법의 최적 파라미터와 최적 조합을 오디오 표식 시뮬레이션을 통해 확인하였다.

가상현실 음향을 위한 심층신경망 기반 사운드 보간 기법 (A Sound Interpolation Method Using Deep Neural Network for Virtual Reality Sound)

  • 최재규;최승호
    • 방송공학회논문지
    • /
    • 제24권2호
    • /
    • pp.227-233
    • /
    • 2019
  • 본 논문은 가상현실 음향 구현을 위한 심층신경망 기반 사운드 보간 방법에 관한 것으로서, 이를 통해 두 지점에서 취득한 음향 신호들을 사용하여 두 지점 사이의 음향을 생성한다. 산술평균이나 기하평균 같은 통계적 방법으로 사운드 보간을 수행할 수 있지만 이는 실제 비선형 음향 특성을 반영하기에 미흡하다. 이러한 문제를 해결하기 위해서 본 연구에서는 두 지점과 목표 지점의 음향신호를 기반으로 심층신경망을 훈련하여 사운드 보간을 시도하였으며, 실험결과 통계적 방법에 비해 심층신경망 기반 사운드 보간 방법의 성능이 우수함을 보였다.

가중치 초기화 및 매개변수 갱신 방법에 따른 컨벌루션 신경망의 성능 비교 (Performance Comparison of Convolution Neural Network by Weight Initialization and Parameter Update Method1)

  • 박성욱;김도연
    • 한국멀티미디어학회논문지
    • /
    • 제21권4호
    • /
    • pp.441-449
    • /
    • 2018
  • Deep learning has been used for various processing centered on image recognition. One core algorithms of the deep learning, convolutional neural network is an deep neural network that specialized in image recognition. In this paper, we use a convolutional neural network to classify forest insects and propose an optimization method. Experiments were carried out by combining two weight initialization and six parameter update methods. As a result, the Xavier-SGD method showed the highest performance with an accuracy of 82.53% in the 12 different combinations of experiments. Through this, the latest learning algorithms, which complement the disadvantages of the previous parameter update method, we conclude that it can not lead to higher performance than existing methods in all application environments.

A Video Expression Recognition Method Based on Multi-mode Convolution Neural Network and Multiplicative Feature Fusion

  • Ren, Qun
    • Journal of Information Processing Systems
    • /
    • 제17권3호
    • /
    • pp.556-570
    • /
    • 2021
  • The existing video expression recognition methods mainly focus on the spatial feature extraction of video expression images, but tend to ignore the dynamic features of video sequences. To solve this problem, a multi-mode convolution neural network method is proposed to effectively improve the performance of facial expression recognition in video. Firstly, OpenFace 2.0 is used to detect face images in video, and two deep convolution neural networks are used to extract spatiotemporal expression features. Furthermore, spatial convolution neural network is used to extract the spatial information features of each static expression image, and the dynamic information feature is extracted from the optical flow information of multiple expression images based on temporal convolution neural network. Then, the spatiotemporal features learned by the two deep convolution neural networks are fused by multiplication. Finally, the fused features are input into support vector machine to realize the facial expression classification. Experimental results show that the recognition accuracy of the proposed method can reach 64.57% and 60.89%, respectively on RML and Baum-ls datasets. It is better than that of other contrast methods.

딥러닝 합성곱 신경망을 이용한 효율적인 홍채인식 (Efficient Iris Recognition using Deep-Learning Convolution Neural Network)

  • 최광미;정유정
    • 한국전자통신학회논문지
    • /
    • 제15권3호
    • /
    • pp.521-526
    • /
    • 2020
  • 본 논문은 홍채영상의 이동불변의 특징값 을추출에 탁월한 고차 국소 자동 상관함수를 적용하여 25개의 특징 값을 입력 값으로 적용한 일반적인 HOLP 신경망에 특징 값 25개의 평균값을 추가한 개선된 HOLP 신경망을 구현하여 인식률을 확인하여 보았다. 종류가 상이한 딥러닝 구조들과 비교하였을 때 음성과 영상분야에서 탁월한 성능을 보이는 Back-Propagation 신경망과 특징 추출기와 분류기를 통합한 합성 곱 신경망을 활용하여 홍채인식의 인식률을 비교하여 보았다.

Improvement of the Convergence Rate of Deep Learning by Using Scaling Method

  • Ho, Jiacang;Kang, Dae-Ki
    • International journal of advanced smart convergence
    • /
    • 제6권4호
    • /
    • pp.67-72
    • /
    • 2017
  • Deep learning neural network becomes very popular nowadays due to the reason that it can learn a very complex dataset such as the image dataset. Although deep learning neural network can produce high accuracy on the image dataset, it needs a lot of time to reach the convergence stage. To solve the issue, we have proposed a scaling method to improve the neural network to achieve the convergence stage in a shorter time than the original method. From the result, we can observe that our algorithm has higher performance than the other previous work.

Deep Neural Network 언어모델을 위한 Continuous Word Vector 기반의 입력 차원 감소 (Input Dimension Reduction based on Continuous Word Vector for Deep Neural Network Language Model)

  • 김광호;이동현;임민규;김지환
    • 말소리와 음성과학
    • /
    • 제7권4호
    • /
    • pp.3-8
    • /
    • 2015
  • In this paper, we investigate an input dimension reduction method using continuous word vector in deep neural network language model. In the proposed method, continuous word vectors were generated by using Google's Word2Vec from a large training corpus to satisfy distributional hypothesis. 1-of-${\left|V\right|}$ coding discrete word vectors were replaced with their corresponding continuous word vectors. In our implementation, the input dimension was successfully reduced from 20,000 to 600 when a tri-gram language model is used with a vocabulary of 20,000 words. The total amount of time in training was reduced from 30 days to 14 days for Wall Street Journal training corpus (corpus length: 37M words).

Application of artificial neural networks (ANNs) and linear regressions (LR) to predict the deflection of concrete deep beams

  • Mohammadhassani, Mohammad;Nezamabadi-pour, Hossein;Jumaat, Mohd Zamin;Jameel, Mohammed;Arumugam, Arul M.S.
    • Computers and Concrete
    • /
    • 제11권3호
    • /
    • pp.237-252
    • /
    • 2013
  • This paper presents the application of artificial neural network (ANN) to predict deep beam deflection using experimental data from eight high-strength-self-compacting-concrete (HSSCC) deep beams. The optimized network architecture was ten input parameters, two hidden layers, and one output. The feed forward back propagation neural network of ten and four neurons in first and second hidden layers using TRAINLM training function predicted highly accurate and more precise load-deflection diagrams compared to classical linear regression (LR). The ANN's MSE values are 40 times smaller than the LR's. The test data R value from ANN is 0.9931; thus indicating a high confidence level.

Scene-based Nonuniformity Correction by Deep Neural Network with Image Roughness-like and Spatial Noise Cost Functions

  • Hong, Yong-hee;Song, Nam-Hun;Kim, Dae-Hyeon;Jun, Chan-Won;Jhee, Ho-Jin
    • 한국컴퓨터정보학회논문지
    • /
    • 제24권6호
    • /
    • pp.11-19
    • /
    • 2019
  • In this paper, a new Scene-based Nonuniformity Correction (SBNUC) method is proposed by applying Image Roughness-like and Spatial Noise cost functions on deep neural network structure. The classic approaches for nonuniformity correction require generally plenty of sequential image data sets to acquire accurate image correction offset coefficients. The proposed method, however, is able to estimate offset from only a couple of images powered by the characteristic of deep neural network scheme. The real world SWIR image set is applied to verify the performance of proposed method and the result shows that image quality improvement of PSNR 70.3dB (maximum) is achieved. This is about 8.0dB more than the improved IRLMS algorithm which preliminarily requires precise image registration process on consecutive image frames.

심층신경망을 이용한 스마트 양식장용 어류 크기 자동 측정 시스템 (Automatic Fish Size Measurement System for Smart Fish Farm Using a Deep Neural Network)

  • 이윤호;전주현;주문갑
    • 대한임베디드공학회논문지
    • /
    • 제17권3호
    • /
    • pp.177-183
    • /
    • 2022
  • To measure the size and weight of the fish, we developed an automatic fish size measurement system using a deep neural network, where the YOLO (You Only Look Once)v3 model was used. To detect fish, an IP camera with infrared function was installed over the fish pool to acquire image data and used as input data for the deep neural network. Using the bounding box information generated as a result of detecting the fish and the structure for which the actual length is known, the size of the fish can be obtained. A GUI (Graphical User Interface) program was implemented using LabVIEW and RTSP (Real-Time Streaming protocol). The automatic fish size measurement system shows the results and stores them in a database for future work.