Search | Korea Science

Audio Event Detection Using Deep Neural Networks (깊은 신경망을 이용한 오디오 이벤트 검출)

Lim, Minkyu;Lee, Donghyun;Park, Hosung;Kim, Ji-Hwan
- Journal of Digital Contents Society
- /
- v.18 no.1
- /
- pp.183-190
- /
- 2017
This paper proposes an audio event detection method using Deep Neural Networks (DNN). The proposed method applies Feed Forward Neural Network (FFNN) to generate output probabilities of twenty audio events for each frame. Mel scale filter bank (FBANK) features are extracted from each frame, and its five consecutive frames are combined as one vector which is the input feature of the FFNN. The output layer of FFNN produces audio event probabilities for each input feature vector. More than five consecutive frames of which event probability exceeds threshold are detected as an audio event. An audio event continues until the event is detected within one second. The proposed method achieves as 71.8% accuracy for 20 classes of the UrbanSound8K and the BBC Sound FX dataset.
https://doi.org/10.9728/dcs.2017.18.1.183 인용 PDF KSCI

Human Walking Detection and Background Noise Classification by Deep Neural Networks for Doppler Radars (사람 걸음 탐지 및 배경잡음 분류 처리를 위한 도플러 레이다용 딥뉴럴네트워크)

Kwon, Jihoon;Ha, Seoung-Jae;Kwak, Nojun
- The Journal of Korean Institute of Electromagnetic Engineering and Science
- /
- v.29 no.7
- /
- pp.550-559
- /
- 2018
The effectiveness of deep neural networks (DNNs) for detection and classification of micro-Doppler signals generated by human walking and background noise sources is investigated. Previous research included a complex process for extracting meaningful features that directly affect classifier performance, and this feature extraction is based on experiences and statistical analysis. However, because a DNN gradually reconstructs and generates features through a process of passing layers in a network, the preprocess for feature extraction is not required. Therefore, binary classifiers and multiclass classifiers were designed and analyzed in which multilayer perceptrons (MLPs) and DNNs were applied, and the effectiveness of DNNs for recognizing micro-Doppler signals was demonstrated. Experimental results showed that, in the case of MLPs, the classification accuracies of the binary classifier and the multiclass classifier were 90.3% and 86.1%, respectively, for the test dataset. In the case of DNNs, the classification accuracies of the binary classifier and the multiclass classifier were 97.3% and 96.1%, respectively, for the test dataset.
https://doi.org/10.5515/KJKIEES.2018.29.7.550 인용 PDF KSCI

Design and Implementation of Intelligent Power Strip Using Deep Neural Network (DNN 신경망을 이용한 지능형 멀티탭 설계 및 구현)

Jong-Chan Lee
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.11a
- /
- pp.774-775
- /
- 2023
최근 인공지능 기술의 발달로 인하여 AI를 활용한 가정에서 이용할 수 있는 다양한 지능형 IoT 제품들이 시중에 출시되고 있다. 대표적으로 가정에서 사용하는 멀티탭 등 여러 가지 상품들이 있다. 본 논문에서는 전류 센서와 전압 센서값을 이용하여 가전제품을 예측하고 이를 시각화하여 전기 절약에 도움을 줄 수 있는 지능형 멀티탭을 제안한다.
https://doi.org/10.3745/PKIPS.y2023m11a.774 인용 PDF

Prediction of Barge Ship Roll Response Amplitude Operator Using Machine Learning Techniques

Lim, Jae Hwan;Jo, Hyo Jae
- Journal of Ocean Engineering and Technology
- /
- v.34 no.3
- /
- pp.167-179
- /
- 2020
Recently, the increasing importance of artificial intelligence (AI) technology has led to its increased use in various fields in the shipbuilding and marine industries. For example, typical scenarios for AI include production management, analyses of ships on a voyage, and motion prediction. Therefore, this study was conducted to predict a response amplitude operator (RAO) through AI technology. It used a neural network based on one of the types of AI methods. The data used in the neural network consisted of the properties of the vessel and RAO values, based on simulating the in-house code. The learning model consisted of an input layer, hidden layer, and output layer. The input layer comprised eight neurons, the hidden layer comprised the variables, and the output layer comprised 20 neurons. The RAO predicted with the neural network and an RAO created with the in-house code were compared. The accuracy was assessed and reviewed based on the root mean square error (RMSE), standard deviation (SD), random number change, correlation coefficient, and scatter plot. Finally, the optimal model was selected, and the conclusion was drawn. The ultimate goals of this study were to reduce the difficulty in the modeling work required to obtain the RAO, to reduce the difficulty in using commercial tools, and to enable an assessment of the stability of medium/small vessels in waves.
https://doi.org/10.26748/KSOE.2019.107 인용 PDF KSCI

Retrieval of Land Surface Temperature Using Landsat 8 Images with Deep Neural Networks (Landsat 8 영상을 이용한 심층신경망 기반의 지표면온도 산출)

Kim, Seoyeon;Lee, Soo-Jin;Lee, Yang-Won
- Korean Journal of Remote Sensing
- /
- v.36 no.3
- /
- pp.487-501
- /
- 2020
As a viable option for retrieval of LST (Land Surface Temperature), this paper presents a DNN (Deep Neural Network) based approach using 148 Landsat 8 images for South Korea. Because the brightness temperature and emissivity for the band 10 (approx. 11-㎛ wavelength) of Landsat 8 are derived by combining physics-based equations and empirical coefficients, they include uncertainties according to regional conditions such as meteorology, climate, topography, and vegetation. To overcome this, we used several land surface variables such as NDVI (Normalized Difference Vegetation Index), land cover types, topographic factors (elevation, slope, aspect, and ruggedness) as well as the T₀ calculated from the brightness temperature and emissivity. We optimized four seasonal DNN models using the input variables and in-situ observations from ASOS (Automated Synoptic Observing System) to retrieve the LST, which is an advanced approach when compared with the existing method of the bias correction using a linear equation. The validation statistics from the 1,728 matchups during 2013-2019 showed a good performance of the CC=0.910~0.917 and RMSE=3.245~3.365℃, especially for spring and fall. Also, our DNN models produced a stable LST for all types of land cover. A future work using big data from Landsat 5/7/8 with additional land surface variables will be necessary for a more reliable retrieval of LST for high-resolution satellite images.
https://doi.org/10.7780/kjrs.2020.36.3.8 인용 PDF KSCI HTML

Optimization of Action Recognition based on Slowfast Deep Learning Model using RGB Video Data (RGB 비디오 데이터를 이용한 Slowfast 모델 기반 이상 행동 인식 최적화)

Jeong, Jae-Hyeok;Kim, Min-Suk
- Journal of Korea Multimedia Society
- /
- v.25 no.8
- /
- pp.1049-1058
- /
- 2022
HAR(Human Action Recognition) such as anomaly and object detection has become a trend in research field(s) that focus on utilizing Artificial Intelligence (AI) methods to analyze patterns of human action in crime-ridden area(s), media services, and industrial facilities. Especially, in real-time system(s) using video streaming data, HAR has become a more important AI-based research field in application development and many different research fields using HAR have currently been developed and improved. In this paper, we propose and analyze a deep-learning-based HAR that provides more efficient scheme(s) using an intelligent AI models, such system can be applied to media services using RGB video streaming data usage without feature extraction pre-processing. For the method, we adopt Slowfast based on the Deep Neural Network(DNN) model under an open dataset(HMDB-51 or UCF101) for improvement in prediction accuracy.
https://doi.org/10.9717/kmms.2022.25.8.1049 인용 PDF KSCI

DG-based SPO tuple recognition using self-attention M-Bi-LSTM

Jung, Joon-young
- ETRI Journal
- /
- v.44 no.3
- /
- pp.438-449
- /
- 2022
This study proposes a dependency grammar-based self-attention multilayered bidirectional long short-term memory (DG-M-Bi-LSTM) model for subject-predicate-object (SPO) tuple recognition from natural language (NL) sentences. To add recent knowledge to the knowledge base autonomously, it is essential to extract knowledge from numerous NL data. Therefore, this study proposes a high-accuracy SPO tuple recognition model that requires a small amount of learning data to extract knowledge from NL sentences. The accuracy of SPO tuple recognition using DG-M-Bi-LSTM is compared with that using NL-based self-attention multilayered bidirectional LSTM, DG-based bidirectional encoder representations from transformers (BERT), and NL-based BERT to evaluate its effectiveness. The DG-M-Bi-LSTM model achieves the best results in terms of recognition accuracy for extracting SPO tuples from NL sentences even if it has fewer deep neural network (DNN) parameters than BERT. In particular, its accuracy is better than that of BERT when the learning data are limited. Additionally, its pretrained DNN parameters can be applied to other domains because it learns the structural relations in NL sentences.
https://doi.org/10.4218/etrij.2020-0460 인용 PDF KSCI

Comparison on of Activation Functions for Shrinkage Prediction Model using DNN (DNN을 활용한 콘크리트 건조수축 예측 모델의 활성화 함수 비교분석)

Han, Jun-Hui;Kim, Su-Hoo;Han, Soo-Hwan;Beak, Sung-Jin;Kim, Jong;Han, Min-Cheol
- Proceedings of the Korean Institute of Building Construction Conference
- /
- 2022.11a
- /
- pp.121-122
- /
- 2022
In this study, compared and analyzed various Activation Functions to present a methodology for developing a natural intelligence-based prediction system. As a result of the analysis, ELU was the best with RMSE: 62.87, R²: 0.96, and the error rate was 4%. However, it is considered desirable to construct a prediction system by combining each algorithm model for optimization.
PDF

Comparison on of Minimization of Loos function for strength Prediction Model using DNN (DNN을 활용한 강도예측모델의 손실함수 최소화 기법 비교분석)

Han, Jun-Hui;Kim, Su-Hoo;Beak, Sung-Jin;Han, Soo-Hwan;Kim, Jong;Han, Min-Cheol
- Proceedings of the Korean Institute of Building Construction Conference
- /
- 2022.04a
- /
- pp.182-183
- /
- 2022
In this study, compared and analyzed various loss function minimization techniques to present a methodology for developing a natural intelligence-based prediction system. As a result of the analysis, He Initialization was the best with RMSE: 3.78, R2: 0.94, and the error rate was 6%. However, it is considered desirable to construct a prediction system by combining each technique for optimization.
PDF

Deep Neural Network compression based on clustering of per layer in frequency domain (주파수 영역에서의 군집화 기반 계층별 딥 뉴럴 네트워크 압축)

Hong, Minsoo;Kim, Sungjei;Jeong, Jinwoo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.64-67
- /
- 2020
최근 다양한 분야에서 딥 러닝 기반의 많은 연구가 진행되고 있으며 이에 따라 딥 러닝 모델의 경량화를 통해 제한된 메모리를 가진 하드웨어에 올릴 수 있는 경량화 된 딥 뉴럴 네트워크(DNN)를 개발하는 연구도 활발해졌다. 이에 본 논문은 주파수 영역에서의 군집화 기반 계층별 딥 뉴럴 네트워크 압축을 제안한다. 이산 코사인 변환, 양자화, 군집화, 적응적 엔트로피 코딩 과정을 각 모델의 계층에 순차적으로 적용하여 DNN이 차지하는 메모리를 줄인다. 제안한 알고리즘을 통해 VGG16을 손실률은 1% 미만의 손실에서 전체 가중치를 3.98%까지 압축, 약 25배가량 경량화 할 수 있었다.
PDF

Search Result 261, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)