Search | Korea Science

Sound Event Detection based on Deep Neural Networks (딥 뉴럴네트워크 기반의 소리 이벤트 검출)

Chung, Suk-Hwan;Chung, Yong-Joo
- The Journal of the Korea institute of electronic communication sciences
- /
- v.14 no.2
- /
- pp.389-396
- /
- 2019
In this paper, various architectures of deep neural networks were applied for sound event detection and their performances were compared using a common audio database. The FNN, CNN, RNN and CRNN were implemented using hyper-parameters optimized for the database as well as the architecture of each neural network. Among the implemented deep neural networks, CRNN performed best at all testing conditions and CNN followed CRNN in performance. Although RNN has a merit in tracking the time-correlations in audio signals, it showed poor performance compared with CNN and CRNN.
https://doi.org/10.13067/JKIECS.2019.14.2.389 인용 PDF KSCI HTML

Deep Neural Network compression based on clustering of per layer in frequency domain (주파수 영역에서의 군집화 기반 계층별 딥 뉴럴 네트워크 압축)

Hong, Minsoo;Kim, Sungjei;Jeong, Jinwoo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.64-67
- /
- 2020
최근 다양한 분야에서 딥 러닝 기반의 많은 연구가 진행되고 있으며 이에 따라 딥 러닝 모델의 경량화를 통해 제한된 메모리를 가진 하드웨어에 올릴 수 있는 경량화 된 딥 뉴럴 네트워크(DNN)를 개발하는 연구도 활발해졌다. 이에 본 논문은 주파수 영역에서의 군집화 기반 계층별 딥 뉴럴 네트워크 압축을 제안한다. 이산 코사인 변환, 양자화, 군집화, 적응적 엔트로피 코딩 과정을 각 모델의 계층에 순차적으로 적용하여 DNN이 차지하는 메모리를 줄인다. 제안한 알고리즘을 통해 VGG16을 손실률은 1% 미만의 손실에서 전체 가중치를 3.98%까지 압축, 약 25배가량 경량화 할 수 있었다.
PDF

A Survey on Deep Neural Networks for 3D Reconstruction from a 2D Image (단일 이미지 기반 3D 모델 생성을 위한 딥-뉴럴 네트워크 분류 및 성능비교)

Kim, MinGeyung;Choi, Yoo-Joo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2022.05a
- /
- pp.715-718
- /
- 2022
단일 이미지로부터 3D 모델을 생성하는 방법은 메타버스와 가상현실 콘텐츠에 대한 필요성이 높아짐에 따라, 보다 효율적인 모델 생성방법으로서 관심이 높아지고 있다. 본 논문에서는 단일 이미지로부터 3D 모델을 자동 생성하는 기존 딥-뉴럴 네트워크들을 대상으로, 생성되는 3D 모델의 유형에 따라 기존 네트워크들을 분류하고, 주요 딥-뉴럴 네트워크의 형태와 특징, 그리고 모델 생성의 성능을 분석하고자 한다.
https://doi.org/10.3745/PKIPS.y2022m05a.715 인용 PDF

Performance Improvement of Object Recognition System in Broadcast Media Using Hierarchical CNN (계층적 CNN을 이용한 방송 매체 내의 객체 인식 시스템 성능향상 방안)

Kwon, Myung-Kyu;Yang, Hyo-Sik
- Journal of Digital Convergence
- /
- v.15 no.3
- /
- pp.201-209
- /
- 2017
This paper is a smartphone object recognition system using hierarchical convolutional neural network. The overall configuration is a method of communicating object information to the smartphone by matching the collected data by connecting the smartphone and the server and recognizing the object to the convergence neural network in the server. It is also compared to a hierarchical convolutional neural network and a fractional convolutional neural network. Hierarchical convolutional neural networks have 88% accuracy, fractional convolutional neural networks have 73% accuracy and 15%p performance improvement. Based on this, it shows possibility of expansion of T-Commerce market connected with smartphone and broadcasting media.
https://doi.org/10.14400/JDC.2017.15.3.201 인용 PDF KSCI

Analysis and Study for Appropriate Deep Neural Network Structures and Self-Supervised Learning-based Brain Signal Data Representation Methods (딥 뉴럴 네트워크의 적절한 구조 및 자가-지도 학습 방법에 따른 뇌신호 데이터 표현 기술 분석 및 고찰)

Won-Jun Ko
- The Journal of the Korea institute of electronic communication sciences
- /
- v.19 no.1
- /
- pp.137-142
- /
- 2024
Recently, deep learning technology has become those methods as de facto standards in the area of medical data representation. But, deep learning inherently requires a large amount of training data, which poses a challenge for its direct application in the medical field where acquiring large-scale data is not straightforward. Additionally, brain signal modalities also suffer from these problems owing to the high variability. Research has focused on designing deep neural network structures capable of effectively extracting spectro-spatio-temporal characteristics of brain signals, or employing self-supervised learning methods to pre-learn the neurophysiological features of brain signals. This paper analyzes methodologies used to handle small-scale data in emerging fields such as brain-computer interfaces and brain signal-based state prediction, presenting future directions for these technologies. At first, this paper examines deep neural network structures for representing brain signals, then analyzes self-supervised learning methodologies aimed at efficiently learning the characteristics of brain signals. Finally, the paper discusses key insights and future directions for deep learning-based brain signal analysis.
https://doi.org/10.13067/JKIECS.2024.19.1.137 인용 PDF

Using CNN-LSTM for Effective Application of Dialogue Context to Emotion Classification (CNN-LSTM을 이용한 대화 문맥 반영과 감정 분류)

Shin, Dong-Won;Lee, Yeon-Soo;Jang, Jung-Sun;Rim, Hae-Chang
- 한국어정보학회:학술대회논문집
- /
- 2016.10a
- /
- pp.141-146
- /
- 2016
대화 시스템에서 사용자가 나타내는 발화에 내재된 감정을 분류하는 것은, 시스템이 적절한 응답과 서비스를 제공하는데 있어 매우 중요하다. 본 연구에서는 대화 내 감정 분류를 하는데 있어 직접적, 간접적으로 드러나는 감정 자질을 자동으로 학습하고 감정이 지속되는 대화 문맥을 효과적으로 반영하기 위해 CNN-LSTM 방식의 딥 뉴럴 네트워크 구조를 제안한다. 그리고 대량의 구어체 코퍼스를 이용한 사전 학습으로 데이터 부족 문제를 완화하였다. 실험 결과 제안하는 방법이 기존의 SVM이나, 단순한 RNN, CNN 네트워크 구조에 비해 전반전인 성능 향상을 보였고, 특히 감정이 있는 경우 더 잘 분류하는 것을 확인할 수 있었다.
PDF

Using CNN-LSTM for Effective Application of Dialogue Context to Emotion Classification (CNN-LSTM을 이용한 대화 문맥 반영과 감정 분류)

Shin, Dong-Won;Lee, Yeon-Soo;Jang, Jung-Sun;Rim, Hae-Chang
- Annual Conference on Human and Language Technology
- /
- 2016.10a
- /
- pp.141-146
- /
- 2016
대화 시스템에서 사용자가 나타내는 발화에 내재된 감정을 분류하는 것은, 시스템이 적절한 응답과 서비스를 제공하는데 있어 매우 중요하다. 본 연구에서는 대화 내 감정 분류를 하는데 있어 직접적, 간접적으로 드러나는 감정 자질을 자동으로 학습하고 감정이 지속되는 대화 문맥을 효과적으로 반영하기 위해 CNN-LSTM 방식의 딥 뉴럴 네트워크 구조를 제안한다. 그리고 대량의 구어체 코퍼스를 이용한 사전 학습으로 데이터 부족 문제를 완화하였다. 실험 결과 제안하는 방법이 기존의 SVM이나, 단순한 RNN, CNN 네트워크 구조에 비해 전반전인 성능 향상을 보였고, 특히 감정이 있는 경우 더 잘 분류하는 것을 확인할 수 있었다.
PDF

A Research Trend Study on Bio-Signal Processing using Attention Mechanism (어텐션 메카니즘을 이용한 생체신호처리 연구 동향 분석)

Yeong-Hyeon Byeon;Keun-Chang Kwak
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.05a
- /
- pp.630-632
- /
- 2023
어텐션 메커니즘은 딥 뉴럴네트워크에 결합하여 언어 생성 모델에서 성능을 개선하였고, 이러한 성공은 다양한 신호처리 분야에 응용 및 확장되고 있다. 특정 입력 신호 부분에 선택적으로 집중함으로써, 어텐션 모델은 음성 인식, 이미지와 비디오 처리, 그리고 생체인식 등의 분야에서 더 높은 성능을 보여주고 있다. 어텐션 기반 모델은 심전도 신호를 이용한 개인식별 및 부정맥검출, 뇌파도 신호를 이용한 발작유형분류 및 수면 단계 분류, 근전도 신호를 이용한 제스처 인식 등에 사용되고 있다. 어텐션 메커니즘은 딥 뉴럴네트워크의 해석 가능성과 설명 가능성을 향상시키기 위해 사용되기도 한다. 신호 처리 분야에서의 어텐션 모델 연구는 지속적으로 진행 중이며, 다른 분야에서의 잠재력 탐구에 대한 관심이 높아지고 있다. 따라서 본 논문은 어텐션 메카니즘을 이용한 생체신호처리 연구 동향 분석을 수행한다.
https://doi.org/10.3745/PKIPS.y2023m05a.630 인용 PDF

The Impact of Various Degrees of Composite Minimax ApproximatePolynomials on Convolutional Neural Networks over Fully HomomorphicEncryption (다양한 차수의 합성 미니맥스 근사 다항식이 완전 동형 암호 상에서의 컨볼루션 신경망 네트워크에 미치는 영향)

Junghyun Lee;Jong-Seon No
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.33 no.6
- /
- pp.861-868
- /
- 2023
One of the key technologies in providing data analysis in the deep learning while maintaining security is fully homomorphic encryption. Due to constraints in operations on fully homomorphically encrypted data, non-arithmetic functions used in deep learning must be approximated by polynomials. Until now, the degrees of approximation polynomials with composite minimax polynomials have been uniformly set across layers, which poses challenges for effective network designs on fully homomorphic encryption. This study theoretically proves that setting different degrees of approximation polynomials constructed by composite minimax polynomial in each layer does not pose any issues in the inference on convolutional neural networks.
https://doi.org/10.13089/JKIISC.2023.33.6.861 인용 PDF HTML

Deep learning-based Automatic Weed Detection on Onion Field (딥러닝을 이용한 양파 밭의 잡초 검출 연구)

Kim, Seo jeong;Lee, Jae Su;Kim, Hyong Suk
- Smart Media Journal
- /
- v.7 no.3
- /
- pp.16-21
- /
- 2018
This paper presents the design and implementation of a deep learning-based automated weed detector on onion fields. The system is based on a Convolutional Neural Network that specifically selects proposed regions. The detector initiates training with a dataset taken from agricultural onion fields, after which candidate regions with very high probability of suspicion are considered weeds. Non-maximum suppression helps preserving the less overlapped bounding boxes. The dataset collected from different onion farms is evaluated with the proposed classifier. Classification accuracy is about 99% for the dataset, indicating the proposed method's superior performance with regard to weed detection on the onion fields.
https://doi.org/10.30693/SMJ.2018.7.3.16 인용 PDF KSCI

Search Result 62, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)