• Title/Summary/Keyword: Algorithm Model

Search Result 13,015, Processing Time 0.039 seconds

Dual CNN Structured Sound Event Detection Algorithm Based on Real Life Acoustic Dataset (실생활 음향 데이터 기반 이중 CNN 구조를 특징으로 하는 음향 이벤트 인식 알고리즘)

  • Suh, Sangwon;Lim, Wootaek;Jeong, Youngho;Lee, Taejin;Kim, Hui Yong
    • Journal of Broadcast Engineering
    • /
    • v.23 no.6
    • /
    • pp.855-865
    • /
    • 2018
  • Sound event detection is one of the research areas to model human auditory cognitive characteristics by recognizing events in an environment with multiple acoustic events and determining the onset and offset time for each event. DCASE, a research group on acoustic scene classification and sound event detection, is proceeding challenges to encourage participation of researchers and to activate sound event detection research. However, the size of the dataset provided by the DCASE Challenge is relatively small compared to ImageNet, which is a representative dataset for visual object recognition, and there are not many open sources for the acoustic dataset. In this study, the sound events that can occur in indoor and outdoor are collected on a larger scale and annotated for dataset construction. Furthermore, to improve the performance of the sound event detection task, we developed a dual CNN structured sound event detection system by adding a supplementary neural network to a convolutional neural network to determine the presence of sound events. Finally, we conducted a comparative experiment with both baseline systems of the DCASE 2016 and 2017.

Design of Immersive Walking Interaction Using Deep Learning for Virtual Reality Experience Environment of Visually Impaired People (시각 장애인 가상현실 체험 환경을 위한 딥러닝을 활용한 몰입형 보행 상호작용 설계)

  • Oh, Jiseok;Bong, Changyun;Kim, Jinmo
    • Journal of the Korea Computer Graphics Society
    • /
    • v.25 no.3
    • /
    • pp.11-20
    • /
    • 2019
  • In this study, a novel virtual reality (VR) experience environment is proposed for enabling walking adaptation of visually impaired people. The core of proposed VR environment is based on immersive walking interactions and deep learning based braille blocks recognition. To provide a realistic walking experience from the perspective of visually impaired people, a tracker-based walking process is designed for determining the walking state by detecting marching in place, and a controller-based VR white cane is developed that serves as the walking assistance tool for visually impaired people. Additionally, a learning model is developed for conducting comprehensive decision-making by recognizing and responding to braille blocks situated on roads that are followed during the course of directions provided by the VR white cane. Based on the same, a VR application comprising an outdoor urban environment is designed for analyzing the VR walking environment experience. An experimental survey and performance analysis were also conducted for the participants. Obtained results corroborate that the proposed VR walking environment provides a presence of high-level walking experience from the perspective of visually impaired people. Furthermore, the results verify that the proposed learning algorithm and process can recognize braille blocks situated on sidewalks and roadways with high accuracy.

A Study on Stealth Design for Exterior Equipment Arrangement Considering the Multi-Bounce Effect (다중반사를 고려한 함정의 외부 탑재 장비 최적배치 연구)

  • Hwang, Joon-Tae;Hong, Suk-Yoon;Kwon, Hyun-Wung;Kim, Jong-Chul;Song, Jee-Hun
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.23 no.7
    • /
    • pp.918-925
    • /
    • 2017
  • Multiple reflections on exterior equipment with complex shape on naval ships cause unexpectedly high Radar Cross Section (RCS) distributions, and the directions of reradiated electromagnetic waves are hard to predict. Therefore, the optimum arrangement of exterior equipments should be considered according to the Radar Absorbing Structure (RAS) method. In this paper, the optimum arrangement for exterior equipments was determined to reduce multiple reflections and RCS even with complex shapes. The sequential descending arrangement method was used to establish an optimum arrangement algorithm. An LCS-2 type model was selected for optimum exterior equipment arrangements. In order to reduce computational cost, RCS distributions and multiple reflection path analysis of exterior equipments was carried out to select exterior equipments for optimum arrangement, and an optimum arrangement was determined to find positions with minimum RCS values. Also, the RCS reduction effect was analyzed using detectable radar range.

Improved VFM Method for High Accuracy Flight Simulation (고정밀 비행 시뮬레이션을 위한 개선 VFM 기법 연구)

  • Lee, Chiho;Kim, Mukyeom;Lee, Jae-Lyun;Jeon, Kwon-Su;Tyan, Maxim;Lee, Jae-Woo
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.49 no.9
    • /
    • pp.709-719
    • /
    • 2021
  • Recent progress in analysis and flight simulation methods enables wider use of a virtual certification and reduces number of certification flight tests. Aerodynamic database (AeroDB) is one of the most important components for the flight simulation. It is composed of aerodynamic coefficients at a range of flight conditions and control deflections. This paper proposes and efficient method for construction of AeroDB that combines Gaussian Process based Variable Fidelity Modeling with adaptive sampling algorithm. A case study of virtual certification of a F-16 fighter is presented. Four AeroDB were constructed using different number and distribution of high-fidelity data points. The constructed database is then used to simulate gliding, short pitch, and roll response. Compliance with certification regulations is then checked. The case study demonstrates that the proposed method can significantly reduce number of high-fidelity data points while maintaining high accuracy of the simulation.

A study on ICO-based fund investment (ICO 기반 자금 투자에 대한 연구)

  • Yoo, Soonduck
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.19 no.5
    • /
    • pp.25-32
    • /
    • 2019
  • The purpose of this study is to investigate how to make a proper investment in ICO in the market. Previously, companies used to borrow money from banks or to obtain investments from venture capital (VC) and angel investors, but now ICOs are used as a new type of funding and financing model. The ICO sells the tokens or coins created on the blockchain openly online to raise the necessary funds, and provides the market value by paying the tokens or coins as much as the investment amount. According to this study, the limitations of the ICO market are (1) difficulties in evaluating the company, (2) uncertainties in investments, (3) lack of legal safeguards, and (4) measures to secure corporate stability after recruitment. At present, there is no way to cope with this systematically since the ICO is not protected in the legal framework. Nevertheless, we investigated the ways to make proper investment in the existing ICO market. In investing in ICO, investors should (1) consider investment methods and profitability, and (2) verify and judge investment fraud through various channels (ex. Homepage, composition team profile, etc.) and make investments based on this. This study will contribute to the formation of a healthy ICO market by understanding the newly emerged ICO market and studying the considerations when investing in it, thereby contributing to the right investor training and reducing the mass production of consumer damages caused by fraud. The limitation of this study is that the domestic ICO has not yet been examined in the legal framework, so further research is needed when policy changes occur in the future.

A Development of Road Crack Detection System Using Deep Learning-based Segmentation and Object Detection (딥러닝 기반의 분할과 객체탐지를 활용한 도로균열 탐지시스템 개발)

  • Ha, Jongwoo;Park, Kyongwon;Kim, Minsoo
    • The Journal of Society for e-Business Studies
    • /
    • v.26 no.1
    • /
    • pp.93-106
    • /
    • 2021
  • Many recent studies on deep learning-based road crack detection have shown significantly more improved performances than previous works using algorithm-based conventional approaches. However, many deep learning-based studies are still focused on classifying the types of cracks. The classification of crack types is highly anticipated in that it can improve the crack detection process, which is currently relying on manual intervention. However, it is essential to calculate the severity of the cracks as well as identifying the type of cracks in actual pavement maintenance planning, but studies related to road crack detection have not progressed enough to automated calculation of the severity of cracks. In order to calculate the severity of the crack, the type of crack and the area of the crack in the image must be identified together. This study deals with a method of using Mobilenet-SSD that is deep learning-based object detection techniques to effectively automate the simultaneous detection of crack types and crack areas. To improve the accuracy of object-detection for road cracks, several experiments were conducted to combine the U-Net for automatic segmentation of input image and object-detection model, and the results were summarized. As a result, image masking with U-Net is able to maximize object-detection performance with 0.9315 mAP value. While referring the results of this study, it is expected that the automation of the crack detection functionality on pave management system can be further enhanced.

Application of CFD to Design Procedure of Ammonia Injection System in DeNOx Facilities in a Coal-Fired Power Plant (석탄화력 발전소 탈질설비의 암모니아 분사시스템 설계를 위한 CFD 기법 적용에 관한 연구)

  • Kim, Min-Kyu;Kim, Byeong-Seok;Chung, Hee-Taeg
    • Clean Technology
    • /
    • v.27 no.1
    • /
    • pp.61-68
    • /
    • 2021
  • Selective catalytic reduction (SCR) is widely used as a method of removing nitrogen oxide in large-capacity thermal power generation systems. Uniform mixing of the injected ammonia and the inlet flue gas is very important to the performance of the denitrification reduction process in the catalyst bed. In the present study, a computational analysis technique was applied to the ammonia injection system design process of a denitrification facility. The applied model is the denitrification facility of an 800 MW class coal-fired power plant currently in operation. The flow field to be solved ranges from the inlet of the ammonia injection system to the end of the catalyst bed. The flow was analyzed in the two-dimensional domain assuming incompressible. The steady-state turbulent flow was solved with the commercial software named ANSYS-Fluent. The nozzle arrangement gap and injection flow rate in the ammonia injection system were chosen as the design parameters. A total of four (4) cases were simulated and compared. The root mean square of the NH3/NO molar ratio at the inlet of the catalyst layer was chosen as the optimization parameter and the design of the experiment was used as the base of the optimization algorithm. The case where the nozzle pitch and flow rate were adjusted at the same time was the best in terms of flow uniformity.

Development of Fender Segmentation System for Port Structures using Vision Sensor and Deep Learning (비전센서 및 딥러닝을 이용한 항만구조물 방충설비 세분화 시스템 개발)

  • Min, Jiyoung;Yu, Byeongjun;Kim, Jonghyeok;Jeon, Haemin
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.26 no.2
    • /
    • pp.28-36
    • /
    • 2022
  • As port structures are exposed to various extreme external loads such as wind (typhoons), sea waves, or collision with ships; it is important to evaluate the structural safety periodically. To monitor the port structure, especially the rubber fender, a fender segmentation system using a vision sensor and deep learning method has been proposed in this study. For fender segmentation, a new deep learning network that improves the encoder-decoder framework with the receptive field block convolution module inspired by the eccentric function of the human visual system into the DenseNet format has been proposed. In order to train the network, various fender images such as BP, V, cell, cylindrical, and tire-types have been collected, and the images are augmented by applying four augmentation methods such as elastic distortion, horizontal flip, color jitter, and affine transforms. The proposed algorithm has been trained and verified with the collected various types of fender images, and the performance results showed that the system precisely segmented in real time with high IoU rate (84%) and F1 score (90%) in comparison with the conventional segmentation model, VGG16 with U-net. The trained network has been applied to the real images taken at one port in Republic of Korea, and found that the fenders are segmented with high accuracy even with a small dataset.

Road Extraction from Images Using Semantic Segmentation Algorithm (영상 기반 Semantic Segmentation 알고리즘을 이용한 도로 추출)

  • Oh, Haeng Yeol;Jeon, Seung Bae;Kim, Geon;Jeong, Myeong-Hun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.3
    • /
    • pp.239-247
    • /
    • 2022
  • Cities are becoming more complex due to rapid industrialization and population growth in modern times. In particular, urban areas are rapidly changing due to housing site development, reconstruction, and demolition. Thus accurate road information is necessary for various purposes, such as High Definition Map for autonomous car driving. In the case of the Republic of Korea, accurate spatial information can be generated by making a map through the existing map production process. However, targeting a large area is limited due to time and money. Road, one of the map elements, is a hub and essential means of transportation that provides many different resources for human civilization. Therefore, it is essential to update road information accurately and quickly. This study uses Semantic Segmentation algorithms Such as LinkNet, D-LinkNet, and NL-LinkNet to extract roads from drone images and then apply hyperparameter optimization to models with the highest performance. As a result, the LinkNet model using pre-trained ResNet-34 as the encoder achieved 85.125 mIoU. Subsequent studies should focus on comparing the results of this study with those of studies using state-of-the-art object detection algorithms or semi-supervised learning-based Semantic Segmentation techniques. The results of this study can be applied to improve the speed of the existing map update process.

Feasibility Study for Derivation of Tropospheric Ozone Motion Vector Using Geostationary Environmental Satellite Measurements (정지궤도 위성 대류권 오존 관측 자료를 이용한 대류권 이동벡터 산출 가능성 연구)

  • Shin, Daegeun;Kim, Somyoung;Bak, Juseon;Baek, Kanghyun;Hong, Sungjae;Kim, Jaehwan
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1069-1080
    • /
    • 2022
  • The tropospheric ozone is a pollutant that causes a great deal of damage to humans and ecosystems worldwide. In the event that ozone moves downwind from its source, a localized problem becomes a regional and global problem. To enhance ozone monitoring efficiency, geostationary satellites with continuous diurnal observations have been developed. The objective of this study is to derive the Tropospheric Ozone Movement Vector (TOMV) by employing continuous observations of tropospheric ozone from geostationary satellites for the first time in the world. In the absence of Geostationary Environmental Monitoring Satellite (GEMS) tropospheric ozone observation data, the GEOS-Chem model calculated values were used as synthetic data. Comparing TOMV with GEOS-Chem, the TOMV algorithm overestimated wind speed, but it correctly calculated wind direction represented by pollution movement. The ozone influx can also be calculated using the calculated ozone movement speed and direction multiplied by the observed ozone concentration. As an alternative to a backward trajectory method, this approach will provide better forecasting and analysis by monitoring tropospheric ozone inflow characteristics on a continuous basis. However, if the boundary of the ozone distribution is unclear, motion detection may not be accurate. In spite of this, the TOMV method may prove useful for monitoring and forecasting pollution based on geostationary environmental satellites in the future.