• Title/Summary/Keyword: 데이터 증강

Search Result 494, Processing Time 0.031 seconds

A Study of Implementation for Visualizing 3 Dimension Content Generation using Index (인덱스를 활용한 3차원 콘텐츠 생성 시각화 구현에 관한 연구)

  • Lee, Hyun-Chang;Shin, Seong-Yoon;Jang, Hee-Seon;Koh, Jin-Gwang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.10
    • /
    • pp.11-17
    • /
    • 2010
  • Mobile device, one of typical devices in ubiquitous environment, is received attention owing to portability. In these days, technical researches on a kind of the device are focusing on applications of smart phone. For example, the techniques using geographical position and applied instances such as augmented reality techniques are gradually increasing. That makes data processing important. Mobile application services for users also require various application techniques based on moving objects. In addition, they require the techniques that processed data are needed to be shown in visualization. However, this is reality that it lacks of showing visualization works to improve the understanding of thing what it is. To reduce or solve the problems, in this paper we show the results to implement the R tree based 3 dimension index architecture in visualization. Further, we implemented and present creating objects, showing in 3D for the objects, catching spatial position on a node map through mini map function and improving the understanding of R tree by visualizing.

A Parametric Image Enhancement Technique for Contrast-Enhanced Ultrasonography (조영증강 의료 초음파 진단에서 파라미터 영상의 개선 기법)

  • Kim, Ho Joon;Gwak, Seong Hoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.6
    • /
    • pp.231-236
    • /
    • 2014
  • The transit time of contrast agents and the parameters of time-intensity curves in ultrasonography are important factors to diagnose various diseases of a digestive organ. We have implemented an automatic parametric imaging method to overcome the difficulty of the diagnosis by naked eyes. However, the micro-bubble noise and the respiratory motions may degrade the reliability of the parameter images. In this paper, we introduce an optimization technique based on MRF(Markov Random Field) model to enhance the quality of the parameter images, and present an image tracking algorithm to compensate the image distortion by respiratory motions. A method to extract the respiration periods from the ultrasound image sequence has been developed. We have implemented the ROI(Region of Interest) tracking algorithm using the dynamic weights and a momentum factor based on these periods. An energy function is defined for the Gibbs sampler of the image enhancement method. Through the experiments using the data to diagnose liver lesions, we have shown that the proposed method improves the quality of the parametric images.

Voice-to-voice conversion using transformer network (Transformer 네트워크를 이용한 음성신호 변환)

  • Kim, June-Woo;Jung, Ho-Young
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.55-63
    • /
    • 2020
  • Voice conversion can be applied to various voice processing applications. It can also play an important role in data augmentation for speech recognition. The conventional method uses the architecture of voice conversion with speech synthesis, with Mel filter bank as the main parameter. Mel filter bank is well-suited for quick computation of neural networks but cannot be converted into a high-quality waveform without the aid of a vocoder. Further, it is not effective in terms of obtaining data for speech recognition. In this paper, we focus on performing voice-to-voice conversion using only the raw spectrum. We propose a deep learning model based on the transformer network, which quickly learns the voice conversion properties using an attention mechanism between source and target spectral components. The experiments were performed on TIDIGITS data, a series of numbers spoken by an English speaker. The conversion voices were evaluated for naturalness and similarity using mean opinion score (MOS) obtained from 30 participants. Our final results yielded 3.52±0.22 for naturalness and 3.89±0.19 for similarity.

VR-based Hiking System that supports Real-time Field Condition (등산로 조건을 실시간으로 지원하는 VR 기반의 사이버 등산 시스템)

  • Ko, Dae-sik
    • Journal of Platform Technology
    • /
    • v.6 no.4
    • /
    • pp.78-86
    • /
    • 2018
  • In this paper, a VR-based cyber hiking system was designed to provide virtual reality for famous mountains that can offer the real senses and feelings of hiking in supporting environmental factors of actual mountains such as the temperature, air, sound, echoes, etc., of the mountain the user wants to climb. The VR-based cyber hiking system that reflects real-time site conditions is largely consisted of the data collection module that collects data from the live site, multiple drive modules that enables the user to feel real senses using data from the sites, and sensor module to detect the stimuli provided by the drive modules and the user's physical body transition. Unlike existing VR-based hiking systems, the proposed cyber hiking system not only provides simple virtual reality for the wanted mountain, but can also provide the natural conditions of real mountains and implement the uphill and downhill of hiking routes. In particular, it has the effect of providing fun and game elements to users by excluding unnecessary conditions and risks that may arise in actual hiking and instead supporting augmented realities such as squirrels on actual hiking paths. In addition, in providing users with the changes in their body before and after hiking, it is expected to be effective in providing diverse feedback such as the height, gradient, and speed of mountain hiking.

Object Recognition Using Convolutional Neural Network in military CCTV (합성곱 신경망을 활용한 군사용 CCTV 객체 인식)

  • Ahn, Jin Woo;Kim, Dohyung;Kim, Jaeoh
    • Journal of the Korea Society for Simulation
    • /
    • v.31 no.2
    • /
    • pp.11-20
    • /
    • 2022
  • There is a critical need for AI assistance in guard operations of Army base perimeters, which is exacerbated by changes in the national defense and security environment such as force reduction. In addition, the possibility for human error inherent to perimeter guard operations attests to the need for an innovative revamp of current systems. The purpose of this study is to propose a real-time object detection AI tailored to military CCTV surveillance with three unique characteristics. First, training data suitable for situations in which relatively small objects must be recognized is used due to the characteristics of military CCTV. Second, we utilize a data augmentation algorithm suited for military context applied in the data preparation step. Third, a noise reduction algorithm is applied to account for military-specific situations, such as camouflaged targets and unfavorable weather conditions. The proposed system has been field-tested in a real-world setting, and its performance has been verified.

MAGICal Synthesis: Memory-Efficient Approach for Generative Semiconductor Package Image Construction (MAGICal Synthesis: 반도체 패키지 이미지 생성을 위한 메모리 효율적 접근법)

  • Yunbin Chang;Wonyong Choi;Keejun Han
    • Journal of the Microelectronics and Packaging Society
    • /
    • v.30 no.4
    • /
    • pp.69-78
    • /
    • 2023
  • With the rapid growth of artificial intelligence, the demand for semiconductors is enormously increasing everywhere. To ensure the manufacturing quality and quantity simultaneously, the importance of automatic defect detection during the packaging process has been re-visited by adapting various deep learning-based methodologies into automatic packaging defect inspection. Deep learning (DL) models require a large amount of data for training, but due to the nature of the semiconductor industry where security is important, sharing and labeling of relevant data is challenging, making it difficult for model training. In this study, we propose a new framework for securing sufficient data for DL models with fewer computing resources through a divide-and-conquer approach. The proposed method divides high-resolution images into pre-defined sub-regions and assigns conditional labels to each region, then trains individual sub-regions and boundaries with boundary loss inducing the globally coherent and seamless images. Afterwards, full-size image is reconstructed by combining divided sub-regions. The experimental results show that the images obtained through this research have high efficiency, consistency, quality, and generality.

Development of a Flooding Detection Learning Model Using CNN Technology (CNN 기술을 적용한 침수탐지 학습모델 개발)

  • Dong Jun Kim;YU Jin Choi;Kyung Min Park;Sang Jun Park;Jae-Moon Lee;Kitae Hwang;Inhwan Jung
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.6
    • /
    • pp.1-7
    • /
    • 2023
  • This paper developed a training model to classify normal roads and flooded roads using artificial intelligence technology. We expanded the diversity of learning data using various data augmentation techniques and implemented a model that shows good performance in various environments. Transfer learning was performed using the CNN-based Resnet152v2 model as a pre-learning model. During the model learning process, the performance of the final model was improved through various parameter tuning and optimization processes. Learning was implemented in Python using Google Colab NVIDIA Tesla T4 GPU, and the test results showed that flooding situations were detected with very high accuracy in the test dataset.

Building Fire Monitoring and Escape Navigation System Based on AR and IoT Technologies (AR과 IoT 기술을 기반으로 한 건물 화재 모니터링 및 탈출 내비게이션 시스템)

  • Wentao Wang;Seung-Yong Lee;Sanghun Park;Seung-Hyun Yoon
    • Journal of the Korea Computer Graphics Society
    • /
    • v.30 no.3
    • /
    • pp.159-169
    • /
    • 2024
  • This paper proposes a new real-time fire monitoring and evacuation navigation system by integrating Augmented Reality (AR) technology with Internet of Things (IoT) technology. The proposed system collects temperature data through IoT temperature measurement devices installed in buildings and automatically transmits it to a MySQL cloud database via an IoT platform, enabling real-time and accurate data monitoring. Subsequently, the real-time IoT data is visualized on a 3D building model generated through Building Information Modeling (BIM), and the model is represented in the real world using AR technology, allowing intuitive identification of the fire origin. Furthermore, by utilizing Vuforia engine's Device Tracking and Area Targets features, the system tracks the user's real-time location and employs an enhanced A* algorithm to find the optimal evacuation route among multiple exits. The paper evaluates the proposed system's practicality and demonstrates its effectiveness in rapid and safe evacuation through user experiments based on various virtual fire scenarios.

A COVID-19 Chest X-ray Reading Technique based on Deep Learning (딥 러닝 기반 코로나19 흉부 X선 판독 기법)

  • Ann, Kyung-Hee;Ohm, Seong-Yong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.4
    • /
    • pp.789-795
    • /
    • 2020
  • Many deaths have been reported due to the worldwide pandemic of COVID-19. In order to prevent the further spread of COVID-19, it is necessary to quickly and accurately read images of suspected patients and take appropriate measures. To this end, this paper introduces a deep learning-based COVID-19 chest X-ray reading technique that can assist in image reading by providing medical staff whether a patient is infected. First of all, in order to learn the reading model, a sufficient dataset must be secured, but the currently provided COVID-19 open dataset does not have enough image data to ensure the accuracy of learning. Therefore, we solved the image data number imbalance problem that degrades AI learning performance by using a Stacked Generative Adversarial Network(StackGAN++). Next, the DenseNet-based classification model was trained using the augmented data set to develop the reading model. This classification model is a model for binary classification of normal chest X-ray and COVID-19 chest X-ray, and the performance of the model was evaluated using part of the actual image data as test data. Finally, the reliability of the model was secured by presenting the basis for judging the presence or absence of disease in the input image using Grad-CAM, one of the explainable artificial intelligence called XAI.

A Study for Generation of Artificial Lunar Topography Image Dataset Using a Deep Learning Based Style Transfer Technique (딥러닝 기반 스타일 변환 기법을 활용한 인공 달 지형 영상 데이터 생성 방안에 관한 연구)

  • Na, Jong-Ho;Lee, Su-Deuk;Shin, Hyu-Soung
    • Tunnel and Underground Space
    • /
    • v.32 no.2
    • /
    • pp.131-143
    • /
    • 2022
  • The lunar exploration autonomous vehicle operates based on the lunar topography information obtained from real-time image characterization. For highly accurate topography characterization, a large number of training images with various background conditions are required. Since the real lunar topography images are difficult to obtain, it should be helpful to be able to generate mimic lunar image data artificially on the basis of the planetary analogs site images and real lunar images available. In this study, we aim to artificially create lunar topography images by using the location information-based style transfer algorithm known as Wavelet Correct Transform (WCT2). We conducted comparative experiments using lunar analog site images and real lunar topography images taken during China's and America's lunar-exploring projects (i.e., Chang'e and Apollo) to assess the efficacy of our suggested approach. The results show that the proposed techniques can create realistic images, which preserve the topography information of the analog site image while still showing the same condition as an image taken on lunar surface. The proposed algorithm also outperforms a conventional algorithm, Deep Photo Style Transfer (DPST) in terms of temporal and visual aspects. For future work, we intend to use the generated styled image data in combination with real image data for training lunar topography objects to be applied for topographic detection and segmentation. It is expected that this approach can significantly improve the performance of detection and segmentation models on real lunar topography images.