• Title/Summary/Keyword: Synthetic Data

Search Result 1,431, Processing Time 0.027 seconds

Generation of Synthetic Particle Images for Particle Image Velocimetry using Physics-Informed Neural Network (물리 기반 인공신경망을 이용한 PIV용 합성 입자이미지 생성)

  • Hyeon Jo Choi;Myeong Hyeon, Shin;Jong Ho, Park;Jinsoo Park
    • Journal of the Korean Society of Visualization
    • /
    • v.21 no.1
    • /
    • pp.119-126
    • /
    • 2023
  • Acquiring experimental data for PIV verification or machine learning training data is resource-demanding, leading to an increasing interest in synthetic particle images as simulation data. Conventional synthetic particle image generation algorithms do not follow physical laws, and the use of CFD is time-consuming and requires computing resources. In this study, we propose a new method for synthetic particle image generation, based on a Physics-Informed Neural Networks(PINN). The PINN is utilized to infer the flow fields, enabling the generation of synthetic particle images that follow physical laws with reduced computation time and have no constraints on spatial resolution compared to CFD. The proposed method is expected to contribute to the verification of PIV algorithms.

Environmental Data Management and Supply Plan for Building Synthetic Battlefield Environment of Air Combat Simulation (항공 전투 시뮬레이션의 합성전장환경 구축을 위한 환경 데이터 관리 및 공급 방안)

  • Yang, Ka-Ram;Hwam, Won K.;Park, Sang C.
    • Journal of the Korea Society for Simulation
    • /
    • v.22 no.3
    • /
    • pp.7-14
    • /
    • 2013
  • In this paper, there is a research for providing environmental data to reflect environmental effects to the simulation for the aviation weapon systems by the construction of the synthetic battlefield. The results of the aviation engagement simulation are able to differ by environmental effect. This paper analyzes the real aviation battlefield and designs the synthetic battlefield based on the analysis. In order to construct the designed synthetic battlefield, we collects the real environmental data for the atmosphere and structures the collected data using GIS (Geographic information system interpolation). The main objective of this paper is to design the synthetic battlefield based on the derived environmental factors from the analysis of the real aviation battlefield, and it constructs the designed synthetic battlefield by the collection of real atmosphere data. The constructed synthetic battlefield provides the environmental data which are requested from the distributed simulation system, and it makes the system reflect environmental effects to the simulation.

Mapping Digital Manufacturing Simulation to Synthetic Environment using SEDRIS (SEDRIS를 이용한 디지털 생산 시뮬레이션과 합성 환경 매핑)

  • Moon, Hong-Il;Han, Soon-Hung
    • Journal of the Korea Society for Simulation
    • /
    • v.14 no.2
    • /
    • pp.15-24
    • /
    • 2005
  • The goal of a distributed simulation such as battle field simulation is to combine all kinds of simulations in the same synthetic environment and to make people interact at the same time. It is a key issue to share the same synthetic environment among simulations. To support reusability and affordability in the modeling and simulation area, DMSO(Defense Modeling and Simulation Office) of USA developed concepts such as HLA(High Level Architecture) and SEDRIS (Synthetic Environmental Data Representation and Interchange Specification). In the industrial simulation area, the digital manufacturing is the main stream. To reduce cost and to reuse simulation environment, the standardization becomes the focus of digital manufacturing. This study proposes to use SEDRIS to improve interoperability of manufacturing data. The simulation data of DELMIA, which is a leading commercial digital manufacturing solution, is mapped and translated into the SEDRIS transmittal format. Mapping of the manufacturing simulation data and the synthetic environment are implemented and verified through experiments.

  • PDF

A Study on Synthetic Data Generation Based Safe Differentially Private GAN (차분 프라이버시를 만족하는 안전한 GAN 기반 재현 데이터 생성 기술 연구)

  • Kang, Junyoung;Jeong, Sooyong;Hong, Dowon;Seo, Changho
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.30 no.5
    • /
    • pp.945-956
    • /
    • 2020
  • The publication of data is essential in order to receive high quality services from many applications. However, if the original data is published as it is, there is a risk that sensitive information (political tendency, disease, ets.) may reveal. Therefore, many research have been proposed, not the original data but the synthetic data generating and publishing to privacy preserve. but, there is a risk of privacy leakage still even if simply generate and publish the synthetic data by various attacks (linkage attack, inference attack, etc.). In this paper, we propose a synthetic data generation algorithm in which privacy preserved by applying differential privacy the latest privacy protection technique to GAN, which is drawing attention as a synthetic data generative model in order to prevent the leakage of such sensitive information. The generative model used CGAN for efficient learning of labeled data, and applied Rényi differential privacy, which is relaxation of differential privacy, considering the utility aspects of the data. And validation of the utility of the generated data is conducted and compared through various classifiers.

gMLP-based Self-Supervised Learning Anomaly Detection using a Simple Synthetic Data Generation Method (단순한 합성데이터 생성 방식을 활용한 gMLP 기반 자기 지도 학습 이상탐지 기법)

  • Ju-Hyo, Hwang;Kyo-Hong, Jin
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.27 no.1
    • /
    • pp.8-14
    • /
    • 2023
  • The existing self-supervised learning-based CutPaste generated synthetic data by cutting and attaching specific patches from normal images and then performed anomaly detection. However, this method has a problem in that there is a clear difference in the boundary of the patch. NSA for solving these problems have achieved higher anomaly detection performance by generating natural synthetic data through Poisson Blending. However, NSA has the disadvantage of having many hyperparameters that need to be adjusted for each class. In this paper, synthetic data similar to normal were generated by a simple method of making the size of the synthetic patch very small. At this time, since the patches are so locally synthesized, models that learn local features can easily overfit synthetic data. Therefore, we performed anomaly detection using gMLP, which learns global features, and even with simple synthesis methods, we were able to achieve higher performance than conventional self-supervised learning techniques.

Studies on the Stochastic Generation of Long Term Runoff (2) (장기유출량의 추계학적 모의 발생에 관한 연구 (II))

  • 이순혁;맹승진;박종국
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.35 no.3
    • /
    • pp.117-129
    • /
    • 1993
  • This study was conducted to get reasonable and abundant hydrological time series of monthly flows simulated by a best fitting stochastic simulation model for the establishment of rational design and the rationalization of management for agricultural hydraulic structures including reservoirs. Comparative analysis carried out for both statistical characteristics and synthetic monthly flows simulated by the multi-season first order Markov model based on Gamma distribution which is confirmed as good one in the first report of this study and by Harmonic synthetic model analyzed in this report for the six watersheds of Yeong San and Seom Jin river systems. 1.Arithmetic mean values of synthetic monthly flows simulated by Gamma distribution are much closer to the results of the observed data than those of Harmonic synthetic model in the applied watersheds. 2.In comparison with the coefficients of variation, index of fluctuation for monthly flows simulated by two kinds of synthetic models, those based on Gamma distribution are appeared closer to the observed data than those of Harmonic synthetic model both in Yeong San and Seom Jin river systems. 3.It was found that synthetic monthly flows based on Gamma distribution are considered to give better results than those of Harmonic synthetic model in the applied watersheds. 4.Continuation studies by comparison with other simulation techniques are to be desired for getting reasonable generation technique of synthetic monthly flows for the various river systems in Korea.

  • PDF

Synthetic data augmentation for pixel-wise steel fatigue crack identification using fully convolutional networks

  • Zhai, Guanghao;Narazaki, Yasutaka;Wang, Shuo;Shajihan, Shaik Althaf V.;Spencer, Billie F. Jr.
    • Smart Structures and Systems
    • /
    • v.29 no.1
    • /
    • pp.237-250
    • /
    • 2022
  • Structural health monitoring (SHM) plays an important role in ensuring the safety and functionality of critical civil infrastructure. In recent years, numerous researchers have conducted studies to develop computer vision and machine learning techniques for SHM purposes, offering the potential to reduce the laborious nature and improve the effectiveness of field inspections. However, high-quality vision data from various types of damaged structures is relatively difficult to obtain, because of the rare occurrence of damaged structures. The lack of data is particularly acute for fatigue crack in steel bridge girder. As a result, the lack of data for training purposes is one of the main issues that hinders wider application of these powerful techniques for SHM. To address this problem, the use of synthetic data is proposed in this article to augment real-world datasets used for training neural networks that can identify fatigue cracks in steel structures. First, random textures representing the surface of steel structures with fatigue cracks are created and mapped onto a 3D graphics model. Subsequently, this model is used to generate synthetic images for various lighting conditions and camera angles. A fully convolutional network is then trained for two cases: (1) using only real-word data, and (2) using both synthetic and real-word data. By employing synthetic data augmentation in the training process, the crack identification performance of the neural network for the test dataset is seen to improve from 35% to 40% and 49% to 62% for intersection over union (IoU) and precision, respectively, demonstrating the efficacy of the proposed approach.

A Statistical Methodology Study for Measuring Privacy Disclosure Riskin Open Data Environment (오픈 데이터 환경에서 개인정보 노출 위험 측정을 위한 통계적 방법론 연구)

  • Sieun Kim;Ieck-chae Euom
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.2
    • /
    • pp.323-333
    • /
    • 2024
  • Recently, Syntheic data has been in the spotlight as a technology that can protect personal information while maintaining the patterns and characteristics of actual data. Accordingly, technical and institutional research on synthetic data is actively being conducted, but it is difficult to actively use synthetic data due to the lack of clear standards and guidelines. This study is a preliminary study for quantifying the disclosure risk of synthetic data, and derives a privacy disclosure risk index through statistical methodology and suggests specific application measures to comply with the General Data Protection Regulation(GDPR). It is expected that the disclosure risk and the balance of data utility can be controlled through the privacy disclosure risk index of this study in an open data environment.

Generating Synthetic Raman Spectra of DMMP and 2-CEES by Mathematical Transforms and Deep Generative Models (수학적 변환과 심층 생성 모델을 활용한 DMMP와 2-CEES의 모의 라만 분광 생성)

  • Sungwon Park;Boseong Jeong;Hongjoong Kim
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.26 no.5
    • /
    • pp.422-430
    • /
    • 2023
  • To build an automated system detecting toxic chemicals from Raman spectra, we have to obtain sufficient data of toxic chemicals. However, it usually costs high to gather Raman spectra of toxic chemicals in diverse situations. Tackling this problem, we develop methods to generate synthetic Raman spectra of DMMP and 2-CEES without actual experiments. First, we propose certain mathematical transforms to augment few original Raman spectra. Then, we train deep generative models to generate more realistic and diverse data. Analyzing synthetic Raman spectra of toxic chemicals generated by our methods through visualization, we qualitatively verify that the data are sufficiently similar to original data and diverse. For conclusion, we obtain a synthetic dataset of DMMP and 2-CEES with the proposed algorithm.

Generation of Synthetic Time Series Wind Speed Data using Second-Order Markov Chain Model (2차 마르코프 사슬 모델을 이용한 시계열 인공 풍속 자료의 생성)

  • Ki-Wahn Ryu
    • Journal of Wind Energy
    • /
    • v.14 no.1
    • /
    • pp.37-43
    • /
    • 2023
  • In this study, synthetic time series wind data was generated numerically using a second-order Markov chain. One year of wind data in 2020 measured by the AWS on Wido Island was used to investigate the statistics for measured wind data. Both the transition probability matrix and the cumulative transition probability matrix for annual hourly mean wind speed were obtained through statistical analysis. Probability density distribution along the wind speed and autocorrelation according to time were compared with the first- and the second-order Markov chains with various lengths of time series wind data. Probability density distributions for measured wind data and synthetic wind data using the first- and the second-order Markov chains were also compared to each other. For the case of the second-order Markov chain, some improvement of the autocorrelation was verified. It turns out that the autocorrelation converges to zero according to increasing the wind speed when the data size is sufficiently large. The generation of artificial wind data is expected to be useful as input data for virtual digital twin wind turbines.