• 제목/요약/키워드: time series data generation

검색결과 108건 처리시간 0.025초

EVAPORATION DATA STOCHASTIC GENERATION FOR KING FAHAD DAM LAKE IN BISHAH, SAUDI ARABIA

  • Abdulmohsen A. Al-Shaikh
    • Water Engineering Research
    • /
    • 제2권4호
    • /
    • pp.209-218
    • /
    • 2001
  • Generation of evaporation data generally assists in planning, operation, and management of reservoirs and other water works. Annual and monthly evaporation series were generated for King Fahad Dam Lake in Bishah, Saudi Arabia. Data was gathered for period of 22 years. Tests of homogeneity and normality were conducted and results showed that data was homogeneous and normally distributed. For generating annual series, an Autoregressive first order model AR(1) was used and for monthly evaporation series method of fragments was used. Fifty replicates for annual series, and fifty replicates for each month series, each with 22 values length, were generated. Performance of the models was evaluated by comparing the statistical parameters of the generated series with those of the historical data. Annual and monthly models were found to be satisfactory in preserving the statistical parameters of the historical series. About 89% of the tested values of the considered parameters were within the assigned confidence limits

  • PDF

Development of a method of the data generation with maintaining quantile of the sample data

  • Joohyung Lee;Young-Oh Kim
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2023년도 학술발표회
    • /
    • pp.244-244
    • /
    • 2023
  • Both the frequency and the magnitude of hydrometeorological extreme events such as severe floods and droughts are increasing. In order to prevent a damage from the climatic disaster, hydrological models are often simulated under various meteorological conditions. While performing the simulations, a synthetic data generated through time series models which maintains the key statistical characteristics of the sample data are widely applied. However, the synthetic data can easily maintains both the average and the variance of the sample data, but the quantile is not maintained well. In this study, we proposes a data generation method which maintains the quantile of the sample data well. The equations of the former maintenance of variance extension (MOVE) are expanded to maintain quantile rather than the average or the variance of the sample data. The equations are derived and the coefficients are determined based on the characteristics of the sample data that we aim to preserve. Monte Carlo simulation is utilized to assess the performance of the proposed data generation method. A time series data (data length of 500) is regarded as the sample data and selected randomly from the sample data to create the data set (data length of 30) for simulation. Data length of the selected data set is expanded from 30 to 500 by using the proposed method. Then, the average, the variance, and the quantile difference between the sample data, and the expanded data are evaluated with relative root mean square error for each simulation. As a result of the simulation, each equation which is designed to maintain the characteristic of data performs well. Moreover, expanded data can preserve the quantile of sample data more precisely than that those expanded through the conventional time series model.

  • PDF

BIVARIATE ANALYSIS에 의한 월류량에 모의발생에 관한 연구 (A STUDY ON SYNTHETIC GENERATION OF MONTHLY STREAMFLOW BY BIVARIATE ANALYSIS)

  • 서병하;윤용남;강관원
    • 물과 미래
    • /
    • 제12권2호
    • /
    • pp.63-69
    • /
    • 1979
  • The sequences of monthly streamflows constitute a non-statonary time series. The purely stochastic model has been applied to data generation of non-stationary time series. Tow different mothods--single site and multisite generation--have been used on the hydrologic time series. In this study the synthetic generation method by bivariate analysis, studied by Thomas Fiering, one of multi-site models, has been applied to the historical data on monthly streamflows at two sites in Nakdong River, and also for validity of this model the single site Thomas Fiering model applied. Through statistical analysis it has been shown that the performance of bivariate Thomas Fiering model was better than that of the other. By comparison of mean and standard deviaion between the historical and the generated, and cross correlogram interpretation, it has been known that the model used herein has good performance to simultaneously generate the monthly streamflows at two sites in a river hasin.

  • PDF

A Bayesian time series model with multiple structural change-points for electricity data

  • Kim, Jaehee
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권4호
    • /
    • pp.889-898
    • /
    • 2017
  • In this research multiple change-points estimation for South Korean electricity generation data is considered. We analyze the South Korean electricity data via deterministically trending dynamic time series model with multiple structural changes in trends in a Bayesian approach. The number of change-points and the timing are unknown. The goal is to find the best model with the appropriate number of change-points and the length of the segments. A genetic algorithm is implemented to solve this optimization problem with a variable dimension of parameters. We estimate the structural change-points for South Korean electricity generation data and Nile River flow data additionally.

임의의 표본상호상관함수와 비정규확률분포를 갖는 다중 난류시계열의 디지털 합성방법을 이용한 풍속데이터 시뮬레이션 (Wind Data Simulation Using Digital Generation of Non-Gaussian Turbulence Multiple Time Series with Specified Sample Cross Correlations)

  • 성승학;김욱;김경천;부정숙
    • 한국대기환경학회지
    • /
    • 제19권5호
    • /
    • pp.569-581
    • /
    • 2003
  • A method of synthetic time series generation was developed and applied to the simulation of homogeneous turbulence in a periodic 3 - D box and the hourly wind data simulation. The method can simulate almost exact sample auto and cross correlations of multiple time series and control non-Gaussian distribution. Using the turbulence simulation, influence of correlations, non-Gaussian distribution, and one-direction anisotropy on homogeneous structure were studied by investigating the spatial distribution of turbulence kinetic energy and enstrophy. An hourly wind data of Typhoon Robin was used to illustrate a capability of the method to simulate sample cross correlations of multiple time series. The simulated typhoon data shows a similar shape of fluctuations and almost exactly the same sample auto and cross correlations of the Robin.

태양광발전 단기예측모델 개발 (The Development of the Short-Term Predict Model for Solar Power Generation)

  • 김광득
    • 한국태양에너지학회 논문집
    • /
    • 제33권6호
    • /
    • pp.62-69
    • /
    • 2013
  • In this paper, Korea Institute of Energy Research, building integrated renewable energy monitoring system that utilizes solar power generation forecast data forecast model is proposed. Renewable energy integration of real-time monitoring system based on monitoring data were building a database and the database of the weather conditions and to study the correlation structure was tailoring. The weather forecast cloud cover data, generation data, and solar radiation data, a data mining and time series analysis using the method developed models to forecast solar power. The development of solar power in order to forecast model of weather forecast data it is important to secure. To this end, in three hours, including a three-day forecast today Meteorological data were used from the KMA(korea Meteorological Administration) site offers. In order to verify the accuracy of the predicted solar circle for each prediction and the actual environment can be applied to generation and were analyzed.

BST-IGT Model: Synthetic Benchmark Generation Technique Maintaining Trend of Time Series Data

  • Kim, Kyung Min;Kwak, Jong Wook
    • 한국컴퓨터정보학회논문지
    • /
    • 제25권2호
    • /
    • pp.31-39
    • /
    • 2020
  • 본 논문에서는 시계열 데이터를 기반으로 합성 벤치마크를 생성하는 기법을 소개한다. IoT 기기에서 측정되는 많은 데이터는 시간에 따른 수치 변화를 측정하는 시계열적 특성이 있다. 하지만 긴 기간 동안 측정되는 데이터를 일반화된 시계열 데이터로 모델링하기 힘든 문제점이 존재한다. 이런 문제를 개선하기 위해 본 논문에서는 BST-IGT 모델을 소개한다. BST-IGT 모델은 전체 데이터를 시계열 모델링이 쉬운 구간으로 분리하여 생성 데이터를 템플릿으로 수집하고 이를 기반으로 특성을 공유하거나 변형되는 새로운 합성 벤치마크를 생성한다. 제안된 모델링 기법을 이용하여 신규 벤치마크를 생성한 결과, 기존 데이터의 통계적 특성을 유지하는 합성 벤치마크와 다른 벤치마크와의 혼합으로 여러 특성을 가지는 벤치마크의 생성을 수행할 수 있었다.

2차 마르코프 사슬 모델을 이용한 시계열 인공 풍속 자료의 생성 (Generation of Synthetic Time Series Wind Speed Data using Second-Order Markov Chain Model)

  • 유기완
    • 풍력에너지저널
    • /
    • 제14권1호
    • /
    • pp.37-43
    • /
    • 2023
  • In this study, synthetic time series wind data was generated numerically using a second-order Markov chain. One year of wind data in 2020 measured by the AWS on Wido Island was used to investigate the statistics for measured wind data. Both the transition probability matrix and the cumulative transition probability matrix for annual hourly mean wind speed were obtained through statistical analysis. Probability density distribution along the wind speed and autocorrelation according to time were compared with the first- and the second-order Markov chains with various lengths of time series wind data. Probability density distributions for measured wind data and synthetic wind data using the first- and the second-order Markov chains were also compared to each other. For the case of the second-order Markov chain, some improvement of the autocorrelation was verified. It turns out that the autocorrelation converges to zero according to increasing the wind speed when the data size is sufficiently large. The generation of artificial wind data is expected to be useful as input data for virtual digital twin wind turbines.

적대적 생성망을 이용한 부동산 시계열 데이터 생성 방안 (A Methodology for Realty Time-series Generation Using Generative Adversarial Network)

  • 유재필;한창훈;신현준
    • 한국융합학회논문지
    • /
    • 제12권10호
    • /
    • pp.9-17
    • /
    • 2021
  • 최근 빅데이터 분석, 인공지능, 기계학습 등의 발전으로 인해서 데이터를 과학적으로 분석하는 기술이 발전하고 있으며 이는 의사결정 문제를 최적으로 해결해주고 있다. 그러나 특정 분야의 경우에는 데이터의 양이 부족해서 과학적 방식에 적용하는 것이 어렵다. 예컨대 부동산과 같은 데이터는 데이터 발표 시점이 최근이거나 비 유동성 자산이다 보니 발표 주기가 긴 경우가 많다. 따라서 본 연구에서는 이런 문제점을 극복하기 위해서 TimeGAN 모형을 통해 기존의 시계열의 확장 가능성에 대해서 연구하고자 한다. 이를 위해 부동산과 관련된 총 45개의 시계열을 데이터 셋에 맞게 2012년부터 2021년까지 주 단위로 데이터를 수집하고 시계열 간의 상관관계를 고려해서 총 15개의 최종 시계열을 선정한다. 15개의 시계열에 대해서 TimeGAN 모형을 통해 데이터 확장을한 결과, PCA 및 T-SNE 시각화 알고리즘을 통해 실제 데이터와 확장 데이터 간의 통계적 분포가 유사하다는 것을 확인할 수 있었다. 따라서 본 논문을 통해서 데이터의 과적합 또는 과소적합이라는 한계점을 극복할 수 있는 다양한 실험이 연구되기를 기대한다.

시간효과를 반영한 통행발생모형 개발 (The Trip Generation Models with Time-effects)

  • 김상록;김진희;김형진;정진혁
    • 대한교통학회지
    • /
    • 제30권1호
    • /
    • pp.103-112
    • /
    • 2012
  • 우리 시간의 흐름에 따라 사회 경제 구조, 생활양식 등 여러 요소가 변하면서 사람들의 통행목적과 통행행태도 변하게 된다. 하지만 기존의 통행발생모형은 장래 수요예측 시 기준연도에 추정된 parameter 값이 장래에도 동일하기 때문에 시간과 통행행태의 변화를 반영할 수 없는 문제점을 안고 있다. 이러한 문제를 개선하기 위하여 시간변화를 반영한 통행발생모형을 개발하였다. 가구통행실태조사는 특정 시점의 통행특성을 조사한 횡단면 자료이지만 그동안의 3번의 조사를 통해 시계열적 측면이 보완되었고, 조사자료를 기초로 하여 행정동을 기준으로 OD를 구축하기 때문에 이를 하나의 패널로 설정하여 패널분석을 수행할 수 있다. 따라서 본 연구에서는 1996년부터 2006년까지 조사된 가구통행실태조사자료 및 전수화 OD, 각 기준연도의 사회경제지표를 이용하여 패널분석을 통해 통행발생모형을 개발하였다. 분석 결과 시간이 지나면서 유의한 시간효과가 나타났다. 학원통행의 경우에는 시간이 지날수록 인구당 통행발생량이 증가하는 시간효과의 패턴이 나타났고, 쇼핑통행의 경우에는 감소하는 시간효과의 패턴이 나타났다.