• Title/Summary/Keyword: ensemble method

Search Result 507, Processing Time 0.038 seconds

Gaussian noise addition approaches for ensemble optimal interpolation implementation in a distributed hydrological model

  • Manoj Khaniya;Yasuto Tachikawa;Kodai Yamamoto;Takahiro Sayama;Sunmin Kim
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.25-25
    • /
    • 2023
  • The ensemble optimal interpolation (EnOI) scheme is a sub-optimal alternative to the ensemble Kalman filter (EnKF) with a reduced computational demand making it potentially more suitable for operational applications. Since only one model is integrated forward instead of an ensemble of model realizations, online estimation of the background error covariance matrix is not possible in the EnOI scheme. In this study, we investigate two Gaussian noise based ensemble generation strategies to produce dynamic covariance matrices for assimilation of water level observations into a distributed hydrological model. In the first approach, spatially correlated noise, sampled from a normal distribution with a fixed fractional error parameter (which controls its standard deviation), is added to the model forecast state vector to prepare the ensembles. In the second method, we use an adaptive error estimation technique based on the innovation diagnostics to estimate this error parameter within the assimilation framework. The results from a real and a set of synthetic experiments indicate that the EnOI scheme can provide better results when an optimal EnKF is not identified, but performs worse than the ensemble filter when the true error characteristics are known. Furthermore, while the adaptive approach is able to reduce the sensitivity to the fractional error parameter affecting the first (non-adaptive) approach, results are usually worse at ungauged locations with the former.

  • PDF

Uncertainty investigation and mitigation in flood forecasting

  • Nguyen, Hoang-Minh;Bae, Deg-Hyo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.155-155
    • /
    • 2018
  • Uncertainty in flood forecasting using a coupled meteorological and hydrological model is arisen from various sources, especially the uncertainty comes from the inaccuracy of Quantitative Precipitation Forecasts (QPFs). In order to improve the capability of flood forecast, the uncertainty estimation and mitigation are required to perform. This study is conducted to investigate and reduce such uncertainty. First, ensemble QPFs are generated by using Monte - Carlo simulation, then each ensemble member is forced as input for a hydrological model to obtain ensemble streamflow prediction. Likelihood measures are evaluated to identify feasible member. These members are retained to define upper and lower limits of the uncertainty interval and assess the uncertainty. To mitigate the uncertainty for very short lead time, a blending method, which merges the ensemble QPFs with radar-based rainfall prediction considering both qualitative and quantitative skills, is proposed. Finally, blending bias ratios, which are estimated from previous time step, are used to update the members over total lead time. The proposed method is verified for the two flood events in 2013 and 2016 in the Yeonguol and Soyang watersheds that are located in the Han River basin, South Korea. The uncertainty in flood forecasting using a coupled Local Data Assimilation and Prediction System (LDAPS) and Sejong University Rainfall - Runoff (SURR) model is investigated and then mitigated by blending the generated ensemble LDAPS members with radar-based rainfall prediction that uses McGill algorithm for precipitation nowcasting by Lagrangian extrapolation (MAPLE). The results show that the uncertainty of flood forecasting using the coupled model increases when the lead time is longer. The mitigation method indicates its effectiveness for mitigating the uncertainty with the increases of the percentage of feasible member (POFM) and the ratio of the number of observations that fall into the uncertainty interval (p-factor).

  • PDF

Noise Correction of Remote Sensing Imageries: Application to KOMPSAT/OSMI Data

  • Kang, Y.Q.;Ahn, Y.H.
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.694-696
    • /
    • 2003
  • The KOMPSAT/OSMI remote sending data of 800 km swath are collected by whisk broom method employing 96 charge coupled devices (CCDs). The stripping noise in the OSMI imageries, which arise mainly due to the non-uniform sensitivities of 96 CCDs, are the major hindrance for oceanographic applications of the OSMI data. The OSMI images are corrected by 'Ensemble Smoothness' method which is based on an assumption that the series of the averages and variances of digital numbers in each line should vary smoothly. The data of each line are corrected by linear regression model of which coefficients are obtained by Ensemble Smoothness method. Our algorithm can be applied not only to OSMI data but also for other remote sensing date collected by whisk broom or push broom.

  • PDF

Bankruptcy prediction using ensemble SVM model (앙상블 SVM 모형을 이용한 기업 부도 예측)

  • Choi, Ha Na;Lim, Dong Hoon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.6
    • /
    • pp.1113-1125
    • /
    • 2013
  • Corporate bankruptcy prediction has been an important topic in the accounting and finance field for a long time. Several data mining techniques have been used for bankruptcy prediction. However, there are many limits for application to real classification problem with a single model. This study proposes ensemble SVM (support vector machine) model which assembles different SVM models with each different kernel functions. Our ensemble model is made and evaluated by v-fold cross-validation approach. The k top performing models are recruited into the ensemble. The classification is then carried out using the majority voting opinion of the ensemble. In this paper, we investigate the performance of ensemble SVM classifier in terms of accuracy, error rate, sensitivity, specificity, ROC curve, and AUC to compare with single SVM classifiers based on financial ratios dataset and simulation dataset. The results confirmed the advantages of our method: It is robust while providing good performance.

Improving an Ensemble Model by Optimizing Bootstrap Sampling (부트스트랩 샘플링 최적화를 통한 앙상블 모형의 성능 개선)

  • Min, Sung-Hwan
    • Journal of Internet Computing and Services
    • /
    • v.17 no.2
    • /
    • pp.49-57
    • /
    • 2016
  • Ensemble classification involves combining multiple classifiers to obtain more accurate predictions than those obtained using individual models. Ensemble learning techniques are known to be very useful for improving prediction accuracy. Bagging is one of the most popular ensemble learning techniques. Bagging has been known to be successful in increasing the accuracy of prediction of the individual classifiers. Bagging draws bootstrap samples from the training sample, applies the classifier to each bootstrap sample, and then combines the predictions of these classifiers to get the final classification result. Bootstrap samples are simple random samples selected from the original training data, so not all bootstrap samples are equally informative, due to the randomness. In this study, we proposed a new method for improving the performance of the standard bagging ensemble by optimizing bootstrap samples. A genetic algorithm is used to optimize bootstrap samples of the ensemble for improving prediction accuracy of the ensemble model. The proposed model is applied to a bankruptcy prediction problem using a real dataset from Korean companies. The experimental results showed the effectiveness of the proposed model.

Ensemble Design of Machine Learning Technigues: Experimental Verification by Prediction of Drifter Trajectory (앙상블을 이용한 기계학습 기법의 설계: 뜰개 이동경로 예측을 통한 실험적 검증)

  • Lee, Chan-Jae;Kim, Yong-Hyuk
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.3
    • /
    • pp.57-67
    • /
    • 2018
  • The ensemble is a unified approach used for getting better performance by using multiple algorithms in machine learning. In this paper, we introduce boosting and bagging, which have been widely used in ensemble techniques, and design a method using support vector regression, radial basis function network, Gaussian process, and multilayer perceptron. In addition, our experiment was performed by adding a recurrent neural network and MOHID numerical model. The drifter data used for our experimental verification consist of 683 observations in seven regions. The performance of our ensemble technique is verified by comparison with four algorithms each. As verification, mean absolute error was adapted. The presented methods are based on ensemble models using bagging, boosting, and machine learning. The error rate was calculated by assigning the equal weight value and different weight value to each unit model in ensemble. The ensemble model using machine learning showed 61.7% improvement compared to the average of four machine learning technique.

Ensemble Downscaling of Soil Moisture Data Using BMA and ATPRK

  • Youn, Youjeong;Kim, Kwangjin;Chung, Chu-Yong;Park, No-Wook;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.4
    • /
    • pp.587-607
    • /
    • 2020
  • Soil moisture is essential information for meteorological and hydrological analyses. To date, many efforts have been made to achieve the two goals for soil moisture data, i.e., the improvement of accuracy and resolution, which is very challenging. We presented an ensemble downscaling method for quality improvement of gridded soil moisture data in terms of the accuracy and the spatial resolution by the integration of BMA (Bayesian model averaging) and ATPRK (area-to-point regression kriging). In the experiments, the BMA ensemble showed a 22% better accuracy than the data sets from ESA CCI (European Space Agency-Climate Change Initiative), ERA5 (ECMWF Reanalysis 5), and GLDAS (Global Land Data Assimilation System) in terms of RMSE (root mean square error). Also, the ATPRK downscaling could enhance the spatial resolution from 0.25° to 0.05° while preserving the improved accuracy and the spatial pattern of the BMA ensemble, without under- or over-estimation. The quality-improved data sets can contribute to a variety of local and regional applications related to soil moisture, such as agriculture, forest, hydrology, and meteorology. Because the ensemble downscaling method can be applied to the other land surface variables such as temperature, humidity, precipitation, and evapotranspiration, it can be a viable option to complement the accuracy and the spatial resolution of satellite images and numerical models.

Generation of runoff ensemble members using the shot noise process based rainfall-runoff model (Shot Noise Process 기반 강우-유출 모형을 이용한 유출 앙상블 멤버 생성)

  • Kang, Minseok;Cho, Eunsaem;Yoo, Chulsang
    • Journal of Korea Water Resources Association
    • /
    • v.52 no.9
    • /
    • pp.603-613
    • /
    • 2019
  • This study proposes a method to generate runoff ensemble members using a rainfall-runoff model based on the shot noise process (hereafter the rainfall-runoff model). The proposed method was applied to generate runoff ensemble members for three drainage basins of Daerim 2, Guro 1 and the Jungdong, whose results were then compared with the observed. The parameters of the rainfall-runoff model were estimated using the empirical formulas like the Kerby, Kraven II and Russel, also the concept of modified rational formula. Gamma and exponential distributions were used to generate random numbers of the parameters of the rainfall-runoff model. Especially, the gamma distribution is found to be useful to generate various random numbers depending on the pre-assigned relationship between mean and standard deviation. Comparison between the generated runoff ensemble members and the observed shows that those runoff ensemble members generated using the gamma distribution with its standard deviation twice of the mean properly cover the observed runoff.

Diversity based Ensemble Genetic Programming for Improving Classification Performance (분류 성능 향상을 위한 다양성 기반 앙상블 유전자 프로그래밍)

  • Hong Jin-Hyuk;Cho Sung-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.12
    • /
    • pp.1229-1237
    • /
    • 2005
  • Combining multiple classifiers has been actively exploited to improve classification performance. It is required to construct a pool of accurate and diverse base classifier for obtaining a good ensemble classifier. Conventionally ensemble learning techniques such as bagging and boosting have been used and the diversify of base classifiers for the training set has been estimated, but there are some limitations in classifying gene expression profiles since only a few training samples are available. This paper proposes an ensemble technique that analyzes the diversity of classification rules obtained by genetic programming. Genetic programming generates interpretable rules, and a sample is classified by combining the most diverse set of rules. We have applied the proposed method to cancer classification with gene expression profiles. Experiments on lymphoma cancer dataset, prostate cancer dataset and ovarian cancer dataset have illustrated the usefulness of the proposed method. h higher classification accuracy has been obtained with the proposed method than without considering diversity. It has been also confirmed that the diversity increases classification performance.

Path Loss Prediction Using an Ensemble Learning Approach

  • Beom Kwon;Eonsu Noh
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.2
    • /
    • pp.1-12
    • /
    • 2024
  • Predicting path loss is one of the important factors for wireless network design, such as selecting the installation location of base stations in cellular networks. In the past, path loss values were measured through numerous field tests to determine the optimal installation location of the base station, which has the disadvantage of taking a lot of time to measure. To solve this problem, in this study, we propose a path loss prediction method based on machine learning (ML). In particular, an ensemble learning approach is applied to improve the path loss prediction performance. Bootstrap dataset was utilized to obtain models with different hyperparameter configurations, and the final model was built by ensembling these models. We evaluated and compared the performance of the proposed ensemble-based path loss prediction method with various ML-based methods using publicly available path loss datasets. The experimental results show that the proposed method outperforms the existing methods and can predict the path loss values accurately.