• Title/Summary/Keyword: multi?model ensemble

Search Result 94, Processing Time 0.026 seconds

Development of the Selected Multi-model Consensus Technique for the Tropical Cyclone Track Forecast in the Western North Pacific (태풍 진로예측을 위한 다중모델 선택 컨센서스 기법 개발)

  • Jun, Sanghee;Lee, Woojeong;Kang, KiRyong;Yun, Won-Tae
    • Atmosphere
    • /
    • v.25 no.2
    • /
    • pp.375-387
    • /
    • 2015
  • A Selected Multi-model CONsensus (SMCON) technique was developed and verified for the tropical cyclone track forecast in the western North Pacific. The SMCON forecasts were produced by averaging numerical model forecasts showing low 70% latest 6 h prediction errors among 21 models. In the homogeneous comparison for 54 tropical cyclones in 2013 and 2014, the SMCON improvement rate was higher than the other forecasts such as the Non-Selected Multi-model CONsensus (NSMCON) and other numerical models (i.e., GDAPS, GEPS, GFS, HWRF, ECMWF, ECMWF_H, ECMWF_EPS, JGSM, TEPS). However, the SMCON showed lower or similar improvement rate than a few forecasts including ECMWF_EPS forecasts at 96 h in 2013 and at 72 h in 2014 and the TEPS forecast at 120 h in 2013. Mean track errors of the SMCON for two year were smaller than the NSMCON and these differences were 0.4, 1.2, 5.9, 12.9, 8.2 km at 24-, 48-, 72-, 96-, 120-h respectively. The SMCON error distributions showed smaller central tendency than the NSMCON's except 72-, 96-h forecasts in 2013. Similarly, the density for smaller track errors of the SMCON was higher than the NSMCON's except at 72-, 96-h forecast in 2013 in the kernel density estimation analysis. In addition, the NSMCON has lager range of errors above the third quantile and larger standard deviation than the SMCON's at 72-, 96-h forecasts in 2013. Also, the SMCON showed smaller bias than ECMWF_H for the cross track bias. Thus, we concluded that the SMCON could provide more reliable information on the tropical cyclone track forecast by reflecting the real-time performance of the numerical models.

Effective viscosity of bidisperse suspensions

  • Koo Sangkyun;Song Kwang Ho
    • Korea-Australia Rheology Journal
    • /
    • v.17 no.1
    • /
    • pp.27-32
    • /
    • 2005
  • We determine the effective viscosity of suspensions with bidisperse particle size distribution by modifying an effective-medium theory that was proposed by Acrivos and Chang (1987) for monodisperse suspensions. The modified theory uses a simple model that captures some important effects of multi-particle hydrodynamic interactions. The modifications are described in detail in the present study. Estimations of effective viscosity by the modified theory are compared with the results of prior work for monodisperse and bidisperse suspensions. It is shown that the estimations agree very well with experimental or other calculated results up to approximately 0.45 of normalized particle volume fraction which is the ratio of volume faction to the maximum volume fraction of particles for bidisperse suspensions.

Ensemble of Nested Dichotomies for Activity Recognition Using Accelerometer Data on Smartphone (Ensemble of Nested Dichotomies 기법을 이용한 스마트폰 가속도 센서 데이터 기반의 동작 인지)

  • Ha, Eu Tteum;Kim, Jeongmin;Ryu, Kwang Ryel
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.4
    • /
    • pp.123-132
    • /
    • 2013
  • As the smartphones are equipped with various sensors such as the accelerometer, GPS, gravity sensor, gyros, ambient light sensor, proximity sensor, and so on, there have been many research works on making use of these sensors to create valuable applications. Human activity recognition is one such application that is motivated by various welfare applications such as the support for the elderly, measurement of calorie consumption, analysis of lifestyles, analysis of exercise patterns, and so on. One of the challenges faced when using the smartphone sensors for activity recognition is that the number of sensors used should be minimized to save the battery power. When the number of sensors used are restricted, it is difficult to realize a highly accurate activity recognizer or a classifier because it is hard to distinguish between subtly different activities relying on only limited information. The difficulty gets especially severe when the number of different activity classes to be distinguished is very large. In this paper, we show that a fairly accurate classifier can be built that can distinguish ten different activities by using only a single sensor data, i.e., the smartphone accelerometer data. The approach that we take to dealing with this ten-class problem is to use the ensemble of nested dichotomy (END) method that transforms a multi-class problem into multiple two-class problems. END builds a committee of binary classifiers in a nested fashion using a binary tree. At the root of the binary tree, the set of all the classes are split into two subsets of classes by using a binary classifier. At a child node of the tree, a subset of classes is again split into two smaller subsets by using another binary classifier. Continuing in this way, we can obtain a binary tree where each leaf node contains a single class. This binary tree can be viewed as a nested dichotomy that can make multi-class predictions. Depending on how a set of classes are split into two subsets at each node, the final tree that we obtain can be different. Since there can be some classes that are correlated, a particular tree may perform better than the others. However, we can hardly identify the best tree without deep domain knowledge. The END method copes with this problem by building multiple dichotomy trees randomly during learning, and then combining the predictions made by each tree during classification. The END method is generally known to perform well even when the base learner is unable to model complex decision boundaries As the base classifier at each node of the dichotomy, we have used another ensemble classifier called the random forest. A random forest is built by repeatedly generating a decision tree each time with a different random subset of features using a bootstrap sample. By combining bagging with random feature subset selection, a random forest enjoys the advantage of having more diverse ensemble members than a simple bagging. As an overall result, our ensemble of nested dichotomy can actually be seen as a committee of committees of decision trees that can deal with a multi-class problem with high accuracy. The ten classes of activities that we distinguish in this paper are 'Sitting', 'Standing', 'Walking', 'Running', 'Walking Uphill', 'Walking Downhill', 'Running Uphill', 'Running Downhill', 'Falling', and 'Hobbling'. The features used for classifying these activities include not only the magnitude of acceleration vector at each time point but also the maximum, the minimum, and the standard deviation of vector magnitude within a time window of the last 2 seconds, etc. For experiments to compare the performance of END with those of other methods, the accelerometer data has been collected at every 0.1 second for 2 minutes for each activity from 5 volunteers. Among these 5,900 ($=5{\times}(60{\times}2-2)/0.1$) data collected for each activity (the data for the first 2 seconds are trashed because they do not have time window data), 4,700 have been used for training and the rest for testing. Although 'Walking Uphill' is often confused with some other similar activities, END has been found to classify all of the ten activities with a fairly high accuracy of 98.4%. On the other hand, the accuracies achieved by a decision tree, a k-nearest neighbor, and a one-versus-rest support vector machine have been observed as 97.6%, 96.5%, and 97.6%, respectively.

API Feature Based Ensemble Model for Malware Family Classification (악성코드 패밀리 분류를 위한 API 특징 기반 앙상블 모델 학습)

  • Lee, Hyunjong;Euh, Seongyul;Hwang, Doosung
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.29 no.3
    • /
    • pp.531-539
    • /
    • 2019
  • This paper proposes the training features for malware family analysis and analyzes the multi-classification performance of ensemble models. We construct training data by extracting API and DLL information from malware executables and use Random Forest and XGBoost algorithms which are based on decision tree. API, API-DLL, and DLL-CM features for malware detection and family classification are proposed by analyzing frequently used API and DLL information from malware and converting high-dimensional features to low-dimensional features. The proposed feature selection method provides the advantages of data dimension reduction and fast learning. In performance comparison, the malware detection rate is 93.0% for Random Forest, the accuracy of malware family dataset is 92.0% for XGBoost, and the false positive rate of malware family dataset including benign is about 3.5% for Random Forest and XGBoost.

A Sampling Stochastic Linear Programming Model for Coordinated Multi-Reservoir Operation (저수지군 연계운영을 위한 표본 추계학적 선형 계획 모형)

  • Lee, Yong-Dae;Kim, Sheung-Kown;Kim, Jae-Hee
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2004.05a
    • /
    • pp.685-688
    • /
    • 2004
  • 본 연구에서는 저수지군 연계운영을 위한 표본 추계학적 선형 계획(SSLP, Sampling Stochastic Linear Programming) 모형을 제안한다. 일반적 추계학적 모형은 과거 자료로부터 확률변수의 확률분포를 추정하고 이를 몇 개 구간으로 나누어 이산 확률 값을 산정하여 기댓값이 최대가 되는 운영방안을 도출하지만 저수지 유입량 예측시 고려되어야할 지속성 효과(Persistemcy Effect)와 유역간 또는 시점별 공분산 효과(The joint spatial and temporal correlations)를 반영하는데 많은 한계가 있다. 이를 극복하기 위하여 과거자료 자체를 유입량 시나리오로 적용하여 시${\cdot}$공간적 상관관계를 유지하는 표본 추계학적(Sampling Stochastic)기법을 바탕으로 Simple Recourse Model로 구성한 추계학적 선형 계획 모형을 제시한다. 이 모형은 미국 기상청(NWS)에서 발생 가능한 유입량의 시나리오를 예측하는 방법인 앙상블 유량 예측(ESP, Ensemble Streamflow Prediction)을 통한 시나리오를 적용함으로써 좀더 신뢰성 있는 저수지군 연계운영 계획을 도출 할 수 있을 것으로 기대된다.

  • PDF

Evaluating Changes and Uncertainty of Nitrogen Load from Rice Paddy according to the Climate Change Scenario Multi-Model Ensemble (기후변화시나리오 다중모형 앙상블에 따른 논 질소 유출 부하량 변동 및 불확실성 평가)

  • Choi, Soon-Kun;Jeong, Jaehak;Yeob, So-Jin;Kim, Minwook;Kim, Jin Ho;Kim, Min-Kyeong
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.62 no.5
    • /
    • pp.47-62
    • /
    • 2020
  • Rice paddy accounts for approximately 52.5% of all farmlands in South Korea, and it is closely related to the water environment. Climate change is expected to affect not only agricultural productivity also the water and the nutrient circulation. Therefore this study was aimed to evaluate changes of nitrogen load from rice paddy considering climate change scenario uncertainty. APEX-Paddy model which reflect rice paddy environment by modifying APEX (Agricultural Policy and Environmental eXtender) model was used. Using the AIMS (APCC Integrated Modeling Solution) offered by the APEC Climate Center, bias correction was conducted for 9 GCMs using non-parametric quantile mapping. Bias corrected climate change scenarios were applied to the APEX-Paddy model. The changes and uncertainty in runoff and nitrogen load were evaluated using multi-model ensemble. Paddy runoff showed a change of 23.1% for RCP4.5 scenario and 45.5% for RCP8.5 scenario compared the 2085s (2071 to 2100) against the base period (1976 to 2005). The nitrogen load was found to be increased as 43.9% for RCP4.5 scenario and 76.0% for RCP8.5 scenario. The uncertainty analysis showed that the annual standard deviation of nitrogen loads increased in the future, and the maximum entropy indicated an increasing tendency. And Duncan's analysis showed significant differences among GCMs as the future progressed. The result of this study seems to be used as a basis for mid- and long-term policies for water resources and water system environment considering climate change.

Two-Branch Classifier for Retinal Imaging Analysis (망막 영상 분석을 위한 두 갈래 분류기)

  • Oh, Young-tack;Park, Hyunjin
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.614-616
    • /
    • 2021
  • The world faces difficulties in terms of eye care, including treatment, quality of prevention, vision rehabilitation services, and scarcity of trained eye care experts. However, it is difficult to develop a method for classifying various ocular diseases because the existing dataset for retinal image disclosure does not consist of various diseases found in clinical practice. We propose a method for classifying ocular diseases using the Retinal Fundus Multi-disease Image Dataset (RFMiD), a dataset published in the ISBI-2021 challenge. Our goal is to develop a robust and generalizable model for screening retinal images into normal and abnormal categories. The performance of the proposed model shows a value of 0.9782 for the test dataset as an area under the curve (AUC) score.

  • PDF

Evaluation of Agro-Climatic Index Using Multi-Model Ensemble Downscaled Climate Prediction of CMIP5 (상세화된 CMIP5 기후변화전망의 다중모델앙상블 접근에 의한 농업기후지수 평가)

  • Chung, Uran;Cho, Jaepil;Lee, Eun-Jeong
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.17 no.2
    • /
    • pp.108-125
    • /
    • 2015
  • The agro-climatic index is one of the ways to assess the climate resources of particular agricultural areas on the prospect of agricultural production; it can be a key indicator of agricultural productivity by providing the basic information required for the implementation of different and various farming techniques and practicalities to estimate the growth and yield of crops from the climate resources such as air temperature, solar radiation, and precipitation. However, the agro-climate index can always be changed since the index is not the absolute. Recently, many studies which consider uncertainty of future climate change have been actively conducted using multi-model ensemble (MME) approach by developing and improving dynamic and statistical downscaling of Global Climate Model (GCM) output. In this study, the agro-climatic index of Korean Peninsula, such as growing degree day based on $5^{\circ}C$, plant period based on $5^{\circ}C$, crop period based on $10^{\circ}C$, and frost free day were calculated for assessment of the spatio-temporal variations and uncertainties of the indices according to climate change; the downscaled historical (1976-2005) and near future (2011-2040) RCP climate sceneries of AR5 were applied to the calculation of the index. The result showed four agro-climatic indices calculated by nine individual GCMs as well as MME agreed with agro-climatic indices which were calculated by the observed data. It was confirmed that MME, as well as each individual GCM emulated well on past climate in the four major Rivers of South Korea (Han, Nakdong, Geum, and Seumjin and Yeoungsan). However, spatial downscaling still needs further improvement since the agro-climatic indices of some individual GCMs showed different variations with the observed indices at the change of spatial distribution of the four Rivers. The four agro-climatic indices of the Korean Peninsula were expected to increase in nine individual GCMs and MME in future climate scenarios. The differences and uncertainties of the agro-climatic indices have not been reduced on the unlimited coupling of multi-model ensembles. Further research is still required although the differences started to improve when combining of three or four individual GCMs in the study. The agro-climatic indices which were derived and evaluated in the study will be the baseline for the assessment of agro-climatic abnormal indices and agro-productivity indices of the next research work.

An enhancement of GloSea5 ensemble weather forecast based on ANFIS (ANFIS를 활용한 GloSea5 앙상블 기상전망기법 개선)

  • Moon, Geon-Ho;Kim, Seon-Ho;Bae, Deg-Hyo
    • Journal of Korea Water Resources Association
    • /
    • v.51 no.11
    • /
    • pp.1031-1041
    • /
    • 2018
  • ANFIS-based methodology for improving GloSea5 ensemble weather forecast is developed and evaluated in this study. The proposed method consists of two steps: pre & post processing. For ensemble prediction of GloSea5, weights are assigned to the ensemble members based on Optimal Weighting Method (OWM) in the pre-processing. Then, the bias of the results of pre-processed is corrected based on Model Output Statistics (MOS) method in the post-processing. The watershed of the Chungju multi-purpose dam in South Korea is selected as a study area. The results of evaluation indicated that the pre-processing step (CASE1), the post-processing step (CASE2), pre & post processing step (CASE3) results were significantly improved than the original GloSea5 bias correction (BC_GS5). Correction performance is better the order of CASE3, CASE1, CASE2. Also, the accuracy of pre-processing was improved during the season with high variability of precipitation. The post-processing step reduced the error that could not be smoothed by pre-processing step. It could be concluded that this methodology improved the ability of GloSea5 ensemble weather forecast by using ANFIS, especially, for the summer season with high variability of precipitation when applied both pre- and post-processing steps.

Investigating Data Preprocessing Algorithms of a Deep Learning Postprocessing Model for the Improvement of Sub-Seasonal to Seasonal Climate Predictions (계절내-계절 기후예측의 딥러닝 기반 후보정을 위한 입력자료 전처리 기법 평가)

  • Uran Chung;Jinyoung Rhee;Miae Kim;Soo-Jin Sohn
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.25 no.2
    • /
    • pp.80-98
    • /
    • 2023
  • This study explores the effectiveness of various data preprocessing algorithms for improving subseasonal to seasonal (S2S) climate predictions from six climate forecast models and their Multi-Model Ensemble (MME) using a deep learning-based postprocessing model. A pipeline of data transformation algorithms was constructed to convert raw S2S prediction data into the training data processed with several statistical distribution. A dimensionality reduction algorithm for selecting features through rankings of correlation coefficients between the observed and the input data. The training model in the study was designed with TimeDistributed wrapper applied to all convolutional layers of U-Net: The TimeDistributed wrapper allows a U-Net convolutional layer to be directly applied to 5-dimensional time series data while maintaining the time axis of data, but every input should be at least 3D in U-Net. We found that Robust and Standard transformation algorithms are most suitable for improving S2S predictions. The dimensionality reduction based on feature selections did not significantly improve predictions of daily precipitation for six climate models and even worsened predictions of daily maximum and minimum temperatures. While deep learning-based postprocessing was also improved MME S2S precipitation predictions, it did not have a significant effect on temperature predictions, particularly for the lead time of weeks 1 and 2. Further research is needed to develop an optimal deep learning model for improving S2S temperature predictions by testing various models and parameters.