• Title/Summary/Keyword: ensemble 평균

Search Result 140, Processing Time 0.025 seconds

Measurement of cardiac output during treadmill exercise by impedance cardiography with a new ensemble average (새로운 앙상블 평균법에 의한 임피던스 심장기록법의 트래드밀 운동 중의 심박출량 측정)

  • Kim, Deok-W.;Song, Chul-G.;Oh, In-S.;Hwang, Soo-K.;Kim, Won-K.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1990 no.05
    • /
    • pp.7-8
    • /
    • 1990
  • In this study, a new ensemble average technique was developed to measure cardiac output during treadmill exercise. Each dZ/dt peak (C point) was used as a starting point for ensemble averaging, instead of conventionally used R wave of ECG in order to prevent the peak dZ/dt waveform from blurring. In ease of using R wave as a reference, time interval from R wave to the peak of dZ/dt varies for each heart beat. Stroke volume, heart rate, and cardiac output of five male were successfully measured with Balke protocol using the new ensemble average technique.

  • PDF

Analysis and Application of Power Consumption Patterns for Changing the Power Consumption Behaviors (전력소비행위 변화를 위한 전력소비패턴 분석 및 적용)

  • Jang, MinSeok;Nam, KwangWoo;Lee, YonSik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.4
    • /
    • pp.603-610
    • /
    • 2021
  • In this paper, we extract the user's power consumption patterns, and model the optimal consumption patterns by applying the user's environment and emotion. Based on the comparative analysis of these two patterns, we present an efficient power consumption method through changes in the user's power consumption behavior. To extract significant consumption patterns, vector standardization and binary data transformation methods are used, and learning about the ensemble's ensemble with k-means clustering is applied, and applying the support factor according to the value of k. The optimal power consumption pattern model is generated by applying forced and emotion-based control based on the learning results for ensemble aggregates with relatively low average consumption. Through experiments, we validate that it can be applied to a variety of windows through the number or size adjustment of clusters to enable forced and emotion-based control according to the user's intentions by identifying the correlation between the number of clusters and the consistency ratios.

Improved Estimation of Hourly Surface Ozone Concentrations using Stacking Ensemble-based Spatial Interpolation (스태킹 앙상블 모델을 이용한 시간별 지상 오존 공간내삽 정확도 향상)

  • KIM, Ye-Jin;KANG, Eun-Jin;CHO, Dong-Jin;LEE, Si-Woo;IM, Jung-Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.25 no.3
    • /
    • pp.74-99
    • /
    • 2022
  • Surface ozone is produced by photochemical reactions of nitrogen oxides(NOx) and volatile organic compounds(VOCs) emitted from vehicles and industrial sites, adversely affecting vegetation and the human body. In South Korea, ozone is monitored in real-time at stations(i.e., point measurements), but it is difficult to monitor and analyze its continuous spatial distribution. In this study, surface ozone concentrations were interpolated to have a spatial resolution of 1.5km every hour using the stacking ensemble technique, followed by a 5-fold cross-validation. Base models for the stacking ensemble were cokriging, multi-linear regression(MLR), random forest(RF), and support vector regression(SVR), while MLR was used as the meta model, having all base model results as additional input variables. The results showed that the stacking ensemble model yielded the better performance than the individual base models, resulting in an averaged R of 0.76 and RMSE of 0.0065ppm during the study period of 2020. The surface ozone concentration distribution generated by the stacking ensemble model had a wider range with a spatial pattern similar with terrain and urbanization variables, compared to those by the base models. Not only should the proposed model be capable of producing the hourly spatial distribution of ozone, but it should also be highly applicable for calculating the daily maximum 8-hour ozone concentrations.

A Multimodal Profile Ensemble Approach to Development of Recommender Systems Using Big Data (빅데이터 기반 추천시스템 구현을 위한 다중 프로파일 앙상블 기법)

  • Kim, Minjeong;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.4
    • /
    • pp.93-110
    • /
    • 2015
  • The recommender system is a system which recommends products to the customers who are likely to be interested in. Based on automated information filtering technology, various recommender systems have been developed. Collaborative filtering (CF), one of the most successful recommendation algorithms, has been applied in a number of different domains such as recommending Web pages, books, movies, music and products. But, it has been known that CF has a critical shortcoming. CF finds neighbors whose preferences are like those of the target customer and recommends products those customers have most liked. Thus, CF works properly only when there's a sufficient number of ratings on common product from customers. When there's a shortage of customer ratings, CF makes the formation of a neighborhood inaccurate, thereby resulting in poor recommendations. To improve the performance of CF based recommender systems, most of the related studies have been focused on the development of novel algorithms under the assumption of using a single profile, which is created from user's rating information for items, purchase transactions, or Web access logs. With the advent of big data, companies got to collect more data and to use a variety of information with big size. So, many companies recognize it very importantly to utilize big data because it makes companies to improve their competitiveness and to create new value. In particular, on the rise is the issue of utilizing personal big data in the recommender system. It is why personal big data facilitate more accurate identification of the preferences or behaviors of users. The proposed recommendation methodology is as follows: First, multimodal user profiles are created from personal big data in order to grasp the preferences and behavior of users from various viewpoints. We derive five user profiles based on the personal information such as rating, site preference, demographic, Internet usage, and topic in text. Next, the similarity between users is calculated based on the profiles and then neighbors of users are found from the results. One of three ensemble approaches is applied to calculate the similarity. Each ensemble approach uses the similarity of combined profile, the average similarity of each profile, and the weighted average similarity of each profile, respectively. Finally, the products that people among the neighborhood prefer most to are recommended to the target users. For the experiments, we used the demographic data and a very large volume of Web log transaction for 5,000 panel users of a company that is specialized to analyzing ranks of Web sites. R and SAS E-miner was used to implement the proposed recommender system and to conduct the topic analysis using the keyword search, respectively. To evaluate the recommendation performance, we used 60% of data for training and 40% of data for test. The 5-fold cross validation was also conducted to enhance the reliability of our experiments. A widely used combination metric called F1 metric that gives equal weight to both recall and precision was employed for our evaluation. As the results of evaluation, the proposed methodology achieved the significant improvement over the single profile based CF algorithm. In particular, the ensemble approach using weighted average similarity shows the highest performance. That is, the rate of improvement in F1 is 16.9 percent for the ensemble approach using weighted average similarity and 8.1 percent for the ensemble approach using average similarity of each profile. From these results, we conclude that the multimodal profile ensemble approach is a viable solution to the problems encountered when there's a shortage of customer ratings. This study has significance in suggesting what kind of information could we use to create profile in the environment of big data and how could we combine and utilize them effectively. However, our methodology should be further studied to consider for its real-world application. We need to compare the differences in recommendation accuracy by applying the proposed method to different recommendation algorithms and then to identify which combination of them would show the best performance.

Climate Change Impact Assessments on Korean Water Reseources using Multi-Model Ensemble (MME(Multi-Model Ensemble)를 활용한 국가 수자원 기후변화 영향평가)

  • Bae, Deg-Hyo;Jeong, Il-Won;Lee, Byung-Ju;Jun, Tae-Hyun
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2009.05a
    • /
    • pp.198-202
    • /
    • 2009
  • 기후변화는 강수와 기온을 변화시켜 수자원에 지대한 영향을 미칠 것으로 알려져 있다. 따라서 이에 대한 안정적인 수자원 관리를 위해서는 기후변화 영향을 정량적으로 평가하는 것이 필요하다. 기본적으로 기후변화에 대한 수자원의 영향을 연구할 때 '온실가스 배출시나리오, GCMs을 통한 기후모의, 시공간적 편차보정을 위한 상세화, 유출모형 적용을 통한 유출시나리오 생산'의 과정을 거친다. 그러나 유출시나리오를 얻기까지 과정에는 각각 불확실성을 가지고 있기 때문에 최종결과의 불확실성은 각 과정을 거치면서 매우 커진다고 할 수 있다. 다양한 배출시나리오, GCM 결과, 유출모형에 대해 단순평균 혹은 가중치를 주는 multi-model ensemble 기법은 각 경우에 따른 값의 범위를 제시할 수있다는 점 때문에 불확실성 평가에서 주로 이용되고 있다. 본 연구에서는 우리나라 5대강 유역 109개 중권역에 대해 multi-model ensemble을 적용하여 기후변화에 의한 수자원 영향을 평가하였다. 1971년에서 2100년까지 120년 기간에 대해 3개의 온실가스 배출시나리오, 13개의 GCMs 결과들을 수집하여 총 39개의 기후시나리오를 이용하였고, 이를 8개의 유출모형에 적용하여 총 312개의 유출시나리오를 생산하였다. 생산된 유출시나리오를 기준시간(1971${\sim}$2000)에 대한 미래의 세 기간(2020s, 2050s, 2080s)으로 나누어 변화율을 분석한 결과 여름철 유출량과 겨울철 유출량이 증가될것으로 나타났으나 겨울철 유출량 전망은 여름철에 비해 불확실성이 큰 것으로 나타났다. 공간적으로는 한강유역이 위치한 북쪽유역이 남쪽에 비해 불확실성이 큰 것으로 나타났다. 결과적으로 유출의 시공간적 편차에 의해 우리나라 수자원은 홍수피해 증가가 예상되었으며, 월별유출량의 변화로 인해 용수확보와 관리에 어려움이 증가할 것으로 전망되었다.

  • PDF

Classification of Weather Patterns in the East Asia Region using the K-means Clustering Analysis (K-평균 군집분석을 이용한 동아시아 지역 날씨유형 분류)

  • Cho, Young-Jun;Lee, Hyeon-Cheol;Lim, Byunghwan;Kim, Seung-Bum
    • Atmosphere
    • /
    • v.29 no.4
    • /
    • pp.451-461
    • /
    • 2019
  • Medium-range forecast is highly dependent on ensemble forecast data. However, operational weather forecasters have not enough time to digest all of detailed features revealed in ensemble forecast data. To utilize the ensemble data effectively in medium-range forecasting, representative weather patterns in East Asia in this study are defined. The k-means clustering analysis is applied for the objectivity of weather patterns. Input data used daily Mean Sea Level Pressure (MSLP) anomaly of the ECMWF ReAnalysis-Interim (ERA-Interim) during 1981~2010 (30 years) provided by the European Centre for Medium-Range Weather Forecasts (ECMWF). Using the Explained Variance (EV), the optimal study area is defined by 20~60°N, 100~150°E. The number of clusters defined by Explained Cluster Variance (ECV) is thirty (k = 30). 30 representative weather patterns with their frequencies are summarized. Weather pattern #1 occurred all seasons, but it was about 56% in summer (June~September). The relatively rare occurrence of weather pattern (#30) occurred mainly in winter. Additionally, we investigate the relationship between weather patterns and extreme weather events such as heat wave, cold wave, and heavy rainfall as well as snowfall. The weather patterns associated with heavy rainfall exceeding 110 mm day-1 were #1, #4, and #9 with days (%) of more than 10%. Heavy snowfall events exceeding 24 cm day-1 mainly occurred in weather pattern #28 (4%) and #29 (6%). High and low temperature events (> 34℃ and < -14℃) were associated with weather pattern #1~4 (14~18%) and #28~29 (27~29%), respectively. These results suggest that the classification of various weather patterns will be used as a reference for grouping all ensemble forecast data, which will be useful for the scenario-based medium-range ensemble forecast in the future.

Malicious Insider Detection Using Boosting Ensemble Methods (앙상블 학습의 부스팅 방법을 이용한 악의적인 내부자 탐지 기법)

  • Park, Suyun
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.2
    • /
    • pp.267-277
    • /
    • 2022
  • Due to the increasing proportion of cloud and remote working environments, various information security incidents are occurring. Insider threats have emerged as a major issue, with cases in which corporate insiders attempting to leak confidential data by accessing it remotely. In response, insider threat detection approaches based on machine learning have been developed. However, existing machine learning methods used to detect insider threats do not take biases and variances into account, which leads to limited performance. In this paper, boosting-type ensemble learning algorithms are applied to verify the performance of malicious insider detection, conduct a close analysis, and even consider the imbalance in datasets to determine the final result. Through experiments, we show that using ensemble learning achieves similar or higher accuracy to other existing malicious insider detection approaches while considering bias-variance tradeoff. The experimental results show that ensemble learning using bagging and boosting methods reached an accuracy of over 98%, which improves malicious insider detection performance by 5.62% compared to the average accuracy of single learning models used.

Analysis of ensemble streamflow prediction effect on deriving dam releases for water supply (용수공급을 위한 댐 방류량 결정에서의 앙상블 유량 예측 효과 분석)

  • Kim, Yeonju;Kim, Gi Joo;Kim, Young-Oh
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.12
    • /
    • pp.969-980
    • /
    • 2023
  • Since the 2000s, ensemble streamflow prediction (ESP) has been actively utilized in South Korea, primarily for hydrological forecasting purposes. Despite its notable success in hydrological forecasting, the original objective of enhancing water resources system management has been relatively overlooked. Consequently, this study aims to demonstrate the utility of ESP in water resources management by creating a simple hypothetical exercise for dam operators and applying it to actual multi-purpose dams in South Korea. The hypothetical exercise showed that even when the means of ESP are identical, different costs can result from varying standard deviations. Subsequently, using sampling stochastic dynamic programming (SSDP) and considering the capacity-inflow ratio (CIR), optimal release patterns were derived for Soyang Dam (CIR = 1.345) and Chungju Dam (CIR = 0.563) based on types W and P. For this analysis, Type W was defined with standard deviation equal to the mean inflow, and Type P with standard deviation ten times of the mean inflow. Simulated operations were conducted from 2020 to 2022 using the derived optimal releases. The results indicate that in the case of Dam Chungju, more aggressive optimal release patterns were derived under types with smaller standard deviations, and the simulated operations demonstrated satisfactory outcomes. Similarly, Soyang Dam exhibited similar results in terms of optimal release, but there was no significant difference in the simulation between types W and P due to its large CIR. Ultimately, this study highlights that even with the same mean values, the standard deviation of ESP impacts optimal release patterns and outcomes in simulation. Additionally, it underscores that systems with smaller CIRs are more sensitive to such uncertainties. Based on these findings, there is potential for improvements in South Korea's current operational practices, which rely solely on single representative values for water resources management.

Use of the Moving Average of the Current Weather Data for the Solar Power Generation Amount Prediction (현재 기상 정보의 이동 평균을 사용한 태양광 발전량 예측)

  • Lee, Hyunjin
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.8
    • /
    • pp.1530-1537
    • /
    • 2016
  • Recently, solar power generation shows the significant growth in the renewable energy field. Using the short-term prediction, it is possible to control the electric power demand and the power generation plan of the auxiliary device. However, a short-term prediction can be used when you know the weather forecast. If it is not possible to use the weather forecast information because of disconnection of network at the island and the mountains or for security reasons, the accuracy of prediction is not good. Therefore, in this paper, we proposed a system capable of short-term prediction of solar power generation amount by using only the weather information that has been collected by oneself. We used temperature, humidity and insolation as weather information. We have applied a moving average to each information because they had a characteristic of time series. It was composed of min, max and average of each information, differences of mutual information and gradient of it. An artificial neural network, SVM and RBF Network model was used for the prediction algorithm and they were combined by Ensemble method. The results of this suggest that using a moving average during pre-processing and ensemble prediction models will maximize prediction accuracy.

Prediction of Hindered Settling Velocity of Bidisperse Suspensions (이중 입도 분포를 가진 현탁액의 침강 속도 예측)

  • Koo, Sangkyun
    • Applied Chemistry for Engineering
    • /
    • v.19 no.6
    • /
    • pp.609-616
    • /
    • 2008
  • The present study is concerned with a simple numerical method for estimating the hindered settling velocity of noncolloidal suspensions with bidisperse size distribution of particles. The method is based on an effective-medium theory which uses the conditional ensemble averages for describing the velocity fields or other physical quantities of interest in the suspension system with the particles randomly placed. The effective-medium theory originally developed by Acrivos and Chang[1] for monodisperse suspensions is modified for the bidisperse case. Using the radial distribution functions and stream functions the hindered settling velocity of the suspended particles is calculated numerically. The predictions by the present method are compared with the previous experimental results by Davis and Birdsell[2] and Cheung et al.[3]. It is shown that the estimations by the effective-medium model of the present study reasonably agree with the experimental results.