• Title/Summary/Keyword: random variable

Search Result 904, Processing Time 0.031 seconds

CHAID Algorithm by Cube-based Sampling

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.239-247
    • /
    • 2003
  • Decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud dection, data reduction and variable screening, etc. CHAID(Chi-square Automatic Interaction Detector), is an exploratory method used to study the relationship between a dependent variable and a series of predictor variables. In this paper we propose and CHAID algorithm by cube-based sampling and explore CHAID algorithm in view of accuracy and speed by the number of variables.

  • PDF

Pitch Control Simulation of Horizontal Wind Power System (수평축 풍력발전시스템의 피치제어 시뮬레이션)

  • Hwang, Sung-Joon;Seo, Young-Taek;Oh, Chul-Soo
    • Proceedings of the KIEE Conference
    • /
    • 2000.07b
    • /
    • pp.846-848
    • /
    • 2000
  • In the early development of wind energy, the majority of wind turbines were operated at constant speed. Recently, the number of variable-speed wind turbines adopted pitch control apparatus has increased. This paper deal with a simulation of pitch control of variable-speed wind turbine and the response of pitch angle is traced in a given random wind speed.

  • PDF

Random Forest Based Abnormal ECG Dichotomization using Linear and Nonlinear Feature Extraction (선형-비선형 특징추출에 의한 비정상 심전도 신호의 랜덤포레스트 기반 분류)

  • Kim, Hye-Jin;Kim, Byeong-Nam;Jang, Won-Seuk;Yoo, Sun-K.
    • Journal of Biomedical Engineering Research
    • /
    • v.37 no.2
    • /
    • pp.61-67
    • /
    • 2016
  • This paper presented a method for random forest based the arrhythmia classification using both heart rate (HR) and heart rate variability (HRV) features. We analyzed the MIT-BIH arrhythmia database which contains half-hour ECG recorded from 48 subjects. This study included not only the linear features but also non-linear features for the improvement of classification performance. We classified abnormal ECG using mean_NN (mean of heart rate), SD1/SD2 (geometrical feature of poincare HRV plot), SE (spectral entropy), pNN100 (percentage of a heart rate longer than 100 ms) affecting accurate classification among combined of linear and nonlinear features. We compared our proposed method with Neural Networks to evaluate the accuracy of the algorithm. When we used the features extracted from the HRV as an input variable for classifier, random forest used only the most contributed variable for classification unlike the neural networks. The characteristics of random forest enable the dimensionality reduction of the input variables, increase a efficiency of classifier and can be obtained faster, 11.1% higher accuracy than the neural networks.

Probabilistic Approach of Stability Analysis for Rock Wedge Failure (확률론적 해석방법을 이용한 쐐기파괴의 안정성 해석)

  • Park, Hyuck-Jin
    • Economic and Environmental Geology
    • /
    • v.33 no.4
    • /
    • pp.295-307
    • /
    • 2000
  • Probabilistic analysis is a powerful method to quantify variability and uncertainty common in engineering geology fields. In rock slope engineering, the uncertainty and variation may be in the form of scatter in orientations and geometries of discontinuities, and also test results. However, in the deterministic analysis, the factor of safety which is used to ensure stability of rock slopes, is based on the fixed representative values for each parameter without a consideration of the scattering in data. For comparison, in the probabilistic analysis, these discontinuity parameters are considered as random variables, and therefore, the reliability and probability theories are utilized to evaluate the possibility of slope failure. Therefore, in the probabilistic analysis, the factor of safety is considered as a random variable and replaced by the probability of failure to measure the level of slope stability. In this study, the stochastic properties of discontinuity parameters are evaluated and the stability of rock slope is analyzed based on the random properties of discontinuity parameters. Then, the results between the deterministic analysis and the probabilistic analysis are compared and the differences between the two analysis methods are explained.

  • PDF

Emulation of Variable Wind Speed and Turbulance Effect in a Wind Turbine Simulator (가변 풍속과 터뷸런스를 고려한 가변속 풍력 발전 시스템 시뮬레이터 개발)

  • Song, Seung-Ho;Kim, Dong-Yong;Kim, In-Sun;Kyong, Nam-Ho
    • 한국신재생에너지학회:학술대회논문집
    • /
    • 2006.11a
    • /
    • pp.290-296
    • /
    • 2006
  • Control algorithms and implementation issues for a wind turbine simulator are presented for realistic emulation of variable wind characteristics using a lab-scale motor and generator set. When the average wind speed nd turbulence level is given, the torque reference of prime mover is decided through various blocks, such as random wind speed generator, blade characteristic curves, and tower effect compensation. The variable nature of wind can be implemented and tested by not only the computer simulation but also the hardware-in-loop-simulator (HILS). Some application examples of HILS include the development and test of turbine control software for more efficient and stable operation. Feasibility of the proposed simulator has verified by computer simulations and experiment.

  • PDF

On the Strong Law of Large Numbers for Arbitrary Random Variables

  • Nam, Eun-Woo
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.05a
    • /
    • pp.49-54
    • /
    • 2002
  • For arbitrary random variables {$X_{n},n{\geq}1$}, the order of growth of the series. $S_{n}\;=\;{\sum}_{j=1}^n\;X_{j}$ is studied in this paper. More specifically, when the series S_{n}$ diverges almost surely, the strong law of large numbers $S_{n}/g_{n}^{-1}$($A_{n}{\psi}(A_{n}))\;{\rightarrow}\;0$ a.s. is constructed by extending the results of Petrov (1973). On the other hand, if the series $S_{n}$ converges almost surely to a random variable S, then the tail series $T_{n}\;=\;S\;-\;S_{n-1}\;=\;{\sum}_{j=n}^{\infty}\;X_{j}$ is a well-defined sequence of random variables and converges to 0 almost surely. For the almost surely convergent series $S_{n}$, a tail series strong law of large numbers $T_{n}/g_{n}^{-1}(B_{n}{\psi}^{\ast}(B_{n}^{-1}))\;{\rightarrow}\;0$ a.s., which generalizes the result of Klesov (1984), is also established by investigating the duality between the limiting behavior of partial sums and that of tail series. In particular, an example is provided showing that the current work can prevail despite the fact that previous tail series strong law of large numbers does not work.

  • PDF

Comparison of Variable Importance Measures in Tree-based Classification (나무구조의 분류분석에서 변수 중요도에 대한 고찰)

  • Kim, Na-Young;Lee, Eun-Kyung
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.5
    • /
    • pp.717-729
    • /
    • 2014
  • Projection pursuit classification tree uses a 1-dimensional projection with the view of the most separating classes in each node. These projection coefficients contain information distinguishing two groups of classes from each other and can be used to calculate the importance measure of classification in each variable. This paper reviews the variable importance measure with increasing interest in line with growing data size. We compared the performances of projection pursuit classification tree with those of classification and regression tree(CART) and random forest. Projection pursuit classification tree are found to produce better performance in most cases, particularly with highly correlated variables. The importance measure of projection pursuit classification tree performs slightly better than the importance measure of random forest.

Assessment of Slope Stability With the Uncertainty in Soil Property Characterization (지반성질 불확실성을 고려한 사면안정 해석)

  • 김진만
    • Proceedings of the Korean Geotechical Society Conference
    • /
    • 2003.03a
    • /
    • pp.123-130
    • /
    • 2003
  • The estimation of key soil properties and subsequent quantitative assessment of the associated uncertainties has always been an important issue in geotechnical engineering. It is well recognized that soil properties vary spatially as a result of depositional and post-depositional processes. The stochastic nature of spatially varying soil properties can be treated as a random field. A practical statistical approach that can be used to systematically model various sources of uncertainty is presented in the context of reliability analysis of slope stability Newly developed expressions for probabilistic characterization of soil properties incorporate sampling and measurement errors, as well as spatial variability and its reduced variance due to spatial averaging. Reliability analyses of the probability of slope failure using the different statistical representations of soil properties show that the incorporation of spatial correlation and conditional simulation leads to significantly lower probability of failure than obtained using simple random variable approach.

  • PDF

Dynamic Analysis of Aircraft Landing Gear under Nonstationary Random Excitations (비정상 랜덤 가진력을 받는 항공기 착륙장치의 동특성 해석)

  • 황재혁;유병성;박명호
    • Journal of KSNVE
    • /
    • v.8 no.2
    • /
    • pp.251-259
    • /
    • 1998
  • The motion of an aircraft landing gear over rough runway at variable speed is nonstationary. In this paper, a method for the computation of nonstationary response variance is presented which uses a state space form for the combination of landing gear and runway excitation. The dynamic characteristics of the landing gear under nonstationary random excitations has also been analyzed using the proposed method. The formulation is for linear systems of arbitrary order and allows any deterministic velocity history. It has been found by a series of simulation that correlation parameter, damping coefficients of landing gear and tire, and velocity profiles play a prominent role on the dynamic characteristics.

  • PDF

Prediction of New Customer's Degree of Loyalty of Internet Shopping Mall Using Continuous Conditional Random Field (Continuous Conditional Random Field에 의한 인터넷 쇼핑몰 신규 고객등급 예측)

  • Ahn, Gil Seung;Hur, Sun
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.41 no.1
    • /
    • pp.10-16
    • /
    • 2015
  • In this study, we suggest a method to predict probability distribution of a new customer's degree of loyalty using C-CRF that reflects the RFM score and similarity to the neighbors of the customer. An RFM score prediction model is introduced to construct the first feature function of C-CRF. Integrating demographical similarity, purchasing characteristic similarity and purchase history similarity, we make a unified similarity variable to configure the second feature function of C-CRF. Then parameters of each feature function are estimated and we train our C-CRF model by training data set and suggest a probabilistic distribution to estimate a new customer's degree of loyalty. An example is provided to illustrate our model.