• Title/Summary/Keyword: multi-random variables

Search Result 93, Processing Time 0.022 seconds

Application of Random Forests to Assessment of Importance of Variables in Multi-sensor Data Fusion for Land-cover Classification

  • Park No-Wook;Chi kwang-Hoon
    • Korean Journal of Remote Sensing
    • /
    • v.22 no.3
    • /
    • pp.211-219
    • /
    • 2006
  • A random forests classifier is applied to multi-sensor data fusion for supervised land-cover classification in order to account for the importance of variable. The random forests approach is a non-parametric ensemble classifier based on CART-like trees. The distinguished feature is that the importance of variable can be estimated by randomly permuting the variable of interest in all the out-of-bag samples for each classifier. Two different multi-sensor data sets for supervised classification were used to illustrate the applicability of random forests: one with optical and polarimetric SAR data and the other with multi-temporal Radarsat-l and ENVISAT ASAR data sets. From the experimental results, the random forests approach could extract important variables or bands for land-cover discrimination and showed reasonably good performance in terms of classification accuracy.

ACCOUNTING FOR IMPORTANCE OF VARIABLES IN MUL TI-SENSOR DATA FUSION USING RANDOM FORESTS

  • Park No-Wook;Chi Kwang-Hoon
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.283-285
    • /
    • 2005
  • To account for the importance of variable in multi-sensor data fusion, random forests are applied to supervised land-cover classification. The random forests approach is a non-parametric ensemble classifier based on CART-like trees. Its distinguished feature is that the importance of variable can be estimated by randomly permuting the variable of interest in all the out-of-bag samples for each classifier. Supervised classification with a multi-sensor remote sensing data set including optical and polarimetric SAR data was carried out to illustrate the applicability of random forests. From the experimental result, the random forests approach could extract important variables or bands for land-cover discrimination and showed good performance, as compared with other non-parametric data fusion algorithms.

  • PDF

A FUNCTIONAL CENTRAL LIMIT THEOREM FOR LINEAR RANDOM FIELD GENERATED BY NEGATIVELY ASSOCIATED RANDOM FIELD

  • Ryu, Dae-Hee
    • Journal of the Chungcheong Mathematical Society
    • /
    • v.22 no.3
    • /
    • pp.507-517
    • /
    • 2009
  • We prove a functional central limit theorem for a linear random field generated by negatively associated multi-dimensional random variables. Under finite second moment condition we extend the result in Kim, Ko and Choi[Kim,T.S, Ko,M.H and Choi, Y.K.,2008. The invariance principle for linear multi-parameter stochastic processes generated by associated fields. Statist. Probab. Lett. 78, 3298-3303] to the negatively associated case.

  • PDF

Multi-Objective Stochastic Optimization in Water Resources System

  • Shim, Soon Bo
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.8 no.1
    • /
    • pp.41-59
    • /
    • 1983
  • The purpose of this paper is to present a method of multi-objective, stochastic optimization in water resources system which investigates the development of potential non-normal deterministic equivalents for subsequent use in multiobjective stochastic programming methods, Given probability statement involving a function of several random variables, it is often possible to obtain a deterministic equivalent of it that does not include any orginal random variables. A Stochastic trade-off technique-MSTOT is suggested to help a decision maker achieve satisfactory levels for several objective functions. This makes use of deterministic equivalents to handle random variables in the objective functions. The emphasis is in the development of non-normal deterministic equivalents for use in multiobjective stochastic techniques.

  • PDF

Conditional Probability of a 'Choseong', a 'Jungseong', and a 'Jongseong' Between Syllables in Multi-Syllable Korean Words (한국어 다음절 단어의 초성, 중성, 종성단위의 음절간 조건부 확률)

  • 이재홍;이재학
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.9
    • /
    • pp.692-703
    • /
    • 1991
  • A Korean word is composed of syllables. A Korean syllable is regarded as a random variable according to its probabilistic property in occurrence. A Korean syllable is divided into 'choseong', 'jungseong', and 'jongseong' which are regarded as random variables. We can consider teh conditional probatility of syllable as an index which represents the occurrence correlation between syllables in Korean words. Since the number of syllables is enormous, we use the conditional probability of a' choseong', a 'jungseong', and a 'jongseong' between syllables as an index which represents the occurrence correlation between syllables in Korean words. The length distribution of Korean woeds is computed according to frequency and to kind. Form the cumulative frequency of a Korean syllable computed from multi-syllable Korean woeds, all probabilities and conditiona probabilities are computed for the three random variables. The conditional probabilities of 'choseong'- 'choseong', 'jungseong'- 'jungseong', 'jongseong'-'jongseong', 'jongseong'-'choseong' between adjacent syllables in multi-syllable Korean woeds are computed.

  • PDF

Variable Selection with Regression Trees

  • Chang, Young-Jae
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.2
    • /
    • pp.357-366
    • /
    • 2010
  • Many tree algorithms have been developed for regression problems. Although they are regarded as good algorithms, most of them suffer from loss of prediction accuracy when there are many noise variables. To handle this problem, we propose the multi-step GUIDE, which is a regression tree algorithm with a variable selection process. The multi-step GUIDE performs better than some of the well-known algorithms such as Random Forest and MARS. The results based on simulation study shows that the multi-step GUIDE outperforms other algorithms in terms of variable selection and prediction accuracy. It generally selects the important variables correctly with relatively few noise variables and eventually gives good prediction accuracy.

A Dual Problem of Calibration of Design Weights Based on Multi-Auxiliary Variables

  • Al-Jararha, J.
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.2
    • /
    • pp.137-146
    • /
    • 2015
  • Singh (2013) considered the dual problem to the calibration of design weights to obtain a new generalized linear regression estimator (GREG) for the finite population total. In this work, we have made an attempt to suggest a way to use the dual calibration of the design weights in case of multi-auxiliary variables; in other words, we have made an attempt to give an answer to the concern in Remark 2 of Singh (2013) work. The same idea is also used to generalize the GREG estimator proposed by Deville and S$\ddot{a}$rndal (1992). It is not an easy task to find the optimum values of the parameters appear in our approach; therefore, few suggestions are mentioned to select values for such parameters based on a random sample. Based on real data set and under simple random sampling without replacement design, our approach is compared with other approaches mentioned in this paper and for different sample sizes. Simulation results show that all estimators have negligible relative bias, and the multivariate case of Singh (2013) estimator is more efficient than other estimators.

Dynamic reliability of structures: the example of multi-grid composite walls

  • Liu, Pei;Yaoa, Qian-Feng
    • Structural Engineering and Mechanics
    • /
    • v.36 no.4
    • /
    • pp.463-479
    • /
    • 2010
  • Based on damage accumulation of multi-grid composite walls, a method of dynamic reliability estimations is proposed. The multi-grid composite wall is composed of edge frame beam, edge frame columns, grid beams, grid columns and filling blocks. The equations including stiffness, shear forces at filling blocks cracking and multi-grid composite walls yielding, ultimate displacement, and damage index are obtained through tests of 13 multi-grid composite wall specimens. Employing these equations in reliability calculations, procedures of dynamic reliability estimations based on damage accumulation of multi-grid composite walls subjected to random earthquake excitations are proposed. Finally the proposed method is applied to the typical composite wall specimen subjected to random earthquake excitations which can be specified by a finite number of input random variables. The dynamic reliability estimates, when filling blocks crack under earthquakes corresponding to 63% exceedance in 50 years and when the composite wall reach limit state under earthquakes corresponding to 2-3% exceedance in 50 years, are obtained using the proposed method by taking damage indexes as thresholds. The results from the proposed method which show good agreement with those from Monte-Carlo simulations demonstrate the proposed method is effective.

Study of Virtual Goods Purchase Model Applying Dynamic Social Network Structure Variables (동적 소셜네트워크 구조 변수를 적용한 가상 재화 구매 모형 연구)

  • Lee, Hee-Tae;Bae, Jungho
    • Journal of Distribution Science
    • /
    • v.17 no.3
    • /
    • pp.85-95
    • /
    • 2019
  • Purpose - The existing marketing studies using Social Network Analysis have assumed that network structure variables are time-invariant. However, a node's network position can fluctuate considerably over time and the node's network structure can be changed dynamically. Hence, if such a dynamic structural network characteristics are not specified for virtual goods purchase model, estimated parameters can be biased. In this paper, by comparing a time-invariant network structure specification model(base model) and time-varying network specification model(proposed model), the authors intend to prove whether the proposed model is superior to the base model. In addition, the authors also intend to investigate whether coefficients of network structure variables are random over time. Research design, data, and methodology - The data of this study are obtained from a Korean social network provider. The authors construct a monthly panel data by calculating the raw data. To fit the panel data, the authors derive random effects panel tobit model and multi-level mixed effects model. Results - First, the proposed model is better than that of the base model in terms of performance. Second, except for constraint, multi-level mixed effects models with random coefficient of every network structure variable(in-degree, out-degree, in-closeness centrality, out-closeness centrality, clustering coefficient) perform better than not random coefficient specification model. Conclusion - The size and importance of virtual goods market has been dramatically increasing. Notwithstanding such a strategic importance of virtual goods, there is little research on social influential factors which impact the intention of virtual good purchase. Even studies which investigated social influence factors have assumed that social network structure variables are time-invariant. However, the authors show that network structure variables are time-variant and coefficients of network structure variables are random over time. Thus, virtual goods purchase model with dynamic network structure variables performs better than that with static network structure model. Hence, if marketing practitioners intend to use social influences to sell virtual goods in social media, they had better consider time-varying social influences of network members. In addition, this study can be also differentiated from other related researches using survey data in that this study deals with actual field data.

Mutual Information and Redundancy for Categorical Data

  • Hong, Chong-Sun;Kim, Beom-Jun
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.2
    • /
    • pp.297-307
    • /
    • 2006
  • Most methods for describing the relationship among random variables require specific probability distributions and some assumptions of random variables. The mutual information based on the entropy to measure the dependency among random variables does not need any specific assumptions. And the redundancy which is a analogous version of the mutual information was also proposed. In this paper, the redundancy and mutual information are explored to multi-dimensional categorical data. It is found that the redundancy for categorical data could be expressed as the function of the generalized likelihood ratio statistic under several kinds of independent log-linear models, so that the redundancy could also be used to analyze contingency tables. Whereas the generalized likelihood ratio statistic to test the goodness-of-fit of the log-linear models is sensitive to the sample size, the redundancy for categorical data does not depend on sample size but its cell probabilities itself.