• Title/Summary/Keyword: many variables

Search Result 2,500, Processing Time 0.034 seconds

A Study on Participation Factors of Police Service in the Region Inhabitants (지역주민의 치안서비스 참여요인에 관한 연구)

  • Kim, Young-Oh
    • Korean Security Journal
    • /
    • no.6
    • /
    • pp.235-254
    • /
    • 2003
  • The Police main roles are order maintenance, law enforcement, community service etc. Recently it is increasing service function to community that in the cource of diverse roles. This study analyze recognition and attitudes of citizens on the Policing, participation factors, relationship among many factors between independent variables and dependent variables. The research method examine theoretical literatures and survey questionnaire for empirical analysis. The spss(win) 10.0 program is using analyze the data.

  • PDF

CHAID Algorithm by Cube-based Sampling

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.239-247
    • /
    • 2003
  • Decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud dection, data reduction and variable screening, etc. CHAID(Chi-square Automatic Interaction Detector), is an exploratory method used to study the relationship between a dependent variable and a series of predictor variables. In this paper we propose and CHAID algorithm by cube-based sampling and explore CHAID algorithm in view of accuracy and speed by the number of variables.

  • PDF

Dynamic Progrmming Application in Determining the Optimal Replacement Time of Equipment (동적계획에 의한 장비최적교체시기 결정)

  • Jeong, Hyeon-Tae
    • Journal of Korean Society for Quality Management
    • /
    • v.14 no.1
    • /
    • pp.61-66
    • /
    • 1986
  • Many papers have suggested various models how to decide an optimal time for replacing equipment. In this study, Dynamic Programming is applied to establish a model of replacing equipment and a new algorithm is developed for computerization to meet the increased number of variables. It is possible to predict the real situation with higher accuracy by employing the proposed model including more variables such as planning horizon, original cost, salvage value, decreasing rate, operating and maintenance costs, increasing rate, and so on.

  • PDF

Discriminant Analysis of Binary Data by Using the Maximum Entropy Distribution

  • Lee, Jung Jin;Hwang, Joon
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.909-917
    • /
    • 2003
  • Although many classification models have been used to classify binary data, none of the classification models dominates all varying circumstances depending on the number of variables and the size of data(Asparoukhov and Krzanowski (2001)). This paper proposes a classification model which uses information on marginal distributions of sub-variables and its maximum entropy distribution. Classification experiments by using simulation are discussed.

A Study on Data Mining Using the Spline Basis

  • Lee, Sun-Geune;Sim, Songyong;Koo, Ja-Yong
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.2
    • /
    • pp.255-264
    • /
    • 2004
  • Due to a computerized data processing, there are many cases when we encounter a huge data set. On the other hand, advances in computing technologies make it possible to deal with a huge data set. One important area is the data mining. In this paper we consider data mining when the dependent variable is binary. The proposed method is to use the poly-class model when the independent variables consists of continuous and discrete variables. An example is provided.

Discovery of Association Rules Using Latent Variables

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.10a
    • /
    • pp.177-188
    • /
    • 2005
  • Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary threshold measures in association rule; support and confidence and lift. In the case of appling real world to association rules, we have some difficulties in data interpretation because we obtain many rules. In this paper, we develop the model of association rules using latent variables for environmental survey data.

  • PDF

A System Design for Evolutionary Optimizer (Evolutionary Optimizer를 위한 시스템 설계)

  • Rhee Chang-Kwon;Byun Jai-Hyun
    • Proceedings of the Korean Society for Quality Management Conference
    • /
    • 2004.04a
    • /
    • pp.503-506
    • /
    • 2004
  • Evolutionary operation is useful to improve on-line full-scale manufacturing processes by systematically changing the levels of the process variables without jeopardizing the product. This paper presents a system design for an evolutionary operation software called 'evolutionary optimizer'. The system design is based primarily on data flow diagram. Evolutionary optimizer consists of four modules: factorial design module, many variables module, mixture Production module, and mean/dispersion module.

  • PDF

Robust Variable Selection in Classification Tree

  • Jang Jeong Yee;Jeong Kwang Mo
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2001.11a
    • /
    • pp.89-94
    • /
    • 2001
  • In this study we focus on variable selection in decision tree growing structure. Some of the splitting rules and variable selection algorithms are discussed. We propose a competitive variable selection method based on Kruskal-Wallis test, which is a nonparametric version of ANOVA F-test. Through a Monte Carlo study we note that CART has serious bias in variable selection towards categorical variables having many values, and also QUEST using F-test is not so powerful to select informative variables under heavy tailed distributions.

  • PDF

Discovery of Association Rules Using Latent Variables

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.149-160
    • /
    • 2006
  • Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary threshold measures in association rule; support and confidence and lift. In the case of appling real world to association rules, we have some difficulties in data interpretation because we obtain many rules. In this paper, we develop the model of association rules using latent variables for environmental survey data.

  • PDF

On the clustering of huge categorical data

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1353-1359
    • /
    • 2010
  • Basic objective in cluster analysis is to discover natural groupings of items. In general, clustering is conducted based on some similarity (or dissimilarity) matrix or the original input data. Various measures of similarities between objects are developed. In this paper, we consider a clustering of huge categorical real data set which shows the aspects of time-location-activity of Korean people. Some useful similarity measure for the data set, are developed and adopted for the categorical variables. Hierarchical and nonhierarchical clustering method are applied for the considered data set which is huge and consists of many categorical variables.