Search | Korea Science

Kim, Young-Oh
- Korean Security Journal
- /
- no.6
- /
- pp.235-254
- /
- 2003
The Police main roles are order maintenance, law enforcement, community service etc. Recently it is increasing service function to community that in the cource of diverse roles. This study analyze recognition and attitudes of citizens on the Policing, participation factors, relationship among many factors between independent variables and dependent variables. The research method examine theoretical literatures and survey questionnaire for empirical analysis. The spss(win) 10.0 program is using analyze the data.
PDF

Park, Hee-Chang;Cho, Kwang-Hyun
- 한국데이터정보과학회:학술대회논문집
- /
- 2003.10a
- /
- pp.239-247
- /
- 2003
Decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud dection, data reduction and variable screening, etc. CHAID(Chi-square Automatic Interaction Detector), is an exploratory method used to study the relationship between a dependent variable and a series of predictor variables. In this paper we propose and CHAID algorithm by cube-based sampling and explore CHAID algorithm in view of accuracy and speed by the number of variables.
PDF

Jeong, Hyeon-Tae
- Journal of Korean Society for Quality Management
- /
- v.14 no.1
- /
- pp.61-66
- /
- 1986
Many papers have suggested various models how to decide an optimal time for replacing equipment. In this study, Dynamic Programming is applied to establish a model of replacing equipment and a new algorithm is developed for computerization to meet the increased number of variables. It is possible to predict the real situation with higher accuracy by employing the proposed model including more variables such as planning horizon, original cost, salvage value, decreasing rate, operating and maintenance costs, increasing rate, and so on.
PDF

Lee, Jung Jin;Hwang, Joon
- Communications for Statistical Applications and Methods
- /
- v.10 no.3
- /
- pp.909-917
- /
- 2003
Although many classification models have been used to classify binary data, none of the classification models dominates all varying circumstances depending on the number of variables and the size of data(Asparoukhov and Krzanowski (2001)). This paper proposes a classification model which uses information on marginal distributions of sub-variables and its maximum entropy distribution. Classification experiments by using simulation are discussed.
https://doi.org/10.5351/CKSS.2003.10.3.909 인용 PDF KSCI

Lee, Sun-Geune;Sim, Songyong;Koo, Ja-Yong
- Communications for Statistical Applications and Methods
- /
- v.11 no.2
- /
- pp.255-264
- /
- 2004
Due to a computerized data processing, there are many cases when we encounter a huge data set. On the other hand, advances in computing technologies make it possible to deal with a huge data set. One important area is the data mining. In this paper we consider data mining when the dependent variable is binary. The proposed method is to use the poly-class model when the independent variables consists of continuous and discrete variables. An example is provided.
https://doi.org/10.5351/CKSS.2004.11.2.255 인용 PDF KSCI

Park, Hee-Chang;Cho, Kwang-Hyun
- 한국데이터정보과학회:학술대회논문집
- /
- 2005.10a
- /
- pp.177-188
- /
- 2005
Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary threshold measures in association rule; support and confidence and lift. In the case of appling real world to association rules, we have some difficulties in data interpretation because we obtain many rules. In this paper, we develop the model of association rules using latent variables for environmental survey data.
PDF

Rhee Chang-Kwon;Byun Jai-Hyun
- Proceedings of the Korean Society for Quality Management Conference
- /
- 2004.04a
- /
- pp.503-506
- /
- 2004
Evolutionary operation is useful to improve on-line full-scale manufacturing processes by systematically changing the levels of the process variables without jeopardizing the product. This paper presents a system design for an evolutionary operation software called 'evolutionary optimizer'. The system design is based primarily on data flow diagram. Evolutionary optimizer consists of four modules: factorial design module, many variables module, mixture Production module, and mean/dispersion module.
PDF

Jang Jeong Yee;Jeong Kwang Mo
- Proceedings of the Korean Statistical Society Conference
- /
- 2001.11a
- /
- pp.89-94
- /
- 2001
In this study we focus on variable selection in decision tree growing structure. Some of the splitting rules and variable selection algorithms are discussed. We propose a competitive variable selection method based on Kruskal-Wallis test, which is a nonparametric version of ANOVA F-test. Through a Monte Carlo study we note that CART has serious bias in variable selection towards categorical variables having many values, and also QUEST using F-test is not so powerful to select informative variables under heavy tailed distributions.
PDF

Park, Hee-Chang;Cho, Kwang-Hyun
- Journal of the Korean Data and Information Science Society
- /
- v.17 no.1
- /
- pp.149-160
- /
- 2006
Association rule mining searches for interesting relationships among items in a given large data set. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary threshold measures in association rule; support and confidence and lift. In the case of appling real world to association rules, we have some difficulties in data interpretation because we obtain many rules. In this paper, we develop the model of association rules using latent variables for environmental survey data.
PDF

Kim, Dae-Hak
- Journal of the Korean Data and Information Science Society
- /
- v.21 no.6
- /
- pp.1353-1359
- /
- 2010
Basic objective in cluster analysis is to discover natural groupings of items. In general, clustering is conducted based on some similarity (or dissimilarity) matrix or the original input data. Various measures of similarities between objects are developed. In this paper, we consider a clustering of huge categorical real data set which shows the aspects of time-location-activity of Korean people. Some useful similarity measure for the data set, are developed and adopted for the categorical variables. Hierarchical and nonhierarchical clustering method are applied for the considered data set which is huge and consists of many categorical variables.
PDF KSCI