Search | Korea Science

Tree-structured Clustering for Continuous Data (연속형 자료에 대한 나무형 군집화)

Huh Myung-Hoe;Yang Kyung-Sook
- The Korean Journal of Applied Statistics
- /
- v.18 no.3
- /
- pp.661-671
- /
- 2005
The aim of this study is to propose a clustering method, called tree-structured clustering, by recursively partitioning continuous multivariate dat a based on overall $R^2$ criterion with a practical node-splitting decision rule. The clustering method produces easily interpretable clustering rules of tree types with the variable selection function. In numerical examples (Fisher's iris data and a Telecom case), we note several differences between tree-structured clustering and K-means clustering.
https://doi.org/10.5351/KJAS.2005.18.3.661 인용 PDF KSCI

Tree-structured Clustering for Mixed Data (혼합형 데이터에 대한 나무형 군집화)

Yang Kyung-Sook;Huh Myung-Hoe
- The Korean Journal of Applied Statistics
- /
- v.19 no.2
- /
- pp.271-282
- /
- 2006
The aim of this study is to propose a tree-structured clustering for mixed data. We suggest a scaling method to reduce the variable selection bias among categorical variables. In numerical examples such as credit data, German credit data, we note several differences between tree-structured clustering and K-means clustering.
https://doi.org/10.5351/KJAS.2006.19.2.271 인용 PDF KSCI

The Construction of Tree-structured Database and Tree Search Strategies in Distribution Systems (트리구조의 배전계통 데이타베이스 구성과 트리탐색기법)

Kim, S.H.;Ryu, H.S.;Choi, B.Y.;Cho, S.H.;Moon, Y.H.
- Proceedings of the KIEE Conference
- /
- 1992.07a
- /
- pp.172-175
- /
- 1992
This paper proposes the methods to construct the tree-structured database and analyze the distribution system network. In order to cope with an extensive amount of data and the frequent breaker switching operations in distribution systems, the database for system configuration is constructed using binary trees. Once the tree-structured database has been built, the system tracing of distribution network can be rapidly performed. This remarkably enhances the efficiency of data search and easily adapts to system changes due to switching operations. The computation method of fast power flow using tree search strategies is presented. The methods in the paper may be available in the field of distribution system operation.
PDF

A Tombstone Filtered LSM-Tree for Stable Performance of KVS (키밸류 저장소 성능 제어를 위한 삭제 키 분리 LSM-Tree)

Lee, Eunji
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.22 no.4
- /
- pp.17-22
- /
- 2022
With the spread of web services, data types are becoming more diversified. In addition to the form of storing data such as images, videos, and texts, the number and form of properties and metadata expressing the data are different for each data. In order to efficiently process such unstructured data, a key-value store is widely used for state-of-the-art applications. LSM-Tree (Log Structured Merge Tree) is the core data structure of various commercial key-value stores. LSM-Tree is optimized to provide high performance for small writes by recording all write and delete operations in a log manner. However, there is a problem in that the delay time and processing speed of user requests are lowered as batches of deletion operations for expired data are inserted into the LSM-Tree as special key-value data. This paper presents a Filtered LSM-Tree (FLSM-Tree) that solves the above problem by separating the deleted key from the main tree structure while maintaining all the advantages of the existing LSM-Tree. The proposed method is implemented in LevelDB, a commercial key-value store and it shows that the read performance is improved by up to 47% in performance evaluation.
https://doi.org/10.7236/JIIBC.2022.22.4.17 인용 PDF KSCI HTML

An Efficient Disk Block Allocation Method for XML Data (XML 데이타를 위한 효율적인 디스크 블록 할당 방법)

Kim, Jung-Hoon;Son, Jin-Hyun;Chung, Yon-Dohn;Kim, Myoung-Ho
- Journal of KIISE:Databases
- /
- v.34 no.5
- /
- pp.465-472
- /
- 2007
With the recent proliferation of the use of semi-structured data such as XML, it becomes more important to efficiently store and manage the semi-structured data. The XML data can be logically modelled as a rooted tree e.g., the DOM tree. In order to process a query on the XML data, we traverse the tree structure. In this paper we present an algorithm that places the XML data to disk blocks. The proposed algorithm assigns a number to each node of the tree in a bottom-up fashion. Then, the nodes are allocated to disk blocks using the assigned number. The proposed algorithm does not need access pattern information, and provides good performance for any access pattern. The characteristics of the proposed method are presented with analysis. Through experiments, we evaluate the performance of the proposed method.
PDF KSCI

Note on classification and regression tree analysis (분류와 회귀나무분석에 관한 소고)

임용빈;오만숙
- Journal of Korean Society for Quality Management
- /
- v.30 no.1
- /
- pp.152-161
- /
- 2002
The analysis of large data sets with hundreds of thousands observations and thousands of independent variables is a formidable computational task. A less parametric method, capable of identifying important independent variables and their interactions, is a tree structured approach to regression and classification. It gives a graphical and often illuminating way of looking at data in classification and regression problems. In this paper, we have reviewed and summarized tile methodology used to construct a tree, multiple trees and the sequential strategy for identifying active compounds in large chemical databases.
PDF KSCI

B2V-Tree: An Indexing Scheme for Partial Match Queries on Wireless Data Streams (B2V-Tree: 무선 데이타 스트림에서 부분 부합 질의를 위한 색인 기법)

Chung, Yon-Dohn;Lee, Ji-Yeon
- Journal of KIISE:Databases
- /
- v.32 no.3
- /
- pp.285-296
- /
- 2005
In mobile distributed systems the data on the air can be accessed by a lot of mobile clients. And, we need an indexing scheme in order to energy-efficiently access the data on the wireless broadcast stream. In conventional indexing schemes, they use the values of primary key attributes and construct tree-structured index. Therefore, the conventional indexing schemes do not support content-based retrieval queries such as partial-match queries. In this paper we propose an indexing scheme, called B2V-Tree, which supports partial match queries on wireless broadcast data stream. For this purpose, we construct a tree-structured index which is composed of bit-vectors, where the bit-vectors are generated from data records through multi-attribute hashing.
PDF KSCI

Development of Decision Tree Program based on Web for Analyzing Clinical Information of Sasang Constitutional Medicine (사상체질 임상정보 분석을 위한 웹 기반의 의사결정 나무 프로그램 개발)

Jin, Hee-Jeong;Kim, Myoung-Geun;Kim, Jong-Yeol
- Korean Journal of Oriental Medicine
- /
- v.14 no.3
- /
- pp.81-87
- /
- 2008
Sasanag Contitution Medicine(SCM) is the traditional medicine theory based on constitutional medicine in Korea. It is most import ant that a personal SCM type is determined accurately ahead of applying any Sasang treatments. For this, many researches have been studied to diagnose the SCM type using constitutional clinical data. The decision tree is a tree-structured data-mining methodology. Recently, in the Korean traditional medicine society, there have been several efforts to find diagnosing tools using the decision tree method. So, we developed a decision tree program based on web for analyzing constitutional clinical information. It can use various clinical data as input data, offer filtering function to select clinical data to be used. We can find useful factor to be influential on SCM types using this program.
PDF

Model Selection for Tree-Structured Regression

Kim, Sung-Ho
- Journal of the Korean Statistical Society
- /
- v.25 no.1
- /
- pp.1-24
- /
- 1996
In selecting a final tree, Breiman, Friedman, Olshen, and Stone(1984) compare the prediction risks of a pair of tree, where one contains the other, using the standard error of the prediction risk of the larger one. This paper proposes an approach to selection of a final tree by using the standard error of the difference of the prediction risks between a pair of trees rather than the standard error of the larger one. This approach is compared with CART's for simulated data from a simple regression model. Asymptotic results of the approaches are also derived and compared to each other. Both the asymptotic and the simulation results indicate that final trees by CART tend to be smaller than desired.
PDF

Wage Determinants Analysis by Quantile Regression Tree

Chang, Young-Jae
- Communications for Statistical Applications and Methods
- /
- v.19 no.2
- /
- pp.293-301
- /
- 2012
Quantile regression proposed by Koenker and Bassett (1978) is a statistical technique that estimates conditional quantiles. The advantage of using quantile regression is the robustness in response to large outliers compared to ordinary least squares(OLS) regression. A regression tree approach has been applied to OLS problems to fit flexible models. Loh (2002) proposed the GUIDE algorithm that has a negligible selection bias and relatively low computational cost. Quantile regression can be regarded as an analogue of OLS, therefore it can also be applied to GUIDE regression tree method. Chaudhuri and Loh (2002) proposed a nonparametric quantile regression method that blends key features of piecewise polynomial quantile regression and tree-structured regression based on adaptive recursive partitioning. Lee and Lee (2006) investigated wage determinants in the Korean labor market using the Korean Labor and Income Panel Study(KLIPS). Following Lee and Lee, we fit three kinds of quantile regression tree models to KLIPS data with respect to the quantiles, 0.05, 0.2, 0.5, 0.8, and 0.95. Among the three models, multiple linear piecewise quantile regression model forms the shortest tree structure, while the piecewise constant quantile regression model has a deeper tree structure with more terminal nodes in general. Age, gender, marriage status, and education seem to be the determinants of the wage level throughout the quantiles; in addition, education experience appears as the important determinant of the wage level in the highly paid group.
https://doi.org/10.5351/CKSS.2012.19.2.293 인용 PDF KSCI

Search Result 90, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)