• Title/Summary/Keyword: Data Tree

Search Result 3,320, Processing Time 0.027 seconds

The guideline for choosing the right-size of tree for boosting algorithm (부스팅 트리에서 적정 트리사이즈의 선택에 관한 연구)

  • Kim, Ah-Hyoun;Kim, Ji-Hyun;Kim, Hyun-Joong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.5
    • /
    • pp.949-959
    • /
    • 2012
  • This article is to find the right size of decision trees that performs better for boosting algorithm. First we defined the tree size D as the depth of a decision tree. Then we compared the performance of boosting algorithm with different tree sizes in the experiment. Although it is an usual practice to set the tree size in boosting algorithm to be small, we figured out that the choice of D has a significant influence on the performance of boosting algorithm. Furthermore, we found out that the tree size D need to be sufficiently large for some dataset. The experiment result shows that there exists an optimal D for each dataset and choosing the right size D is important in improving the performance of boosting. We also tried to find the model for estimating the right size D suitable for boosting algorithm, using variables that can explain the nature of a given dataset. The suggested model reveals that the optimal tree size D for a given dataset can be estimated by the error rate of stump tree, the number of classes, the depth of a single tree, and the gini impurity.

Detection of Individual Trees and Estimation of Mean Tree Height using Airborne LIDAR Data (항공 라이다데이터를 이용한 개별수목탐지 및 평균수고추정)

  • Hwang, Se-Ran;Lee, Mi-Jin;Lee, Im-Pyeong
    • Spatial Information Research
    • /
    • v.20 no.3
    • /
    • pp.27-38
    • /
    • 2012
  • As the necessity of forest conservation and management has been increased, various forest studies using LIDAR data have been actively performed. These studies often utilize the tree height as an important parameter to measure the forest quantitatively. This study thus attempt to apply two representative methods to estimate tree height from airborne LIDAR data and compare the results. The first method based on the detection of the individual trees using a local maximum filter estimates the number of trees, the position and heights of the individual trees, and the mean tree height. The other method estimates the maximum and mean tree height, and the crown mean height for each grid cell or the entire area from the canopy height model (CHM) and height histogram. In comparison with the field measurements, 76.6% of the individual trees are detected correctly; and the estimated heights of all trees and only conifer trees show the RMSE of 1.91m and 0.75m, respectively. The tree mean heights estimated from CHM retain about 1~2m RMSE, and the histogram method underestimates the tree mean height with about 0.6m. For more accurate derivation of diverse forest information, we should select and integrate the complimentary methods appropriate to the tree types and estimation parameters.

Dynamic Extension of Genetic Tree Maps (유전 목 지도의 동적 확장)

  • Ha, seong-Wook;Kwon, Kee-Hang;Kang, Dae-Seong
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.6
    • /
    • pp.386-395
    • /
    • 2002
  • In this paper, we suggest dynamic genetic tree-maps(DGTM) using optimal features on recognizing data. The DGTM uses the genetic algorithm about the importance of features rarely considerable on conventional neural networks and introduces GTM(genetic tree-maps) using tree structure according of the priority of features. Hence, we propose the extended formula, DGTM(dynamic GTM) has dynamic functions to separate and merge the neuron of neural network along the similarity of features.

CST-Tree with improved Mint system multimedia content using the refine (CST-트리를 채택한 Mint 시스템을 이용한 멀티미디어 콘텐츠 검색 개선)

  • Jung, Myoung Jin;Cho, Sung Je
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.9 no.3
    • /
    • pp.145-153
    • /
    • 2013
  • Multimedia contents' searching methods in existing mobile computer environment tend to be brought with many over heads when attempting to search data in large bulks. The MINT system has been studied to resolve such problem. However, MINT system delivers many over heads by searching multimedia contents through applying B-Tree. The suggested method is said to be CST-MCR searching system based on CST-Tree. The specific characteristic of this method is improved MINT system by applying CST-Tree's index. The result of capacity evaluation came out with improved capacity of 4.27% from MINT system and CST-MCR methods in average. Conclusively, it has been proven that the suggested method is superior.

I-Tree: A Frequent Patterns Mining Approach without Candidate Generation or Support Constraint

  • Tanbeer, Syed Khairuzzaman;Sarkar, Jehad;Jeong, Byeong-Soo;Lee, Young-Koo;Lee, Sung-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.31-33
    • /
    • 2007
  • Devising an efficient one-pass frequent pattern mining algorithm has been an issue in data mining research in recent past. Pattern growth algorithms like FP-Growth which are found more efficient than candidate generation and test algorithms still require two database scans. Moreover, FP-growth approach requires rebuilding the base-tree while mining with different support counts. In this paper we propose an item-based tree, called I-Tree that not only efficiently mines frequent patterns with single database scan but also provides multiple mining scopes with multiple support thresholds. The 'build-once-mine-many' property of I-Tree allows it to construct the tree only once and perform mining operation several times with the variation of support count values.

  • PDF

A Study on Decision Tree for Multiple Binary Responses

  • Lee, Seong-Keon
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.971-980
    • /
    • 2003
  • The tree method can be extended to multivariate responses, such as repeated measure and longitudinal data, by modifying the split function so as to accommodate multiple responses. Recently, some decision trees for multiple responses have been constructed by Segal (1992) and Zhang (1998). Segal suggested a tree can analyze continuous longitudinal response using Mahalanobis distance for within node homogeneity measures and Zhang suggested a tree can analyze multiple binary responses using generalized entropy criterion which is proportional to maximum likelihood of joint distribution of multiple binary responses. In this paper, we will modify CART procedure and suggest a new tree-based method that can analyze multiple binary responses using similarity measures.

Annual Precipitation Reconstruction Based on Tree-ring Data at Seorak (설악산 지역의 Tree-ring 자료를 이용한 연 강수량 재생성)

  • Kwak, Jae Won;Han, Heechan;Lee, Minjung;Kim, Hung Soo;Mun, Jangwon
    • Journal of Korean Society on Water Environment
    • /
    • v.31 no.1
    • /
    • pp.19-28
    • /
    • 2015
  • The purpose of this research is reconstruction of annual precipitation based on Tree-ring series at Seorak mountain and examine its effectiveness. To do so we performed nonlinear time series characteristics test of Tree-ring series and reconstructed annual precipitation of Gangneung from 1687 to 1911 using Artificial neural network and Nonlinear autoregressive exogeneous input (NARX) model which reflects stochastic properties. As a result, Tree-ring series at Seorak Mountain shows nonlinear time series property and reconstructed annual precipitation series drawn from NARX is similar in statistical characteristics of observed annual time series.

On Design and Implementation of Incremental LR Parsing Algorithm Using Changed Threed Tree (변화된 스레드 트리를 이용한 점진적 LR 파싱 알고리즘 구현 및 설계)

  • Lee, Dae-Sik
    • Convergence Security Journal
    • /
    • v.5 no.4
    • /
    • pp.19-25
    • /
    • 2005
  • Threaded Tree is the data structure that can express parse stack as well as parse tree with LR parsing table. $Larchev\^{e}que$ makes Threaded Tree and Incremental Parsing with stack. This paper suggests the algorithm consisting of changed threaded tree without stack in order to reduce reparsing node and parsing speed. Also, it suggests incremental parsing algorithm to get rid of the reparsing process in node.

  • PDF

PSN: A Dynamic Numbering Scheme for W3C XQuery Update Facility

  • Hong, Dong-Kweon
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.8 no.2
    • /
    • pp.121-125
    • /
    • 2008
  • It is essential to maintain hierarchical information properly for efficient XML query processing. Well known approach to represent hierarchical information of XML tree is assigning a specific node number to each node of XML tree. Insertion and deletion of XML node can occur at any position in a dynamic XML tree. A dynamic numbering scheme allows us to add nodes to or delete nodes from an XML tree without relabeling or with relabeling only a few existing nodes of XML tree while executing XML query efficiently. According to W3C XQuery update facility specifications a node can be added as first or last child of the existing node in XML tree. Generating new number for last child requires referencing the number of previous last child. Getting the number of last child is very costly with previous approaches. We have developed a new dynamic numbering scheme PSN which is very effective for insertion of a node as last child. Our approach reduces the time to find last child dramatically by removing sorting of children.

Evaluation Method of College English Education Effect Based on Improved Decision Tree Algorithm

  • Dou, Fang
    • Journal of Information Processing Systems
    • /
    • v.18 no.4
    • /
    • pp.500-509
    • /
    • 2022
  • With the rapid development of educational informatization, teaching methods become diversified characteristics, but a large number of information data restrict the evaluation on teaching subject and object in terms of the effect of English education. Therefore, this study adopts the concept of incremental learning and eigenvalue interval algorithm to improve the weighted decision tree, and builds an English education effect evaluation model based on association rules. According to the results, the average accuracy of information classification of the improved decision tree algorithm is 96.18%, the classification error rate can be as low as 0.02%, and the anti-fitting performance is good. The classification error rate between the improved decision tree algorithm and the original decision tree does not exceed 1%. The proposed educational evaluation method can effectively provide early warning of academic situation analysis, and improve the teachers' professional skills in an accelerated manner and perfect the education system.