• Title/Summary/Keyword: Tree type

Search Result 1,070, Processing Time 0.026 seconds

Research on Comparing System with Syntactic-Semantic Tree in Subjective-type Grading (주관식 문제 채점에서의 구문의미트리 비교 시스템에 대한 연구)

  • Kang, WonSeog
    • The Journal of Korean Association of Computer Education
    • /
    • v.20 no.5
    • /
    • pp.79-88
    • /
    • 2017
  • To upgrade the subjective question grading, we need the syntactic-semantic analysis to analyze syntatic-semantic relation between words in answering. However, since the syntactic-semantic tree has structural and semantic relation between words, we can not apply the method calculating the similarity between vectors. This paper suggests the comparing system with syntactic-semantic tree which has structural and semantic relation between words. In this thesis, we suggest similarity calculation principles for comparing the trees and verify the principles through experiments. This system will help the subjective question grading by comparing the trees and be utilized in distinguishing similar documents.

CANCER CLASSIFICATION AND PREDICTION USING MULTIVARIATE ANALYSIS

  • Shon, Ho-Sun;Lee, Heon-Gyu;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.706-709
    • /
    • 2006
  • Cancer is one of the major causes of death; however, the survival rate can be increased if discovered at an early stage for timely treatment. According to the statistics of the World Health Organization of 2002, breast cancer was the most prevalent cancer for all cancers occurring in women worldwide, and it account for 16.8% of entire cancers inflicting Korean women today. In order to classify the type of breast cancer whether it is benign or malignant, this study was conducted with the use of the discriminant analysis and the decision tree of data mining with the breast cancer data disclosed on the web. The discriminant analysis is a statistical method to seek certain discriminant criteria and discriminant function to separate the population groups on the basis of observation values obtained from two or more population groups, and use the values obtained to allow the existing observation value to the population group thereto. The decision tree analyzes the record of data collected in the part to show it with the pattern existing in between them, namely, the combination of attribute for the characteristics of each class and make the classification model tree. Through this type of analysis, it may obtain the systematic information on the factors that cause the breast cancer in advance and prevent the risk of recurrence after the surgery.

  • PDF

Research on Subjective-type Grading System Using Syntactic-Semantic Tree Comparator (구문의미트리 비교기를 이용한 주관식 문항 채점 시스템에 대한 연구)

  • Kang, WonSeog
    • The Journal of Korean Association of Computer Education
    • /
    • v.21 no.6
    • /
    • pp.83-92
    • /
    • 2018
  • The subjective question is appropriate for evaluation of deep thinking, but it is not easy to score. Since, regardless of same scoring criterion, the graders are able to produce different scores, we need the objective automatic evaluation system. However, the system has the problem of Korean analysis and comparison. This paper suggests the Korean syntactic analysis and subjective grading system using the syntactic-semantic tree comparator. This system is the hybrid grading system of word based and syntactic-semantic tree based grading. This system grades the answers on the subjective question using the syntactic-semantic comparator. This proposed system has the good result. This system will be utilized in Korean syntactic-semantic analysis, subjective question grading, and document classification.

Discovering Relationships between Skin Type and Life Style Using Data Mining Techniques: A Case Study of Korea

  • Kim, Taeheung;Ha, Jihyun;Lee, Jong-Seok;Oh, Younhak;Cho, Yong Ju
    • Industrial Engineering and Management Systems
    • /
    • v.15 no.1
    • /
    • pp.110-121
    • /
    • 2016
  • With the growing interest in skincare and maintenance, there are increasing numbers of studies on the classification of skin type and the factors influencing each type. This study presents a novel methodology by using data mining, for the determination of the relationships between skin type, lifestyle, and patterns of cosmetic utilization. Eight skin-specific factors, which are moisture, sebum in U-zone (both cheeks), sebum in T-zone (forehead, nose, and chin), pore, melanin, wrinkle, acne, hemoglobin, were measured in 1,246 subjects living in South Korea, in conjunction with a questionnaire survey analyzing their lifestyles and pattern of cosmetic utilization. Using various multivariate statistical methods and data mining techniques, we classified the skin types based on the skin-specific values, determined the relationship between skin type and lifestyle, and accordingly sorted the subjects into clusters. Logistic regression analysis revealed gender-related differences in the skin; therefore, separate analyses were performed for males and females. Using the Gaussian Mixture Modeling (GMM) technique, we classified the subjects based on skin type (two male and four female). Using the ANOVA and decision tree techniques, we attempted to characterize the relationship between each skin type and the lifestyles of the subjects. Menstruation, eating habits, stress, and smoking were identified as the major factors affecting the skin.

A Feature Analysis of Industrial Accidents Using C4.5 Algorithm (C4.5 알고리즘을 이용한 산업 재해의 특성 분석)

  • Leem, Young-Moon;Kwag, Jun-Koo;Hwang, Young-Seob
    • Journal of the Korean Society of Safety
    • /
    • v.20 no.4 s.72
    • /
    • pp.130-137
    • /
    • 2005
  • Decision tree algorithm is one of the data mining techniques, which conducts grouping or prediction into several sub-groups from interested groups. This technique can analyze a feature of type on groups and can be used to detect differences in the type of industrial accidents. This paper uses C4.5 algorithm for the feature analysis. The data set consists of 24,887 features through data selection from total data of 25,159 taken from 2 year observation of industrial accidents in Korea For the purpose of this paper, one target value and eight independent variables are detailed by type of industrial accidents. There are 222 total tree nodes and 151 leaf nodes after grouping. This paper Provides an acceptable level of accuracy(%) and error rate(%) in order to measure tree accuracy about created trees. The objective of this paper is to analyze the efficiency of the C4.5 algorithm to classify types of industrial accidents data and thereby identify potential weak points in disaster risk grouping.

Target Object Search Algorithm under Dynamic Programming in the Tree-Type Maze (Dynamic Programming을 적용한 트리구조 미로내의 목표물 탐색 알고리즘)

  • Lee Dong-Hoon;Yoon Han-Ul;Sim Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.15 no.5
    • /
    • pp.626-631
    • /
    • 2005
  • This paper presents the target object search algorithm under dynamic programming (DP) in the Tree-type maze. We organized an experimental environment with the concatenation Y-shape diverged way, small mobile robot, and a target object. By the principle of optimality, the backbone of DP, an agent recognizes that a given whole problem can be solved if the values of the best solution of certain ancillary problem can be determined according to the principle of optimality. In experiment, we used two different control algorithms: a left-handed method and DP. Finally we verified the efficiency of DP in the practical application using a real robot.

Analyzing Migration Decision-Making Characteristics Based on Population Change Pattern and Distribution of Basic Living Services in Rural Areas (농촌지역 인구변화 특성 및 기초생활서비스 분포 특성을 고려한 이주 의사 결정 요인 분석)

  • Kim, Suyeon;Choi, Jin-Ah
    • Journal of Korean Society of Rural Planning
    • /
    • v.28 no.4
    • /
    • pp.1-9
    • /
    • 2022
  • Rural decline due to the decrease of the local population is an inevitable phenomenon, and a vicious cycle has been formed between a lack of basic living services and a population decrease in rural areas. Therefore, the study aims to derive the migration decision-making characteristics based on basic living service infrastructure data in rural areas. To do this, the population change over the past 20 years was categorized into six types, and the relationship between the classified population change types and the number of basic living service infrastructures was analyzed using decision tree analysis. Of the total 3,501 regions, 801 regions were the continuous decline type, of which 740 were rural areas. On the other hand, among 569 regions that were the continuous increase type, 401 regions were urban areas, confirming the population imbalance between rural and urban areas. As a result of the decision tree analysis on the relationship between population change types and the distribution of basic living service infrastructure, the number of daycare centers was derived as an important variable to classify the continuous increase type. Hospitals, parks, and public transportation were also found to be major basic living services affecting the classification of population change types.

Incremental Generation of A Decision Tree Using Global Discretization For Large Data (대용량 데이터를 위한 전역적 범주화를 이용한 결정 트리의 순차적 생성)

  • Han, Kyong-Sik;Lee, Soo-Won
    • The KIPS Transactions:PartB
    • /
    • v.12B no.4 s.100
    • /
    • pp.487-498
    • /
    • 2005
  • Recently, It has focused on decision tree algorithm that can handle large dataset. However, because most of these algorithms for large datasets process data in a batch mode, if new data is added, they have to rebuild the tree from scratch. h more efficient approach to reducing the cost problem of rebuilding is an approach that builds a tree incrementally. Representative algorithms for incremental tree construction methods are BOAT and ITI and most of these algorithms use a local discretization method to handle the numeric data type. However, because a discretization requires sorted numeric data in situation of processing large data sets, a global discretization method that sorts all data only once is more suitable than a local discretization method that sorts in every node. This paper proposes an incremental tree construction method that efficiently rebuilds a tree using a global discretization method to handle the numeric data type. When new data is added, new categories influenced by the data should be recreated, and then the tree structure should be changed in accordance with category changes. This paper proposes a method that extracts sample points and performs discretiration from these sample points to recreate categories efficiently and uses confidence intervals and a tree restructuring method to adjust tree structure to category changes. In this study, an experiment using people database was made to compare the proposed method with the existing one that uses a local discretization.

Characteristic of Soil and Cambial Electrical Resistance for Investigation on Defect Cause of Planting Tree in Apartment

  • Cho, Chi-Woung;Yoo, Sun-Ah;Kim, Jeong-Ho
    • Journal of Environmental Science International
    • /
    • v.21 no.11
    • /
    • pp.1307-1320
    • /
    • 2012
  • The purpose of this paper is to provide information on planting construction for healthy plant growth. To achieve this purpose, this study analyzed the planting type, planting density, withering rate, soil characteristics, and cambium electrical resistance (CER) of withered trees in an apartment complex with a high withering rate. The major plant groups examined consisted of native broad-leaved tree species (39.3%), native narrow-leaved tree species (24.2%), and native broad-leaved - exotic narrow-leaved tree species (16.4%). The planting density of the green area, where trees were planted from 0.0 to 0.3 trees per unit area, was measured as 98.4%. Withered trees were found in 19 of the 20 planted species, and the withering rate was 41.8% (610 withered/1,461 planted). Withering rates for tree species were measured as follows: Sophora japonica and Salix babylonica (100.0%), Magmolia denudata (84.3%), Lindera obtusiloba (74.7%), cornus kousa (69.3%), acer triflorum (69.2%), diospyros kaki (66.7%), Prunus yedoensis (62.8%), Acer palmatum (52.6%), Prunus armeniaca (51.1%), Chaenomeles sinensis (43.7%), Ginkgo biloba (40.9%), Zelkova serrata (31.0%), Cornus officinalis (28.6%), Taxus cuspidata (25.6%), Pinus densiflora (21.4%), Pinus parviflora (15.2%), Pinus strobus (14.6%), and Abies holophylla (10.3%). Soil chemical analyses for 18 samples revealed that as the withering rate increased, the following occurred: (a) the ratio of silt and clay in soil increased; (b) the soil pH, organic matter rate, nitrogen, available phosphorus, and cation exchange capacity (CEC) in samples were graded as "inadequate," based on the plant grading evaluation; and (c) the NaCl and cation exchange capacity were evaluated as "somewhat satisfactory." The measurement of CER for withering rate shows electrical resistance for higher withering rate are higher, which could predict that a tree will not grow well.