Search | Korea Science

The guideline for choosing the right-size of tree for boosting algorithm (부스팅 트리에서 적정 트리사이즈의 선택에 관한 연구)

Kim, Ah-Hyoun;Kim, Ji-Hyun;Kim, Hyun-Joong
- Journal of the Korean Data and Information Science Society
- /
- v.23 no.5
- /
- pp.949-959
- /
- 2012
This article is to find the right size of decision trees that performs better for boosting algorithm. First we defined the tree size D as the depth of a decision tree. Then we compared the performance of boosting algorithm with different tree sizes in the experiment. Although it is an usual practice to set the tree size in boosting algorithm to be small, we figured out that the choice of D has a significant influence on the performance of boosting algorithm. Furthermore, we found out that the tree size D need to be sufficiently large for some dataset. The experiment result shows that there exists an optimal D for each dataset and choosing the right size D is important in improving the performance of boosting. We also tried to find the model for estimating the right size D suitable for boosting algorithm, using variables that can explain the nature of a given dataset. The suggested model reveals that the optimal tree size D for a given dataset can be estimated by the error rate of stump tree, the number of classes, the depth of a single tree, and the gini impurity.
https://doi.org/10.7465/jkdi.2012.23.5.949 인용 PDF KSCI

Oriental Medicine-based Health Pre-Diagnosis System using Fuzzy Decision Tree (퍼지 의사 결정 트리를 이용한 한의학 기반의 건강 사전 진단 시스템)

Kim, Kwang Baek
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.25 no.11
- /
- pp.1519-1524
- /
- 2021
In this paper, we propose a method that uses fuzzy decision tree based health pre-diagnosis system of oriental medicine. The proposed fuzzy decision tree based health pre-diagnosis system uses the data from the past which has been pre-trained to get the boundary values based on entropy then, when the user inputs the symptoms, the top 5 diseases that causes those symptoms are extracted. With the extracted top 5 diseases, the system provides information on those diseases with the cause and how to treat them with folk remedies. The database of the diseases and their symptoms is established with the information based on the various books that the oriental doctor recommended then reviewed by the oriental doctor for confirmation. By utilizing the data from the past to train the symptoms of the diseases, the proposed oriental medicine-based health pre-diagnosis system method could provide more accurate diagnosis results faster.
https://doi.org/10.6109/jkiice.2021.25.11.1519 인용 PDF KSCI

Extraction of Blood Velocity Using FCM and Fuzzy Decision Trees in Doppler Ultrasound Images of Brachial Artery (상완동맥 색조 도플러 초음파 영상에서 FCM과 퍼지 의사 결정 트리를 이용한 혈류 속도 추출)

Kim, Kwang Baek;Jung, Young Jin;Nam, Youn Man;Lee, Jae Yeol
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2019.07a
- /
- pp.19-22
- /
- 2019
상완동맥은 어깨에서부터 팔꿈치까지 내려오는 상완골의 내측부에 존재하며 혈압을 측정할 때 사용되는 혈관이다. 이 혈관은 골절로 인해 찢어지거나, 또는 혈액순환에 문제가 생겨 혈관이 막히는 경우가 발생한다. 이러한 경우 혈관의 상태를 확인하기 위하여 색조 도플러 초음파 검사를 사용하지만, 사용자에 따라 영상을 통한 판단 기준이 다르다는 문제점이 발생한다. 따라서 본 논문에서는 FCM과 Fuzzy Decision Tree를 이용한 영상 처리를 통해 일관성 있는 판단기준을 세우기 위한 혈류의 속도를 제안한다. 색조 도플러 초음파 영상에서의 상완 동맥을 추출하여 기울기를 이용한 FCM 알고리즘을 통해 소속도를 추출한 뒤 퍼지 룰에 적용하여 의사 결정 트리로 등급을 분류하고 결과적으로 혈류 속도를 추출한다. 색조 도플러 초음파 영상에서 환자의 개인 정보를 보호하기 위해 개인 정보 영역을 제거하여 ROI 영역을 추출하고 ROI 영역을 이진화를 통하여 상완동맥이 있는 영역을 추출한다. 이진화 된 ROI 영역에서 혈관 영상의 혈류 방향으로의 무게중심을 설정하고 각각의 픽셀과 무게중심 선과의 거리를 이용하여 소속도를 추출한 후 FCM을 사용하여 최적의 기울기를 선정한다. FCM을 통해 추출한 최종 소속도를 이용하여 퍼지 룰에 적용한 뒤 계산된 T-norm과 소속도의 분산을 이용하여 의사 결정 트리를 형성 트리의 단말 노드들은 각 픽셀을 분류한다. 분류되어진 데이터들의 노드별 소속도 평균을 구한 뒤 디퍼지화를 통해 COG(Center of Gravity)를 계산한다. 마지막으로 그 값을 이용하여 혈류 속도에 영향을 미치는 정도를 계산한 뒤 최종 혈류의 속도를 제안한다.
PDF

Prediction Model for Unpaid Customers Using Big Data (빅 데이터 기반의 체납 수용가 예측 모델)

Jeong, Jaean;Lee, Kyouhwan;Jung, Hoekyung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.24 no.7
- /
- pp.827-833
- /
- 2020
In this paper, to reduce the unpaid rate of local governments, the internal data elements affecting the arrears in Water-INFOS are searched through interviews with meter readers in certain local governments. Candidate data affecting arrears from national statistical data were derived. The influence of the independent variable on the dependent variable was sampled by examining the disorder of the dependent variable in the data set called information gain. We also evaluated the higher prediction rates of decision tree and logistic regression using n-fold cross-validation. The results confirmed that the decision tree can find more accurate customer payment patterns than logistic regression. In the process of developing an analysis algorithm model using machine learning, the optimal values of two environmental variables, the minimum number of data and the maximum purity, which directly affect the complexity and accuracy of the decision tree, are derived to improve the accuracy of the algorithm.
https://doi.org/10.6109/jkiice.2020.24.7.827 인용 PDF KSCI

Decision Tree Based Application Recommendation System (의사결정트리 기반 애플리케이션 추천 시스템)

Kim, Doo-Hyeong;Shin, Jae-Myong;Park, Sang-Won
- Proceedings of the Korean Information Science Society Conference
- /
- 2012.06d
- /
- pp.140-142
- /
- 2012
최근 상황인지에 관한 연구가 활발히 진행되고 있으며 스마트폰의 각종 센서를 통해 사용자의 컨텍스트 파악이 가능해졌다. 이에 따라서 스마트폰의 컨텍스트 파악을 통해서 사용자에게 각종 친화적 서비스 모델이 많이 생겨 나고 있다. 사용자의 경로 추론, 실내에서의 사용자의 위치파악, 사용자 위치기반 편의시설 추천 등이 그 예이며, 그 중 애플리케이션 추천은 대표적인 서비스라 할 수 있다. 애플리케이션 추천은 사용자의 컨텍스트에 따라서 애플리케이션 사용내역을 로그 데이터로 만들고, 로그 데이터를 기반으로 컨텍스트에 따라서 사용자의 애플리케이션 추천을 해주는 시스템이다. 여기서 로그 데이터를 가공하지 않고 통계를 통해 추천이 가능하지만, 로그 데이터를 사용하여 의사 결정 트리를 만들게 되면 보다 정확하고, 빠르게 추천이 가능하며 적은 로그 데이터로 더 많은 컨텍스트에 적용하여 추천 할 수 있다는 이점이 있다. 본 논문에서는 사용자의 컨텍스트 추출하고 이 데이터를 기반으로 의사결정트리를 만들어 앱을 추천하는 시스템을 제안한다. 이러한 컨텍스트 수집 방법과 추론모델을 이용한 애플리케이션 추천 시스템은 추후 사용자 친화적 서비스 연구에 많은 도움이 될 것이다.

Korean Caption Extraction with Decision Tree (의사결정 트리를 이용한 한글 자막 추출)

Jung, Je-Hee;Lee, Seun-Hoon;Kim, Jae-Kwang;Lee, Jee-Hyong
- Proceedings of the Korean Information Science Society Conference
- /
- 2008.06c
- /
- pp.527-532
- /
- 2008
자막은 영상과 관련이 있는 정보를 포함한다. 이러한 영상의 정보를 이용하기 위해서 자막을 추출하는 연구가 진행되고 있다. 기존의 자막 추출 연구는 언어 독립적인 특징으로 자막을 이루는 획의 에지는 일정한 간격을 유지하거나 수평라인으로 존재하는 글자의 분포를 이용한 방법을 제안하였다. 이러한 방법들은 획의 간격이 일정한 자막이나 하나의 글자가 하나의 획으로 이루어진 글자에서만 정상적인 동작을 보장하였다. 본 논문에서는 한글 자막 특징을 고려한 자막 추출 방법을 제안한다. 먼저, 한글 자막의 특징인 가로 획의 다수 분포를 고려한 적응형 에지 이진화를 수행하여 에지 영상을 생성하고 에지 연결 객체를 생성한다. 그 후에 생성한 연결 객체를 특징을 추출하여 사전에 생성한 의사결정 트리로 연결 객체를 자막과 비자막 연결객체로 분류한다. 의사결정 트리를 생성하기 위해서 사용한 연결 객체는 뉴스, 다큐멘터리 프로그램에서 획득하였으며, 성능 평가를 위해서 뉴스, 다큐멘터리, 스포츠 프로그램과 같은 대중 방송에서 획득한 영상에서 자막을 추출하였다. 평가 방법은 찾아진 연결 객체 중에 자막 연결 객체의 비율과 전체 자막 중에서 찾아진 자막 연결 객체의 비율로 분석하였다. 실험 결과에서는 제안한 방법이 한글 자막의 추출에 적용 가능함을 보여준다.
PDF

An Automatic Method for Selecting Comparative Standard Land Parcels in Land Price Appraisal Using a Decision Tree (의사결정트리를 이용한 개별 공시지가 비교표준지의 자동 선정)

Kim, Jong-Yoon;Park, Soo-Hong
- Journal of the Korean Association of Geographic Information Studies
- /
- v.7 no.1
- /
- pp.9-19
- /
- 2004
The selection of comparative standard parcels should be objective and reasonable, which is an important task in the individual land price appraisal procedure. However, the current procedure is mainly done manually by government officials. Therefore, the efficiency and objectiveness of this selection procedure is not guaranteed and questionable. In this study, we first defined the problem by analyzing the current comparative standard land parcel selection method. In addition, we devised a decision tree-based method using a machine learning algorithm that is considered to be efficient and objective compared to the current selection procedure. Finally the proposed method is then applied to the study area for evaluating the appropriateness and accuracy.
PDF

Box Office Hit Prediction Using Data mining and Text mining (데이터마이닝과 텍스트마이닝을 활용한 영화 흥행 예측)

Jo, Hyo-jung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2021.05a
- /
- pp.316-318
- /
- 2021
영화 수익에 있어 영화의 흥행 여부는 중요한 영향을 끼친다. 영화 흥행 요인은 영화 산업의 규모가 커지면서 많은 제작사들 및 투자자들이 고려해야 하는 사항이 되었다. 따라서 영화의 흥행을 예측하기 위한 많은 모델이 연구되었다. 본 연구의 목적은 선행연구에서 흥행에 유의미한 영향을 끼친다고 밝혀진 스크린 수, 감독명, 제작사명 등의 내재적인 속성과 더불어 온라인 구전 변수를 사용하여 영화 흥행 예측 모델을 만드는 것이다. 이때 기사 수, 블로그 수와 같이 온라인 구전의 크기를 나타내는 변수들을 사용하는 대신 개봉 후 첫 주간의 관람객 리뷰를 텍스트마이닝을 이용하여 전체 리뷰 중 긍정 리뷰의 비율에 따라 점수를 매긴 후 독립변수로 사용한다. 그 후, 데이터 마이닝 기법을 활용하여 만든 모델에 앞서 언급한 독립변수를 입력 값으로 사용하여 영화의 흥행을 예측한다. 최종적으로 의사결정트리와 로지스틱회귀를 수행한 결과 영화 흥행에 영향을 주는 독립변수를 찾고 모델의 성능을 평가하였다. 로지스틱회귀의 결과 관객 수, 평점이 영화의 흥행에 특히 유의한 영향을 끼치는 변수로 선정되었고 리뷰 역시 유의한 변수로 선정되었다. 이때 만들어진 모델은 약 90%의 높은 수준의 정확도를 보여주었다. 의사결정트리의 결과 관객 수가 가장 중요한 변수로 선정되었다.
https://doi.org/10.3745/PKIPS.y2021m05a.316 인용 PDF

The Extended Cube Tree for Distribution Area Query Processing in Spatial Data Warehouses (공간 데이터 웨어하우스에서 분포 지역 질의 처리를 위한 확장된 큐브 트리 기법)

최준호;유병섭;박순영;배해영
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.10b
- /
- pp.76-78
- /
- 2004
최근 원격 탐사 시스템 등이 발전함에 따라 축적된 공간 데이터의 양이 증가했고 이를 공간 데이터 웨어하우스 분야에서 의사 결정에 활용하는 방안이 중요한 이슈가 되고 있다. 기존의 활용 방법은 주어진 영역을 기준으로 공간 범위-집계를 검색하는 형태였지만, 최근 특정 성향 분석을 위해 분포 질의를 요청하고 그 결과 지역에 대한 공간 분석을 통한 의사결정의 필요성이 대두되었다. 하지만 기존의 처리 방법으로 비공간 질의를 처리하기 위해서는 모든 데이터를 검색해야 하므로 분포 질의를 처리하기 위한 비용이 증가하게 된다. 본 논문에서는 분포 지역 질의 처리를 위한 확장된 큐브 트리 기법을 제안한다. 제안하는 기법은 분석하고자 하는 사실 테이블의 비공간 속성을 큐브 트리의 키로 사용하고, 이 속성과 관련된 공간 데이터의 포인터 집합을 관리한다. 본 논문의 제안 기법을 공간 데이터 웨어하우스에 적용함으로써 비공간 속성 질의를 통해 공간 객체를 결과로 요청하는 형태의 질의를 지원할 수 있게 되며 사실 컬럼을 계층화시킴으로서 사용자에게 좀 더 다각적인 분석을 지원할 수 있다.
PDF

Context Aware Environment based U-Health Service of Recommendation Factors Identity and Decision-Making Model Creation (상황인지 환경 기반 유헬스 서비스의 추천 요인 식별 및 의사결정 모델 생성)

Kim, Jae-Kwon;Lee, Young-Ho
- Journal of Digital Convergence
- /
- v.11 no.5
- /
- pp.429-436
- /
- 2013
Context aware environment u-health service is to provide health service with recognition of a computer. The computer recognizes that a patient can contact real life in many context. Context aware environment service for recommend have to definition of context data and service recommendations related to factors shall be identified. In this paper, Context aware environment of u-health service will be provide context data related to identifies recommendations factors using multivariate analysis method and recommendations factors creation to decision tree, association rule based decision model. health service recommend for significantly context data can be distinguish through recommendation factors of identify. Also, context data of patient can know preference factors through preference decision model.
https://doi.org/10.14400/JDPM.2013.11.5.429 인용 PDF

Search Result 242, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)