• Title/Summary/Keyword: 거리기반학습

Search Result 255, Processing Time 0.025 seconds

A distance metric of nominal attribute based on conditional probability (조건부 확률에 기반한 범주형 자료의 거리 측정)

  • 이재호;우종하;오경환
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09b
    • /
    • pp.53-56
    • /
    • 2003
  • 유사도 혹은 자료간의 거리 개념은 많은 기계학습 알고리즘에서 사용되고 있는 중요한 측정개념이다 하지만 입력되는 자료의 속성들중 순서가 정의되지 않은 범주형 속성이 포함되어 있는 경우, 자료간의 유사도나 거리 측정에 어려움이 따른다. 비거리 기반의 알고리즘들의 경우-C4.5, CART-거리의 측정없이 작동할 수 있지만, 거리기반의 알고리즘들의 경우 범주형 속성의 거리 정보 결여로 효과적으로 적용될 수 없는 문제점을 갖고 있다. 본 논문에서는 이러한 범주형 자료들간 거리 측정을 자료 집합의 특성을 충분히 고려한 방법을 제안한다. 이를 위해 자료 집합의 선험적인 정보를 필요로 한다. 이런 선험적 정보인 조건부 확률을 기반으로한 거리 측정방법을 제시하고 오류 피드백을 통해서 속성 간 거리 측정을 최적화 하려고 노력한다. 주어진 자료 집합에 대해 서로 다른 두 범주형 값이 목적 속성에 대해서 유사한 분포를 보인다면 이들 값들은 비교적 가까운 거리로 결정한다 이렇게 결정된 거리를 기반으로 학습 단계를 진행하며 이때 발생한 오류들에 대해 피드백 작업을 진행한다. UCI Machine Learning Repository의 자료들을 이용한 실험 결과를 통해 제안한 거리 측정 방법의 우수한 성능을 확인하였다.

  • PDF

k-Nearest Neighbor Learning with Varying Norms (놈(Norm)에 따른 k-최근접 이웃 학습의 성능 변화)

  • Kim, Doo-Hyeok;Kim, Chan-Ju;Hwang, Kyu-Baek
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2008.06c
    • /
    • pp.371-375
    • /
    • 2008
  • 예제 기반 학습(instance-based learning) 방법 중 하나인 k-최근접 이웃(k-nearest reighbor, k-NN) 학습은 간단하고 예측 정확도가 비교적 높아 분류 및 회귀 문제 해결을 위한 기반 방법론으로 널리 적용되고 있다. k-NN 학습을 위한 알고리즘은 기본적으로 유클리드 거리 혹은 2-놈(norm)에 기반하여 학습예제들 사이의 거리를 계산한다. 본 논문에서는 유클리드 거리를 일반화한 개념인 p-놈의 사용이 k-NN 학습의 성능에 어떠한 영향을 미치는지 연구하였다. 구체적으로 합성데이터와 다수의 기계학습 벤치마크 문제 및 실제 데이터에 다양한 p-놈을 적용하여 그 일반화 성능을 경험적으로 조사하였다. 실험 결과, 데이터에 잡음이 많이 존재하거나 문제가 어려운 경우에 p의 값을 작게 하는 것이 성능을 향상시킬 수 있었다.

  • PDF

The Relations of Learning Effectiveness and the Level of Learner's Structure Perception of Transactional Distance in Online Learning Environment (온라인 학습에서 교류거리의 구조지각수준과 학습효과의 관계)

  • Kim, Jungkyum;Lee, Sungil
    • The Journal of Korean Association of Computer Education
    • /
    • v.11 no.6
    • /
    • pp.85-94
    • /
    • 2008
  • This study is to find strategies to promote distance learning effectiveness in the online learning environment. The research showed that the level of learner's perception of the structure of transactional distance was not significantly different according to sex (p>.05). There was significant correlation between the level of learner's perception of the structure and academic satisfaction(p<.01). And, the level of learner's perception of the structure and learning durability appeared to have statistically significant correlation. However there was no significant difference(p>.05) between academic achievement. Among the three subordinate factors, the course interaction organization had the most influence on the learner's academic satisfaction and the learner's learning durability was influenced the most by the content organization.

  • PDF

An Incremental Multi Partition Averaging Algorithm Based on Memory Based Reasoning (메모리 기반 추론 기법에 기반한 점진적 다분할평균 알고리즘)

  • Yih, Hyeong-Il
    • Journal of IKEEE
    • /
    • v.12 no.1
    • /
    • pp.65-74
    • /
    • 2008
  • One of the popular methods used for pattern classification is the MBR (Memory-Based Reasoning) algorithm. Since it simply computes distances between a test pattern and training patterns or hyperplanes stored in memory, and then assigns the class of the nearest training pattern, it is notorious for memory usage and can't learn additional information from new data. In order to overcome this problem, we propose an incremental learning algorithm (iMPA). iMPA divides the entire pattern space into fixed number partitions, and generates representatives from each partition. Also, due to the fact that it can not learn additional information from new data, we present iMPA which can learn additional information from new data and not require access to the original data, used to train. Proposed methods have been successfully shown to exhibit comparable performance to k-NN with a lot less number of patterns and better result than EACH system which implements the NGE theory using benchmark data sets from UCI Machine Learning Repository.

  • PDF

A Study on the Storage Requirement and Incremental Learning of the k-NN Classifier (K_NN 분류기의 메모리 사용과 점진적 학습에 대한 연구)

  • 이형일;윤충화
    • The Journal of Information Technology
    • /
    • v.1 no.1
    • /
    • pp.65-84
    • /
    • 1998
  • The MBR (Memory Based Reasoning) is a supervised learning method that utilizes the distances among the input and trained patterns in its classification, and is also called a distance based learning algorithm. The MBR is based on the k-NN classifier, in which teaming is performed by simply storing training patterns in the memory without any further processing. This paper proposes a new learning algorithm which is more efficient than the traditional k-NN classifier and has incremental learning capability, Furthermore, our proposed algorithm is insensitive to noisy patterns, and guarantees more efficient memory usage.

  • PDF

A Comparison of Distance Metric Learning Methods for Face Recognition (얼굴인식을 위한 거리척도학습 방법 비교)

  • Suvdaa, Batsuri;Ko, Jae-Pil
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.6
    • /
    • pp.711-718
    • /
    • 2011
  • The k-Nearest Neighbor classifier that does not require a training phase is appropriate for a variable number of classes problem like face recognition, Recently distance metric learning methods that is trained with a given data set have reported the significant improvement of the kNN classifier. However, the performance of a distance metric learning method is variable for each application, In this paper, we focus on the face recognition and compare the performance of the state-of-the-art distance metric learning methods, Our experimental results on the public face databases demonstrate that the Mahalanobis distance metric based on PCA is still competitive with respect to both performance and time complexity in face recognition.

Estimating Farmland Prices Using Distance Metrics and an Ensemble Technique (거리척도와 앙상블 기법을 활용한 지가 추정)

  • Lee, Chang-Ro;Park, Key-Ho
    • Journal of Cadastre & Land InformatiX
    • /
    • v.46 no.2
    • /
    • pp.43-55
    • /
    • 2016
  • This study estimated land prices using instance-based learning. A k-nearest neighbor method was utilized among various instance-based learning methods, and the 10 distance metrics including Euclidean distance were calculated in k-nearest neighbor estimation. One distance metric prediction which shows the best predictive performance would be normally chosen as final estimate out of 10 distance metric predictions. In contrast to this practice, an ensemble technique which combines multiple predictions to obtain better performance was applied in this study. We applied the gradient boosting algorithm, a sort of residual-fitting model to our data in ensemble combining. Sales price data of farm lands in Haenam-gun, Jeolla Province were used to demonstrate advantages of instance-based learning as well as an ensemble technique. The result showed that the ensemble prediction was more accurate than previous 10 distance metric predictions.

Long Distance Face Recognition System using the Automatic Face Image Creation by Distance (거리별 얼굴영상 자동 생성 방법을 이용한 원거리 얼굴인식 시스템)

  • Moon, Hae Min;Pan, Sung Bum
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.11
    • /
    • pp.137-145
    • /
    • 2014
  • This paper suggests an LDA-based long distance face recognition algorithm for intelligent surveillance system. The existing face recognition algorithm using single distance face image as training images caused a problem that face recognition rate is decreased with increasing distance. The face recognition algorithm using face images by actual distance as training images showed good performance. However, this also causes user inconvenience as it requires the user to move one to five meters in person to acquire face images for initial user registration. In this paper, proposed method is used for training images by using single distance face image to automatically create face images by various distances. The test result showed that the proposed face recognition technique generated better performance by average 16.3% in short distance and 18.0% in long distance than the technique using the existing single distance face image as training. When it was compared with the technique that used face images by distance as training, the performance fell 4.3% on average at a close distance and remained the same at a long distance.

A Machine learning Approach for Knowledge Base Construction Incorporating GIS Data for land Cover Classification of Landsat ETM+ Image (지식 기반 시스템에서 GIS 자료를 활용하기 위한 기계 학습 기법에 관한 연구 - Landsat ETM+ 영상의 토지 피복 분류를 사례로)

  • Kim, Hwa-Hwan;Ku, Cha-Yang
    • Journal of the Korean Geographical Society
    • /
    • v.43 no.5
    • /
    • pp.761-774
    • /
    • 2008
  • Integration of GIS data and human expert knowledge into digital image processing has long been acknowledged as a necessity to improve remote sensing image analysis. We propose inductive machine learning algorithm for GIS data integration and rule-based classification method for land cover classification. Proposed method is tested with a land cover classification of a Landsat ETM+ multispectral image and GIS data layers including elevation, aspect, slope, distance to water bodies, distance to road network, and population density. Decision trees and production rules for land cover classification are generated by C5.0 inductive machine learning algorithm with 350 stratified random point samples. Production rules are used for land cover classification integrated with unsupervised ISODATA classification. Result shows that GIS data layers such as elevation, distance to water bodies and population density can be effectively integrated for rule-based image classification. Intuitive production rules generated by inductive machine learning are easy to understand. Proposed method demonstrates how various GIS data layers can be integrated with remotely sensed imagery in a framework of knowledge base construction to improve land cover classification.

Enhanced FCM Based Hybrid Network for Effective Pattern Classification (효과적인 패턴분류를 위한 개선된 FCM 기반 하이브리드 네트워크)

  • Kim, Tae-Hyung;Cha, Eui-Young;Kim, Kwang-Baek
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2009.01a
    • /
    • pp.35-40
    • /
    • 2009
  • FCM 알고리즘은 입력 벡터와 각 클러스터의 유클리드 거리를 이용하여 구해진 소속도만를 비교하여 데이터를 분류하기 때문에 클러스터링 된 공간에서의 데이터들의 분포에 따라 바람직하지 못한 클러스터링 결과를 보일 수 있다. 이러한 문제점을 개선하기 위해 대칭적 성질을 이용하는 대칭성 측도에 퍼지 이론을 적용하여 군집간의 거리에 따른 변화와 군집 중심의 위치, 그리고 군집 형태에 따라 영향을 덜 받는 개선된 FCM이 제안되었다. 본 논문에서는 효과적으로 패턴을 분류하기 위해 개선된 FCM 알고리즘을 적용한 개선된 하이브리드 네트워크를 제안한다. 제안된 하이브리드 네트워크는 개선된 FCM 알고리즘을 입력층과 중간층의 학습구조 적용하고 중간층과 출력층의 학습구조는 일반화된 델타학습법을 적용한다. 제안된 방법의 인식성능을 평가하기 위해 2차원 좌표평면 상의 데이터를 기존의 Max_Min 신경망을 이용한 FCM 기반 RBF 네트워크와 FCM 기반 RBF 네트워크, HCM 기반 네트워크와 제안된 방법 간의 학습 및 인식 성능을 비교 및 분석하였다.

  • PDF