• Title/Summary/Keyword: AdaBoost 알고리즘

Search Result 88, Processing Time 0.025 seconds

An Improved AdaBoost Algorithm by Clustering Samples (샘플 군집화를 이용한 개선된 아다부스트 알고리즘)

  • Baek, Yeul-Min;Kim, Joong-Geun;Kim, Whoi-Yul
    • Journal of Broadcast Engineering
    • /
    • v.18 no.4
    • /
    • pp.643-646
    • /
    • 2013
  • We present an improved AdaBoost algorithm to avoid overfitting phenomenon. AdaBoost is widely known as one of the best solutions for object detection. However, AdaBoost tends to be overfitting when a training dataset has noisy samples. To avoid the overfitting phenomenon of AdaBoost, the proposed method divides positive samples into K clusters using k-means algorithm, and then uses only one cluster to minimize the training error at each iteration of weak learning. Through this, excessive partitions of samples are prevented. Also, noisy samples are excluded for the training of weak learners so that the overfitting phenomenon is effectively reduced. In our experiment, the proposed method shows better classification and generalization ability than conventional boosting algorithms with various real world datasets.

Prediction of Citizens' Emotions on Home Mortgage Rates Using Machine Learning Algorithms (기계학습 알고리즘을 이용한 주택 모기지 금리에 대한 시민들의 감정예측)

  • Kim, Yun-Ki
    • Journal of Cadastre & Land InformatiX
    • /
    • v.49 no.1
    • /
    • pp.65-84
    • /
    • 2019
  • This study attempted to predict citizens' emotions regarding mortgage rates using machine learning algorithms. To accomplish the research purpose, I reviewed the related literature and then set up two research questions. To find the answers to the research questions, I classified emotions according to Akman's classification and then predicted citizens' emotions on mortgage rates using six machine learning algorithms. The results showed that AdaBoost was the best classifier in all evaluation categories. However, the performance level of Naive Bayes was found to be lower than those of other classifiers. Also, this study conducted a ROC analysis to identify which classifier predicts each emotion category well. The results demonstrated that AdaBoost was the best predictor of the residents' emotions on home mortgage rates in all emotion categories. However, in the sadness class, the performance levels of the six algorithms used in this study were much lower than those in the other emotion categories.

Face Detection using Color Information and AdaBoost Algorithm (색상정보와 AdaBoost 알고리즘을 이용한 얼굴검출)

  • Na, Jong-Won;Kang, Dae-Wook;Bae, Jong-Sung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.5
    • /
    • pp.843-848
    • /
    • 2008
  • Most of face detection technique uses information from the face of the movement. The traditional face detection method is to use difference picture method ate used to detect movement. However, most do not consider this mathematical approach using real-time or real-time implementation of the algorithm is complicated, not easy. This paper, the first to detect real-time facial image is converted YCbCr and RGB video input. Next, you convert the difference between video images of two adjacent to obtain and then to conduct Glassfire Labeling. Labeling value compared to the threshold behavior Area recognizes and converts video extracts. Actions to convert video to conduct face detection, and detection of facial characteristics required for the extraction and use of AdaBoost algorithm.

Text Filtering using Iterative Boosting Algorithms (반복적 부스팅 학습을 이용한 문서 여과)

  • Hahn, Sang-Youn;Zang, Byoung-Tak
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.4
    • /
    • pp.270-277
    • /
    • 2002
  • Text filtering is a task of deciding whether a document has relevance to a specified topic. As Internet and Web becomes wide-spread and the number of documents delivered by e-mail explosively grows the importance of text filtering increases as well. The aim of this paper is to improve the accuracy of text filtering systems by using machine learning techniques. We apply AdaBoost algorithms to the filtering task. An AdaBoost algorithm generates and combines a series of simple hypotheses. Each of the hypotheses decides the relevance of a document to a topic on the basis of whether or not the document includes a certain word. We begin with an existing AdaBoost algorithm which uses weak hypotheses with their output of 1 or -1. Then we extend the algorithm to use weak hypotheses with real-valued outputs which was proposed recently to improve error reduction rates and final filtering performance. Next, we attempt to achieve further improvement in the AdaBoost's performance by first setting weights randomly according to the continuous Poisson distribution, executing AdaBoost, repeating these steps several times, and then combining all the hypotheses learned. This has the effect of mitigating the ovefitting problem which may occur when learning from a small number of data. Experiments have been performed on the real document collections used in TREC-8, a well-established text retrieval contest. This dataset includes Financial Times articles from 1992 to 1994. The experimental results show that AdaBoost with real-valued hypotheses outperforms AdaBoost with binary-valued hypotheses, and that AdaBoost iterated with random weights further improves filtering accuracy. Comparison results of all the participants of the TREC-8 filtering task are also provided.

Face Recognition using AdaBoost Algorithm and Development of Surveillance Robot for a Ship (AdaBoost 알고리즘을 이용한 얼굴인식 및 선박용 감시로봇 개발)

  • Go, Seok-Jo;Park, Jang-Sik;Jang, Yong-Seo;Choi, Moon-Ho
    • The Journal of Korea Robotics Society
    • /
    • v.3 no.3
    • /
    • pp.219-225
    • /
    • 2008
  • This study developed a surveillance robot for a ship. The developed robot consists of ultrasonic sensors, an actuator, a lighting fixture and a camera. The ultrasonic sensors are used to avoid collision with obstacles in the environment. The actuator is a servo motor system. The developed robot has four drive wheels for driving. The lighting fixture is used to guide the robot in a dark environment. To transmit an image, a camera with a pan moving and a tilt moving is equipped on the upper part of the robot. AdaBoost algorithm trained with 15 features, is used for face recognition. In order to evaluate the face recognition of the developed robot, experiments were performed.

  • PDF

A Method to Improve the Performance of Weak Classifier in AdaBoost by Considering Features Distribution (특징분포를 고려한 AdaBoost 약분류기의 성능 개선방법)

  • Lee, Gyung-Ju;Choi, Hyung-Il;Kim, Gye-Young
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2012.01a
    • /
    • pp.209-211
    • /
    • 2012
  • 본 논문에서는 AdaBoost 알고리즘에서 약분류기(Weak Classifier)의 성능을 개선하기 위한 임계값 설정 방법을 제안한다. 일반적으로 약분류기에 사용되는 임계값은 특징들의 평균값을 많이 사용하지만 이는 특징들의 분포가 고려되지 않았기 때문에 분별력이 많이 떨어진다. 그러므로 각 특징들의 분포를 고려한 약분류기의 임계값 설정방법을 제안한다. 이는 얼굴에 대한 간단한 학습 및 테스트를 통하여 기존 방법에 비하여 더 나은 성능을 보임을 입증한다.

  • PDF

Ensemble Learning for Solving Data Imbalance in Bankruptcy Prediction (기업부실 예측 데이터의 불균형 문제 해결을 위한 앙상블 학습)

  • Kim, Myoung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.3
    • /
    • pp.1-15
    • /
    • 2009
  • In a classification problem, data imbalance occurs when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. This paper proposes a Geometric Mean-based Boosting (GM-Boost) to resolve the problem of data imbalance. Since GM-Boost introduces the notion of geometric mean, it can perform learning process considering both majority and minority sides, and reinforce the learning on misclassified data. An empirical study with bankruptcy prediction on Korea companies shows that GM-Boost has the higher classification accuracy than previous methods including Under-sampling, Over-Sampling, and AdaBoost, used in imbalanced data and robust learning performance regardless of the degree of data imbalance.

  • PDF

Vehicle Detection Using Edge Analysis and AdaBoost Algorithm (에지 분석과 에이다부스트 알고리즘을 이용한 차량검출)

  • Song, Gwang-Yul;Lee, Ki-Yong;Lee, Joon-Woong
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.17 no.1
    • /
    • pp.1-11
    • /
    • 2009
  • This paper proposes an algorithm capable of detecting vehicles in front or in rear using a monocular camera installed in a vehicle. The vehicle detection has been regarded as an important part of intelligent vehicle technologies. The proposed algorithm is mainly composed of two parts: 1)hypothesis generation of vehicles, and 2)hypothesis verification. The hypotheses of vehicles are generated by the analysis of vertical and horizontal edges and the detection of symmetry axis. The hypothesis verification, which determines vehicles among hypotheses, is done by the AdaBoost algorithm. The proposed algorithm is proven to be effective through experiments performed on various images captured on the roads.

The Optimization of Ensembles for Bankruptcy Prediction (기업부도 예측 앙상블 모형의 최적화)

  • Myoung Jong Kim;Woo Seob Yun
    • Information Systems Review
    • /
    • v.24 no.1
    • /
    • pp.39-57
    • /
    • 2022
  • This paper proposes the GMOPTBoost algorithm to improve the performance of the AdaBoost algorithm for bankruptcy prediction in which class imbalance problem is inherent. AdaBoost algorithm has the advantage of providing a robust learning opportunity for misclassified samples. However, there is a limitation in addressing class imbalance problem because the concept of arithmetic mean accuracy is embedded in AdaBoost algorithm. GMOPTBoost can optimize the geometric mean accuracy and effectively solve the category imbalance problem by applying Gaussian gradient descent. The samples are constructed according to the following two phases. First, five class imbalance datasets are constructed to verify the effect of the class imbalance problem on the performance of the prediction model and the performance improvement effect of GMOPTBoost. Second, class balanced data are constituted through data sampling techniques to verify the performance improvement effect of GMOPTBoost. The main results of 30 times of cross-validation analyzes are as follows. First, the class imbalance problem degrades the performance of ensembles. Second, GMOPTBoost contributes to performance improvements of AdaBoost ensembles trained on imbalanced datasets. Third, Data sampling techniques have a positive impact on performance improvement. Finally, GMOPTBoost contributes to significant performance improvement of AdaBoost ensembles trained on balanced datasets.

Design and Implementation of Electrocardiogram Data Interpretation system using AdaBoost Algorithm (AdaBoost 알고리즘을 이용한 심전도 정보 판독 시스템의 설계 및 구현)

  • Lim, Myung-Jae;Hong, Jin-Kyoung;Kim, Kyu-Ho;Choi, Mi-Lim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.10 no.2
    • /
    • pp.129-134
    • /
    • 2010
  • Diseases such as cardiovascular illnesses, according to the National Statistical Office opened reveals that 600-800 people were killed, blood pressure, arteriosclerosis, heart disease, stroke, etc. will be a flow of blood disorders that occur in cardiovascular illnesses today are fulfilling the Master / Slave samangryulin disease appears high. Died of cardiovascular disease also told them the correct first aid survival when patients are accounted for approximately 40% of emergency rapid response is required. Therefore, this paper, the weak classifier in the AdaBoost algorithm to generate a strong classifier by combining effects throughout the analysis to measure the ECG, and cardiovascular disease that occurred to you as soon as the emergency management system that can deliver on the proposed Desk was. The electrocardiogram data measured by the ZigBee-based sensors, communication devices and emergency transport for emergency alarms in the determination and monitoring of the management desk by providing health services to enable the delivery was fast.