• Title/Summary/Keyword: Random forest

Search Result 951, Processing Time 0.035 seconds

Medical Image Classification and Retrieval Using BoF Feature Histogram with Random Forest Classifier (Random Forest 분류기와 Bag-of-Feature 특징 히스토그램을 이용한 의료영상 자동 분류 및 검색)

  • Son, Jung Eun;Ko, Byoung Chul;Nam, Jae Yeal
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.4
    • /
    • pp.273-280
    • /
    • 2013
  • This paper presents novel OCS-LBP (Oriented Center Symmetric Local Binary Patterns) based on orientation of pixel gradient and image retrieval system based on BoF (Bag-of-Feature) and random forest classifier. Feature vectors extracted from training data are clustered into code book and each feature is transformed new BoF feature using code book. BoF features are applied to random forest for training and random forest having N classes is constructed by combining several decision trees. For testing, the same OCS-LBP feature is extracted from a query image and BoF is applied to trained random forest classifier. In contrast to conventional retrieval system, query image selects similar K-nearest neighbor (K-NN) classes after random forest is performed. Then, Top K similar images are retrieved from database images that are only labeled K-NN classes. Compared with other retrieval algorithms, the proposed method shows both fast processing time and improved retrieval performance.

Korean Text Classification Using Randomforest and XGBoost Focusing on Seoul Metropolitan Civil Complaint Data (RandomForest와 XGBoost를 활용한 한국어 텍스트 분류: 서울특별시 응답소 민원 데이터를 중심으로)

  • Ha, Ji-Eun;Shin, Hyun-Chul;Lee, Zoon-Ky
    • The Journal of Bigdata
    • /
    • v.2 no.2
    • /
    • pp.95-104
    • /
    • 2017
  • In 2014, Seoul Metropolitan Government launched a response service aimed at responding promptly to civil complaints. The complaints received are categorized based on their content and sent to the department in charge. If this part can be automated, the time and labor costs will be reduced. In this study, we collected 17,700 cases of complaints for 7 years from June 1, 2010 to May 31, 2017. We compared the XGBoost with RandomForest and confirmed the suitability of Korean text classification. As a result, the accuracy of XGBoost compared to RandomForest is generally high. The accuracy of RandomForest was unstable after upsampling and downsampling using the same sample, while XGBoost showed stable overall accuracy.

  • PDF

Forest smoke detection using Random Forest (Random Forest를 이용한 산불연기 감지)

  • Kwak, Joon-Young;Kim, Deok-Yeon;Ko, Byoung-Chul;Nam, Jae-Yeal
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06c
    • /
    • pp.351-353
    • /
    • 2011
  • 본 논문에서는 CCD카메라로부터 입력된 동영상에서 Random Forest를 이용하여 산불 연기를 검출하는 알고리즘을 제안한다. 산불 연기의 느린 움직임을 보완하기 위해 모든 프레임 대신에 변화가 큰 프레임들을 키 프레임으로 지정하고 지정된 키 프레임의 이전 100프레임 동안의 가 특징 값을 누적시켜 특징 백터를 추출한다. 이후, 학습 데이터들로부터 추출된 특징백터의 훈련과정을 통해 50개의 결정 트리를 갖는 Random Forest를 생성한다. Random Forest는 산불 연기의 정도에 따라 4개의 상태를 나타내는 클래스들로 분류하도록 학습되었으며 Random Forest에 의한 분류결과에 따라 해당 영역이 연기인지 아닌지를 최종 판단한다.

Rice yield prediction in South Korea by using random forest (Random Forest를 이용한 남한지역 쌀 수량 예측 연구)

  • Kim, Junhwan;Lee, Juseok;Sang, Wangyu;Shin, Pyeong;Cho, Hyeounsuk;Seo, Myungchul
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.21 no.2
    • /
    • pp.75-84
    • /
    • 2019
  • In this study, the random forest approach was used to predict the national mean rice yield of South Korea by using mean climatic factors at a national scale. A random forest model that used monthly climate variable and year as an important predictor in predicting crop yield. Annual yield change would be affected by technical improvement for crop management as well as climate. Year as prediction factor represent technical improvement. Thus, it is likely that the variables of importance identified for the random forest model could result in a large error in prediction of rice yield in practice. It was also found that elimination of the trend of yield data resulted in reasonable accuracy in prediction of yield using the random forest model. For example, yield prediction using the training set (data obtained from 1991 to 2005) had a relatively high degree of agreement statistics. Although the degree of agreement statistics for yield prediction for the test set (2006-2015) was not as good as those for the training set, the value of relative root mean square error (RRMSE) was less than 5%. In the variable importance plot, significant difference was noted in the importance of climate factors between the training and test sets. This difference could be attributed to the shifting of the transplanting date, which might have affected the growing season. This suggested that acceptable yield prediction could be achieved using random forest, when the data set included consistent planting or transplanting dates in the predicted area.

Object Classification Method Using Dynamic Random Forests and Genetic Optimization

  • Kim, Jae Hyup;Kim, Hun Ki;Jang, Kyung Hyun;Lee, Jong Min;Moon, Young Shik
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.5
    • /
    • pp.79-89
    • /
    • 2016
  • In this paper, we proposed the object classification method using genetic and dynamic random forest consisting of optimal combination of unit tree. The random forest can ensure good generalization performance in combination of large amount of trees by assigning the randomization to the training samples and feature selection, etc. allocated to the decision tree as an ensemble classification model which combines with the unit decision tree based on the bagging. However, the random forest is composed of unit trees randomly, so it can show the excellent classification performance only when the sufficient amounts of trees are combined. There is no quantitative measurement method for the number of trees, and there is no choice but to repeat random tree structure continuously. The proposed algorithm is composed of random forest with a combination of optimal tree while maintaining the generalization performance of random forest. To achieve this, the problem of improving the classification performance was assigned to the optimization problem which found the optimal tree combination. For this end, the genetic algorithm methodology was applied. As a result of experiment, we had found out that the proposed algorithm could improve about 3~5% of classification performance in specific cases like common database and self infrared database compare with the existing random forest. In addition, we had shown that the optimal tree combination was decided at 55~60% level from the maximum trees.

Fast Random-Forest-Based Human Pose Estimation Using a Multi-scale and Cascade Approach

  • Chang, Ju Yong;Nam, Seung Woo
    • ETRI Journal
    • /
    • v.35 no.6
    • /
    • pp.949-959
    • /
    • 2013
  • Since the recent launch of Microsoft Xbox Kinect, research on 3D human pose estimation has attracted a lot of attention in the computer vision community. Kinect shows impressive estimation accuracy and real-time performance on massive graphics processing unit hardware. In this paper, we focus on further reducing the computation complexity of the existing state-of-the-art method to make the real-time 3D human pose estimation functionality applicable to devices with lower computing power. As a result, we propose two simple approaches to speed up the random-forest-based human pose estimation method. In the original algorithm, the random forest classifier is applied to all pixels of the segmented human depth image. We first use a multi-scale approach to reduce the number of such calculations. Second, the complexity of the random forest classification itself is decreased by the proposed cascade approach. Experiment results for real data show that our method is effective and works in real time (30 fps) without any parallelization efforts.

Human action recognition using HOOF and Random Forest (HOOF와 Random Forest를 이용한 휴먼 행동 인식)

  • Hong, June-Hyoek;Ko, Byoung-Chul;Nam, Jae-Yeal
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2012.06b
    • /
    • pp.450-452
    • /
    • 2012
  • 본 논문에서는 CCD 카메라에 입력된 동영상에서 Random Forest를 이용하여 휴먼 행동을 인식하는 알고리즘을 제안한다. 행동 인식을 위한 특징 벡터 추출을 위해 가장 최근의 N개의 비디오 프레임들을 하나의 액션 볼륨으로 생성하고, 액션 볼륨 내에서 객체 트랙킹 된 영역을 서브 볼륨으로 생성한다. 이후 서브불륨을 $N{\times}N$개의 블록으로 나누고 각 블록에서 HOOF (Histogram of oriented optical flow)를 특징 벡터로 추출한다. 각 휴먼의 행동인식을 위해 사용된 Random Forest 분류기는 걷기, 뛰기, 발차기, 주먹질, 앉기, 쓰러지기, 넘어지기 7개의 행동을 나타내는 클래스로 분류하도록 학습되었으며 Random Forest에 의한 분류결과에 따라 어떤 행동을 취하는지 최종 판단한다.

Road Extraction Based on Random Forest and Color Correlogram (랜덤 포레스트와 칼라 코렐로그램을 이용한 도로추출)

  • Choi, Ji-Hye;Song, Gwang-Yul;Lee, Joon-Woong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.4
    • /
    • pp.346-352
    • /
    • 2011
  • This paper presents a system of road extraction for traffic images from a single camera. The road in the images is subject to large changes in appearance because of environmental effects. The proposed system is based on the integration of color correlograms and random forest. The color correlogram depicts the color properties of an image properly. Using the random forest, road extraction is formulated as a learning paradigm. The combined effects of color correlograms and random forest create a robust system capable of extracting the road in very changeable situations.

Classification of cardiotocograms using random forest classifier and selection of important features from cardiotocogram signal

  • Arif, Muhammad
    • Biomaterials and Biomechanics in Bioengineering
    • /
    • v.2 no.3
    • /
    • pp.173-183
    • /
    • 2015
  • In obstetrics, cardiotocography is a procedure to record the fetal heartbeat and the uterine contractions usually during the last trimester of pregnancy. It helps to monitor patterns associated with the fetal activity and to detect the pathologies. In this paper, random forest classifier is used to classify normal, suspicious and pathological patterns based on the features extracted from the cardiotocograms. The results showed that random forest classifier can detect these classes successfully with overall classification accuracy of 93.6%. Moreover, important features are identified to reduce the feature space. It is found that using seven important features, similar classification accuracy can be achieved by random forest classifier (93.3%).

A rubber o-ring defect detection system using data augmentation based on the SinGAN and random forest algorithm (SinGAN기반 데이터 증강과 random forest알고리즘을 이용한 고무 오링 결함 검출 시스템)

  • Lee, Yong Eun;Lee, Han Sung;Kim, Dae Won;Kim, Kyung Chun
    • Journal of the Korean Society of Visualization
    • /
    • v.19 no.3
    • /
    • pp.63-68
    • /
    • 2021
  • In this study, data was augmentation through the SinGAN algorithm using small image data, and defects in rubber O-rings were detected using the random forest algorithm. Unlike the commonly used data augmentation image rotation method to solve the data imbalance problem, the data imbalance problem was solved by using the SinGAN algorithm. A study was conducted to distinguish between normal products and defective products of rubber o-ring by using the random forest algorithm. A total of 20,000 image date were divided into transit and testing datasets, and an accuracy result was obtained to distinguish 97.43% defects as a result of the test.