• 제목/요약/키워드: Classification Algorithms

검색결과 1,182건 처리시간 0.028초

Discriminating Eggs from Two Local Breeds Based on Fatty Acid Profile and Flavor Characteristics Combined with Classification Algorithms

  • Dong, Xiao-Guang;Gao, Li-Bing;Zhang, Hai-Jun;Wang, Jing;Qiu, Kai;Qi, Guang-Hai;Wu, Shu-Geng
    • 한국축산식품학회지
    • /
    • 제41권6호
    • /
    • pp.936-949
    • /
    • 2021
  • This study discriminated fatty acid profile and flavor characteristics of Beijing You Chicken (BYC) as a precious local breed and Dwarf Beijing You Chicken (DBYC) eggs. Fatty acid profile and flavor characteristics were analyzed to identify differences between BYC and DBYC eggs. Four classification algorithms were used to build classification models. Arachidic acid, oleic acid (OA), eicosatrienoic acid, docosapentaenoic acid (DPA), hexadecenoic acid, monounsaturated fatty acids (MUFA), polyunsaturated fatty acids (PUFA), unsaturated fatty acids (UFA) and 35 volatile compounds had significant differences in fatty acids and volatile compounds by gas chromatography-mass spectrometry (GC-MS) (p<0.05). For fatty acid data, k-nearest neighbor (KNN) and support vector machine (SVM) got 91.7% classification accuracy. SPME-GC-MS data failed in classification models. For electronic nose data, classification accuracy of KNN, linear discriminant analysis (LDA), SVM and decision tree was all 100%. The overall results indicated that BYC and DBYC eggs could be discriminated based on electronic nose with suitable classification algorithms. This research compared the differentiation of the fatty acid profile and volatile compounds of various egg yolks. The results could be applied to evaluate egg nutrition and distinguish avian eggs.

SMOTE와 분류 기법을 활용한 산사태 위험 지역 결정 방법 (Method for Assessing Landslide Susceptibility Using SMOTE and Classification Algorithms)

  • 윤형구
    • 한국지반공학회논문집
    • /
    • 제39권6호
    • /
    • pp.5-12
    • /
    • 2023
  • 산사태 위험 지역을 사전에 조사하여 설정하는 것은 다수의 피해를 줄이기 위해 필요하다. 해당 연구의 목적은 machine learning 기법 중 분류 알고리즘을 활용하여 대상 지반의 안전율 분류를 수행할 수 있는 방법론을 제시하는 것이다. 산사태 위험 지역은 high risk area(HRA) 모델을 적용하였으며, 8개의 지반공학 물성치를 통해 위험 지역을 판단하였다. 분류 알고리즘은 decision tree(DT), K-Nearest Neighbor(KNN), logistic regression(LR) 그리고 random forest(RF)의 4가지가 활용 되었으며, 안전율 1.2~2.0 범위에 8가지 지반공학 물성치의 분류 정확도를 계산하였다. 정확도는 안전율이 1.2~1.7 범위에서 신뢰성 높게 나타났지만, 그 외 범위인 1.8~2.0 사이에서는 상대적으로 낮은 정확도를 보였다. 이를 극복하기 위하여 synthetic minority over-sampling technique(SMOTE) 알고리즘을 적용하여 데이터 개수를 증폭하였으며, 증폭한 데이터를 통해 분류 알고리즘을 적용하면 안전율 1.8~2.0 범위에서 정확도가 평균적으로 약 250% 증가한 것으로 나타났다. 해당 연구 결과는 SMOTE 알고리즘이 데이터 개수를 향상시켜 분류 알고리즘의 정확도가 개선된 것을 보여주며, 타 분야에도 정확도 향상에 적용 가능하다고 판단된다.

생태계 모방 알고리즘 기반 특징 선택 방법의 성능 개선 방안 (Performance Improvement of Feature Selection Methods based on Bio-Inspired Algorithms)

  • 윤철민;양지훈
    • 정보처리학회논문지B
    • /
    • 제15B권4호
    • /
    • pp.331-340
    • /
    • 2008
  • 특징 선택은 기계 학습에서 분류의 성능을 높이기 위해 사용되는 방법이다. 여러 방법들이 개발되고 사용되어 오고 있으나, 전체 데이터에서 최적화된 특징 부분집합을 구성하는 문제는 여전히 어려운 문제로 남아있다. 생태계 모방 알고리즘은 생물체들의 행동 원리 등을 기반으로하여 만들어진 진화적 알고리즘으로, 최적화된 해를 찾는 문제에서 매우 유용하게 사용되는 방법이다. 특징 선택 문제에서도 생태계 모방 알고리즘을 이용한 해결방법들이 제시되어 오고 있으며, 이에 본 논문에서는 생태계 모방 알고리즘을 이용한 특징 선택 방법을 개선하는 방안을 제시한다. 이를 위해 잘 알려진 생태계 모방 알고리즘인 유전자 알고리즘(GA)과 파티클 집단 최적화 알고리즘(PSO)을 이용하여 데이터에서 가장분류 성능이 우수한 특징 부분집합을 만들어 내도록 하고, 최종적으로 개별 특징의 사전 중요도를 설정하여 생태계 모방 알고리즘을 개선하는 방법을 제안하였다. 이를 위해 개별 특징의 우수도를 구할 수 있는 mRMR이라는 방법을 이용하였다. 이렇게 설정한 사전 중요도를 이용하여 GA와 PSO의 진화 연산을 수정하였다. 데이터를 이용한 실험을 통하여 제안한 방법들의 성능을 검증하였다. GA와 PSO를 이용한 특징 선택 방법은 그 분류 정확도에 있어서 뛰어난 성능을 보여주었다. 그리고 최종적으로 제시한 사전 중요도를 이용해 개선된 방법은 그 진화 속도와 분류 정확도 면에서 기존의 GA와 PSO 방법보다 더 나아진 성능을 보여주는 것을 확인하였다.

Monitoring of Graveyards in Mountainous Areas with Simulated KOMPSAT-2 imagery

  • Chang, Eun-Mi;Kim, Min-Ho;Lee, Byung-Whan;Heo, Min
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2003년도 Proceedings of ACRS 2003 ISRS
    • /
    • pp.1409-1411
    • /
    • 2003
  • The application of simulated KOMPSAT-2 imagery to monitor graveyards is to be developed. Positions calculated from image were compared with those obtained from Geographic Positioning System. With 24 checkpoints, the position of graveyards showed within 5-meter range. Unsupervised classification, supervised classification, and objected-orientation classification algorithms were used to extract the graveyard. Unsupervised classification with masking processes based on National topographic data gives the best result. The graveyards were categorized with four types in field studies while the two types of graveyards were shown in descriptive statistics. Cluster Analysis and discriminant analysis showed the consistency with two types of tombs. It was hard to get a specific spectral signature of graveyards, as they are covered with grasses at different levels and shaded from the surrounding trees. The slopes and aspects of location of graveyards did not make any difference in the spectral signatures. This study gives the basic spectral characteristics for further development of objected-oriented classification algorithms and plausibility of KOMPSAT-2 images for management of mountainous areas in the aspect of position accuracy and classification accuracy.

  • PDF

Comparison of Three Land Cover Classification Algorithms -ISODATA, SMA, and SOM - for the Monitoring of North Korea with MODIS Multi-temporal Data

  • Kim, Do-Hyung;Jeong, Seung-Gyu;Park, Chong-Hwa
    • 대한원격탐사학회지
    • /
    • 제23권3호
    • /
    • pp.181-188
    • /
    • 2007
  • The objective of this research was to investigate the optimal land cover classification algorithm for the monitoring of North Korea with MODIS multi-temporal data based on monthly phenological characteristics. Three frequently used land cover classification algorithms, ISODATA1), SMA2), and SOM3) were employed for this study; the land cover categories were forest, grass, agricultural, wetland, barren, built-up, and water body. The outcomes of the study can be summarized as follows. First, the overall classification accuracy of ISODATA, SMA, and SOM was 69.03%, 64.28%, and 73.57%, respectively. Second, ISODATA and SMA resulted in a higher classification accuracy of forest and agricultural categories, but SOM performed better for the built-up area, bare soil, grassland, and water. A possible explanation for this difference would be related to the difference of sensitivity against the vegetation activity. This would be related to the capability of SOM to express all of their values without any loss of data by maintaining the topology between pixels of primitive data after classification, while ISODATA and SMA retain limited amount of data after normalization process. Third, we can conclude that SOM is the best algorithm for monitoring the land cover change of North Korea.

신경회로망과 유전 알고리즘을 이용한 유전자 추출법과 이의 암 분류법에의 적용 (Gene selection method using neural networks and genetic algorithm and its applications to classification of cancers)

  • 조현성;김태선;전성모;위재우;이종호
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2002년도 하계학술대회 논문집 D
    • /
    • pp.2815-2817
    • /
    • 2002
  • Classification method of cancers using cDNA microarrays data was developed using genetic algorithms and neural networks. For gene selection, 2308 genes were ranked using genetic algorithms, and selected by frequency number of selection from 1000 of genetic iterative runs. To calculate fitness values, artificial neural networks are used as classifier. The small, round blue cell tumors (SRBCTs) which is difficult to distinguish via pathological single test was used as test diseases for classification, and the test results showed the 96% of exact classification capability for 25 test samples.

  • PDF

Case based Reasoning System with Two Dimensional Reduction Technique for Customer Classification Model

  • Kim, Kyoung-Jae;Ahn, Hyun-Chul
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국해양정보통신학회 2005년도 추계종합학술대회
    • /
    • pp.383-386
    • /
    • 2005
  • This study proposes a case based reasoning system with two dimensional reduction techniques. In this study, vertical and horizontal dimensions of the research data are reduced through hybrid feature and instance selection process using genetic algorithms. We applied the proposed model to customer classification model which utilizes customers' demographic characteristics as inputs to predict their buying behavior for the specific product. Experimental results show that the proposed technique may improve the classification accuracy and outperform various optimized models of typical CBR system.

  • PDF

Hybridized Decision Tree methods for Detecting Generic Attack on Ciphertext

  • Alsariera, Yazan Ahmad
    • International Journal of Computer Science & Network Security
    • /
    • 제21권7호
    • /
    • pp.56-62
    • /
    • 2021
  • The surge in generic attacks execution against cipher text on the computer network has led to the continuous advancement of the mechanisms to protect information integrity and confidentiality. The implementation of explicit decision tree machine learning algorithm is reported to accurately classifier generic attacks better than some multi-classification algorithms as the multi-classification method suffers from detection oversight. However, there is a need to improve the accuracy and reduce the false alarm rate. Therefore, this study aims to improve generic attack classification by implementing two hybridized decision tree algorithms namely Naïve Bayes Decision tree (NBTree) and Logistic Model tree (LMT). The proposed hybridized methods were developed using the 10-fold cross-validation technique to avoid overfitting. The generic attack detector produced a 99.8% accuracy, an FPR score of 0.002 and an MCC score of 0.995. The performances of the proposed methods were better than the existing decision tree method. Similarly, the proposed method outperformed multi-classification methods for detecting generic attacks. Hence, it is recommended to implement hybridized decision tree method for detecting generic attacks on a computer network.

Transfer Learning Using Convolutional Neural Network Architectures for Glioma Classification from MRI Images

  • Kulkarni, Sunita M.;Sundari, G.
    • International Journal of Computer Science & Network Security
    • /
    • 제21권2호
    • /
    • pp.198-204
    • /
    • 2021
  • Glioma is one of the common types of brain tumors starting in the brain's glial cell. These tumors are classified into low-grade or high-grade tumors. Physicians analyze the stages of brain tumors and suggest treatment to the patient. The status of the tumor has an importance in the treatment. Nowadays, computerized systems are used to analyze and classify brain tumors. The accurate grading of the tumor makes sense in the treatment of brain tumors. This paper aims to develop a classification of low-grade glioma and high-grade glioma using a deep learning algorithm. This system utilizes four transfer learning algorithms, i.e., AlexNet, GoogLeNet, ResNet18, and ResNet50, for classification purposes. Among these algorithms, ResNet18 shows the highest classification accuracy of 97.19%.

An Application of Support Vector Machines to Customer Loyalty Classification of Korean Retailing Company Using R Language

  • 응위엔푸티엔;이영찬
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제26권4호
    • /
    • pp.17-37
    • /
    • 2017
  • Purpose Customer Loyalty is the most important factor of customer relationship management (CRM). Especially in retailing industry, where customers have many options of where to spend their money. Classifying loyal customers through customers' data can help retailing companies build more efficient marketing strategies and gain competitive advantages. This study aims to construct classification models of distinguishing the loyal customers within a Korean retailing company using data mining techniques with R language. Design/methodology/approach In order to classify retailing customers, we used combination of support vector machines (SVMs) and other classification algorithms of machine learning (ML) with the support of recursive feature elimination (RFE). In particular, we first clean the dataset to remove outlier and impute the missing value. Then we used a RFE framework for electing most significant predictors. Finally, we construct models with classification algorithms, tune the best parameters and compare the performances among them. Findings The results reveal that ML classification techniques can work well with CRM data in Korean retailing industry. Moreover, customer loyalty is impacted by not only unique factor such as net promoter score but also other purchase habits such as expensive goods preferring or multi-branch visiting and so on. We also prove that with retailing customer's dataset the model constructed by SVMs algorithm has given better performance than others. We expect that the models in this study can be used by other retailing companies to classify their customers, then they can focus on giving services to these potential vip group. We also hope that the results of this ML algorithm using R language could be useful to other researchers for selecting appropriate ML algorithms.