• 제목/요약/키워드: Classification model

검색결과 4,060건 처리시간 0.035초

Document Classification Model Using Web Documents for Balancing Training Corpus Size per Category

  • Park, So-Young;Chang, Juno;Kihl, Taesuk
    • Journal of information and communication convergence engineering
    • /
    • 제11권4호
    • /
    • pp.268-273
    • /
    • 2013
  • In this paper, we propose a document classification model using Web documents as a part of the training corpus in order to resolve the imbalance of the training corpus size per category. For the purpose of retrieving the Web documents closely related to each category, the proposed document classification model calculates the matching score between word features and each category, and generates a Web search query by combining the higher-ranked word features and the category title. Then, the proposed document classification model sends each combined query to the open application programming interface of the Web search engine, and receives the snippet results retrieved from the Web search engine. Finally, the proposed document classification model adds these snippet results as Web documents to the training corpus. Experimental results show that the method that considers the balance of the training corpus size per category exhibits better performance in some categories with small training sets.

InceptionV3 기반의 심장비대증 분류 정확도 향상 연구 (A Study on the Improvement of Accuracy of Cardiomegaly Classification Based on InceptionV3)

  • 정우연;김정훈
    • 대한의용생체공학회:의공학회지
    • /
    • 제43권1호
    • /
    • pp.45-51
    • /
    • 2022
  • The purpose of this study is to improve the classification accuracy compared to the existing InceptionV3 model by proposing a new model modified with the fully connected hierarchical structure of InceptionV3, which showed excellent performance in medical image classification. The data used for model training were trained after data augmentation on a total of 1026 chest X-ray images of patients diagnosed with normal heart and Cardiomegaly at Kyungpook National University Hospital. As a result of the experiment, the learning classification accuracy and loss of the InceptionV3 model were 99.57% and 1.42, and the accuracy and loss of the proposed model were 99.81% and 0.92. As a result of the classification performance evaluation for precision, recall, and F1 score of Inception V3, the precision of the normal heart was 78%, the recall rate was 100%, and the F1 score was 88. The classification accuracy for Cardiomegaly was 100%, the recall rate was 78%, and the F1 score was 88. On the other hand, in the case of the proposed model, the accuracy for a normal heart was 100%, the recall rate was 92%, and the F1 score was 96. The classification accuracy for Cardiomegaly was 95%, the recall rate was 100%, and the F1 score was 97. If the chest X-ray image for normal heart and Cardiomegaly can be classified using the model proposed based on the study results, better classification will be possible and the reliability of classification performance will gradually increase.

성장곡선모형의 판별분석에서 균형이차분류법의 적용 (An Application of the Balanced Quadratic Classification Rule on the Discriminant Analysis in Growth Curve Model)

  • 심규박
    • 품질경영학회지
    • /
    • 제23권2호
    • /
    • pp.53-67
    • /
    • 1995
  • The problem considered here is to find the optimal discriminant analysis method in growth curve model. It has been studied how to find correct prior probability for the effective classification in discriminant analysis. We use the balanced condition to calculate prior probability. From the informative simulation study, new classification rule for the growth curve model is suggested. The suggested classification rule has better classification result than the other previously suggested method in terms of error rate criterion.

  • PDF

ANFIS 기반 분류모형의 설계 및 성능평가 (Design and Evaluation of ANFIS-based Classification Model)

  • 송희석;김재경
    • 지능정보연구
    • /
    • 제15권3호
    • /
    • pp.151-165
    • /
    • 2009
  • 퍼지신경망 모형은 인공신경망의 네트워크 구조 표현방법 및 학습알고리듬과 퍼지시스템의 추론방법을 통합한 모형으로 제어 및 예측분야에 성공적으로 적용되고 있다. 본 연구에서는 퍼지신경망 모형 중 우수한 예측정확도로 인해 최근 각광받고 있는ANFIS (Adaptive Network-based Fuzzy Inference System)모형을 기반으로 하는 분류모형을 설계하고 기존의 분류기법(C5.0 의사결정나무)과 비교하여 분류 정확성 관점에서 평가한다. ANFIS 추론의 경우, 최종 결과값이 계급값이 아닌 연속형 변수값을 취하게 되므로 산출된 결과값을 이용하여 적절한 계급값을 할당하는 과정이 필요하다. 본 연구에서는 의사결정나무기법을 이용하여 계급값을 할당하는 방식과 군집분석을 이용하여 계급값을 할당하는 두 가지 방식을 제안하고 두 가지 데이터 세트에 적용하여 ANFIS를 기반으로 한 분류모형의 정확도를 평가하였다.

  • PDF

다집단 분류 인공신경망 모형의 아키텍쳐 튜닝 (Tuning the Architecture of Neural Networks for Multi-Class Classification)

  • 정철우;민재형
    • 한국경영과학회지
    • /
    • 제38권1호
    • /
    • pp.139-152
    • /
    • 2013
  • The purpose of this study is to claim the validity of tuning the architecture of neural network models for multi-class classification. A neural network model for multi-class classification is basically constructed by building a series of neural network models for binary classification. Building a neural network model, we are required to set the values of parameters such as number of hidden nodes and weight decay parameter in advance, which draws special attention as the performance of the model can be quite different by the values of the parameters. For better performance of the model, it is absolutely necessary to have a prior process of tuning the parameters every time the neural network model is built. Nonetheless, previous studies have not mentioned the necessity of the tuning process or proved its validity. In this study, we claim that we should tune the parameters every time we build the neural network model for multi-class classification. Through empirical analysis using wine data, we show that the performance of the model with the tuned parameters is superior to those of untuned models.

One-dimensional CNN Model of Network Traffic Classification based on Transfer Learning

  • Lingyun Yang;Yuning Dong;Zaijian Wang;Feifei Gao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제18권2호
    • /
    • pp.420-437
    • /
    • 2024
  • There are some problems in network traffic classification (NTC), such as complicated statistical features and insufficient training samples, which may cause poor classification effect. A NTC architecture based on one-dimensional Convolutional Neural Network (CNN) and transfer learning is proposed to tackle these problems and improve the fine-grained classification performance. The key points of the proposed architecture include: (1) Model classification--by extracting normalized rate feature set from original data, plus existing statistical features to optimize the CNN NTC model. (2) To apply transfer learning in the classification to improve NTC performance. We collect two typical network flows data from Youku and YouTube, and verify the proposed method through extensive experiments. The results show that compared with existing methods, our method could improve the classification accuracy by around 3-5%for Youku, and by about 7 to 27% for YouTube.

기술금융을 위한 부실 가능성 예측 최적 판별모형에 대한 연구 (A Study on the Optimal Discriminant Model Predicting the likelihood of Insolvency for Technology Financing)

  • 성웅현
    • 기술혁신학회지
    • /
    • 제10권2호
    • /
    • pp.183-205
    • /
    • 2007
  • 본 연구는 기술력평가에 근거해서 중소기업 부실예측 가능성을 사전에 예측할 수 있는 최적 판별 모형을 개발 제안하였다. 판별모형에 포함될 설명변수는 요인분석과 판별모형의 단계별 선택방법에 의하여 선정되었다. 분석결과 선형판별모형이 로지스틱판별모형보다 임계확률 관점에서 적절한 것으로 나타났다. 최적 선형판별모형의 분류 정분류율은 70.4%, 분류 예측력은 67.5%로 나타났다. 최적 선형판별모형의 활용도를 높이기 위해서 확실 범주와 유보범주를 구분할 수 있는 경계값을 설정하였다. 분석결과를 활용하면 기술금융 취급기관은 부실위험 평가와 더불어 기술금융 신청기업의 순위를 부여할 때 유용하게 사용할 수 있을 것으로 기대된다.

  • PDF

연관 분류 마이닝 기법을 활용한 지식기반 신체활동 평가 모델 (A Knowledge Based Physical Activity Evaluation Model Using Associative Classification Mining Approach)

  • 손창식;최락현;강원석
    • 대한임베디드공학회논문지
    • /
    • 제13권4호
    • /
    • pp.215-223
    • /
    • 2018
  • Recently, as interest of wearable devices has increased, commercially available smart wristbands and applications have been used as a tool for personal healthy management. However most previous studies have focused on evaluating the accuracy and reliability of the technical problems of wearable devices, especially step counts, walking distance, and energy consumption measured from the smart wristbands. In this study, we propose a physical activity evaluation model using classification rules, induced from the associative classification mining approach. These rules associated with five physical activities were generated by considering activities and walking times in target heart rate zones such as 'Out-of Zone', 'Fat Burn Zone', 'Cardio Zone', and 'Peak Zone'. In the experiment, we evaluated the prediction power of classification rules and verified its effectiveness by comparing classification accuracies between the proposed model and support vector machine.

A Novel Image Classification Method for Content-based Image Retrieval via a Hybrid Genetic Algorithm and Support Vector Machine Approach

  • Seo, Kwang-Kyu
    • 반도체디스플레이기술학회지
    • /
    • 제10권3호
    • /
    • pp.75-81
    • /
    • 2011
  • This paper presents a novel method for image classification based on a hybrid genetic algorithm (GA) and support vector machine (SVM) approach which can significantly improve the classification performance for content-based image retrieval (CBIR). Though SVM has been widely applied to CBIR, it has some problems such as the kernel parameters setting and feature subset selection of SVM which impact the classification accuracy in the learning process. This study aims at simultaneously optimizing the parameters of SVM and feature subset without degrading the classification accuracy of SVM using GA for CBIR. Using the hybrid GA and SVM model, we can classify more images in the database effectively. Experiments were carried out on a large-size database of images and experiment results show that the classification accuracy of conventional SVM may be improved significantly by using the proposed model. We also found that the proposed model outperformed all the other models such as neural network and typical SVM models.

비디오 분류에 기반 해석가능한 딥러닝 알고리즘 (An Explainable Deep Learning Algorithm based on Video Classification)

  • 김택위;조인휘
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2023년도 추계학술발표대회
    • /
    • pp.449-452
    • /
    • 2023
  • The rapid development of the Internet has led to a significant increase in multimedia content in social networks. How to better analyze and improve video classification models has become an important task. Deep learning models have typical "black box" characteristics. The model requires explainable analysis. This article uses two classification models: ConvLSTM and VGG16+LSTM models. And combined with the explainable method of LRP, generate visualized explainable results. Finally, based on the experimental results, the accuracy of the classification model is: ConvLSTM: 75.94%, VGG16+LSTM: 92.50%. We conducted explainable analysis on the VGG16+LSTM model combined with the LRP method. We found VGG16+LSTM classification model tends to use the frames biased towards the latter half of the video and the last frame as the basis for classification.