• 제목/요약/키워드: Multi-class Prediction

검색결과 40건 처리시간 0.026초

판별분석에 의한 기업부실예측력 평가: 서울지역 특1급 호텔 사례 분석 (Evaluation of Corporate Distress Prediction Power using the Discriminant Analysis: The Case of First-Class Hotels in Seoul)

  • 김시중
    • 한국산학기술학회논문지
    • /
    • 제17권10호
    • /
    • pp.520-526
    • /
    • 2016
  • 본 연구는 서울지역 특1급 호텔을 대상으로 2015년도 재무비율을 변수로 활용하여 표준재무비율을 산출하며, 다변량 판별분석에 의한 부실예측모형 개발 및 부실예측력 평가에 목적이 있다. 서울소재 19개 특1급 호텔의 14개 재무비율을 분석대상으로 선정하여 실증분석을 실시하였으며 분석결과는 다음과 같다. 첫째, 분석결과 우수기업과 부실기업을 판별하는 7개 재무비율은 유동비율, 차입금의존도, 영업이익대비 이자보상비율, 매출액영업이익율, 자기자본순이익율, 영업현금흐름비율, 총자산회전율로 나타났다. 둘째, 7개 재무비율을 활용하여 우수기업과 부실기업을 판별하는 판별함수를 다변량판별분석에 의해 추정하였으며, 추정된 판별함수를 실제 소속집단과 예측집단으로 분류가 가능한가의 예측력 검정 결과, 예측 판별력의 정확도는 87.9%로 분석되었다. 셋째, 추정된 판별함수의 예측 판별력의 정확도 검증결과 판별분석에 의한 부실예측모형의 예측력은 78.95%로 분석되었다. 이러한 분석결과, 호텔 경영진은 호텔기업의 부실기업집단을 판별하는 7개 재무비율을 중점적으로 관리해야 함을 시사하고 있다. 또한 호텔기업이 타 산업과는 뚜렷한 재무구조의 차이와 부실예측 지표가 상이하며, 이에 호텔기업 대상의 신용평가시스템 구축 시 호텔기업의 재무적 특성을 반영한 시스템 구축이 필요함을 시사하고 있다.

Online Selective-Sample Learning of Hidden Markov Models for Sequence Classification

  • Kim, Minyoung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제15권3호
    • /
    • pp.145-152
    • /
    • 2015
  • We consider an online selective-sample learning problem for sequence classification, where the goal is to learn a predictive model using a stream of data samples whose class labels can be selectively queried by the algorithm. Given that there is a limit to the total number of queries permitted, the key issue is choosing the most informative and salient samples for their class labels to be queried. Recently, several aggressive selective-sample algorithms have been proposed under a linear model for static (non-sequential) binary classification. We extend the idea to hidden Markov models for multi-class sequence classification by introducing reasonable measures for the novelty and prediction confidence of the incoming sample with respect to the current model, on which the query decision is based. For several sequence classification datasets/tasks in online learning setups, we demonstrate the effectiveness of the proposed approach.

A Hybrid Multi-Level Feature Selection Framework for prediction of Chronic Disease

  • G.S. Raghavendra;Shanthi Mahesh;M.V.P. Chandrasekhara Rao
    • International Journal of Computer Science & Network Security
    • /
    • 제23권12호
    • /
    • pp.101-106
    • /
    • 2023
  • Chronic illnesses are among the most common serious problems affecting human health. Early diagnosis of chronic diseases can assist to avoid or mitigate their consequences, potentially decreasing mortality rates. Using machine learning algorithms to identify risk factors is an exciting strategy. The issue with existing feature selection approaches is that each method provides a distinct set of properties that affect model correctness, and present methods cannot perform well on huge multidimensional datasets. We would like to introduce a novel model that contains a feature selection approach that selects optimal characteristics from big multidimensional data sets to provide reliable predictions of chronic illnesses without sacrificing data uniqueness.[1] To ensure the success of our proposed model, we employed balanced classes by employing hybrid balanced class sampling methods on the original dataset, as well as methods for data pre-processing and data transformation, to provide credible data for the training model. We ran and assessed our model on datasets with binary and multivalued classifications. We have used multiple datasets (Parkinson, arrythmia, breast cancer, kidney, diabetes). Suitable features are selected by using the Hybrid feature model consists of Lassocv, decision tree, random forest, gradient boosting,Adaboost, stochastic gradient descent and done voting of attributes which are common output from these methods.Accuracy of original dataset before applying framework is recorded and evaluated against reduced data set of attributes accuracy. The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy on multi valued class datasets than on binary class attributes.[1]

멀티콥터의 추진 시스템 실험 결과를 통한 제자리 비행 성능 예측 및 검증 (Prediction and Verification of Hover Performance through Multi-Copter Propulsion System Test Results)

  • 박승호;고영주;이재하;최종수
    • 한국항공우주학회지
    • /
    • 제46권7호
    • /
    • pp.527-534
    • /
    • 2018
  • 본 논문은 최근에 각광 받고 있는 드론(멀티콥터)과 관련하여 비행체의 비행 성능, 특히 비행 가능 시간에 대한 예측과 실험 데이터를 통한 검증 등을 목표로 하여, 제자리 비행에 대한 연구 결과를 제시 하고 있다. 연구 방법을 드론 시스템을 구성하고 있는 여러 부품들을 기능별로 분류하여 부품 수준에서의 제원과 기능에 대한 정리 및 수학적 수식화를 통하여 단위 부품의 성능을 분석 및 실험 데이터를 확보하고, 이들 단위 부품 데이터의 조합을 통하여 드론 시스템의 제자리비행 성능을 예측하는 연구 결과를 보여 주고 있다. 또한 5kg급 쿼드콥터를 이용하여 제자리 비행에 대한 분석을 통하여 비행시간에 따른 전압 변화를 예측, 검증 하였으며 해당 방법을 통해 제자리 비행시간을 예측하였다.

A Study on Comparison of Lung Cancer Prediction Using Ensemble Machine Learning

  • NAM, Yu-Jin;SHIN, Won-Ji
    • 한국인공지능학회지
    • /
    • 제7권2호
    • /
    • pp.19-24
    • /
    • 2019
  • Lung cancer is a chronic disease which ranks fourth in cancer incidence with 11 percent of the total cancer incidence in Korea. To deal with such issues, there is an active study on the usefulness and utilization of the Clinical Decision Support System (CDSS) which utilizes machine learning. Thus, this study reviews existing studies on artificial intelligence technology that can be used in determining the lung cancer, and conducted a study on the applicability of machine learning in determination of the lung cancer by comparison and analysis using Azure ML provided by Microsoft. The results of this study show different predictions yielded by three algorithms: Support Vector Machine (SVM), Two-Class Support Decision Jungle and Multiclass Decision Jungle. This study has its limitations in the size of the Big data used in Machine Learning. Although the data provided by Kaggle is the most suitable one for this study, it is assumed that there is a limit in learning the data fully due to the lack of absolute figures. Therefore, it is claimed that if the agency's cooperation in the subsequent research is used to compare and analyze various kinds of algorithms other than those used in this study, a more accurate screening machine for lung cancer could be created.

Application of machine learning in optimized distribution of dampers for structural vibration control

  • Li, Luyu;Zhao, Xuemeng
    • Earthquakes and Structures
    • /
    • 제16권6호
    • /
    • pp.679-690
    • /
    • 2019
  • This paper presents machine learning methods using Support Vector Machine (SVM) and Multilayer Perceptron (MLP) to analyze optimal damper distribution for structural vibration control. Regarding different building structures, a genetic algorithm based optimization method is used to determine optimal damper distributions that are further used as training samples. The structural features, the objective function, the number of dampers, etc. are used as input features, and the distribution of dampers is taken as an output result. In the case of a few number of damper distributions, multi-class prediction can be performed using SVM and MLP respectively. Moreover, MLP can be used for regression prediction in the case where the distribution scheme is uncountable. After suitable post-processing, good results can be obtained. Numerical results show that the proposed method can obtain the optimized damper distributions for different structures under different objective functions, which achieves better control effect than the traditional uniform distribution and greatly improves the optimization efficiency.

DEA를 적용한 문화관광축제의 효율성 등급 예측모형 (The Efficiency Rating Prediction for Cultural Tourism Festival Based of DEA)

  • 김은미;홍태호
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제29권3호
    • /
    • pp.145-157
    • /
    • 2020
  • Purpose This study proposed an approach for predicting the efficiency rating of the cultural tourism festivals using DEA and machine learning techniques. The cultural tourism festivals are selected for the best festivals through peer reviews by tourism experts. However, only 10% of the festivals which are held in a year could be evaluated in the view of effectiveness without considering the efficiency of festivals. Design/methodology/approach Efficiency scores were derived from the results of DEA for the prediction of efficiency ratings. This study utilized BCC models to reflect the size effect of festivals and classified the festivals into four ratings according the efficiency scores. Multi-classification method were considered to build the prediction of four ratings for the festivals in this study. We utilized neural networks and SVMs with OAO(one-against-one), OAR(one-against-rest), C&S(crammer & singer) with Korea festival data from 2013 to 2018. Findings The number of total visitors in low efficient rating of DEA is more larger than the number of total visitors in high efficient ratings although the total expenditure of visitors is the highest in the most efficient rating when we analyzed the results of DEA for the characteristics of four ratings. SVM with OAO model showed the most superior performance in accuracy as SVM with OAR model was not trained well because of the imbalanced distribution between efficient rating and the other ratings. Our approach could predict the efficiency of festivals which were not included in the review process of culture tourism festivals without rebuilding DEA models each time. This enables us to manage the festivals efficiently with the proposed machine learning models.

고밀도 암면 다중판의 차음성능 (Sound Insulation performance of multi-layered panels with high density mineral wool)

  • 강현주;김재승;김현실;김봉기;김상렬
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2000년도 추계학술대회논문집
    • /
    • pp.421-424
    • /
    • 2000
  • This paper investigates the characteristics of analytical models in prediction of sound transmission loss for the multi-layered panels with high density mineral wools. The results show that the sandwich model is more adequate to account for sound insulation performance of those panels than the poro-elastic model. In order to improve STC(Sound Transmission Class), the effect of fiber directions of mineral wools is examined, analytically and experimentally. From the comparison of the measurements with the predictions, it is evident that the vertical fiber directions of mineral wools enhance STC value up to 6 dB, compared to that of the horizontal fiber directions.

  • PDF

네이버 영화 리뷰 데이터를 이용한 의미 분석(semantic analysis) (Semantic analysis via application of deep learning using Naver movie review data)

  • 김소진;송종우
    • 응용통계연구
    • /
    • 제35권1호
    • /
    • pp.19-33
    • /
    • 2022
  • SNS의 등장으로 인터넷 이용자들이 온라인에 남기는 텍스트의 양이 방대해지고 그 중요성이 강조되고있다. 특히 네이버의 영화 탭에서 볼 수 있는 영화 평점이나 리뷰는 실제로 관객들이 영화를 보기 전 해당 영화를 볼 것인지 결정하는 데 주요 요인이 되기도 한다. 본 연구는 실제 네이버 영화 리뷰 데이터를 가지고 평점을 예측하는 분석을 수행했다. 영화 리뷰 데이터를 분석하기 위해 평점의 분포를 통해 데이터 특성을 살펴보았고, 텍스트의 의미를 분석하기 위해 형태소 분석을 통한 한국어 자연어처리를 수행했다. 또한 평점 예측에 활용할 모델 선택을 위해 2-Class와 multi-Class 문제들에 대해 머신러닝과 딥러닝, 회귀와 분류 분석을 비교했으며, 오분류의 원인을 영화 리뷰 데이터 특성과 연관시켜 서술했다.

DATA MININING APPROACH TO PARAMETRIC COST ESTIMATE IN EARLY DESIGN STAGE AND ANALYTICAL CHARACTERIZATION ON OLAP (ON-LINE ANALYTICAL PROCESSING)

  • JaeHo Cho;HyunKyun Jung;JaeYoul Chun
    • 국제학술발표논문집
    • /
    • The 4th International Conference on Construction Engineering and Project Management Organized by the University of New South Wales
    • /
    • pp.176-181
    • /
    • 2011
  • A role of cost modeler is that of facilitating design process by the systematic application of cost factors so as to maintain sensible and economic relationships between cost, quantity, utility and appearance. These relationships help to achieve the client's requirements within an agreed budget. The purpose of this study is to develop a parametric cost estimating model for the early design stage by using the multi-dimensional system of OLAP (On-line Analytical Processing) based on the case of quantity data related to architectural design features. The parametric cost estimating models have been adopted to support decision making in the early design stage. These models typically use a similar instance or a pattern of historical case. In order to effectively use this type of data model, it is required to set data classification and prediction methods. One of the methods is to find the similar class in line with attribute selection measure in the multi-dimensional data model. Therefore, this research is to analyze the relevance attribute influenced by architectural design features with the subject of case-based quantity data used for the parametric cost estimating model. The relevance attributes can be analyzed by Analytical Characterization. It helps determine what attributes to be included in the OLAP multi-dimension.

  • PDF