Browse > Article

The Prediction of DEA based Efficiency Rating for Venture Business Using Multi-class SVM  

Park, Ji-Young (College of Business Administration, Pusan National University)
Hong, Tae-Ho (College of Business Administration, Pusan National University)
Publication Information
Asia pacific journal of information systems / v.19, no.2, 2009 , pp. 139-155 More about this Journal
Abstract
For the last few decades, many studies have tried to explore and unveil venture companies' success factors and unique features in order to identify the sources of such companies' competitive advantages over their rivals. Such venture companies have shown tendency to give high returns for investors generally making the best use of information technology. For this reason, many venture companies are keen on attracting avid investors' attention. Investors generally make their investment decisions by carefully examining the evaluation criteria of the alternatives. To them, credit rating information provided by international rating agencies, such as Standard and Poor's, Moody's and Fitch is crucial source as to such pivotal concerns as companies stability, growth, and risk status. But these types of information are generated only for the companies issuing corporate bonds, not venture companies. Therefore, this study proposes a method for evaluating venture businesses by presenting our recent empirical results using financial data of Korean venture companies listed on KOSDAQ in Korea exchange. In addition, this paper used multi-class SVM for the prediction of DEA-based efficiency rating for venture businesses, which was derived from our proposed method. Our approach sheds light on ways to locate efficient companies generating high level of profits. Above all, in determining effective ways to evaluate a venture firm's efficiency, it is important to understand the major contributing factors of such efficiency. Therefore, this paper is constructed on the basis of following two ideas to classify which companies are more efficient venture companies: i) making DEA based multi-class rating for sample companies and ii) developing multi-class SVM-based efficiency prediction model for classifying all companies. First, the Data Envelopment Analysis(DEA) is a non-parametric multiple input-output efficiency technique that measures the relative efficiency of decision making units(DMUs) using a linear programming based model. It is non-parametric because it requires no assumption on the shape or parameters of the underlying production function. DEA has been already widely applied for evaluating the relative efficiency of DMUs. Recently, a number of DEA based studies have evaluated the efficiency of various types of companies, such as internet companies and venture companies. It has been also applied to corporate credit ratings. In this study we utilized DEA for sorting venture companies by efficiency based ratings. The Support Vector Machine(SVM), on the other hand, is a popular technique for solving data classification problems. In this paper, we employed SVM to classify the efficiency ratings in IT venture companies according to the results of DEA. The SVM method was first developed by Vapnik (1995). As one of many machine learning techniques, SVM is based on a statistical theory. Thus far, the method has shown good performances especially in generalizing capacity in classification tasks, resulting in numerous applications in many areas of business, SVM is basically the algorithm that finds the maximum margin hyperplane, which is the maximum separation between classes. According to this method, support vectors are the closest to the maximum margin hyperplane. If it is impossible to classify, we can use the kernel function. In the case of nonlinear class boundaries, we can transform the inputs into a high-dimensional feature space, This is the original input space and is mapped into a high-dimensional dot-product space. Many studies applied SVM to the prediction of bankruptcy, the forecast a financial time series, and the problem of estimating credit rating, In this study we employed SVM for developing data mining-based efficiency prediction model. We used the Gaussian radial function as a kernel function of SVM. In multi-class SVM, we adopted one-against-one approach between binary classification method and two all-together methods, proposed by Weston and Watkins(1999) and Crammer and Singer(2000), respectively. In this research, we used corporate information of 154 companies listed on KOSDAQ market in Korea exchange. We obtained companies' financial information of 2005 from the KIS(Korea Information Service, Inc.). Using this data, we made multi-class rating with DEA efficiency and built multi-class prediction model based data mining. Among three manners of multi-classification, the hit ratio of the Weston and Watkins method is the best in the test data set. In multi classification problems as efficiency ratings of venture business, it is very useful for investors to know the class with errors, one class difference, when it is difficult to find out the accurate class in the actual market. So we presented accuracy results within 1-class errors, and the Weston and Watkins method showed 85.7% accuracy in our test samples. We conclude that the DEA based multi-class approach in venture business generates more information than the binary classification problem, notwithstanding its efficiency level. We believe this model can help investors in decision making as it provides a reliably tool to evaluate venture companies in the financial domain. For the future research, we perceive the need to enhance such areas as the variable selection process, the parameter selection of kernel function, the generalization, and the sample size of multi-class.
Keywords
Multi-class Support Vector Machines(SVM); Data Envelopment Analysis(DEA); Business Efficiency; Venture Business;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Ahn, H., Kim, K.-j, and Han, I., 'Intelligent Credit Rating Model for Korean Companies using Multiclass Support Vector Machines,' Korean Management Review, Vol. 35, No. 5, 2006, pp. 1479-1496
2 Banker, R.D., Charnes A., and Cooper, W.W., 'Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis,' Management Science, Vol. 30, No. 9, 1984, pp. 1078-1092   DOI   ScienceOn
3 Barua, P.L., Brockette, P.L., Cooper, W.W., Deng, H., Parket, B.R. Ruefli, T.W., and Winston, A., 'DEA evaluations of long-and short-run efficiencies of digital vs. physical product 'dot com' companies,' Socio Economic Planning Sciences, Vol. 38, No. 4, 2004, pp. 233-253   DOI   ScienceOn
4 Hong, T., Park, J., and Kim, E., 'Using DEA-logit Approach to Evaluate Efficiency of IT Venture Business,' The Journal of Internet Electronic Commerce Research, Vol. 7, No. 3, 2007, pp. 429-449
5 Huang, Z., Chen, H., Hsu, C-J., Chen, W-H., and Wu, S., 'Credit rating analysis with support vector machines and neural network: a market comparative study,' Decision Support Systems, Vol. 37, 2004, pp. 543-558   DOI   ScienceOn
6 Kim, J.J., 'Research Papers: The Effect of DEA Efficiency on ROE of KOSDAQ Firms,' TAX ACCOUNTING RESEARCH, Vol. 14, 2004, pp. 153-170
7 Lee, Y.-C., 'Credit Risk Evaluation Using Worst Practice DEA,' DAEHAN Association of Business Administration, Vol. 19, No. 5, 2006, pp. 183-203
8 Min, J.H. and Jeong, C.W., 'Nonparametric Approach to Bankruptcy Prediction: Developíng Cross Peeling Technique integrating DEA and Negative DEA,' Korean Management Review, Vol. 35, No. 4, 2006, pp. 1157-1180
9 Tay, F.E.H. and Cao, L.J., 'Application of support vector machines in financial time series forecasting,' Omega, Vol. 29, No. 4, 2001, pp. 309-317   DOI   ScienceOn
10 Vapnik, V., Statistical, tistical Learning Theory, New York, Wiley, 1998
11 Zhu, J., 'Multi-factor performance measure model with an application to Fortune 500 companies,' European Journal of Operational Research, Vol. 123, No. 1, pp. 105-124   DOI   ScienceOn
12 Platt, J.C., Cristianini, N., and Shawe-Taylor, J., 'Large Margin DAGs for Multiclass Classification,' in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, Vol. 12, 2000, pp. 547-553
13 Shin, K.S. and Han, I., 'A case-based approach using inductive indexing for corporate bond rating,' Decision Support Systems, Vol. 32, No. 1, 2001, pp. 41-52   DOI   ScienceOn
14 Zhu, J., 'Multi-factor performance measure model with an application to Fortune 500 companies,' European Journal of Operational Research, Vol. 123, No. 1, 2000, pp. 105-124   DOI   ScienceOn
15 Hu, M.Y., Zhang, G., Christine, X.J., and Patuwo, B.E., 'A cross-validation analysis of neural network out-of-sample performance in exchange rate forecasting,' Decision Sciences, Vol. 30, No. 1, 1999, pp. 197-216   DOI
16 Hwang, W.W. and Yang, B.S., 'Fault Diagnosis of Rotating Macrunery Using Multiclass Support Vector Machines,' Transactions of the Korean Society for Noise and Vibration Engineering, Vol. 14, No. 12, 2004, pp. 1233-1240   DOI   ScienceOn
17 Zhu, J., Quantitative Model For Performance Evaluation And Benchmarking, Kluwer Academic Publishers, 2003
18 Hong, T. and Park, J., 'A DEA-Based Data Mining for the Evaluation of the Efficiency in the IT Venture Business,' Proceedings of the 13th Asia Pacfic Management Conference, 2007
19 Hsu, C.W. and Lin, C.J., 'A Comparison of Methods for Multiclass Support Vector Machines,' IEEE Transactions on Neural Networks, Vol. 13, No. 2, 2002, pp. 415-425   DOI   ScienceOn
20 Lee, K.-J., Kim, J.-J., and Geon, C., 'The Evaluation of the Efficiencyof Internet Companies Using a DEA Model,' DAEHAN Association of Business Administration, Vol. 20, No. 1, 2007, pp. 109-136
21 Kim K.J., 'Productivity Analysis of Software Industry in Korea,' Productivify Review, Vol. 19, No. 1, 2005, pp. 95-111
22 Koo, C.O., Kim, G.J., and Park, J.W., 'The Productivity Analysis of KOSDAQ Software Companies,' Journal Finance and Accounting Accountiong Information, Vol. 6, No. 1, 2006, pp. 27-51
23 Carlos, S.C., Yolanda, F.C., and Cecilio, M.M., 'Measuring DEA efficiency in Internet companies,' Decision Support Systems, Vol. 38, No. 4, 2005, pp. 557-573   DOI   ScienceOn
24 Crammer, K. and Singer, Y., 'On the learnability and design of output codes for multiclass problems,' Comput. Learning Theory, 2000, pp. 35-46
25 Garavaglia, S., 'An application of a Counter Propagation Neural Networks: Simulating the Standard and Poor's Corporate Bond Rating Systems,' Proceedings of the First International Conference on Artificial Intelligence on Wall Street, 1991, pp. 278-287
26 Hong, H.K., Shin, C.K., Park, S.C., and Kim. S.H., 'Evaluating thc efficiency of system íntegration projects using data envelopment analysis(DEA) and machine learning,' Expert Systems with Applications, Vol. 16, 1999, pp. 283-296   DOI   ScienceOn
27 Paradi, J.C, Asmild, M., and Simark, P.C., 'Using DEA and worst practice DEA in credit risk evaluation,' Journal of Productive Analysis, Vol. 21, No. 2, 2004, pp. 153-165   DOI   ScienceOn
28 Kwon, Y.S., Han, I.G., and Lee, K.C., 'Ordinal Pairwise Partitioning(OPP) approach to neural networks training in bond rating,' Intelligent Systems in Accounting, Finance and Management, Vol. 6, 1997, pp. 23-40   DOI   ScienceOn
29 Song, K.M., Bahn, S.S., and Kim, S.T., 'The Significance of Financial Model in Venture Credit Rating and Its Comparison with the Case of Large Corporate,' The Korean Venture Management Review, Vol. 10, No. 2, 2007, pp. 1-23
30 Farrell, M.J., 'The Measurement of Productive Efficiency,' Journal of the Royal Statistical Society, Vol. 120, 1957, pp. 253-281   DOI   ScienceOn
31 Hearst, M.A., Dumais, S.T., Osman, E., Platt, j.. and Scholkopf, B., 'Support vector machines,' IEEE lntelligent System, Vol. 13, No. 4, 1998, pp. 18-28   DOI   ScienceOn
32 Kim, J.W., 'Expert systems for bond rating: a comparative analysis of statistical, rule -based and neural network systems,' Expert Systems, Vol. 10, 1993, pp. 167-171   DOI   ScienceOn
33 Kim, K.S., 'Analyzing the Technical Efficiency of Korean System Integration Firms Using DEA and Malmquist Productivity Analysis,' Journal of the Korea society of IT services, Vol. 5, No. 1, 2006, pp. 1-16   과학기술학회마을
34 Nguyen, M.H. and Rajapakse, J.C., 'MultiClass Support Vector Machines for Protein Secondary Structure Prediction,' Genome Informatics, Vol. 14, 2003, pp. 218-227
35 Weston, J. and Watkins, C., 'Multi-class support vector machines,' presented at the Proc. ESAMM99, M. Verleysen, Ed., Brussels, Belgium, 1999
36 Charnes, W.W., Cooper, and Rhodes, E., 'Measuring the Efficiency of Decision Making Units,' European Journal of Operation Research, Vol. 2, No. 6, 1978, pp. 429-444   DOI   ScienceOn