• Title/Summary/Keyword: four classes

Search Result 964, Processing Time 0.022 seconds

Optimization of Multiclass Support Vector Machine using Genetic Algorithm: Application to the Prediction of Corporate Credit Rating (유전자 알고리즘을 이용한 다분류 SVM의 최적화: 기업신용등급 예측에의 응용)

  • Ahn, Hyunchul
    • Information Systems Review
    • /
    • v.16 no.3
    • /
    • pp.161-177
    • /
    • 2014
  • Corporate credit rating assessment consists of complicated processes in which various factors describing a company are taken into consideration. Such assessment is known to be very expensive since domain experts should be employed to assess the ratings. As a result, the data-driven corporate credit rating prediction using statistical and artificial intelligence (AI) techniques has received considerable attention from researchers and practitioners. In particular, statistical methods such as multiple discriminant analysis (MDA) and multinomial logistic regression analysis (MLOGIT), and AI methods including case-based reasoning (CBR), artificial neural network (ANN), and multiclass support vector machine (MSVM) have been applied to corporate credit rating.2) Among them, MSVM has recently become popular because of its robustness and high prediction accuracy. In this study, we propose a novel optimized MSVM model, and appy it to corporate credit rating prediction in order to enhance the accuracy. Our model, named 'GAMSVM (Genetic Algorithm-optimized Multiclass Support Vector Machine),' is designed to simultaneously optimize the kernel parameters and the feature subset selection. Prior studies like Lorena and de Carvalho (2008), and Chatterjee (2013) show that proper kernel parameters may improve the performance of MSVMs. Also, the results from the studies such as Shieh and Yang (2008) and Chatterjee (2013) imply that appropriate feature selection may lead to higher prediction accuracy. Based on these prior studies, we propose to apply GAMSVM to corporate credit rating prediction. As a tool for optimizing the kernel parameters and the feature subset selection, we suggest genetic algorithm (GA). GA is known as an efficient and effective search method that attempts to simulate the biological evolution phenomenon. By applying genetic operations such as selection, crossover, and mutation, it is designed to gradually improve the search results. Especially, mutation operator prevents GA from falling into the local optima, thus we can find the globally optimal or near-optimal solution using it. GA has popularly been applied to search optimal parameters or feature subset selections of AI techniques including MSVM. With these reasons, we also adopt GA as an optimization tool. To empirically validate the usefulness of GAMSVM, we applied it to a real-world case of credit rating in Korea. Our application is in bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. The experimental dataset was collected from a large credit rating company in South Korea. It contained 39 financial ratios of 1,295 companies in the manufacturing industry, and their credit ratings. Using various statistical methods including the one-way ANOVA and the stepwise MDA, we selected 14 financial ratios as the candidate independent variables. The dependent variable, i.e. credit rating, was labeled as four classes: 1(A1); 2(A2); 3(A3); 4(B and C). 80 percent of total data for each class was used for training, and remaining 20 percent was used for validation. And, to overcome small sample size, we applied five-fold cross validation to our dataset. In order to examine the competitiveness of the proposed model, we also experimented several comparative models including MDA, MLOGIT, CBR, ANN and MSVM. In case of MSVM, we adopted One-Against-One (OAO) and DAGSVM (Directed Acyclic Graph SVM) approaches because they are known to be the most accurate approaches among various MSVM approaches. GAMSVM was implemented using LIBSVM-an open-source software, and Evolver 5.5-a commercial software enables GA. Other comparative models were experimented using various statistical and AI packages such as SPSS for Windows, Neuroshell, and Microsoft Excel VBA (Visual Basic for Applications). Experimental results showed that the proposed model-GAMSVM-outperformed all the competitive models. In addition, the model was found to use less independent variables, but to show higher accuracy. In our experiments, five variables such as X7 (total debt), X9 (sales per employee), X13 (years after founded), X15 (accumulated earning to total asset), and X39 (the index related to the cash flows from operating activity) were found to be the most important factors in predicting the corporate credit ratings. However, the values of the finally selected kernel parameters were found to be almost same among the data subsets. To examine whether the predictive performance of GAMSVM was significantly greater than those of other models, we used the McNemar test. As a result, we found that GAMSVM was better than MDA, MLOGIT, CBR, and ANN at the 1% significance level, and better than OAO and DAGSVM at the 5% significance level.

Study on Current Curriculum Analysis of Clinical Dental Hygiene for Dental Hygiene Students in Korea (국내 치위생(학)과 임상치위생학 교육과정 운영현황 분석)

  • Choi, Yong-Keum;Han, Yang-Keum;Bae, Soo-Myoung;Kim, Jin;Kim, Hye-Jin;Ahn, Se-Youn;Lim, Kun-Ok;Lim, Hee Jung;Jang, Sun-Ok;Jang, Yun-Jung;Jung, Jin-Ah;Jeon, Hyun-Sun;Park, Ji-Eun;Lee, Hyo-Jin;Shin, Bo-Mi
    • Journal of dental hygiene science
    • /
    • v.17 no.6
    • /
    • pp.523-532
    • /
    • 2017
  • The purpose of this study was to provide basic data to standardize the clinical dental hygiene curriculum, based on analysis of current clinical dental hygiene curricula in Korea. We emailed questionnaires to 12 schools to investigate clinical dental hygiene curricula, from February to March, 2017. We analyzed the clinical dental hygiene curricula in 5 schools with a 3-year program and in 7 schools with a 4-year program. The questionnaire comprised nine items on topics relating to clinical dental hygiene, and four items relating to the dental hygiene process and oral prophylaxis. The questionnaire included details regarding the subject name, the grade/semester/credit system, course content and class hours, the number of senior professors, and the number of patients available for dental hygiene clinical training purposes. In total, there were 96 topics listed in the curricula relating to clinical dental hygiene training, and topics varied between the schools. There was an average of 20.4 topic credits, and more credits and hours were allocated to the 4-year program than to the 3-year program. On average, the ratio of students to professors was 21.4:1. Course content included infection control, concepts for dental hygiene processes, dental hygiene assessment, intervention and evaluation, case studies, and periodontal instrumentation. An average of 2 hours per patient was spent on dental hygiene practice, with an average of 1.9 visits. On average, student clinical training involved 19 patients and 26.6 patients in the 3-year and 4-year programs, respectively. The average participation time per student per topic was 38.0 hours and 53.1 hours, in the 3-year and 4-year programs, respectively. Standardizing the clinical dental hygiene curricula in Korea will require consensus guidelines on topics, the number of classes required to achieve core competencies as a dental hygienist, and theory and practice time.

Summative Evaluation of 1993, 1994 Discussion Contest of Scientific Investigation (제 1, 2회 학생 과학 공동탐구 토론대회의 종합적 평가)

  • Kim, Eun-Sook;Yoon, Hye-Gyoung
    • Journal of The Korean Association For Science Education
    • /
    • v.16 no.4
    • /
    • pp.376-388
    • /
    • 1996
  • The first and the second "Discussion Contest of Scientific Investigation" was evaluated in this study. This contest was a part of 'Korean Youth Science Festival' held in 1993 and 1994. The evaluation was based on the data collected from the middle school students of final teams, their teachers, a large number of middle school students and college students who were audience of the final competition. Questionnaires, interviews, reports of final teams, and video tape of final competition were used to collect data. The study focussed on three research questions. The first was about the preparation and the research process of students of final teams. The second was about the format and the proceeding of the Contest. The third was whether participating the Contest was useful experience for the students and the teachers of the final teams. The first area, the preparation and the research process of students, were investigated in three aspects. One was the level of cooperation, participation, support and the role of teachers. The second was the information search and experiment, and the third was the report writing. The students of the final teams from both years, had positive opinion about the cooperation, students' active involvement, and support from family and school. Students considered their teachers to be a guide or a counsellor, showing their level of active participation. On the other hand, the interview of 1993 participants showed that there were times that teachers took strong leading role. Therefore one can conclude that students took active roles most of the time while the room for improvement still exists. To search the information they need during the period of the preparation, student visited various places such as libraries, bookstores, universities, and research institutes. Their search was not limited to reading the books, although the books were primary source of information. Students also learned how to organize the information they found and considered leaning of organizing skill useful and fun. Variety of experiments was an important part of preparation and students had positive opinion about it. Understanding related theory was considered most difficult and important, while designing and building proper equipments was considered difficult but not important. This reflects the students' school experience where the equipments were all set in advance and students were asked to confirm the theories presented in the previous class hours. About the reports recording the research process, students recognize the importance and the necessity of the report but had difficulty in writing it. Their reports showed tendency to list everything they did without clear connection to the problem to be solved. Most of the reports did not record the references and some of them confused report writing with story telling. Therefore most of them need training in writing the reports. It is also desirable to describe the process of student learning when theory or mathematics that are beyond the level of middle school curriculum were used because it is part of their investigation. The second area of evaluation was about the format and the proceeding of the Contest, the problems given to students, and the process of student discussion. The format of the Contests, which consisted of four parts, presentation, refutation, debate and review, received good evaluation from students because it made students think more and gave more difficult time but was meaningful and helped to remember longer time according to students. On the other hand, students said the time given to each part of the contest was too short. The problems given to students were short and open ended to stimulate students' imagination and to offer various possible routes to the solution. This type of problem was very unfamiliar and gave a lot of difficulty to students. Student had positive opinion about the research process they experienced but did not recognize the fact that such a process was possible because of the oneness of the task. The level of the problems was rated as too difficult by teachers and college students but as appropriate by the middle school students in audience and participating students. This suggests that it is possible for student to convert the problems to be challengeable and intellectually satisfactory appropriate for their level of understanding even when the problems were difficult for middle school students. During the process of student discussion, a few problems were observed. Some problems were related to the technics of the discussion, such as inappropriate behavior for the role he/she was taking, mismatching answers to the questions. Some problems were related to thinking. For example, students thinking was off balanced toward deductive reasoning, and reasoning based on experimental data was weak. The last area of evaluation was the effect of the Contest. It was measured through the change of the attitude toward science and science classes, and willingness to attend the next Contest. According to the result of the questionnaire, no meaningful change in attitude was observed. However, through the interview several students were observed to have significant positive change in attitude while no student with negative change was observed. Most of the students participated in Contest said they would participate again or recommend their friend to participate. Most of the teachers agreed that the Contest should continue and they would recommend their colleagues or students to participate. As described above, the "Discussion Contest of Scientific Investigation", which was developed and tried as a new science contest, had positive response from participating students and teachers, and the audience. Two among the list of results especially demonstrated that the goal of the Contest, "active and cooperative science learning experience", was reached. One is the fact that students recognized the experience of cooperation, discussion, information search, variety of experiments to be fun and valuable. The other is the fact that the students recognized the format of the contest consisting of presentation, refutation, discussion and review, required more thinking and was challenging, but was more meaningful. Despite a few problems such as, unfamiliarity with the technics of discussion, weakness in inductive and/or experiment based reasoning, and difficulty in report writing, The Contest demonstrated the possibility of new science learning environment and science contest by offering the chance to challenge open tasks by utilizing student science knowledge and ability to inquire and to discuss rationally and critically with other students.

  • PDF

A Study of Intangible Cultural Heritage Communities through a Social Network Analysis - Focused on the Item of Jeongseon Arirang - (소셜 네트워크 분석을 통한 무형문화유산 공동체 지식연결망 연구 - 정선아리랑을 중심으로 -)

  • Oh, Jung-shim
    • Korean Journal of Heritage: History & Science
    • /
    • v.52 no.3
    • /
    • pp.172-187
    • /
    • 2019
  • Knowledge of intangible cultural heritage is usually disseminated through word-of-mouth and actions rather than written records. Thus, people assemble to teach others about it and form communities. Accordingly, to understand and spread information about intangible cultural heritage properly, it is necessary to understand not only their attributes but also a community's relational characteristics. Community members include specialized transmitters who work under the auspices of institutions, and general transmitters who enjoy intangible cultural heritage in their daily lives. They converse about intangible cultural heritage in close relationships. However, to date, research has focused only on professionals. Thus, this study focused on the roles of general transmitters of intangible cultural heritage information by investigating intangible cultural heritage communities centering around Jeongseon Arirang; a social network analysis was performed. Regarding the research objectives presented in the introduction, the main findings of the study are summarized as follows. First, there were 197 links between 74 members of the Jeongseon Arirang Transmission Community. One individual had connections with 2.7 persons on average, and all were connected through two steps in the community. However, the density and the clustering coefficient were low, 0.036 and 0.32, respectively; therefore, the cohesiveness of this community was low, and the relationships between the members were not strong. Second, 'Young-ran Yu', 'Nam-gi Kim' and 'Gil-ja Kim' were found to be the prominent figures of the Jeongseon Arirang Transmission Community, and the central structure of the network was concentrated around these three individuals. Being located in the central structure of the network indicates that a person is popular and ranked high. Also, it means that a person has an advantage in terms of the speed and quantity of the acquisition of information and resources, and is in a relatively superior position in terms of bargaining power. Third, to understand the replaceability of the roles of Young-ran Yu, Nam-gi Kim, and Gil-ja Kim, who were found to be the major figures through an analysis of the central structure, structural equivalence was profiled. The results of the analysis showed that the positions and roles of Young-ran Yu, Nam-gi Kim, and Gil-ja Kim were unrivaled and irreplaceable in the Jeongseon Arirang Transmission Community. However, considering that these three members were in their 60s and 70s, it seemed that it would be necessary to prepare measures for the smooth maintenance and operation of the community. Fourth, to examine the subgroup hidden in the network of the Jeongseon Arirang Transmission Community, an analysis of communities was conducted. A community refers to a subgroup clearly differentiated based on modularity. The results of the analysis identified the existence of four communities. Furthermore, the results of an analysis of the central structure showed that the communities were formed and centered around Young-ran Yu, Hyung-jo Kim, Nam-gi Kim, and Gil-ja Kim. Most of the transmission TAs recommended by those members, students who completed a course, transmission scholarship holders, and the general members taught in the transmission classes of the Jeongseon Arirang Preservation Society were included as members of the communities. Through these findings, it was discovered that it is possible to maintain the transmission genealogy, making an exchange with the general members by employing the present method for the transmission of Jeongseon Arirang, the joint transmission method. It is worth paying attention to the joint transmission method as it overcomes the demerits of the existing closed one-on-one apprentice method and provides members with an opportunity to learn their masters' various singing styles. This study is significant for the following reasons: First, by collecting and examining data using a social network analysis method, this study analyzed phenomena that had been difficult to investigate using existing statistical analyses. Second, by adopting a different approach to the previous method in which the genealogy was understood, looking at oral data, this study analyzed the structures of the transmitters' relationships with objective and quantitative data. Third, this study visualized and presented the abstract structures of the relationships among the transmitters of intangible cultural heritage information on a 2D spring map. The results of this study can be utilized as a baseline for the development of community-centered policies for the protection of intangible cultural heritage specified in the UNESCO Convention for the Safeguarding of Intangible Cultural Heritage. To achieve this, it would be necessary to supplement this study through case studies and follow-up studies on more aspects in the future.