• Title/Summary/Keyword: multiple-training

Search Result 1,082, Processing Time 0.03 seconds

Cavitation signal detection based on time-series signal statistics (시계열 신호 통계량 기반 캐비테이션 신호 탐지)

  • Haesang Yang;Ha-Min Choi;Sock-Kyu Lee;Woojae Seong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.4
    • /
    • pp.400-405
    • /
    • 2024
  • When cavitation noise occurs in ship propellers, the level of underwater radiated noise abruptly increases, which can be a critical threat factor as it increases the probability of detection, particularly in the case of naval vessels. Therefore, accurately and promptly assessing cavitation signals is crucial for improving the survivability of submarines. Traditionally, techniques for determining cavitation occurrence have mainly relied on assessing acoustic/vibration levels measured by sensors above a certain threshold, or using the Detection of Envelop Modulation On Noise (DEMON) method. However, technologies related to this rely on a physical understanding of cavitation phenomena and subjective criteria based on user experience, involving multiple procedures, thus necessitating the development of techniques for early automatic recognition of cavitation signals. In this paper, we propose an algorithm that automatically detects cavitation occurrence based on simple statistical features reflecting cavitation characteristics extracted from acoustic signals measured by sensors attached to the hull. The performance of the proposed technique is evaluated depending on the number of sensors and model test conditions. It was confirmed that by sufficiently training the characteristics of cavitation reflected in signals measured by a single sensor, the occurrence of cavitation signals can be determined.

Estimation of surface nitrogen dioxide mixing ratio in Seoul using the OMI satellite data (OMI 위성자료를 활용한 서울 지표 이산화질소 혼합비 추정 연구)

  • Kim, Daewon;Hong, Hyunkee;Choi, Wonei;Park, Junsung;Yang, Jiwon;Ryu, Jaeyong;Lee, Hanlim
    • Korean Journal of Remote Sensing
    • /
    • v.33 no.2
    • /
    • pp.135-147
    • /
    • 2017
  • We, for the first time, estimated daily and monthly surface nitrogen dioxide ($NO_2$) volume mixing ratio (VMR) using three regression models with $NO_2$ tropospheric vertical column density (OMIT-rop $NO_2$ VCD) data obtained from Ozone Monitoring Instrument (OMI) in Seoul in South Korea at OMI overpass time (13:45 local time). First linear regression model (M1) is a linear regression equation between OMI-Trop $NO_2$ VCD and in situ $NO_2$ VMR, whereas second linear regression model (M2) incorporates boundary layer height (BLH), temperature, and pressure obtained from Atmospheric Infrared Sounder (AIRS) and OMI-Trop $NO_2$ VCD. Last models (M3M & M3D) are a multiple linear regression equations which include OMI-Trop $NO_2$ VCD, BLH and various meteorological data. In this study, we determined three types of regression models for the training period between 2009 and 2011, and the performance of those regression models was evaluated via comparison with the surface $NO_2$ VMR data obtained from in situ measurements (in situ $NO_2$ VMR) in 2012. The monthly mean surface $NO_2$ VMRs estimated by M3M showed good agreements with those of in situ measurements(avg. R = 0.77). In terms of the daily (13:45LT) $NO_2$ estimation, the highest correlations were found between the daily surface $NO_2$ VMRs estimated by M3D and in-situ $NO_2$ VMRs (avg. R = 0.55). The estimated surface $NO_2$ VMRs by three modelstend to be underestimated. We also discussed the performance of these empirical modelsfor surface $NO_2$ VMR estimation with respect to otherstatistical data such asroot mean square error (RMSE), mean bias, mean absolute error (MAE), and percent difference. This present study shows a possibility of estimating surface $NO_2$ VMR using the satellite measurement.

A Study on Factors Influencing The State of Adaptation of The Hemiplegic Patients (편마비 환자의 퇴원후 적응상태와 관련요인에 대한 분석적 연구)

  • 서문자
    • Journal of Korean Academy of Nursing
    • /
    • v.20 no.1
    • /
    • pp.88-117
    • /
    • 1990
  • The purposes of this study are to delineate a profile of the state of a stroke patient's adaptation at 3 months after hospitalization and to explore the relationship between the level of adaptation and the variables which influence the adaptation of hemiplegic patients. To these ends, theoretical framework was derived basically from the stress adaptation model. The basic assumption underlying the level of adaptation is influenced by the presenting focal, contextual and residual stimuli. This group of stimuli is further operationalized and represented by a perception of stress. which is the perceived effect of the disability and by the mediating variables such as sociodemographic factors as an external conditioning variables and perceived social support and hardiness personality characteristics as an internal intervening variables. The dependent varibales in this study is the level of physical, psychological and social adaptation and is hypothesized to be a function of the interaction between 3 sets of variables namely, the perceived disability effect, external conditioning variables and internal intevening varibles. A total of fourty three subjects from 3 general hospitals in Seoul were observed and interviewed with the aid of 7 structured instruments. The data were collected twice on each subject : first at the pre-discharge period arid at 3 months post-discharge from hospital for the second time. The study was carried out for the period from February to August, 1988. The instruments used for the study include 4 existing scales and 3 scales developed by the researcher for this study. They are : 1) The ADL dependency scale and the scale of the clinical physical functions for the assessment of physical adaptation. 2) the SDS(self report of depression) to measure the level of psychological adaptation. 3) The scale for the amount of social activities for the measurement of the level of social adaptation. 4) The scale for the perceived effect of disability for the measurement of the focal stimuli. 5) The health related hardiness scale and the perceived interpersonal support self evaluation list(ISEL) for the measurement of the hardiness personality character and the perceived social support. The data obtained were analyzed using percentage, oneway ANOVA, Pearson coefficients correlation and stepwise multiple regression. The findings provide valuable information about the present level of physical adaptation at 3 months after discharge. The patient revealed a decreased ADL dependency and lowered limitation of physical function as compared with pre - discharge state. Psycholcgically, the average degree of depression at follow up was within normal range of depression. Socially, the amount of social activities was very low. The one way ANOVA and the correlational analysis revealed the relationship between the 3 sets of variables and the adaptation level as follows : 1) The perceived disability effect was related to the degree of the depression and the amount of social activities but was not related to the physical adaptation. 2) Among the sociodemographic variables, sex and education were related to the difference of ADL dependency and the change of physical function. These factors indicate that women more than men and educated more than the less educated were found more independent. The education was also related to the degree of depression suggesting that the higher the educational level, the more well adapted the patients were both physically and psychologically. Age, marital status and job state were not found to be related to the patient's adaptation level. 3) Among the internal intervening variables, the health related hardiness characteristic was related to the differences of ADL dependency, physical functions and the social activities, indicating that the higher the hardiness character the higher the level of physical and social adaptation. 4) The perceived social support, another internal intervening variable, was related to the degree of depression and the social activities. This data suggest that the higher the perception of social support, the better adapted the patients were psychogically and socially. In summarizing the results of the correlational analysis, the level of physical adaptation was influenced by sex, the years of education and the hardiness character. The level of psychological adaptation was influenced by the years of education, the perceived disability effect and the perceived social support. And the level of social adaptation was influenced by the perceived disability effect, the hardiness character and the perceived social support. The stepwise multiple regression analysis shows findings as follows : 1) The most important factor to explain the difference of ADL dependency was sex, indicating females were more independent than males. 2) The most important factor to explain the difference of physical function and the degree of depression was the patient's education level. 3) The strongest explaining factor for the amount of social activities was perceived self esteem(one of the subconcepts of perceived social support). Thus the most important factors influencing the level of adaptation were found to be sex, education, the hardiness character and self esteem. From the above findings, the significance of this study can be delineated as follows : 1) Corroboration of the assumed relationship between the various variables and the adaptation level as suggested in the conceptual model. 2) Support for the feasibility of the cognitive approach for nursing intervention such as hardness character training, counselling and teaching for self-care in the chronic patients.

  • PDF

Comparison of shaping ability between single length technique and crown-down technique using Mtwo rotary file (Mtwo 전동 파일을 사용한 single length technique과 crown-down technique의 근관성형 효율 비교)

  • Lim, Yoo-Kyoung;Park, Jeong-Kil;Hur, Bock;Kim, Hyeon-Cheol
    • Restorative Dentistry and Endodontics
    • /
    • v.32 no.4
    • /
    • pp.385-396
    • /
    • 2007
  • The aims of this study were to compare the shaping effect and safety between single length technique recommended by manufacturer and crown-down technique using Mtwo rotary file and to present a modified method in use of Mtwo file. Sixty simulated root canal resin blocks were used. The canals were divided into three groups according to instrument and the manner of using methods. Each group had 20 specimens. Group MT was instrumented with single length technique of Mtwo, group MC was instrumented with crown-down technique of Mtwo and group PT was instrumented with crown-down technique of ProTaper. All of the rotary files used in this study were operated by an electric motor. The scanned canal images of before and after preparation were superimposed. These superimposed images were evaluated at apical 1 to 8 mm levels Angle changes were calculated. The preparation time, weight loss, instrument failure and binding, canal aberrations, and centering ratio were measured. Statistical analysis of the three experimental groups was performed with ANOVA and Duncan's multiple range tests for post-hoc comparison and Fisher's exact test was done for the frequency comparison. In total preparation time, group MT and group MC were less than group PT. In the aberrations, group MT had more elbows than those of group MC and group PT. The binding of group MC was least and group MT was less than group PT (P < 0.05). Under the condition of this study, crown-down technique using Mtwo rotary file is better and safer method than single length technique recommended by the manufacturer.

Public Sentiment Analysis of Korean Top-10 Companies: Big Data Approach Using Multi-categorical Sentiment Lexicon (국내 주요 10대 기업에 대한 국민 감성 분석: 다범주 감성사전을 활용한 빅 데이터 접근법)

  • Kim, Seo In;Kim, Dong Sung;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.45-69
    • /
    • 2016
  • Recently, sentiment analysis using open Internet data is actively performed for various purposes. As online Internet communication channels become popular, companies try to capture public sentiment of them from online open information sources. This research is conducted for the purpose of analyzing pulbic sentiment of Korean Top-10 companies using a multi-categorical sentiment lexicon. Whereas existing researches related to public sentiment measurement based on big data approach classify sentiment into dimensions, this research classifies public sentiment into multiple categories. Dimensional sentiment structure has been commonly applied in sentiment analysis of various applications, because it is academically proven, and has a clear advantage of capturing degree of sentiment and interrelation of each dimension. However, the dimensional structure is not effective when measuring public sentiment because human sentiment is too complex to be divided into few dimensions. In addition, special training is needed for ordinary people to express their feeling into dimensional structure. People do not divide their sentiment into dimensions, nor do they need psychological training when they feel. People would not express their feeling in the way of dimensional structure like positive/negative or active/passive; rather they express theirs in the way of categorical sentiment like sadness, rage, happiness and so on. That is, categorial approach of sentiment analysis is more natural than dimensional approach. Accordingly, this research suggests multi-categorical sentiment structure as an alternative way to measure social sentiment from the point of the public. Multi-categorical sentiment structure classifies sentiments following the way that ordinary people do although there are possibility to contain some subjectiveness. In this research, nine categories: 'Sadness', 'Anger', 'Happiness', 'Disgust', 'Surprise', 'Fear', 'Interest', 'Boredom' and 'Pain' are used as multi-categorical sentiment structure. To capture public sentiment of Korean Top-10 companies, Internet news data of the companies are collected over the past 25 months from a representative Korean portal site. Based on the sentiment words extracted from previous researches, we have created a sentiment lexicon, and analyzed the frequency of the words coming up within the news data. The frequency of each sentiment category was calculated as a ratio out of the total sentiment words to make ranks of distributions. Sentiment comparison among top-4 companies, which are 'Samsung', 'Hyundai', 'SK', and 'LG', were separately visualized. As a next step, the research tested hypothesis to prove the usefulness of the multi-categorical sentiment lexicon. It tested how effective categorial sentiment can be used as relative comparison index in cross sectional and time series analysis. To test the effectiveness of the sentiment lexicon as cross sectional comparison index, pair-wise t-test and Duncan test were conducted. Two pairs of companies, 'Samsung' and 'Hanjin', 'SK' and 'Hanjin' were chosen to compare whether each categorical sentiment is significantly different in pair-wise t-test. Since category 'Sadness' has the largest vocabularies, it is chosen to figure out whether the subgroups of the companies are significantly different in Duncan test. It is proved that five sentiment categories of Samsung and Hanjin and four sentiment categories of SK and Hanjin are different significantly. In category 'Sadness', it has been figured out that there were six subgroups that are significantly different. To test the effectiveness of the sentiment lexicon as time series comparison index, 'nut rage' incident of Hanjin is selected as an example case. Term frequency of sentiment words of the month when the incident happened and term frequency of the one month before the event are compared. Sentiment categories was redivided into positive/negative sentiment, and it is tried to figure out whether the event actually has some negative impact on public sentiment of the company. The difference in each category was visualized, moreover the variation of word list of sentiment 'Rage' was shown to be more concrete. As a result, there was huge before-and-after difference of sentiment that ordinary people feel to the company. Both hypotheses have turned out to be statistically significant, and therefore sentiment analysis in business area using multi-categorical sentiment lexicons has persuasive power. This research implies that categorical sentiment analysis can be used as an alternative method to supplement dimensional sentiment analysis when figuring out public sentiment in business environment.

A Study on Market Size Estimation Method by Product Group Using Word2Vec Algorithm (Word2Vec을 활용한 제품군별 시장규모 추정 방법에 관한 연구)

  • Jung, Ye Lim;Kim, Ji Hui;Yoo, Hyoung Sun
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • With the rapid development of artificial intelligence technology, various techniques have been developed to extract meaningful information from unstructured text data which constitutes a large portion of big data. Over the past decades, text mining technologies have been utilized in various industries for practical applications. In the field of business intelligence, it has been employed to discover new market and/or technology opportunities and support rational decision making of business participants. The market information such as market size, market growth rate, and market share is essential for setting companies' business strategies. There has been a continuous demand in various fields for specific product level-market information. However, the information has been generally provided at industry level or broad categories based on classification standards, making it difficult to obtain specific and proper information. In this regard, we propose a new methodology that can estimate the market sizes of product groups at more detailed levels than that of previously offered. We applied Word2Vec algorithm, a neural network based semantic word embedding model, to enable automatic market size estimation from individual companies' product information in a bottom-up manner. The overall process is as follows: First, the data related to product information is collected, refined, and restructured into suitable form for applying Word2Vec model. Next, the preprocessed data is embedded into vector space by Word2Vec and then the product groups are derived by extracting similar products names based on cosine similarity calculation. Finally, the sales data on the extracted products is summated to estimate the market size of the product groups. As an experimental data, text data of product names from Statistics Korea's microdata (345,103 cases) were mapped in multidimensional vector space by Word2Vec training. We performed parameters optimization for training and then applied vector dimension of 300 and window size of 15 as optimized parameters for further experiments. We employed index words of Korean Standard Industry Classification (KSIC) as a product name dataset to more efficiently cluster product groups. The product names which are similar to KSIC indexes were extracted based on cosine similarity. The market size of extracted products as one product category was calculated from individual companies' sales data. The market sizes of 11,654 specific product lines were automatically estimated by the proposed model. For the performance verification, the results were compared with actual market size of some items. The Pearson's correlation coefficient was 0.513. Our approach has several advantages differing from the previous studies. First, text mining and machine learning techniques were applied for the first time on market size estimation, overcoming the limitations of traditional sampling based- or multiple assumption required-methods. In addition, the level of market category can be easily and efficiently adjusted according to the purpose of information use by changing cosine similarity threshold. Furthermore, it has a high potential of practical applications since it can resolve unmet needs for detailed market size information in public and private sectors. Specifically, it can be utilized in technology evaluation and technology commercialization support program conducted by governmental institutions, as well as business strategies consulting and market analysis report publishing by private firms. The limitation of our study is that the presented model needs to be improved in terms of accuracy and reliability. The semantic-based word embedding module can be advanced by giving a proper order in the preprocessed dataset or by combining another algorithm such as Jaccard similarity with Word2Vec. Also, the methods of product group clustering can be changed to other types of unsupervised machine learning algorithm. Our group is currently working on subsequent studies and we expect that it can further improve the performance of the conceptually proposed basic model in this study.

Application of Support Vector Regression for Improving the Performance of the Emotion Prediction Model (감정예측모형의 성과개선을 위한 Support Vector Regression 응용)

  • Kim, Seongjin;Ryoo, Eunchung;Jung, Min Kyu;Kim, Jae Kyeong;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.185-202
    • /
    • 2012
  • .Since the value of information has been realized in the information society, the usage and collection of information has become important. A facial expression that contains thousands of information as an artistic painting can be described in thousands of words. Followed by the idea, there has recently been a number of attempts to provide customers and companies with an intelligent service, which enables the perception of human emotions through one's facial expressions. For example, MIT Media Lab, the leading organization in this research area, has developed the human emotion prediction model, and has applied their studies to the commercial business. In the academic area, a number of the conventional methods such as Multiple Regression Analysis (MRA) or Artificial Neural Networks (ANN) have been applied to predict human emotion in prior studies. However, MRA is generally criticized because of its low prediction accuracy. This is inevitable since MRA can only explain the linear relationship between the dependent variables and the independent variable. To mitigate the limitations of MRA, some studies like Jung and Kim (2012) have used ANN as the alternative, and they reported that ANN generated more accurate prediction than the statistical methods like MRA. However, it has also been criticized due to over fitting and the difficulty of the network design (e.g. setting the number of the layers and the number of the nodes in the hidden layers). Under this background, we propose a novel model using Support Vector Regression (SVR) in order to increase the prediction accuracy. SVR is an extensive version of Support Vector Machine (SVM) designated to solve the regression problems. The model produced by SVR only depends on a subset of the training data, because the cost function for building the model ignores any training data that is close (within a threshold ${\varepsilon}$) to the model prediction. Using SVR, we tried to build a model that can measure the level of arousal and valence from the facial features. To validate the usefulness of the proposed model, we collected the data of facial reactions when providing appropriate visual stimulating contents, and extracted the features from the data. Next, the steps of the preprocessing were taken to choose statistically significant variables. In total, 297 cases were used for the experiment. As the comparative models, we also applied MRA and ANN to the same data set. For SVR, we adopted '${\varepsilon}$-insensitive loss function', and 'grid search' technique to find the optimal values of the parameters like C, d, ${\sigma}^2$, and ${\varepsilon}$. In the case of ANN, we adopted a standard three-layer backpropagation network, which has a single hidden layer. The learning rate and momentum rate of ANN were set to 10%, and we used sigmoid function as the transfer function of hidden and output nodes. We performed the experiments repeatedly by varying the number of nodes in the hidden layer to n/2, n, 3n/2, and 2n, where n is the number of the input variables. The stopping condition for ANN was set to 50,000 learning events. And, we used MAE (Mean Absolute Error) as the measure for performance comparison. From the experiment, we found that SVR achieved the highest prediction accuracy for the hold-out data set compared to MRA and ANN. Regardless of the target variables (the level of arousal, or the level of positive / negative valence), SVR showed the best performance for the hold-out data set. ANN also outperformed MRA, however, it showed the considerably lower prediction accuracy than SVR for both target variables. The findings of our research are expected to be useful to the researchers or practitioners who are willing to build the models for recognizing human emotions.

A Comparative Study of Domestic and International regulation on Mixed-fleet Flying of Flight crew (운항승무원의 항공기 2개 형식 운항관련 국내외 기준 비교 연구)

  • Lee, Koo-Hee
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.30 no.2
    • /
    • pp.403-425
    • /
    • 2015
  • The Chicago Convention and Annexes have become the basis of aviation safety regulations for every contracting state. Generally, the State's aviation safety regulations refer to the Standards and Recommended Practices(SARPs) provided in the Annexes of the Chicago Convention. In order to properly reflect international aviation safety regulations, constant studies of the aviation fields are of paramount importance. This Paper is intended to identify the main differences between korean and foreign regulation and suggest a few amendment proposals on Mixed-fleet Flying(at or more two aircraft type operation) of flight crew. Comparing with these regulations, the korean regulations and implementations have some insufficiency points. I suggest some amendment proposals of korean regulations concerning Mixed-fleet Flying that flight crew operate aircraft of different types. Basically an operator shall not assign a pilot-in-command or a co-pilot to operate at the flight controls of a type of airplane during take-off and landing unless that pilot has operated the flight controls during at least three take-offs and landings within the preceding 90 days on the same type of airplane or in a flight simulator. Also, flight crew members are familiarized with the significant differences in equipment and/or procedures between concurrently operated types. An operator shall ensure that piloting technique and the ability to execute emergency procedures is checked in such a way as to demonstrate the pilot's competence on each type or variant of a type of airplane. Proficiency check shall be performed periodically. When an operator schedules flight crew on different types of airplanes with similar characteristics in terms of operating procedures, systems and handling, the State shall decide the requirements for each type of airplane can be combined. In conclusion, it is necessary for flight crew members to remain concurrently qualified to operate multiple types. The operator shall have a program to include, as a minimum, required differences training between types and qualification to maintain currency on each type. If the Operator utilizes flight crew members to concurrently operate aircraft of different types, the operator shall have qualification processes approved or accepted by the State. If applicable, the qualification curriculum as defined in the operator's Advanced Qualification Program could be applied. Flight crew members are familiarized with the significant differences in equipment and/or procedures between concurrently operated types. The difference among different types of airpcrafts decrease and standards for these airpcrafts can be applied increasingly because function and performance have been improved by aircraft manufacture company in accordance to basic aircraft system in terms of developing new aircrafts for flight standard procedure and safety of flight. Also, it becomes more necessary for flight crews to control multi aircraft types due to various aviation business and activation of leisure business. Nevertheless, in terms of flight crew training and qualification program, there are no regulations in Korea to be applied to new aircraft types differently in accordance with different levels. In addition, it has no choice different programs based on different levels because there are not provisions to restrict or limit and specific standards to operate at or more than two aircraft types for flight safety. Therefore the aviation authority introduce Flight Standardization and/or Operational Evaluation Board in order to analysis differences among aircraft types. In addition to that, the aviation authority should also improve standard flight evaluation and qualification system among different aircraft types for flight crews to apply reasonable training and qualification efficiently. For all the issue mentioned above, I have studied the ICAO SARPs and some state's regulation concerning operating aircraft of different types(Mixed-fleet flying), and suggested some proposals on the different aircraft type operation as an example of comprehensive problem solving. I hope that this paper is 1) to help understanding about the international issue, 2) to help the improvement of korean aviation regulations, 3) to help compliance with international standards and to contribute to the promotion of aviation safety, in addition.

Optimization of Multiclass Support Vector Machine using Genetic Algorithm: Application to the Prediction of Corporate Credit Rating (유전자 알고리즘을 이용한 다분류 SVM의 최적화: 기업신용등급 예측에의 응용)

  • Ahn, Hyunchul
    • Information Systems Review
    • /
    • v.16 no.3
    • /
    • pp.161-177
    • /
    • 2014
  • Corporate credit rating assessment consists of complicated processes in which various factors describing a company are taken into consideration. Such assessment is known to be very expensive since domain experts should be employed to assess the ratings. As a result, the data-driven corporate credit rating prediction using statistical and artificial intelligence (AI) techniques has received considerable attention from researchers and practitioners. In particular, statistical methods such as multiple discriminant analysis (MDA) and multinomial logistic regression analysis (MLOGIT), and AI methods including case-based reasoning (CBR), artificial neural network (ANN), and multiclass support vector machine (MSVM) have been applied to corporate credit rating.2) Among them, MSVM has recently become popular because of its robustness and high prediction accuracy. In this study, we propose a novel optimized MSVM model, and appy it to corporate credit rating prediction in order to enhance the accuracy. Our model, named 'GAMSVM (Genetic Algorithm-optimized Multiclass Support Vector Machine),' is designed to simultaneously optimize the kernel parameters and the feature subset selection. Prior studies like Lorena and de Carvalho (2008), and Chatterjee (2013) show that proper kernel parameters may improve the performance of MSVMs. Also, the results from the studies such as Shieh and Yang (2008) and Chatterjee (2013) imply that appropriate feature selection may lead to higher prediction accuracy. Based on these prior studies, we propose to apply GAMSVM to corporate credit rating prediction. As a tool for optimizing the kernel parameters and the feature subset selection, we suggest genetic algorithm (GA). GA is known as an efficient and effective search method that attempts to simulate the biological evolution phenomenon. By applying genetic operations such as selection, crossover, and mutation, it is designed to gradually improve the search results. Especially, mutation operator prevents GA from falling into the local optima, thus we can find the globally optimal or near-optimal solution using it. GA has popularly been applied to search optimal parameters or feature subset selections of AI techniques including MSVM. With these reasons, we also adopt GA as an optimization tool. To empirically validate the usefulness of GAMSVM, we applied it to a real-world case of credit rating in Korea. Our application is in bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. The experimental dataset was collected from a large credit rating company in South Korea. It contained 39 financial ratios of 1,295 companies in the manufacturing industry, and their credit ratings. Using various statistical methods including the one-way ANOVA and the stepwise MDA, we selected 14 financial ratios as the candidate independent variables. The dependent variable, i.e. credit rating, was labeled as four classes: 1(A1); 2(A2); 3(A3); 4(B and C). 80 percent of total data for each class was used for training, and remaining 20 percent was used for validation. And, to overcome small sample size, we applied five-fold cross validation to our dataset. In order to examine the competitiveness of the proposed model, we also experimented several comparative models including MDA, MLOGIT, CBR, ANN and MSVM. In case of MSVM, we adopted One-Against-One (OAO) and DAGSVM (Directed Acyclic Graph SVM) approaches because they are known to be the most accurate approaches among various MSVM approaches. GAMSVM was implemented using LIBSVM-an open-source software, and Evolver 5.5-a commercial software enables GA. Other comparative models were experimented using various statistical and AI packages such as SPSS for Windows, Neuroshell, and Microsoft Excel VBA (Visual Basic for Applications). Experimental results showed that the proposed model-GAMSVM-outperformed all the competitive models. In addition, the model was found to use less independent variables, but to show higher accuracy. In our experiments, five variables such as X7 (total debt), X9 (sales per employee), X13 (years after founded), X15 (accumulated earning to total asset), and X39 (the index related to the cash flows from operating activity) were found to be the most important factors in predicting the corporate credit ratings. However, the values of the finally selected kernel parameters were found to be almost same among the data subsets. To examine whether the predictive performance of GAMSVM was significantly greater than those of other models, we used the McNemar test. As a result, we found that GAMSVM was better than MDA, MLOGIT, CBR, and ANN at the 1% significance level, and better than OAO and DAGSVM at the 5% significance level.

Ensemble of Nested Dichotomies for Activity Recognition Using Accelerometer Data on Smartphone (Ensemble of Nested Dichotomies 기법을 이용한 스마트폰 가속도 센서 데이터 기반의 동작 인지)

  • Ha, Eu Tteum;Kim, Jeongmin;Ryu, Kwang Ryel
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.4
    • /
    • pp.123-132
    • /
    • 2013
  • As the smartphones are equipped with various sensors such as the accelerometer, GPS, gravity sensor, gyros, ambient light sensor, proximity sensor, and so on, there have been many research works on making use of these sensors to create valuable applications. Human activity recognition is one such application that is motivated by various welfare applications such as the support for the elderly, measurement of calorie consumption, analysis of lifestyles, analysis of exercise patterns, and so on. One of the challenges faced when using the smartphone sensors for activity recognition is that the number of sensors used should be minimized to save the battery power. When the number of sensors used are restricted, it is difficult to realize a highly accurate activity recognizer or a classifier because it is hard to distinguish between subtly different activities relying on only limited information. The difficulty gets especially severe when the number of different activity classes to be distinguished is very large. In this paper, we show that a fairly accurate classifier can be built that can distinguish ten different activities by using only a single sensor data, i.e., the smartphone accelerometer data. The approach that we take to dealing with this ten-class problem is to use the ensemble of nested dichotomy (END) method that transforms a multi-class problem into multiple two-class problems. END builds a committee of binary classifiers in a nested fashion using a binary tree. At the root of the binary tree, the set of all the classes are split into two subsets of classes by using a binary classifier. At a child node of the tree, a subset of classes is again split into two smaller subsets by using another binary classifier. Continuing in this way, we can obtain a binary tree where each leaf node contains a single class. This binary tree can be viewed as a nested dichotomy that can make multi-class predictions. Depending on how a set of classes are split into two subsets at each node, the final tree that we obtain can be different. Since there can be some classes that are correlated, a particular tree may perform better than the others. However, we can hardly identify the best tree without deep domain knowledge. The END method copes with this problem by building multiple dichotomy trees randomly during learning, and then combining the predictions made by each tree during classification. The END method is generally known to perform well even when the base learner is unable to model complex decision boundaries As the base classifier at each node of the dichotomy, we have used another ensemble classifier called the random forest. A random forest is built by repeatedly generating a decision tree each time with a different random subset of features using a bootstrap sample. By combining bagging with random feature subset selection, a random forest enjoys the advantage of having more diverse ensemble members than a simple bagging. As an overall result, our ensemble of nested dichotomy can actually be seen as a committee of committees of decision trees that can deal with a multi-class problem with high accuracy. The ten classes of activities that we distinguish in this paper are 'Sitting', 'Standing', 'Walking', 'Running', 'Walking Uphill', 'Walking Downhill', 'Running Uphill', 'Running Downhill', 'Falling', and 'Hobbling'. The features used for classifying these activities include not only the magnitude of acceleration vector at each time point but also the maximum, the minimum, and the standard deviation of vector magnitude within a time window of the last 2 seconds, etc. For experiments to compare the performance of END with those of other methods, the accelerometer data has been collected at every 0.1 second for 2 minutes for each activity from 5 volunteers. Among these 5,900 ($=5{\times}(60{\times}2-2)/0.1$) data collected for each activity (the data for the first 2 seconds are trashed because they do not have time window data), 4,700 have been used for training and the rest for testing. Although 'Walking Uphill' is often confused with some other similar activities, END has been found to classify all of the ten activities with a fairly high accuracy of 98.4%. On the other hand, the accuracies achieved by a decision tree, a k-nearest neighbor, and a one-versus-rest support vector machine have been observed as 97.6%, 96.5%, and 97.6%, respectively.