Search | Korea Science

Application of Support Vector Regression for Improving the Performance of the Emotion Prediction Model (감정예측모형의 성과개선을 위한 Support Vector Regression 응용)

Kim, Seongjin;Ryoo, Eunchung;Jung, Min Kyu;Kim, Jae Kyeong;Ahn, Hyunchul
- Journal of Intelligence and Information Systems
- /
- v.18 no.3
- /
- pp.185-202
- /
- 2012
.Since the value of information has been realized in the information society, the usage and collection of information has become important. A facial expression that contains thousands of information as an artistic painting can be described in thousands of words. Followed by the idea, there has recently been a number of attempts to provide customers and companies with an intelligent service, which enables the perception of human emotions through one's facial expressions. For example, MIT Media Lab, the leading organization in this research area, has developed the human emotion prediction model, and has applied their studies to the commercial business. In the academic area, a number of the conventional methods such as Multiple Regression Analysis (MRA) or Artificial Neural Networks (ANN) have been applied to predict human emotion in prior studies. However, MRA is generally criticized because of its low prediction accuracy. This is inevitable since MRA can only explain the linear relationship between the dependent variables and the independent variable. To mitigate the limitations of MRA, some studies like Jung and Kim (2012) have used ANN as the alternative, and they reported that ANN generated more accurate prediction than the statistical methods like MRA. However, it has also been criticized due to over fitting and the difficulty of the network design (e.g. setting the number of the layers and the number of the nodes in the hidden layers). Under this background, we propose a novel model using Support Vector Regression (SVR) in order to increase the prediction accuracy. SVR is an extensive version of Support Vector Machine (SVM) designated to solve the regression problems. The model produced by SVR only depends on a subset of the training data, because the cost function for building the model ignores any training data that is close (within a threshold ${\varepsilon}$) to the model prediction. Using SVR, we tried to build a model that can measure the level of arousal and valence from the facial features. To validate the usefulness of the proposed model, we collected the data of facial reactions when providing appropriate visual stimulating contents, and extracted the features from the data. Next, the steps of the preprocessing were taken to choose statistically significant variables. In total, 297 cases were used for the experiment. As the comparative models, we also applied MRA and ANN to the same data set. For SVR, we adopted '${\varepsilon}$-insensitive loss function', and 'grid search' technique to find the optimal values of the parameters like C, d, ${\sigma}^2$, and ${\varepsilon}$. In the case of ANN, we adopted a standard three-layer backpropagation network, which has a single hidden layer. The learning rate and momentum rate of ANN were set to 10%, and we used sigmoid function as the transfer function of hidden and output nodes. We performed the experiments repeatedly by varying the number of nodes in the hidden layer to n/2, n, 3n/2, and 2n, where n is the number of the input variables. The stopping condition for ANN was set to 50,000 learning events. And, we used MAE (Mean Absolute Error) as the measure for performance comparison. From the experiment, we found that SVR achieved the highest prediction accuracy for the hold-out data set compared to MRA and ANN. Regardless of the target variables (the level of arousal, or the level of positive / negative valence), SVR showed the best performance for the hold-out data set. ANN also outperformed MRA, however, it showed the considerably lower prediction accuracy than SVR for both target variables. The findings of our research are expected to be useful to the researchers or practitioners who are willing to build the models for recognizing human emotions.
https://doi.org/10.13088/jiis.2012.18.3.185 인용 PDF KSCI

Prediction of Air Temperature and Relative Humidity in Greenhouse via a Multilayer Perceptron Using Environmental Factors (환경요인을 이용한 다층 퍼셉트론 기반 온실 내 기온 및 상대습도 예측)

Choi, Hayoung;Moon, Taewon;Jung, Dae Ho;Son, Jung Eek
- Journal of Bio-Environment Control
- /
- v.28 no.2
- /
- pp.95-103
- /
- 2019
Temperature and relative humidity are important factors in crop cultivation and should be properly controlled for improving crop yield and quality. In order to control the environment accurately, we need to predict how the environment will change in the future. The objective of this study was to predict air temperature and relative humidity at a future time by using a multilayer perceptron (MLP). The data required to train MLP was collected every 10 min from Oct. 1, 2016 to Feb. 28, 2018 in an eight-span greenhouse ($1,032m^2$) cultivating mango (Mangifera indica cv. Irwin). The inputs for the MLP were greenhouse inside and outside environment data, and set-up and operating values of environment control devices. By using these data, the MLP was trained to predict the air temperature and relative humidity at a future time of 10 to 120 min. Considering typical four seasons in Korea, three-day data of the each season were compared as test data. The MLP was optimized with four hidden layers and 128 nodes for air temperature ($R^2=0.988$) and with four hidden layers and 64 nodes for relative humidity ($R^2=0.990$). Due to the characteristics of MLP, the accuracy decreased as the prediction time became longer. However, air temperature and relative humidity were properly predicted regardless of the environmental changes varied from season to season. For specific data such as spray irrigation, however, the numbers of trained data were too small, resulting in poor predictive accuracy. In this study, air temperature and relative humidity were appropriately predicted through optimization of MLP, but were limited to the experimental greenhouse. Therefore, it is necessary to collect more data from greenhouses at various places and modify the structure of neural network for generalization.
https://doi.org/10.12791/KSBEC.2019.28.2.95 인용 PDF KSCI

A Study on the Revitalization of Tourism Industry through Big Data Analysis (한국관광 실태조사 빅 데이터 분석을 통한 관광산업 활성화 방안 연구)

Lee, Jungmi;Liu, Meina;Lim, Gyoo Gun
- Journal of Intelligence and Information Systems
- /
- v.24 no.2
- /
- pp.149-169
- /
- 2018
Korea is currently accumulating a large amount of data in public institutions based on the public data open policy and the "Government 3.0". Especially, a lot of data is accumulated in the tourism field. However, the academic discussions utilizing the tourism data are still limited. Moreover, the openness of the data of restaurants, hotels, and online tourism information, and how to use SNS Big Data in tourism are still limited. Therefore, utilization through tourism big data analysis is still low. In this paper, we tried to analyze influencing factors on foreign tourists' satisfaction in Korea through numerical data using data mining technique and R programming technique. In this study, we tried to find ways to revitalize the tourism industry by analyzing about 36,000 big data of the "Survey on the actual situation of foreign tourists from 2013 to 2015" surveyed by the Korea Culture & Tourism Research Institute. To do this, we analyzed the factors that have high influence on the 'Satisfaction', 'Revisit intention', and 'Recommendation' variables of foreign tourists. Furthermore, we analyzed the practical influences of the variables that are mentioned above. As a procedure of this study, we first integrated survey data of foreign tourists conducted by Korea Culture & Tourism Research Institute, which is stored in the tourist information system from 2013 to 2015, and eliminate unnecessary variables that are inconsistent with the research purpose among the integrated data. Some variables were modified to improve the accuracy of the analysis. And we analyzed the factors affecting the dependent variables by using data-mining methods: decision tree(C5.0, CART, CHAID, QUEST), artificial neural network, and logistic regression analysis of SPSS IBM Modeler 16.0. The seven variables that have the greatest effect on each dependent variable were derived. As a result of data analysis, it was found that seven major variables influencing 'overall satisfaction' were sightseeing spot attraction, food satisfaction, accommodation satisfaction, traffic satisfaction, guide service satisfaction, number of visiting places, and country. Variables that had a great influence appeared food satisfaction and sightseeing spot attraction. The seven variables that had the greatest influence on 'revisit intention' were the country, travel motivation, activity, food satisfaction, best activity, guide service satisfaction and sightseeing spot attraction. The most influential variables were food satisfaction and travel motivation for Korean style. Lastly, the seven variables that have the greatest influence on the 'recommendation intention' were the country, sightseeing spot attraction, number of visiting places, food satisfaction, activity, tour guide service satisfaction and cost. And then the variables that had the greatest influence were the country, sightseeing spot attraction, and food satisfaction. In addition, in order to grasp the influence of each independent variables more deeply, we used R programming to identify the influence of independent variables. As a result, it was found that the food satisfaction and sightseeing spot attraction were higher than other variables in overall satisfaction and had a greater effect than other influential variables. Revisit intention had a higher ${\beta}$ value in the travel motive as the purpose of Korean Wave than other variables. It will be necessary to have a policy that will lead to a substantial revisit of tourists by enhancing tourist attractions for the purpose of Korean Wave. Lastly, the recommendation had the same result of satisfaction as the sightseeing spot attraction and food satisfaction have higher ${\beta}$ value than other variables. From this analysis, we found that 'food satisfaction' and 'sightseeing spot attraction' variables were the common factors to influence three dependent variables that are mentioned above('Overall satisfaction', 'Revisit intention' and 'Recommendation'), and that those factors affected the satisfaction of travel in Korea significantly. The purpose of this study is to examine how to activate foreign tourists in Korea through big data analysis. It is expected to be used as basic data for analyzing tourism data and establishing effective tourism policy. It is expected to be used as a material to establish an activation plan that can contribute to tourism development in Korea in the future.
https://doi.org/10.13088/jiis.2018.24.2.149 인용 PDF KSCI

The Prediction of Export Credit Guarantee Accident using Machine Learning (기계학습을 이용한 수출신용보증 사고예측)

Cho, Jaeyoung;Joo, Jihwan;Han, Ingoo
- Journal of Intelligence and Information Systems
- /
- v.27 no.1
- /
- pp.83-102
- /
- 2021
The government recently announced various policies for developing big-data and artificial intelligence fields to provide a great opportunity to the public with respect to disclosure of high-quality data within public institutions. KSURE(Korea Trade Insurance Corporation) is a major public institution for financial policy in Korea, and thus the company is strongly committed to backing export companies with various systems. Nevertheless, there are still fewer cases of realized business model based on big-data analyses. In this situation, this paper aims to develop a new business model which can be applied to an ex-ante prediction for the likelihood of the insurance accident of credit guarantee. We utilize internal data from KSURE which supports export companies in Korea and apply machine learning models. Then, we conduct performance comparison among the predictive models including Logistic Regression, Random Forest, XGBoost, LightGBM, and DNN(Deep Neural Network). For decades, many researchers have tried to find better models which can help to predict bankruptcy since the ex-ante prediction is crucial for corporate managers, investors, creditors, and other stakeholders. The development of the prediction for financial distress or bankruptcy was originated from Smith(1930), Fitzpatrick(1932), or Merwin(1942). One of the most famous models is the Altman's Z-score model(Altman, 1968) which was based on the multiple discriminant analysis. This model is widely used in both research and practice by this time. The author suggests the score model that utilizes five key financial ratios to predict the probability of bankruptcy in the next two years. Ohlson(1980) introduces logit model to complement some limitations of previous models. Furthermore, Elmer and Borowski(1988) develop and examine a rule-based, automated system which conducts the financial analysis of savings and loans. Since the 1980s, researchers in Korea have started to examine analyses on the prediction of financial distress or bankruptcy. Kim(1987) analyzes financial ratios and develops the prediction model. Also, Han et al.(1995, 1996, 1997, 2003, 2005, 2006) construct the prediction model using various techniques including artificial neural network. Yang(1996) introduces multiple discriminant analysis and logit model. Besides, Kim and Kim(2001) utilize artificial neural network techniques for ex-ante prediction of insolvent enterprises. After that, many scholars have been trying to predict financial distress or bankruptcy more precisely based on diverse models such as Random Forest or SVM. One major distinction of our research from the previous research is that we focus on examining the predicted probability of default for each sample case, not only on investigating the classification accuracy of each model for the entire sample. Most predictive models in this paper show that the level of the accuracy of classification is about 70% based on the entire sample. To be specific, LightGBM model shows the highest accuracy of 71.1% and Logit model indicates the lowest accuracy of 69%. However, we confirm that there are open to multiple interpretations. In the context of the business, we have to put more emphasis on efforts to minimize type 2 error which causes more harmful operating losses for the guaranty company. Thus, we also compare the classification accuracy by splitting predicted probability of the default into ten equal intervals. When we examine the classification accuracy for each interval, Logit model has the highest accuracy of 100% for 0~10% of the predicted probability of the default, however, Logit model has a relatively lower accuracy of 61.5% for 90~100% of the predicted probability of the default. On the other hand, Random Forest, XGBoost, LightGBM, and DNN indicate more desirable results since they indicate a higher level of accuracy for both 0~10% and 90~100% of the predicted probability of the default but have a lower level of accuracy around 50% of the predicted probability of the default. When it comes to the distribution of samples for each predicted probability of the default, both LightGBM and XGBoost models have a relatively large number of samples for both 0~10% and 90~100% of the predicted probability of the default. Although Random Forest model has an advantage with regard to the perspective of classification accuracy with small number of cases, LightGBM or XGBoost could become a more desirable model since they classify large number of cases into the two extreme intervals of the predicted probability of the default, even allowing for their relatively low classification accuracy. Considering the importance of type 2 error and total prediction accuracy, XGBoost and DNN show superior performance. Next, Random Forest and LightGBM show good results, but logistic regression shows the worst performance. However, each predictive model has a comparative advantage in terms of various evaluation standards. For instance, Random Forest model shows almost 100% accuracy for samples which are expected to have a high level of the probability of default. Collectively, we can construct more comprehensive ensemble models which contain multiple classification machine learning models and conduct majority voting for maximizing its overall performance.
https://doi.org/10.13088/jiis.2021.27.1.083 인용 PDF KSCI

A Study on Optimum Ventilation System in the Deep Coal Mine (심부 석탄광산의 환기시스템 최적화 연구)

Kwon, Joon Uk;Kim, Sun Myung;Kim, Yun Kwang;Jang, Yun Ho
- Tunnel and Underground Space
- /
- v.25 no.2
- /
- pp.186-198
- /
- 2015
This paper aims for the ultimate goal to optimize the work place environment through assuring the optimal required ventilation rate based on the analysis of the airflow. The working environment is deteriorated due to a rise in temperature of a coal mine caused by increase of its depth and carriage tunnels. To improve the environment, the ventilation evaluation on J coal mine is carried out and the effect of a length of the tunnel on the temperature to enhance the ventilation efficiency in the subsurface is numerically analyzed. The analysis shows that J coal mine needs $17,831m^3/min$ for in-flow ventilation rate but the total input air flowrate is $16,474m^3/min$, $1,357m^3/min$ of in-flow ventilation rate shortage. The temperatures were predicted on the two developed models of J mine, and VnetPC that is a numerical program for the flowrate prediction. The result of the simulation notices the temperature in the case of developing all 4 areas of -425ML as a first model is predicted 29.30 at the main gangway 9X of C section and in the case of developing 3 areas of -425ML excepting A area as a second model, it is predicted 27.45 Celsius degrees.
https://doi.org/10.7474/TUS.2015.25.2.186 인용 PDF KSCI

Shape Optimization of Three-Way Reversing Valve for Cavitation Reduction (3 방향 절환밸브의 공동현상 저감을 위한 형상최적화)

Lee, Myeong Gon;Lim, Cha Suk;Han, Seung Ho
- Transactions of the Korean Society of Mechanical Engineers A
- /
- v.39 no.11
- /
- pp.1123-1129
- /
- 2015
A pair of two-way valves typically is used in automotive washing machines, where the water flow direction is frequently reversed and highly pressurized clean water is sprayed to remove the oil and dirt remaining on machined engine and transmission blocks. Although this valve system has been widely used because of its competitive price, its application is sometimes restricted by surging effects, such as pressure ripples occurring in rapid changes in water flow caused by inaccurate valve control. As an alternative, one three-way reversing valve can replace the valve system because it provides rapid and accurate changes to the water flow direction without any precise control device. However, a cavitation effect occurs because of the complicated bottom plug shape of the valve. In this study, the cavitation index and percent of cavitation (POC) were introduced to numerically evaluate fluid flows via computational fluid dynamics (CFD) analysis. To reduce the cavitation effect generated by the bottom plug, the optimal shape design was carried out through a parametric study, in which a simple computer-aided engineering (CAE) model was applied to avoid time-consuming CFD analysis and difficulties in achieving convergence. The optimal shape design process using full factorial design of experiments (DOEs) and an artificial neural network meta-model yielded the optimal waist and tail length of the bottom plug with a POC value of less than 30%, which meets the requirement of no cavitation occurrence. The optimal waist length, tail length and POC value were found to 6.42 mm, 6.96 mm and 27%, respectively.
https://doi.org/10.3795/KSME-A.2015.39.11.1123 인용 PDF KSCI

Vulnerability Assessment of the Climate Change on the Water Environment of Juam Reservoir (기후변화에 따른 주암호 수환경 취약성 평가)

Yoon, Sung Wan;Chung, Se Woong;Park, Hyung Seok
- Proceedings of the Korea Water Resources Association Conference
- /
- 2015.05a
- /
- pp.519-519
- /
- 2015
2007년 발간된 IPCC의 4차 평가보고서에서 자연재해, 환경, 해양, 농업, 생태계, 보건 등 다양한 부분에 미치는 기후변화의 영향에 대한 과학적 근거들이 제시되면서 기후변화는 현세기 범지구적인 화두로 대두되고 있다. 또한, 기후변화에 의한 지구 온난화는 대규모의 수문순환 과정에서의 변화들과 연관되어 담수자원은 기후변화에 대단히 취약하며 미래로 갈수록 악영향을 받을 것으로 6차 기술보고서에서 제시하고 있다. 특히 우리나라는 지구온난화가 전 지구적인 평균보다 급속하게 진행될 가능성이 높기 때문에 기후변화에 대한 담수자원 취약성이 더욱 클 것으로 예상된다. 따라서 지표수에 용수의존도가 높은 우리나라의 댐 저수지를 대상으로 기후변화에 따른 수환경 변화의 정확한 분석과 취약성 평가는 필수적이다. 본 연구에서는 SRES A1B 시나리오를 적용하여 기후변화가 주암호 저수지의 수환경 변화에 미치는 영향을 분석하였다. 지역스케일의 미래 기후시나리오 생산을 위해 인공신경망(Artificial Neural Network.,ANN)기법을 적용하여 예측인자(강우, 상대습도, 최고온도, 최저온도)에 대해 강우-유출모형에 적용이 가능한 지역스케일로 통계적 상세화를 수행하였으며, 이를 유역모델에 적용하여 저수지 유입부의 유출량 및 부하량을 예측하였다. 유역 모델의 결과를 토대로 저수지 운영모델에 저수지 유입부의 유출량을 적용하여 미래 기간의 방류량을 산정하였으며, 최종적으로 저수지 모델에 유입량, 유입부하량 및 방류량을 적용하여 저수지 내 오염 및 영양물질 순환 및 분포 예측을 통해서 기후변화가 저수지 수환경에 미치는 영향을 평가하였다. 기후변화 시나리오에 따른 상세기 후전망을 위해서 기후인자의 미래분석 기간은 (I)단계 구간(2011~2040년), (II)단계 구간(2041~2070년), (III) 단계 구간(2071~2100년)의 3개 구간으로 설정하여 수행하였으며, Baseline인 1991~2010년까지의 실측값과 모의 값을 비교하여 검증하였다. 강우량의 경우 Baseline 대비 미래로 갈수록 증가하는 것으로 전망되었으며, 2011년 대비 2100년에서 연강수량 6.4% 증가한 반면, 일최대강수량이 7.0% 증가하는 것으로 나타나 미래로 갈수록 집중호우의 발생가능성이 커질 것으로 예측되었다. 유역의 수문 수질변화 전망도 강수량 증가의 영향으로 주암댐으로 유입하는 총 유량이 Baseline 대비 증가 하였으며, 유사량 및 오염부하량도 증가하는 것으로 나타났다. 저수지 수환경 변화 예측결과 유입량이 증가함에 따라서 연평균 체류시간이 감소하였으며, 기온 및 유입수온 상승의 영향으로 (I)단계 구간대비 미래로 갈수록 상층 및 심층의 수온이 상승하는 것으로 나타났다. 연중 수온성층기간 역시 증가하는 것으로 나타났으며, 남조류는 (I)단계 구간 대비 (III)단계 구간으로 갈수록 출현시기가 빨라지며 농도 역시 증가하였다. 또한 풍수년, 평수년에 비해 갈수년에 남조류의 연평균농도 상승폭과 최고농도가 크게 나타나 미래로 갈수록 댐 유입량이 적은 해에 남조류로 인한 피해 발생 가능성이 높아질 것으로 예상된다.
PDF

A Study on Deep Learning-based Pedestrian Detection and Alarm System (딥러닝 기반의 보행자 탐지 및 경보 시스템 연구)

Kim, Jeong-Hwan;Shin, Yong-Hyeon
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.18 no.4
- /
- pp.58-70
- /
- 2019
In the case of a pedestrian traffic accident, it has a large-scale danger directly connected by a fatal accident at the time of the accident. The domestic ITS is not used for intelligent risk classification because it is used only for collecting traffic information despite of the construction of good quality traffic infrastructure. The CNN based pedestrian detection classification model, which is a major component of the proposed system, is implemented on an embedded system assuming that it is installed and operated in a restricted environment. A new model was created by improving YOLO's artificial neural network, and the real-time detection speed result of average accuracy 86.29% and 21.1 fps was shown with 20,000 iterative learning. And we constructed a protocol interworking scenario and implementation of a system that can connect with the ITS. If a pedestrian accident prevention system connected with ITS will be implemented through this study, it will help to reduce the cost of constructing a new infrastructure and reduce the incidence of traffic accidents for pedestrians, and we can also reduce the cost for system monitoring.
https://doi.org/10.12815/kits.2019.18.4.58 인용 PDF KSCI

Predicting Corporate Bankruptcy using Simulated Annealing-based Random Fores (시뮬레이티드 어니일링 기반의 랜덤 포레스트를 이용한 기업부도예측)

Park, Hoyeon;Kim, Kyoung-jae
- Journal of Intelligence and Information Systems
- /
- v.24 no.4
- /
- pp.155-170
- /
- 2018
Predicting a company's financial bankruptcy is traditionally one of the most crucial forecasting problems in business analytics. In previous studies, prediction models have been proposed by applying or combining statistical and machine learning-based techniques. In this paper, we propose a novel intelligent prediction model based on the simulated annealing which is one of the well-known optimization techniques. The simulated annealing is known to have comparable optimization performance to the genetic algorithms. Nevertheless, since there has been little research on the prediction and classification of business decision-making problems using the simulated annealing, it is meaningful to confirm the usefulness of the proposed model in business analytics. In this study, we use the combined model of simulated annealing and machine learning to select the input features of the bankruptcy prediction model. Typical types of combining optimization and machine learning techniques are feature selection, feature weighting, and instance selection. This study proposes a combining model for feature selection, which has been studied the most. In order to confirm the superiority of the proposed model in this study, we apply the real-world financial data of the Korean companies and analyze the results. The results show that the predictive accuracy of the proposed model is better than that of the naïve model. Notably, the performance is significantly improved as compared with the traditional decision tree, random forests, artificial neural network, SVM, and logistic regression analysis.
https://doi.org/10.13088/jiis.2018.24.4.155 인용 PDF KSCI HTML

Forecasting daily peak load by time series model with temperature and special days effect (기온과 특수일 효과를 고려하여 시계열 모형을 활용한 일별 최대 전력 수요 예측 연구)

Lee, Jin Young;Kim, Sahm
- The Korean Journal of Applied Statistics
- /
- v.32 no.1
- /
- pp.161-171
- /
- 2019
Varied methods have been researched continuously because the past as the daily maximum electricity demand expectation has been a crucial task in the nation's electrical supply and demand. Forecasting the daily peak electricity demand accurately can prepare the daily operating program about the generating unit, and contribute the reduction of the consumption of the unnecessary energy source through efficient operating facilities. This method also has the advantage that can prepare anticipatively in the reserve margin reduced problem due to the power consumption superabundant by heating and air conditioning that can estimate the daily peak load. This paper researched a model that can forecast the next day's daily peak load when considering the influence of temperature and weekday, weekend, and holidays in the Seasonal ARIMA, TBATS, Seasonal Reg-ARIMA, and NNETAR model. The results of the forecasting performance test on the model of this paper for a Seasonal Reg-ARIMA model and NNETAR model that can consider the day of the week, and temperature showed better forecasting performance than a model that cannot consider these factors. The forecasting performance of the NNETAR model that utilized the artificial neural network was most outstanding.
https://doi.org/10.5351/KJAS.2019.32.1.161 인용 PDF KSCI HTML

Search Result 1,176, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)