• 제목/요약/키워드: basic variable

검색결과 1,054건 처리시간 0.026초

A Study on the Revitalization of Tourism Industry through Big Data Analysis (한국관광 실태조사 빅 데이터 분석을 통한 관광산업 활성화 방안 연구)

  • Lee, Jungmi;Liu, Meina;Lim, Gyoo Gun
    • Journal of Intelligence and Information Systems
    • /
    • 제24권2호
    • /
    • pp.149-169
    • /
    • 2018
  • Korea is currently accumulating a large amount of data in public institutions based on the public data open policy and the "Government 3.0". Especially, a lot of data is accumulated in the tourism field. However, the academic discussions utilizing the tourism data are still limited. Moreover, the openness of the data of restaurants, hotels, and online tourism information, and how to use SNS Big Data in tourism are still limited. Therefore, utilization through tourism big data analysis is still low. In this paper, we tried to analyze influencing factors on foreign tourists' satisfaction in Korea through numerical data using data mining technique and R programming technique. In this study, we tried to find ways to revitalize the tourism industry by analyzing about 36,000 big data of the "Survey on the actual situation of foreign tourists from 2013 to 2015" surveyed by the Korea Culture & Tourism Research Institute. To do this, we analyzed the factors that have high influence on the 'Satisfaction', 'Revisit intention', and 'Recommendation' variables of foreign tourists. Furthermore, we analyzed the practical influences of the variables that are mentioned above. As a procedure of this study, we first integrated survey data of foreign tourists conducted by Korea Culture & Tourism Research Institute, which is stored in the tourist information system from 2013 to 2015, and eliminate unnecessary variables that are inconsistent with the research purpose among the integrated data. Some variables were modified to improve the accuracy of the analysis. And we analyzed the factors affecting the dependent variables by using data-mining methods: decision tree(C5.0, CART, CHAID, QUEST), artificial neural network, and logistic regression analysis of SPSS IBM Modeler 16.0. The seven variables that have the greatest effect on each dependent variable were derived. As a result of data analysis, it was found that seven major variables influencing 'overall satisfaction' were sightseeing spot attraction, food satisfaction, accommodation satisfaction, traffic satisfaction, guide service satisfaction, number of visiting places, and country. Variables that had a great influence appeared food satisfaction and sightseeing spot attraction. The seven variables that had the greatest influence on 'revisit intention' were the country, travel motivation, activity, food satisfaction, best activity, guide service satisfaction and sightseeing spot attraction. The most influential variables were food satisfaction and travel motivation for Korean style. Lastly, the seven variables that have the greatest influence on the 'recommendation intention' were the country, sightseeing spot attraction, number of visiting places, food satisfaction, activity, tour guide service satisfaction and cost. And then the variables that had the greatest influence were the country, sightseeing spot attraction, and food satisfaction. In addition, in order to grasp the influence of each independent variables more deeply, we used R programming to identify the influence of independent variables. As a result, it was found that the food satisfaction and sightseeing spot attraction were higher than other variables in overall satisfaction and had a greater effect than other influential variables. Revisit intention had a higher ${\beta}$ value in the travel motive as the purpose of Korean Wave than other variables. It will be necessary to have a policy that will lead to a substantial revisit of tourists by enhancing tourist attractions for the purpose of Korean Wave. Lastly, the recommendation had the same result of satisfaction as the sightseeing spot attraction and food satisfaction have higher ${\beta}$ value than other variables. From this analysis, we found that 'food satisfaction' and 'sightseeing spot attraction' variables were the common factors to influence three dependent variables that are mentioned above('Overall satisfaction', 'Revisit intention' and 'Recommendation'), and that those factors affected the satisfaction of travel in Korea significantly. The purpose of this study is to examine how to activate foreign tourists in Korea through big data analysis. It is expected to be used as basic data for analyzing tourism data and establishing effective tourism policy. It is expected to be used as a material to establish an activation plan that can contribute to tourism development in Korea in the future.

Optimization of Multiclass Support Vector Machine using Genetic Algorithm: Application to the Prediction of Corporate Credit Rating (유전자 알고리즘을 이용한 다분류 SVM의 최적화: 기업신용등급 예측에의 응용)

  • Ahn, Hyunchul
    • Information Systems Review
    • /
    • 제16권3호
    • /
    • pp.161-177
    • /
    • 2014
  • Corporate credit rating assessment consists of complicated processes in which various factors describing a company are taken into consideration. Such assessment is known to be very expensive since domain experts should be employed to assess the ratings. As a result, the data-driven corporate credit rating prediction using statistical and artificial intelligence (AI) techniques has received considerable attention from researchers and practitioners. In particular, statistical methods such as multiple discriminant analysis (MDA) and multinomial logistic regression analysis (MLOGIT), and AI methods including case-based reasoning (CBR), artificial neural network (ANN), and multiclass support vector machine (MSVM) have been applied to corporate credit rating.2) Among them, MSVM has recently become popular because of its robustness and high prediction accuracy. In this study, we propose a novel optimized MSVM model, and appy it to corporate credit rating prediction in order to enhance the accuracy. Our model, named 'GAMSVM (Genetic Algorithm-optimized Multiclass Support Vector Machine),' is designed to simultaneously optimize the kernel parameters and the feature subset selection. Prior studies like Lorena and de Carvalho (2008), and Chatterjee (2013) show that proper kernel parameters may improve the performance of MSVMs. Also, the results from the studies such as Shieh and Yang (2008) and Chatterjee (2013) imply that appropriate feature selection may lead to higher prediction accuracy. Based on these prior studies, we propose to apply GAMSVM to corporate credit rating prediction. As a tool for optimizing the kernel parameters and the feature subset selection, we suggest genetic algorithm (GA). GA is known as an efficient and effective search method that attempts to simulate the biological evolution phenomenon. By applying genetic operations such as selection, crossover, and mutation, it is designed to gradually improve the search results. Especially, mutation operator prevents GA from falling into the local optima, thus we can find the globally optimal or near-optimal solution using it. GA has popularly been applied to search optimal parameters or feature subset selections of AI techniques including MSVM. With these reasons, we also adopt GA as an optimization tool. To empirically validate the usefulness of GAMSVM, we applied it to a real-world case of credit rating in Korea. Our application is in bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. The experimental dataset was collected from a large credit rating company in South Korea. It contained 39 financial ratios of 1,295 companies in the manufacturing industry, and their credit ratings. Using various statistical methods including the one-way ANOVA and the stepwise MDA, we selected 14 financial ratios as the candidate independent variables. The dependent variable, i.e. credit rating, was labeled as four classes: 1(A1); 2(A2); 3(A3); 4(B and C). 80 percent of total data for each class was used for training, and remaining 20 percent was used for validation. And, to overcome small sample size, we applied five-fold cross validation to our dataset. In order to examine the competitiveness of the proposed model, we also experimented several comparative models including MDA, MLOGIT, CBR, ANN and MSVM. In case of MSVM, we adopted One-Against-One (OAO) and DAGSVM (Directed Acyclic Graph SVM) approaches because they are known to be the most accurate approaches among various MSVM approaches. GAMSVM was implemented using LIBSVM-an open-source software, and Evolver 5.5-a commercial software enables GA. Other comparative models were experimented using various statistical and AI packages such as SPSS for Windows, Neuroshell, and Microsoft Excel VBA (Visual Basic for Applications). Experimental results showed that the proposed model-GAMSVM-outperformed all the competitive models. In addition, the model was found to use less independent variables, but to show higher accuracy. In our experiments, five variables such as X7 (total debt), X9 (sales per employee), X13 (years after founded), X15 (accumulated earning to total asset), and X39 (the index related to the cash flows from operating activity) were found to be the most important factors in predicting the corporate credit ratings. However, the values of the finally selected kernel parameters were found to be almost same among the data subsets. To examine whether the predictive performance of GAMSVM was significantly greater than those of other models, we used the McNemar test. As a result, we found that GAMSVM was better than MDA, MLOGIT, CBR, and ANN at the 1% significance level, and better than OAO and DAGSVM at the 5% significance level.

Correlation Between the Parameters of Radiosensitivity in Human Cancer Cell Lines (인체 암세포주에서 방사선감수성의 지표간의 상호관계)

  • Park, Woo-Yoon;Kim, Won-Dong;Min, Kyung-Soo
    • Radiation Oncology Journal
    • /
    • 제16권2호
    • /
    • pp.99-106
    • /
    • 1998
  • Purpose : We conducted clonogenic assay using human cancer cell lines (MKN-45, PC-14, Y-79, HeLa) to investigate a correlation between the parameters of radiosensitivity. Materials and Methods : Human cancer cell lines were irradiated with single doses of 1, 2, 3, 5, 7 and 10Gy for the study of radiosensitivity and subrethal damage repair capacity was assessed with two fractions of 5Gy separated with a time interval of 0, 1, 2, 3, 4, 6 and 24 hours. Surviving fraction was assessed with clonogenic assay using $Sperman-H\"{a}rbor$ method and mathematical analysis of survival curves was done with linear-quadratic (LQ) , multitarget-single hit(MS) model and mean inactivation dose$(\v{D})$. Results : Surviving fractions at 2Gy(SF2) were variable among the cell lines, ranged from 0.174 to 0.85 The SF2 of Y-79 was lowest and that of PC-14 was highest(p<0.05, t-test). LQ model analysis showed that the values of $\alpha$ for Y-79, MKN-45, HeLa and PC-14 were 0.603, 0.356, 0.275 and 0.102 respectively, and those of $\beta$ were 0.005, 0.016, 0.025 and 0.027 respectively. Fitting to MS model showed that the values of Do for Y-79. MKN-45, HeLa and PC-14 were 1.59. 1.84. 1.88 and 2.52 respectively, and those of n were 0.97, 1.46, 1.52 and 1 69 respectively. The $\v{D}s$ calculated by Gauss-Laguerre method were 1.62, 2.37, 2,01 and 3.95 respectively So the SF2 was significantly correlated with $\alpha$, Do and $\v{D}$. Their Pearson correlation coefficiencics were -0.953 and 0,993. 0.999 respectively(p<0.05). Sublethal damage repair was saturated around 4 hours and recovery ratios (RR) at plateau phase ranged from 2 to 3.79. But RR was not correlated with SF2, ${\alpha}$, ${\beta}$, Do, $\v{D}$. Conclusion : The intrinsic radiosensitivity was very different among the tested human cell lines. Y-79 was the most sensitive and PC-l4 was the least sensitive. SF2 was well correlated with ${\alpha}$, Do, and $\v{D}$. RR was high for MKN-45 and HeLa but had nothing to do with radiosensitivity parameters. These basic parameters can be used as baseline data for various in vitro radiobiological experiments.

  • PDF

Depth Control and Sweeping Depth Stability of the Midwater Trawl (중층트롤의 깊이바꿈과 소해심도의 안정성)

  • 장지원
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • 제9권1호
    • /
    • pp.1-18
    • /
    • 1973
  • For regulating the depth of midwater trawl nets towed at the optimum constant speed, the changes in the shape of warps caused by adding a weight on an arbitrary point of the warp of catenary shape is studied. The shape of a warp may be approximated by a catenary. The resultant inferences under this assumption were experimented. Accordingly feasibilities for the application of the result of this study to the midwater trawl nets were also discussed. A series of experiments for basic midwater trawl gear models in water tank and a couple of experiments of a commercial scale gears at sea which involve the properly designed depth control devices having a variable attitude horizontal wing were carried out. The results are summarized as follows: 1. According to the dimension analysis the depth y of a midwater trawl net is introduced by $$y=kLf(\frac{W_r}{R_r},\;\frac{W_o}{R_o},\;\frac{W_n}{R_n})$$) where k is a constant, L the warp length, f the function, and $W_r,\;W_o$ and $W_n$ the apparent weights of warp, otter board and the net, respectively, 2. When a boat is towing a body of apparent weight $W_n$ and its drag $D_n$ by means of a warp whose length L and apparent weight $W_r$ per unit length, the depth y of the body is given by the following equation, provided that the shape of a warp is a catenary and drag of the warp is neglected in comparison with the drag of the body: $$y=\frac{1}{W_r}\{\sqrt{{D_n^2}+{(W_n+W_rL)^2}}-\sqrt{{D_n^2+W_n}^2\}$$ 3. The changes ${\Delta}y$ of the depth of the midwater trawl net caused by changing the warp length or adding a weight ${\Delta}W_n$_n to the net, are given by the following equations: $${\Delta}y{\approx}\frac{W_n+W_{r}L}{\sqrt{D_n^2+(W_n+W_{r}L)^2}}{\Delta}L$$ $${\Delta}y{\approx}\frac{1}{W_r}\{\frac{W_n+W_rL}{\sqrt{D_n^2+(W_n+W_{r}L)^2}}-{\frac{W_n}{\sqrt{D_n^2+W_n^2}}\}{\Delta}W_n$$ 4. A change ${\Delta}y$ of the depth of the midwater trawl net by adding a weight $W_s$ to an arbitrary point of the warp takes an equation of the form $${\Delta}y=\frac{1}{W_r}\{(T_{ur}'-T_{ur})-T_u'-T_u)\}$$ Where $$T_{ur}^l=\sqrt{T_u^2+(W_s+W_{r}L)^2+2T_u(W_s+W_{r}L)sin{\theta}_u$$ $$T_{ur}=\sqrt{T_u^2+(W_{r}L)^2+2T_uW_{r}L\;sin{\theta}_u$$ $$T_{u}^l=\sqrt{T_u^2+W_s^2+2T_uW_{s}\;sin{\theta}_u$$ and $T_u$ represents the tension at the point on the warp, ${\theta}_u$ the angle between the direction of $T_u$ and horizontal axis, $T_u^2$ the tension at that point when a weights $W_s$ adds to the point where $T_u$ is acted on. 5. If otter boards were constructed lighter and adequate weights were added at their bottom to stabilize them, even they were the same shapes as those of bottom trawls, they were definitely applicable to the midwater trawl gears as the result of the experiments. 6. As the results of water tank tests the relationship between net height of H cm velocity of v m/sec, and that between hydrodynamic resistance of R kg and the velocity of a model net as shown in figure 6 are respectively given by $$H=8+\frac{10}{0.4+v}$$ $$R=3+9v^2$$ 7. It was found that the cross-wing type depth control devices were more stable in operation than that of the H-wing type as the results of the experiments at sea. 8. The hydrodynamic resistance of the net gear in midwater trawling is so large, and regarded as nearly the drag, that sweeping depth of the gear was very stable in spite of types of the depth control devices. 9. An area of the horizontal wing of the H-wing type depth control device was $1.2{\times}2.4m^2$. A midwater trawl net of 2 ton hydrodynamic resistance was connected to the devices and towed with the velocity of 2.3 kts. Under these conditions the depth change of about 20m of the trawl net was obtained by controlling an angle or attack of $30^{\circ}$.

  • PDF