• Title/Summary/Keyword: Vector data model

Search Result 1,181, Processing Time 0.034 seconds

The Analysis on the Relationship between Firms' Exposures to SNS and Stock Prices in Korea (기업의 SNS 노출과 주식 수익률간의 관계 분석)

  • Kim, Taehwan;Jung, Woo-Jin;Lee, Sang-Yong Tom
    • Asia pacific journal of information systems
    • /
    • v.24 no.2
    • /
    • pp.233-253
    • /
    • 2014
  • Can the stock market really be predicted? Stock market prediction has attracted much attention from many fields including business, economics, statistics, and mathematics. Early research on stock market prediction was based on random walk theory (RWT) and the efficient market hypothesis (EMH). According to the EMH, stock market are largely driven by new information rather than present and past prices. Since it is unpredictable, stock market will follow a random walk. Even though these theories, Schumaker [2010] asserted that people keep trying to predict the stock market by using artificial intelligence, statistical estimates, and mathematical models. Mathematical approaches include Percolation Methods, Log-Periodic Oscillations and Wavelet Transforms to model future prices. Examples of artificial intelligence approaches that deals with optimization and machine learning are Genetic Algorithms, Support Vector Machines (SVM) and Neural Networks. Statistical approaches typically predicts the future by using past stock market data. Recently, financial engineers have started to predict the stock prices movement pattern by using the SNS data. SNS is the place where peoples opinions and ideas are freely flow and affect others' beliefs on certain things. Through word-of-mouth in SNS, people share product usage experiences, subjective feelings, and commonly accompanying sentiment or mood with others. An increasing number of empirical analyses of sentiment and mood are based on textual collections of public user generated data on the web. The Opinion mining is one domain of the data mining fields extracting public opinions exposed in SNS by utilizing data mining. There have been many studies on the issues of opinion mining from Web sources such as product reviews, forum posts and blogs. In relation to this literatures, we are trying to understand the effects of SNS exposures of firms on stock prices in Korea. Similarly to Bollen et al. [2011], we empirically analyze the impact of SNS exposures on stock return rates. We use Social Metrics by Daum Soft, an SNS big data analysis company in Korea. Social Metrics provides trends and public opinions in Twitter and blogs by using natural language process and analysis tools. It collects the sentences circulated in the Twitter in real time, and breaks down these sentences into the word units and then extracts keywords. In this study, we classify firms' exposures in SNS into two groups: positive and negative. To test the correlation and causation relationship between SNS exposures and stock price returns, we first collect 252 firms' stock prices and KRX100 index in the Korea Stock Exchange (KRX) from May 25, 2012 to September 1, 2012. We also gather the public attitudes (positive, negative) about these firms from Social Metrics over the same period of time. We conduct regression analysis between stock prices and the number of SNS exposures. Having checked the correlation between the two variables, we perform Granger causality test to see the causation direction between the two variables. The research result is that the number of total SNS exposures is positively related with stock market returns. The number of positive mentions of has also positive relationship with stock market returns. Contrarily, the number of negative mentions has negative relationship with stock market returns, but this relationship is statistically not significant. This means that the impact of positive mentions is statistically bigger than the impact of negative mentions. We also investigate whether the impacts are moderated by industry type and firm's size. We find that the SNS exposures impacts are bigger for IT firms than for non-IT firms, and bigger for small sized firms than for large sized firms. The results of Granger causality test shows change of stock price return is caused by SNS exposures, while the causation of the other way round is not significant. Therefore the correlation relationship between SNS exposures and stock prices has uni-direction causality. The more a firm is exposed in SNS, the more is the stock price likely to increase, while stock price changes may not cause more SNS mentions.

Estimation of Mean Surface Current and Current Variability in the East Sea using Surface Drifter Data from 1991 to 2017 (1991년부터 2017년까지 표층 뜰개 자료를 이용하여 계산한 동해의 평균 표층 해류와 해류 변동성)

  • PARK, JU-EUN;KIM, SOO-YUN;CHOI, BYOUNG-JU;BYUN, DO-SEONG
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.24 no.2
    • /
    • pp.208-225
    • /
    • 2019
  • To understand the mean surface circulation and surface currents in the East Sea, trajectories of surface drifters passed through the East Sea from 1991 to 2017 were analyzed. By analyzing the surface drifter trajectory data, the main paths of surface ocean currents were grouped and the variation in each main current path was investigated. The East Korea Warm Current (EKWC) heading northward separates from the coast at $36{\sim}38^{\circ}N$ and flows to the northeast until $131^{\circ}E$. In the middle (from $131^{\circ}E$ to $137^{\circ}E$) of the East Sea, the average latitude of the currents flowing eastward ranges from 36 to $40^{\circ}N$ and the currents meander with large amplitude. When the average latitude of the surface drifter paths was in the north (south) of $37.5^{\circ}N$, the meandering amplitude was about 50 (100) km. The most frequent route of surface drifters in the middle of the East Sea was the path along $37.5-38.5^{\circ}N$. The surface drifters, which were deployed off the coast of Vladivostok in the north of the East Sea, moved to the southwest along the coast and were separated from the coast to flow southeastward along the cyclonic circulation around the Japan Basin. And, then, the drifters moved to the east along $39-40^{\circ}N$. The mean surface current vector and mean speed were calculated in each lattice with $0.25^{\circ}$ grid spacing using the velocity data of surface drifters which passed through each lattice. The current variance ellipses were calculated with $0.5^{\circ}$ grid spacing. Because the path of the EKWC changes every year in the western part of the Ulleung Basin and the current paths in the Yamato Basin keep changing with many eddies, the current variance ellipses are relatively large in these region. We present a schematic map of the East Sea surface current based on the surface drifter data. The significance of this study is that the surface ocean circulation of the East Sea, which has been mainly studied by numerical model simulations and the sea surface height data obtained from satellite altimeters, was analyzed based on in-situ Lagrangian observational current data.

The Macroeconomic Impacts of Korean Elections and Their Future Consequences (선거(選擧)의 거시경제적(巨視經濟的) 충격(衝擊)과 파급효과(波及效果))

  • Shim, Sang-dal;Lee, Hang-yong
    • KDI Journal of Economic Policy
    • /
    • v.14 no.1
    • /
    • pp.147-165
    • /
    • 1992
  • This paper analyzes the macroeconomic effects of elections on the Korean economy and their future ramifications. It measures the shocks to the Korean economy caused by elections by taking the average of sample forecast errors from four major elections held in the 1980s. The seven variables' Bayesian Vector Autoregression Model which includes the Monetary Base, Industrial Production, Consumption, Consumer Price, Exports, and Investment is based on the quarterly time series data starting from 1970 and is updated every quarter before forecasts are made for the next quarter. Because of this updating of coefficients, which reflects in part the rapid structural changes of the Korean economy, this study can capture the shock effect of elections, which is not possible when using election dummies with a fixed coefficient model. In past elections, especially the elections held in the 1980s, $M_2$ did not show any particular movement, but the currency and base money increased during the quarter of the election was held and the increment was partly recalled in the next quarter. The liquidity of interest rates as measured by corporate bond yields fell during the quarter the election and then rose in the following quarter, which is somewhat contrary to the general concern that interest rates will increase during election periods. Manufacturing employment fell in the quarter of the election because workers turned into campaigners. This decline in employment combined with voting holiday produce a sizeable decline in industrial production during the quarter in which elections are held, but production catches up in the next quarter and sometimes more than offsets the disruption caused during the election quarter. The major shocks to price occur in the previous quarter, reflecting the expectational effect and the relaxation of government price control before the election when we simulate the impulse responses of the VAR model, imposing the same shocks that was measured in the past elections for each election to be held in 1992 and assuming that the elections in 1992 will affect the economy in the same manner as in the 1980s elections, 1992 is expected to see a sizeable increase in monetary base due to election and prices increase pressure will be amplified substantially. On the other hand, the consumption increase due to election is expected to be relatively small and the production will not decrease. Despite increased liquidity, a large portion of liquidity in circulation being used as election funds will distort the flow of funds and aggravate the fund shortage causing investments in plant and equipment and construction activities to stagnate. These effects will be greatly amplified if elections for the head of local government are going to be held this year. If mayoral and gubernatorial elections are held after National Assembly elections, their effect on prices and investment will be approximately double what they normally will have been have only congressional and presidential elections been held. Even when mayoral and gubernatorial elections are held at the same time as congressional elections, the elections of local government heads are shown to add substantial effects to the economy for the year. The above results are based on the assumption that this year's elections will shock the economy in the same manner as in past elections. However, elections in consecutive quarters do not give the economy a chance to pause and recuperate from past elections. This year's elections may have greater effects on prices and production than shown in the model's simulations because campaigners' return to industry may be delayed. Therefore, we may not see a rapid recall of money after elections. In view of the surge in the monetary base and price escalation in the periods before and after elections, economic management in 1992 should place its first priority on controlling the monetary aggregate, in particular, stabilizing the growth of the monetary base.

  • PDF

Bankruptcy Forecasting Model using AdaBoost: A Focus on Construction Companies (적응형 부스팅을 이용한 파산 예측 모형: 건설업을 중심으로)

  • Heo, Junyoung;Yang, Jin Yong
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.35-48
    • /
    • 2014
  • According to the 2013 construction market outlook report, the liquidation of construction companies is expected to continue due to the ongoing residential construction recession. Bankruptcies of construction companies have a greater social impact compared to other industries. However, due to the different nature of the capital structure and debt-to-equity ratio, it is more difficult to forecast construction companies' bankruptcies than that of companies in other industries. The construction industry operates on greater leverage, with high debt-to-equity ratios, and project cash flow focused on the second half. The economic cycle greatly influences construction companies. Therefore, downturns tend to rapidly increase the bankruptcy rates of construction companies. High leverage, coupled with increased bankruptcy rates, could lead to greater burdens on banks providing loans to construction companies. Nevertheless, the bankruptcy prediction model concentrated mainly on financial institutions, with rare construction-specific studies. The bankruptcy prediction model based on corporate finance data has been studied for some time in various ways. However, the model is intended for all companies in general, and it may not be appropriate for forecasting bankruptcies of construction companies, who typically have high liquidity risks. The construction industry is capital-intensive, operates on long timelines with large-scale investment projects, and has comparatively longer payback periods than in other industries. With its unique capital structure, it can be difficult to apply a model used to judge the financial risk of companies in general to those in the construction industry. Diverse studies of bankruptcy forecasting models based on a company's financial statements have been conducted for many years. The subjects of the model, however, were general firms, and the models may not be proper for accurately forecasting companies with disproportionately large liquidity risks, such as construction companies. The construction industry is capital-intensive, requiring significant investments in long-term projects, therefore to realize returns from the investment. The unique capital structure means that the same criteria used for other industries cannot be applied to effectively evaluate financial risk for construction firms. Altman Z-score was first published in 1968, and is commonly used as a bankruptcy forecasting model. It forecasts the likelihood of a company going bankrupt by using a simple formula, classifying the results into three categories, and evaluating the corporate status as dangerous, moderate, or safe. When a company falls into the "dangerous" category, it has a high likelihood of bankruptcy within two years, while those in the "safe" category have a low likelihood of bankruptcy. For companies in the "moderate" category, it is difficult to forecast the risk. Many of the construction firm cases in this study fell in the "moderate" category, which made it difficult to forecast their risk. Along with the development of machine learning using computers, recent studies of corporate bankruptcy forecasting have used this technology. Pattern recognition, a representative application area in machine learning, is applied to forecasting corporate bankruptcy, with patterns analyzed based on a company's financial information, and then judged as to whether the pattern belongs to the bankruptcy risk group or the safe group. The representative machine learning models previously used in bankruptcy forecasting are Artificial Neural Networks, Adaptive Boosting (AdaBoost) and, the Support Vector Machine (SVM). There are also many hybrid studies combining these models. Existing studies using the traditional Z-Score technique or bankruptcy prediction using machine learning focus on companies in non-specific industries. Therefore, the industry-specific characteristics of companies are not considered. In this paper, we confirm that adaptive boosting (AdaBoost) is the most appropriate forecasting model for construction companies by based on company size. We classified construction companies into three groups - large, medium, and small based on the company's capital. We analyzed the predictive ability of AdaBoost for each group of companies. The experimental results showed that AdaBoost has more predictive ability than the other models, especially for the group of large companies with capital of more than 50 billion won.

A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)

  • An, Juyoung;Bae, Junghwan;Han, Namgi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.69-92
    • /
    • 2015
  • The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.

Improved Sentence Boundary Detection Method for Web Documents (웹 문서를 위한 개선된 문장경계인식 방법)

  • Lee, Chung-Hee;Jang, Myung-Gil;Seo, Young-Hoon
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.6
    • /
    • pp.455-463
    • /
    • 2010
  • In this paper, we present an approach to sentence boundary detection for web documents that builds on statistical-based methods and uses rule-based correction. The proposed system uses the classification model learned offline using a training set of human-labeled web documents. The web documents have many word-spacing errors and frequently no punctuation mark that indicates the end of sentence boundary. As sentence boundary candidates, the proposed method considers every Ending Eomis as well as punctuation marks. We optimize engine performance by selecting the best feature, the best training data, and the best classification algorithm. For evaluation, we made two test sets; Set1 consisting of articles and blog documents and Set2 of web community documents. We use F-measure to compare results on a large variety of tasks, Detecting only periods as sentence boundary, our basis engine showed 96.5% in Set1 and 56.7% in Set2. We improved our basis engine by adapting features and the boundary search algorithm. For the final evaluation, we compared our adaptation engine with our basis engine in Set2. As a result, the adaptation engine obtained improvements over the basis engine by 39.6%. We proved the effectiveness of the proposed method in sentence boundary detection.

Impact Analysis of Economic Fluctuation of Saudi Arabia on Korean Overseas Construction Business (사우디아라비아의 경제변동이 한국의 해외건설 수주에 미치는 영향분석)

  • Jeon, Jae-Keun;Lee, Suk-Won;Kim, Jae-Jun
    • Korean Journal of Construction Engineering and Management
    • /
    • v.17 no.2
    • /
    • pp.39-48
    • /
    • 2016
  • According to the order receipt report over the past 10 years the overseas construction business total trades were 54.05% and 68.09% done by the Middle East and other industrial facilities respectively. In the Middle East based on data collected in 2014 the country with most overseas construction business is Saudi Arabia with 34.10%, and the industrial facility occupies the larger share with 56%. Overseas construction business is suffering from a reduction in work orders in the Middle East because of the recent oil price reduction. At this point of view, it is necessary to consider economy fluctuation for the diversification of construction type and work orders. This study analyzed, focusing in Saudi Arabia how the economical fluctuations of nations of progress can affect overseas construction business' trade orders. The analysis results demonstrated that most construction types depends on GDP, investment fund. Also industrial facility can be substituted with Architecture and civil engineering. This work is expected to be used as a basis for trade order amount maintenance and construction type diversification.

Effects of the Trade Insurance and Exchange Risk on Export: The Experience of Korea (무역보험과 환위험이 수출에 미치는 영향)

  • Kim, Chang-Beom
    • International Commerce and Information Review
    • /
    • v.13 no.3
    • /
    • pp.77-95
    • /
    • 2011
  • This paper investigates the relationship between export and economic variables such as trade insurance, world economy activity, relative price, unemployment rate, exchange rate volatility, using monthly data. I employ Johansen cointegration methodology since the model must be stationary to avoid the spurious results. The results indicate that there is a long-run relationship between export and variables. Also, the empirical analysis of cointegrating vector using the CCR, DOLS, FMOLS reveals that the increases of trade insurance has positive relations and the increases of exchange rate volatility have negative relations with export. Especially, DOLS based on Monte Carlo simulations, of this estimator being superior in small samples compared to a number of alternative estimators, as well as being able not only to accommodate higher orders of integration but also to account for possible simultaneity within regressors of a potential system. This paper also applies impulse-response functions to get the additional information regarding the responses of the export to the shocks of the variables. The result indicates that export positively to trade insurance and then decay fast compare with exchange rate volatility. Consequently, trade insurance plays the role of trade policy for export promotion in Korea. Whereas, increase of exchange risk result in reduction of export. Therefore, the support of trade insurance should be expanded and the stabilization of the foreign exchange market must be done for the export promotion.

  • PDF

Estimation of sea surface wind using Radarsat-1 SAR (RADARSAT-1 SAR자료를 이용한 해상풍 추정)

  • Yoon, Hong-Joo;Cho, Han-Keun;Kang, Heung-Soon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.06a
    • /
    • pp.227-230
    • /
    • 2007
  • If we use the microwave of SAR, we can observe on the ocean in spite of bad weather, day and night time. Sea surface images on the ocean of SAR have a lot of information on the atmospheric phenomena related to surface wind vector. Information of wind speed which is extracted from SAR images is used variously. Wind direction data and sigma nought value are put in the CMOD which can extract wind information in order to estimate sea surface wind from SAR images. Wind spectrum which is extracted from SAR always presents opposed two points of $180^{\circ}$ because of applying to 2D-FFT. These ambiguities should be decided by position of land, wind direction or numerical model. Previously, we converted into sigma nought after extracting Digital Number from RadarSat-1 SAR using ENVI4.0, thus, it took a long time because every process was manual. Therefore, we converted sigma nought by matlab code after making matlab code. After that, we are extracting wind direction from sigma nought. Now, to decide wind direction needs further study because wind direction has $180^{\circ}$ ambiguity.

  • PDF

Experimental Validation of Isogeometric Optimal Design (아이소-지오메트릭 형상 최적설계의 실험적 검증)

  • Choi, Myung-Jin;Yoon, Min-Ho;Cho, Seonho
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.27 no.5
    • /
    • pp.345-352
    • /
    • 2014
  • In this paper, the CAD data for the optimal shape design obtained by isogeometric shape optimization is directly used to fabricate the specimen by using 3D printer for the experimental validation. In a conventional finite element method, the geometric approximation inherent in the mesh leads to the accuracy issue in response analysis and design sensitivity analysis. Furthermore, in the finite element based shape optimization, subsequent communication with CAD description is required in the design optimization process, which results in the loss of optimal design information during the communication. Isogeometric analysis method employs the same NURBS basis functions and control points used in CAD systems, which enables to use exact geometrical properties like normal vector and curvature information in the response analysis and design sensitivity analysis procedure. Also, it vastly simplify the design modification of complex geometries without communicating with the CAD description of geometry during design optimization process. Therefore, the information of optimal design and material volume is exactly reflected to fabricate the specimen for experimental validation. Through the design optimization examples of elasticity problem, it is experimentally shown that the optimal design has higher stiffness than the initial design. Also, the experimental results match very well with the numerical results. Using a non-contact optical 3D deformation measuring system for strain distribution, it is shown that the stress concentration is significantly alleviated in the optimal design compared with the initial design.