• Title/Summary/Keyword: Stock Market Forecasting

Search Result 79, Processing Time 0.029 seconds

A Comparative Study between Stock Price Prediction Models Using Sentiment Analysis and Machine Learning Based on SNS and News Articles (SNS와 뉴스기사의 감성분석과 기계학습을 이용한 주가예측 모형 비교 연구)

  • Kim, Dongyoung;Park, Jeawon;Choi, Jaehyun
    • Journal of Information Technology Services
    • /
    • v.13 no.3
    • /
    • pp.221-233
    • /
    • 2014
  • Because people's interest of the stock market has been increased with the development of economy, a lot of studies have been going to predict fluctuation of stock prices. Latterly many studies have been made using scientific and technological method among the various forecasting method, and also data using for study are becoming diverse. So, in this paper we propose stock prices prediction models using sentiment analysis and machine learning based on news articles and SNS data to improve the accuracy of prediction of stock prices. Stock prices prediction models that we propose are generated through the four-step process that contain data collection, sentiment dictionary construction, sentiment analysis, and machine learning. The data have been collected to target newspapers related to economy in the case of news article and to target twitter in the case of SNS data. Sentiment dictionary was built using news articles among the collected data, and we utilize it to process sentiment analysis. In machine learning phase, we generate prediction models using various techniques of classification and the data that was made through sentiment analysis. After generating prediction models, we conducted 10-fold cross-validation to measure the performance of they. The experimental result showed that accuracy is over 80% in a number of ways and F1 score is closer to 0.8. The result can be seen as significantly enhanced result compared with conventional researches utilizing opinion mining or data mining techniques.

Analysis of Global Food Market and Food-Energy Price Links: Based on System Dynamics Approach

  • Kim, Gyu-Rim
    • Korean System Dynamics Review
    • /
    • v.10 no.3
    • /
    • pp.105-124
    • /
    • 2009
  • The situation of the global food markets has been being rapidly restructured and entering on a new phase by new dynamic and driving forces. The factors such as economic growth and income increase, high energy price, globalization, urbanization, and global climate change are transforming patterns of food consumption, production, and markets. The prices and markets of world food and energy are getting increasingly linked each other. Food and fuel are the global dilemma issues associated with the risk of diverting farmland or of consuming cereals for biofuel production in detriment of the cereals supply to the global food markets. An estimated 100 million tons of grain per year are being redirected from food to fuel. Therefore, the objectives of this study are as follows: Firstly, the study examines situations of the world food and energy resources, analyzes the trends of prices of the crude oil and biofuel, and formulates the food-energy links mechanism. Secondly, the study builds a simulation model, based on system dynamics approach, for not only analyzing the global cereals market and energy market but also forecasting the global production, consumption, and stock of those markets by 2030 in the future. The model of this study consists of four sectors, i.e., world population dynamics sector, global food market dynamics sector, global energy market dynamics sector, scenario sector of world economic growth and oil price.

  • PDF

Volatility spillover between the Korean KOSPI and the Hong Kong HSI stock markets

  • Baek, Eun-Ah;Oh, Man-Suk
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.3
    • /
    • pp.203-213
    • /
    • 2016
  • We investigate volatility spillover aspects of realized volatilities (RVs) for the log returns of the Korea Composite Stock Price Index (KOSPI) and the Hang Seng Index (HSI) from 2009-2013. For all RVs, significant long memories and asymmetries are identified. For a model selection, we consider three commonly used time series models as well as three models that incorporate long memory and asymmetry. Taking into account of goodness-of-fit and forecasting ability, Leverage heteroskedastic autoregressive realized volatility (LHAR) model is selected for the given data. The LHAR model finds significant decompositions of the spillover effect from the HSI to the KOSPI into moderate negative daily spillover, positive weekly spillover and positive monthly spillover, and from the KOSPI to the HSI into substantial negative weekly spillover and positive monthly spillover. An interesting result from the analysis is that the daily volatility spillover from the HSI to the KOSPI is significant versus the insignificant daily volatility spillover of the KOSPI to HSI. The daily volatility in Hong Kong affects next day volatility in Korea but the daily volatility in Korea does not affect next day volatility in Hong Kong.

Development and Application of Risk Recovery Index using Machine Learning Algorithms (기계학습알고리즘을 이용한 위험회복지수의 개발과 활용)

  • Kim, Sun Woong
    • Journal of Information Technology Applications and Management
    • /
    • v.23 no.4
    • /
    • pp.25-39
    • /
    • 2016
  • Asset prices decline sharply and stock markets collapse when financial crisis happens. Recently we have encountered more frequent financial crises than ever. 1998 currency crisis and 2008 global financial crisis triggered academic researches on early warning systems that aim to detect the symptom of financial crisis in advance. This study proposes a risk recovery index for detection of good opportunities from financial market instability. We use SVM classifier algorithms to separate recovery period from unstable financial market data. Input variables are KOSPI index and V-KOSPI200 index. Our SVM algorithms show highly accurate forecasting results on testing data as well as training data. Risk recovery index is derived from our SVM-trained outputs. We develop a trading system that utilizes the suggested risk recovery index. The trading result records very high profit, that is, its annual return runs to 121%.

A Study on the Prediction Model of Stock Price Index Trend based on GA-MSVM that Simultaneously Optimizes Feature and Instance Selection (입력변수 및 학습사례 선정을 동시에 최적화하는 GA-MSVM 기반 주가지수 추세 예측 모형에 관한 연구)

  • Lee, Jong-sik;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.147-168
    • /
    • 2017
  • There have been many studies on accurate stock market forecasting in academia for a long time, and now there are also various forecasting models using various techniques. Recently, many attempts have been made to predict the stock index using various machine learning methods including Deep Learning. Although the fundamental analysis and the technical analysis method are used for the analysis of the traditional stock investment transaction, the technical analysis method is more useful for the application of the short-term transaction prediction or statistical and mathematical techniques. Most of the studies that have been conducted using these technical indicators have studied the model of predicting stock prices by binary classification - rising or falling - of stock market fluctuations in the future market (usually next trading day). However, it is also true that this binary classification has many unfavorable aspects in predicting trends, identifying trading signals, or signaling portfolio rebalancing. In this study, we try to predict the stock index by expanding the stock index trend (upward trend, boxed, downward trend) to the multiple classification system in the existing binary index method. In order to solve this multi-classification problem, a technique such as Multinomial Logistic Regression Analysis (MLOGIT), Multiple Discriminant Analysis (MDA) or Artificial Neural Networks (ANN) we propose an optimization model using Genetic Algorithm as a wrapper for improving the performance of this model using Multi-classification Support Vector Machines (MSVM), which has proved to be superior in prediction performance. In particular, the proposed model named GA-MSVM is designed to maximize model performance by optimizing not only the kernel function parameters of MSVM, but also the optimal selection of input variables (feature selection) as well as instance selection. In order to verify the performance of the proposed model, we applied the proposed method to the real data. The results show that the proposed method is more effective than the conventional multivariate SVM, which has been known to show the best prediction performance up to now, as well as existing artificial intelligence / data mining techniques such as MDA, MLOGIT, CBR, and it is confirmed that the prediction performance is better than this. Especially, it has been confirmed that the 'instance selection' plays a very important role in predicting the stock index trend, and it is confirmed that the improvement effect of the model is more important than other factors. To verify the usefulness of GA-MSVM, we applied it to Korea's real KOSPI200 stock index trend forecast. Our research is primarily aimed at predicting trend segments to capture signal acquisition or short-term trend transition points. The experimental data set includes technical indicators such as the price and volatility index (2004 ~ 2017) and macroeconomic data (interest rate, exchange rate, S&P 500, etc.) of KOSPI200 stock index in Korea. Using a variety of statistical methods including one-way ANOVA and stepwise MDA, 15 indicators were selected as candidate independent variables. The dependent variable, trend classification, was classified into three states: 1 (upward trend), 0 (boxed), and -1 (downward trend). 70% of the total data for each class was used for training and the remaining 30% was used for verifying. To verify the performance of the proposed model, several comparative model experiments such as MDA, MLOGIT, CBR, ANN and MSVM were conducted. MSVM has adopted the One-Against-One (OAO) approach, which is known as the most accurate approach among the various MSVM approaches. Although there are some limitations, the final experimental results demonstrate that the proposed model, GA-MSVM, performs at a significantly higher level than all comparative models.

Information Spillover Effects among the Stock Markets of China, Taiwan and Hongkon (국제주식시장의 정보전이효과에 관한 연구 : 중국, 대만, 홍콩을 중심으로)

  • Yoon, Seong-Min;Su, Qian;Kang, Sang Hoon
    • International Area Studies Review
    • /
    • v.14 no.3
    • /
    • pp.62-84
    • /
    • 2010
  • Accurate forecasting of volatility is of considerable interest in financial volatility research, particularly in regard to portfolio allocation, option pricing and risk management because volatility is equal to market risk. So, we attempted to delineate a model with good ability to forecast and identified stylized features of volatility, with a focus on volatility persistence or long memory in the Australian futures market. In this context, we assessed the long-memory property in the volatility of index futures contracts using three conditional volatility models, namely the GARCH, IGARCH and FIGARCH models. We found that the FIGARCH model better captures the long-memory property than do the GARCH and IGARCH models. Additionally, we found that the FIGARCH model provides superior performance in one-day-ahead volatility forecasts. As discussed in this paper, the FIGARCH model should prove a useful technique in forecasting the long-memory volatility in the Australian index futures market.

A Study on Commodity Asset Investment Model Based on Machine Learning Technique (기계학습을 활용한 상품자산 투자모델에 관한 연구)

  • Song, Jin Ho;Choi, Heung Sik;Kim, Sun Woong
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.127-146
    • /
    • 2017
  • Services using artificial intelligence have begun to emerge in daily life. Artificial intelligence is applied to products in consumer electronics and communications such as artificial intelligence refrigerators and speakers. In the financial sector, using Kensho's artificial intelligence technology, the process of the stock trading system in Goldman Sachs was improved. For example, two stock traders could handle the work of 600 stock traders and the analytical work for 15 people for 4weeks could be processed in 5 minutes. Especially, big data analysis through machine learning among artificial intelligence fields is actively applied throughout the financial industry. The stock market analysis and investment modeling through machine learning theory are also actively studied. The limits of linearity problem existing in financial time series studies are overcome by using machine learning theory such as artificial intelligence prediction model. The study of quantitative financial data based on the past stock market-related numerical data is widely performed using artificial intelligence to forecast future movements of stock price or indices. Various other studies have been conducted to predict the future direction of the market or the stock price of companies by learning based on a large amount of text data such as various news and comments related to the stock market. Investing on commodity asset, one of alternative assets, is usually used for enhancing the stability and safety of traditional stock and bond asset portfolio. There are relatively few researches on the investment model about commodity asset than mainstream assets like equity and bond. Recently machine learning techniques are widely applied on financial world, especially on stock and bond investment model and it makes better trading model on this field and makes the change on the whole financial area. In this study we made investment model using Support Vector Machine among the machine learning models. There are some researches on commodity asset focusing on the price prediction of the specific commodity but it is hard to find the researches about investment model of commodity as asset allocation using machine learning model. We propose a method of forecasting four major commodity indices, portfolio made of commodity futures, and individual commodity futures, using SVM model. The four major commodity indices are Goldman Sachs Commodity Index(GSCI), Dow Jones UBS Commodity Index(DJUI), Thomson Reuters/Core Commodity CRB Index(TRCI), and Rogers International Commodity Index(RI). We selected each two individual futures among three sectors as energy, agriculture, and metals that are actively traded on CME market and have enough liquidity. They are Crude Oil, Natural Gas, Corn, Wheat, Gold and Silver Futures. We made the equally weighted portfolio with six commodity futures for comparing with other commodity indices. We set the 19 macroeconomic indicators including stock market indices, exports & imports trade data, labor market data, and composite leading indicators as the input data of the model because commodity asset is very closely related with the macroeconomic activities. They are 14 US economic indicators, two Chinese economic indicators and two Korean economic indicators. Data period is from January 1990 to May 2017. We set the former 195 monthly data as training data and the latter 125 monthly data as test data. In this study, we verified that the performance of the equally weighted commodity futures portfolio rebalanced by the SVM model is better than that of other commodity indices. The prediction accuracy of the model for the commodity indices does not exceed 50% regardless of the SVM kernel function. On the other hand, the prediction accuracy of equally weighted commodity futures portfolio is 53%. The prediction accuracy of the individual commodity futures model is better than that of commodity indices model especially in agriculture and metal sectors. The individual commodity futures portfolio excluding the energy sector has outperformed the three sectors covered by individual commodity futures portfolio. In order to verify the validity of the model, it is judged that the analysis results should be similar despite variations in data period. So we also examined the odd numbered year data as training data and the even numbered year data as test data and we confirmed that the analysis results are similar. As a result, when we allocate commodity assets to traditional portfolio composed of stock, bond, and cash, we can get more effective investment performance not by investing commodity indices but by investing commodity futures. Especially we can get better performance by rebalanced commodity futures portfolio designed by SVM model.

Forecasting Volatility of Stocks Return: A Smooth Transition Combining Forecasts

  • HO, Jen Sim;CHOO, Wei Chong;LAU, Wei Theng;YEE, Choy Leng;ZHANG, Yuruixian;WAN, Cheong Kin
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.10
    • /
    • pp.1-13
    • /
    • 2022
  • This paper empirically explores the predicting ability of the newly proposed smooth transition (ST) time-varying combining forecast methods. The proposed method allows the "weight" of combining forecasts to change gradually over time through its unique feature of transition variables. Stock market returns from 7 countries were applied to Ad Hoc models, the well-known Generalized Autoregressive Conditional Heteroskedasticity (GARCH) family models, and the Smooth Transition Exponential Smoothing (STES) models. Of the individual models, GJRGARCH and STES-E&AE emerged as the best models and thereby were chosen for constructing the combined forecast models where a total of nine ST combining methods were developed. The robustness of the ST combining forecasts is also validated by the Diebold-Mariano (DM) test. The post-sample forecasting performance shows that ST combining forecast methods outperformed all the individual models and fixed weight combining models. This study contributes in two ways: 1) the ST combining methods statistically outperformed all the individual forecast methods and the existing traditional combining methods using simple averaging and Bates & Granger method. 2) trading volume as a transition variable in ST methods was superior to other individual models as well as the ST models with single sign or size of past shocks as transition variables.

DR-LSTM: Dimension reduction based deep learning approach to predict stock price

  • Ah-ram Lee;Jae Youn Ahn;Ji Eun Choi;Kyongwon Kim
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.2
    • /
    • pp.213-234
    • /
    • 2024
  • In recent decades, increasing research attention has been directed toward predicting the price of stocks in financial markets using deep learning methods. For instance, recurrent neural network (RNN) is known to be competitive for datasets with time-series data. Long short term memory (LSTM) further improves RNN by providing an alternative approach to the gradient loss problem. LSTM has its own advantage in predictive accuracy by retaining memory for a longer time. In this paper, we combine both supervised and unsupervised dimension reduction methods with LSTM to enhance the forecasting performance and refer to this as a dimension reduction based LSTM (DR-LSTM) approach. For a supervised dimension reduction method, we use methods such as sliced inverse regression (SIR), sparse SIR, and kernel SIR. Furthermore, principal component analysis (PCA), sparse PCA, and kernel PCA are used as unsupervised dimension reduction methods. Using datasets of real stock market index (S&P 500, STOXX Europe 600, and KOSPI), we present a comparative study on predictive accuracy between six DR-LSTM methods and time series modeling.

Analysis of Trading Performance on Intelligent Trading System for Directional Trading (방향성매매를 위한 지능형 매매시스템의 투자성과분석)

  • Choi, Heung-Sik;Kim, Sun-Woong;Park, Sung-Cheol
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.3
    • /
    • pp.187-201
    • /
    • 2011
  • KOSPI200 index is the Korean stock price index consisting of actively traded 200 stocks in the Korean stock market. Its base value of 100 was set on January 3, 1990. The Korea Exchange (KRX) developed derivatives markets on the KOSPI200 index. KOSPI200 index futures market, introduced in 1996, has become one of the most actively traded indexes markets in the world. Traders can make profit by entering a long position on the KOSPI200 index futures contract if the KOSPI200 index will rise in the future. Likewise, they can make profit by entering a short position if the KOSPI200 index will decline in the future. Basically, KOSPI200 index futures trading is a short-term zero-sum game and therefore most futures traders are using technical indicators. Advanced traders make stable profits by using system trading technique, also known as algorithm trading. Algorithm trading uses computer programs for receiving real-time stock market data, analyzing stock price movements with various technical indicators and automatically entering trading orders such as timing, price or quantity of the order without any human intervention. Recent studies have shown the usefulness of artificial intelligent systems in forecasting stock prices or investment risk. KOSPI200 index data is numerical time-series data which is a sequence of data points measured at successive uniform time intervals such as minute, day, week or month. KOSPI200 index futures traders use technical analysis to find out some patterns on the time-series chart. Although there are many technical indicators, their results indicate the market states among bull, bear and flat. Most strategies based on technical analysis are divided into trend following strategy and non-trend following strategy. Both strategies decide the market states based on the patterns of the KOSPI200 index time-series data. This goes well with Markov model (MM). Everybody knows that the next price is upper or lower than the last price or similar to the last price, and knows that the next price is influenced by the last price. However, nobody knows the exact status of the next price whether it goes up or down or flat. So, hidden Markov model (HMM) is better fitted than MM. HMM is divided into discrete HMM (DHMM) and continuous HMM (CHMM). The only difference between DHMM and CHMM is in their representation of state probabilities. DHMM uses discrete probability density function and CHMM uses continuous probability density function such as Gaussian Mixture Model. KOSPI200 index values are real number and these follow a continuous probability density function, so CHMM is proper than DHMM for the KOSPI200 index. In this paper, we present an artificial intelligent trading system based on CHMM for the KOSPI200 index futures system traders. Traders have experienced on technical trading for the KOSPI200 index futures market ever since the introduction of the KOSPI200 index futures market. They have applied many strategies to make profit in trading the KOSPI200 index futures. Some strategies are based on technical indicators such as moving averages or stochastics, and others are based on candlestick patterns such as three outside up, three outside down, harami or doji star. We show a trading system of moving average cross strategy based on CHMM, and we compare it to a traditional algorithmic trading system. We set the parameter values of moving averages at common values used by market practitioners. Empirical results are presented to compare the simulation performance with the traditional algorithmic trading system using long-term daily KOSPI200 index data of more than 20 years. Our suggested trading system shows higher trading performance than naive system trading.