• Title/Summary/Keyword: Forecast error

Search Result 414, Processing Time 0.022 seconds

A Study of the Nonlinear Characteristics Improvement for a Electronic Scale using Multiple Regression Analysis (다항식 회귀분석을 이용한 전자저울의 비선형 특성 개선 연구)

  • Chae, Gyoo-Soo
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.6
    • /
    • pp.1-6
    • /
    • 2019
  • In this study, the development of a weight estimation model of electronic scale with nonlinear characteristics is presented using polynomial regression analysis. The output voltage of the load cell was measured directly using the reference mass. And a polynomial regression model was obtained using the matrix and curve fitting function of MS Office Excel. The weight was measured in 100g units using a load cell electronic scale measuring up to 5kg and the polynomial regression model was obtained. The error was calculated for simple($1^{st}$), $2^{nd}$ and $3^{rd}$ order polynomial regression. To analyze the suitability of the regression function for each model, the coefficient of determination was presented to indicate the correlation between the estimated mass and the measured data. Using the third order polynomial model proposed here, a very accurate model was obtained with a standard deviation of 10g and the determinant coefficient of 1.0. Based on the theory of multi regression model presented here, it can be used in various statistical researches such as weather forecast, new drug development and economic indicators analysis using logistic regression analysis, which has been widely used in artificial intelligence fields.

Estimating the Demand Function for Industrial Natural Gas Use in Korea : A Cross-sectional Analysis (횡단면 분석을 활용한 한국 산업용 도시가스 수요함수 추정)

  • Lee, Bok-Hee;Lee, Hye-Jeong;Yoo, Seung-Hoon;Huh, Sung-Yoon
    • Journal of the Korean Institute of Gas
    • /
    • v.24 no.6
    • /
    • pp.34-46
    • /
    • 2020
  • In order to supply stable natural gas in the future, it is necessary to forecast the demand in advance and secure the quantity of supply. In this paper, we propose a method of estimating the demand function of industrial natural gas, which is the core of the increase of domestic natural gas demand in the future. The cross-sectional data of 304 domestic industries were used to estimate the demand function of the industrial natural gas, and the effect of industry specific characteristics such as capital investment, manufacturing cost. Finally, the least absolute deviation estimation method which is robust to outliers and does not assume the homogeneity of the error term and the normality, And the results were derived. In addition, the economic value of industrial city gas was estimated using the price elasticity of industrial city gas. Therefore, it can be seen that the continuous expansion and supply of city gas to the industrial sector is beneficial at the national level, and the government needs to promote expansion through the industrial city gas support policy.

The Inter-correlation Analysis between Oil Prices and Dry Bulk Freight Rates (유가와 벌크선 운임의 상관관계 분석에 관한 연구)

  • Ahn, Byoung-Churl;Lee, Kee-Hwan;Kim, Myoung-Hee
    • Journal of Navigation and Port Research
    • /
    • v.46 no.3
    • /
    • pp.289-296
    • /
    • 2022
  • The purpose of this study was to investigate the inter-correlation between crude oil prices and Dry Bulk Freight rates. Eco-friendly shipping fuels has being actively developed to reduce carbon emission. However, carbon neutrality will take longer than anticipated in terms of the present development process. Because of OVID-19 and the Russian invasion of Ukraine, crude oil price fluctuation has been exacerbated. So we must examine the impact on Dry Bulk Freight rates the oil prices have had, because oil prices play a major role in shipping fuels. By using the VAR (Vector Autoregressive) model with monthly data of crude oil prices (Brent, Dubai and WTI) and Dry Bulk Freight rates (BDI, BCI and (BP I) 2008.10~2022.02, the empirical analysis documents that the oil prices have an impact on Dry bulk Freight rates. From the analysis of the forecast error variance decomposition, WTI has the largest explanatory relationship with the BDI and Dubai ranks seoond, Brent ranks third. In conclusion, WTI and Dubai have the largest impact on the BDI, while there are some differences according to the ship-type.

Development of Near Infrared Spectroscopy(NIRS) Equation of Crude Protein in Wheat Germplasm

  • Hyemyeong Yoon;Myung-Chul Lee;Yumi Choi;Myong-Jae Shin;Sejong Oh
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • 2020.08a
    • /
    • pp.100-100
    • /
    • 2020
  • Wheat is mainly composed of carbohydrate but it contains a moderate amount of protein, which gives a very useful characteristics to flour food such as the unique elasticity and stickiness of the dough. We developed a calibration equation for analyzing crude protein content using Near Infrared Spectroscopy to quick analyze the crude protein content of wheat germplasm stored in the National Agrobiodiversity Center, RDA, Korea. The 1,798 wheat germplasms were used to draw up the calibration formula. The crude protein's interval distribution of 1,798 wheat germplasms used for the calibration was 7.04-20.84%, the average content was 13.2%, and standard deviation was 2.6%. The germplasms distribution was composed of a suitable group for the preparation of the calibration formula because the content distribution was a normal, excluding the 13.0-15.5% content section. In order to verify the applicability of the NIRS prediction model, we measured the crude protein content of the 300 wheat germplasms that were not used for the calibration using both Kjeldahl analysis and NIR spectrum. The analysis value calculated using each method were statistically processed, and the test results and statistical indicators of the predictive model were compared. As a result, The R2 value of the optimized NIRS prediction model was 0.997, and the Standard error of Calibration value(SEC) was 0.132, and slope value was 1.000. With prediction model selection, compared to Kjeldahl method, R2 values were 0.994(Kjeldahl), 0.998(NIRS), and the SEC value were 0.191 and 0.132, respectively, comparing the statistical indices of the forecast model. And slope value were 1.013, 1.000, respectively. The analysis of crude protein content by the NIRS predictive model developed by each statistical index showing similar figures is judged to show a high degree of correlation with the Kjeldahl analysis. The proven calibration equation will be used to measure the crude protein content of wheat germplasms held by the National Agrobiodiversity Center, and by dividing the wheat germplasms by their use according to the crude protein content, it will provide useful information to relevant researchers.

  • PDF

The Prediction of Purchase Amount of Customers Using Support Vector Regression with Separated Learning Method (Support Vector Regression에서 분리학습을 이용한 고객의 구매액 예측모형)

  • Hong, Tae-Ho;Kim, Eun-Mi
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.213-225
    • /
    • 2010
  • Data mining has empowered the managers who are charge of the tasks in their company to present personalized and differentiated marketing programs to their customers with the rapid growth of information technology. Most studies on customer' response have focused on predicting whether they would respond or not for their marketing promotion as marketing managers have been eager to identify who would respond to their marketing promotion. So many studies utilizing data mining have tried to resolve the binary decision problems such as bankruptcy prediction, network intrusion detection, and fraud detection in credit card usages. The prediction of customer's response has been studied with similar methods mentioned above because the prediction of customer's response is a kind of dichotomous decision problem. In addition, a number of competitive data mining techniques such as neural networks, SVM(support vector machine), decision trees, logit, and genetic algorithms have been applied to the prediction of customer's response for marketing promotion. The marketing managers also have tried to classify their customers with quantitative measures such as recency, frequency, and monetary acquired from their transaction database. The measures mean that their customers came to purchase in recent or old days, how frequent in a period, and how much they spent once. Using segmented customers we proposed an approach that could enable to differentiate customers in the same rating among the segmented customers. Our approach employed support vector regression to forecast the purchase amount of customers for each customer rating. Our study used the sample that included 41,924 customers extracted from DMEF04 Data Set, who purchased at least once in the last two years. We classified customers from first rating to fifth rating based on the purchase amount after giving a marketing promotion. Here, we divided customers into first rating who has a large amount of purchase and fifth rating who are non-respondents for the promotion. Our proposed model forecasted the purchase amount of the customers in the same rating and the marketing managers could make a differentiated and personalized marketing program for each customer even though they were belong to the same rating. In addition, we proposed more efficient learning method by separating the learning samples. We employed two learning methods to compare the performance of proposed learning method with general learning method for SVRs. LMW (Learning Method using Whole data for purchasing customers) is a general learning method for forecasting the purchase amount of customers. And we proposed a method, LMS (Learning Method using Separated data for classification purchasing customers), that makes four different SVR models for each class of customers. To evaluate the performance of models, we calculated MAE (Mean Absolute Error) and MAPE (Mean Absolute Percent Error) for each model to predict the purchase amount of customers. In LMW, the overall performance was 0.670 MAPE and the best performance showed 0.327 MAPE. Generally, the performances of the proposed LMS model were analyzed as more superior compared to the performance of the LMW model. In LMS, we found that the best performance was 0.275 MAPE. The performance of LMS was higher than LMW in each class of customers. After comparing the performance of our proposed method LMS to LMW, our proposed model had more significant performance for forecasting the purchase amount of customers in each class. In addition, our approach will be useful for marketing managers when they need to customers for their promotion. Even if customers were belonging to same class, marketing managers could offer customers a differentiated and personalized marketing promotion.

An Intelligent Decision Support System for Selecting Promising Technologies for R&D based on Time-series Patent Analysis (R&D 기술 선정을 위한 시계열 특허 분석 기반 지능형 의사결정지원시스템)

  • Lee, Choongseok;Lee, Suk Joo;Choi, Byounggu
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.79-96
    • /
    • 2012
  • As the pace of competition dramatically accelerates and the complexity of change grows, a variety of research have been conducted to improve firms' short-term performance and to enhance firms' long-term survival. In particular, researchers and practitioners have paid their attention to identify promising technologies that lead competitive advantage to a firm. Discovery of promising technology depends on how a firm evaluates the value of technologies, thus many evaluating methods have been proposed. Experts' opinion based approaches have been widely accepted to predict the value of technologies. Whereas this approach provides in-depth analysis and ensures validity of analysis results, it is usually cost-and time-ineffective and is limited to qualitative evaluation. Considerable studies attempt to forecast the value of technology by using patent information to overcome the limitation of experts' opinion based approach. Patent based technology evaluation has served as a valuable assessment approach of the technological forecasting because it contains a full and practical description of technology with uniform structure. Furthermore, it provides information that is not divulged in any other sources. Although patent information based approach has contributed to our understanding of prediction of promising technologies, it has some limitations because prediction has been made based on the past patent information, and the interpretations of patent analyses are not consistent. In order to fill this gap, this study proposes a technology forecasting methodology by integrating patent information approach and artificial intelligence method. The methodology consists of three modules : evaluation of technologies promising, implementation of technologies value prediction model, and recommendation of promising technologies. In the first module, technologies promising is evaluated from three different and complementary dimensions; impact, fusion, and diffusion perspectives. The impact of technologies refers to their influence on future technologies development and improvement, and is also clearly associated with their monetary value. The fusion of technologies denotes the extent to which a technology fuses different technologies, and represents the breadth of search underlying the technology. The fusion of technologies can be calculated based on technology or patent, thus this study measures two types of fusion index; fusion index per technology and fusion index per patent. Finally, the diffusion of technologies denotes their degree of applicability across scientific and technological fields. In the same vein, diffusion index per technology and diffusion index per patent are considered respectively. In the second module, technologies value prediction model is implemented using artificial intelligence method. This studies use the values of five indexes (i.e., impact index, fusion index per technology, fusion index per patent, diffusion index per technology and diffusion index per patent) at different time (e.g., t-n, t-n-1, t-n-2, ${\cdots}$) as input variables. The out variables are values of five indexes at time t, which is used for learning. The learning method adopted in this study is backpropagation algorithm. In the third module, this study recommends final promising technologies based on analytic hierarchy process. AHP provides relative importance of each index, leading to final promising index for technology. Applicability of the proposed methodology is tested by using U.S. patents in international patent class G06F (i.e., electronic digital data processing) from 2000 to 2008. The results show that mean absolute error value for prediction produced by the proposed methodology is lower than the value produced by multiple regression analysis in cases of fusion indexes. However, mean absolute error value of the proposed methodology is slightly higher than the value of multiple regression analysis. These unexpected results may be explained, in part, by small number of patents. Since this study only uses patent data in class G06F, number of sample patent data is relatively small, leading to incomplete learning to satisfy complex artificial intelligence structure. In addition, fusion index per technology and impact index are found to be important criteria to predict promising technology. This study attempts to extend the existing knowledge by proposing a new methodology for prediction technology value by integrating patent information analysis and artificial intelligence network. It helps managers who want to technology develop planning and policy maker who want to implement technology policy by providing quantitative prediction methodology. In addition, this study could help other researchers by proving a deeper understanding of the complex technological forecasting field.

Fund Flow and Market Risk (펀드플로우와 시장위험)

  • Chung, Hyo-Youn;Park, Jong-Won
    • The Korean Journal of Financial Management
    • /
    • v.27 no.2
    • /
    • pp.169-204
    • /
    • 2010
  • This paper examines the dynamic relationship between fund flow and market risk at the aggregate level and explores whether sudden sharp changes in fund flow (fund run) can cause a systemic risk in the Korean financial markets. We use daily and weekly data and regression and VAR analysis. Main results of the paper are as follows: First, in the stock market, a concurrent and a lagged unexpected fund flows have a positive relationship with market volatility. A positive shock in fund flow predicts an increase in stock market volatility. In the bond market, an unexpected fund flow has a negative relationship with the default risk premium, but a positive relationship with the term premium. And an unexpected fund flow of the money market fund has a negative relationship with the liquidy risk, but the explanatory power is very low. Second, for examining whether changes in fund flow induce a systemic risk, we construct a spillover index based on the forecast error variance decomposition of VAR model. A spillover index represents that how much the shock in fund flow can explain the change of market risk in a market. In general, explanatory powers from spillover indexes are so fluctuant and low. In the stock market, the impact of shocks in fund flow on market risk is relatively high and persistent during the period from the end of 2007 to 2008, which is the subprime-mortgage crisis period. In bond market, since the end of 2008, the impact of shocks in fund flow spreads to default risk continually, while in the money market, such a systematic effect doesn't take place. The persistent patterns of spillover effect appearing around a certain period in the stock market and the bond market suggest that the shock to the unexpected fund flow may increase the market risk and can be a cause of systemic risk in the financial markets. However, summarizing the results of regression and VAR model analysis, and considering the very low explanatory power of spillover index analysis, we can conclude that changes in fund flow have a very limited power in explaining changes in market risk and it is not very likely to induce the systemic risk by a fund run in the Korean financial markets.

  • PDF

Development of Market Growth Pattern Map Based on Growth Model and Self-organizing Map Algorithm: Focusing on ICT products (자기조직화 지도를 활용한 성장모형 기반의 시장 성장패턴 지도 구축: ICT제품을 중심으로)

  • Park, Do-Hyung;Chung, Jaekwon;Chung, Yeo Jin;Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.1-23
    • /
    • 2014
  • Market forecasting aims to estimate the sales volume of a product or service that is sold to consumers for a specific selling period. From the perspective of the enterprise, accurate market forecasting assists in determining the timing of new product introduction, product design, and establishing production plans and marketing strategies that enable a more efficient decision-making process. Moreover, accurate market forecasting enables governments to efficiently establish a national budget organization. This study aims to generate a market growth curve for ICT (information and communication technology) goods using past time series data; categorize products showing similar growth patterns; understand markets in the industry; and forecast the future outlook of such products. This study suggests the useful and meaningful process (or methodology) to identify the market growth pattern with quantitative growth model and data mining algorithm. The study employs the following methodology. At the first stage, past time series data are collected based on the target products or services of categorized industry. The data, such as the volume of sales and domestic consumption for a specific product or service, are collected from the relevant government ministry, the National Statistical Office, and other relevant government organizations. For collected data that may not be analyzed due to the lack of past data and the alteration of code names, data pre-processing work should be performed. At the second stage of this process, an optimal model for market forecasting should be selected. This model can be varied on the basis of the characteristics of each categorized industry. As this study is focused on the ICT industry, which has more frequent new technology appearances resulting in changes of the market structure, Logistic model, Gompertz model, and Bass model are selected. A hybrid model that combines different models can also be considered. The hybrid model considered for use in this study analyzes the size of the market potential through the Logistic and Gompertz models, and then the figures are used for the Bass model. The third stage of this process is to evaluate which model most accurately explains the data. In order to do this, the parameter should be estimated on the basis of the collected past time series data to generate the models' predictive value and calculate the root-mean squared error (RMSE). The model that shows the lowest average RMSE value for every product type is considered as the best model. At the fourth stage of this process, based on the estimated parameter value generated by the best model, a market growth pattern map is constructed with self-organizing map algorithm. A self-organizing map is learning with market pattern parameters for all products or services as input data, and the products or services are organized into an $N{\times}N$ map. The number of clusters increase from 2 to M, depending on the characteristics of the nodes on the map. The clusters are divided into zones, and the clusters with the ability to provide the most meaningful explanation are selected. Based on the final selection of clusters, the boundaries between the nodes are selected and, ultimately, the market growth pattern map is completed. The last step is to determine the final characteristics of the clusters as well as the market growth curve. The average of the market growth pattern parameters in the clusters is taken to be a representative figure. Using this figure, a growth curve is drawn for each cluster, and their characteristics are analyzed. Also, taking into consideration the product types in each cluster, their characteristics can be qualitatively generated. We expect that the process and system that this paper suggests can be used as a tool for forecasting demand in the ICT and other industries.

A Hybrid Forecasting Framework based on Case-based Reasoning and Artificial Neural Network (사례기반 추론기법과 인공신경망을 이용한 서비스 수요예측 프레임워크)

  • Hwang, Yousub
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.43-57
    • /
    • 2012
  • To enhance the competitive advantage in a constantly changing business environment, an enterprise management must make the right decision in many business activities based on both internal and external information. Thus, providing accurate information plays a prominent role in management's decision making. Intuitively, historical data can provide a feasible estimate through the forecasting models. Therefore, if the service department can estimate the service quantity for the next period, the service department can then effectively control the inventory of service related resources such as human, parts, and other facilities. In addition, the production department can make load map for improving its product quality. Therefore, obtaining an accurate service forecast most likely appears to be critical to manufacturing companies. Numerous investigations addressing this problem have generally employed statistical methods, such as regression or autoregressive and moving average simulation. However, these methods are only efficient for data with are seasonal or cyclical. If the data are influenced by the special characteristics of product, they are not feasible. In our research, we propose a forecasting framework that predicts service demand of manufacturing organization by combining Case-based reasoning (CBR) and leveraging an unsupervised artificial neural network based clustering analysis (i.e., Self-Organizing Maps; SOM). We believe that this is one of the first attempts at applying unsupervised artificial neural network-based machine-learning techniques in the service forecasting domain. Our proposed approach has several appealing features : (1) We applied CBR and SOM in a new forecasting domain such as service demand forecasting. (2) We proposed our combined approach between CBR and SOM in order to overcome limitations of traditional statistical forecasting methods and We have developed a service forecasting tool based on the proposed approach using an unsupervised artificial neural network and Case-based reasoning. In this research, we conducted an empirical study on a real digital TV manufacturer (i.e., Company A). In addition, we have empirically evaluated the proposed approach and tool using real sales and service related data from digital TV manufacturer. In our empirical experiments, we intend to explore the performance of our proposed service forecasting framework when compared to the performances predicted by other two service forecasting methods; one is traditional CBR based forecasting model and the other is the existing service forecasting model used by Company A. We ran each service forecasting 144 times; each time, input data were randomly sampled for each service forecasting framework. To evaluate accuracy of forecasting results, we used Mean Absolute Percentage Error (MAPE) as primary performance measure in our experiments. We conducted one-way ANOVA test with the 144 measurements of MAPE for three different service forecasting approaches. For example, the F-ratio of MAPE for three different service forecasting approaches is 67.25 and the p-value is 0.000. This means that the difference between the MAPE of the three different service forecasting approaches is significant at the level of 0.000. Since there is a significant difference among the different service forecasting approaches, we conducted Tukey's HSD post hoc test to determine exactly which means of MAPE are significantly different from which other ones. In terms of MAPE, Tukey's HSD post hoc test grouped the three different service forecasting approaches into three different subsets in the following order: our proposed approach > traditional CBR-based service forecasting approach > the existing forecasting approach used by Company A. Consequently, our empirical experiments show that our proposed approach outperformed the traditional CBR based forecasting model and the existing service forecasting model used by Company A. The rest of this paper is organized as follows. Section 2 provides some research background information such as summary of CBR and SOM. Section 3 presents a hybrid service forecasting framework based on Case-based Reasoning and Self-Organizing Maps, while the empirical evaluation results are summarized in Section 4. Conclusion and future research directions are finally discussed in Section 5.

Downscaling of Sunshine Duration for a Complex Terrain Based on the Shaded Relief Image and the Sky Condition (하늘상태와 음영기복도에 근거한 복잡지형의 일조시간 분포 상세화)

  • Kim, Seung-Ho;Yun, Jin I.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.233-241
    • /
    • 2016
  • Experiments were carried out to quantify the topographic effects on attenuation of sunshine in complex terrain and the results are expected to help convert the coarse resolution sunshine duration information provided by the Korea Meteorological Administration (KMA) into a detailed map reflecting the terrain characteristics of mountainous watershed. Hourly shaded relief images for one year, each pixel consisting of 0 to 255 brightness value, were constructed by applying techniques of shadow modeling and skyline analysis to the 3m resolution digital elevation model for an experimental watershed on the southern slope of Mt. Jiri in Korea. By using a bimetal sunshine recorder, sunshine duration was measured at three points with different terrain conditions in the watershed from May 15, 2015 to May 14, 2016. The brightness values of the 3 corresponding pixel points on the shaded relief map were extracted and regressed to the measured sunshine duration, resulting in a brightness-sunshine duration response curve for a clear day. We devised a method to calibrate this curve equation according to sky condition categorized by cloud amount and used it to derive an empirical model for estimating sunshine duration over a complex terrain. When the performance of this model was compared with a conventional scheme for estimating sunshine duration over a horizontal plane, the estimation bias was improved remarkably and the root mean square error for daily sunshine hour was 1.7hr, which is a reduction by 37% from the conventional method. In order to apply this model to a given area, the clear-sky sunshine duration of each pixel should be produced on hourly intervals first, by driving the curve equation with the hourly shaded relief image of the area. Next, the cloud effect is corrected by 3-hourly 'sky condition' of the KMA digital forecast products. Finally, daily sunshine hour can be obtained by accumulating the hourly sunshine duration. A detailed sunshine duration distribution of 3m horizontal resolution was obtained by applying this procedure to the experimental watershed.