• Title/Summary/Keyword: ARIMA Model

Search Result 366, Processing Time 0.037 seconds

Construction of integrated DB for domestic water-cycle system and short-term prediction model (생활용수 물순환 계통 통합 DB 및 단기예측모형 구축)

  • Seungyeon Lee;Sangeun Lee
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.362-362
    • /
    • 2023
  • 한정된 수자원의 이용 및 관리로 매년 물 부족과 물 배분 의사결정 문제가 발생하고 있다. 50년간(1965~2014년) 수자원의 총량은 약 1.2배 증가한 반면 인구수 약 1.8배, 생·공·농업용수의 수요는 약 5배가 증가(국회입법조사처, 2018) 했을 뿐 아니라, 기후변화의 영향으로 인한 강수량의 변화와 지역별 편차가 커져 지속가능한 물관리 필요성이 증대되고 있다. 따라서 효율적인 물관리를 위해서는 관리부처가 분절되어 있는 물순환 계통의 데이터를 통합하는 것이 우선시되어야 하고 이를 통해 물순환 모니터링/평가/예측 기술을 개발할 수 있다. 본 연구에서는 생활용수 물순환 계통 통합 DB를 정의 및 구축하였다. 도시의 관점에서 물순환 시스템을 순차적으로 물 유입(수원~취수장)/전달(정수장~급수지역)/유출(하(폐)수처리장~방류구)의 개념으로 설정하고 DB정의서를 마련하였다. 연구대상지는 가뭄이 장기화가 되고 있는 전라남도중 물순환 계통이 비교적 단순한 네트워크로 형성되어 있는 함평군 도시지역으로 선정하였다. 연구 기간은 총 5년(2017년 1월 1일~2021년 12월 31일)이고 일 단위 실계측자료 위주의 원자료를 구축하였다. 이를 이상치 탐지, 제거, 대체의 과정을 거쳐 품질 보정하고 정제된 시계열 자료에 대한 특성 분석을 하였다. 그 결과, 물순환 계통 내 주요 지점 간의 상관관계 및 지연시간을 통한 물흐름의 시계열적 특성을 파악할 수 있었으며 모형의 적합도를 판단하는 데 활용되는 통계량과 유의미하지 않은 잔차의 자기상관성을 볼 때 물 유입-전달-유출의 단기 예측을 위한 ARIMA(Auto-regressive Integrated Moving Average) 모형의 구축도 가능할 것으로 판단되었다. 다만 여름철 발생하는 방류량의 첨두값을 설명하기 위해서는 강우에 의한 불명수 발생으로 증가하는 방류량을 묘사할 수있어야 하므로 향후에는 물순환계통 외 해당 지역의 불명수(강우 효과)도 하수 방류량의 주요 입력 요인으로 추가 검토할 필요가 있다.

  • PDF

Water Supply forecast Using Multiple ARMA Model Based on the Analysis of Water Consumption Mode with Wavelet Transform. (Wavelet Transform을 이용한 물수요량의 특성분석 및 다원 ARMA모형을 통한 물수요량예측)

  • Jo, Yong-Jun;Kim, Jong-Mun
    • Journal of Korea Water Resources Association
    • /
    • v.31 no.3
    • /
    • pp.317-326
    • /
    • 1998
  • Water consumption characteristics on the northern part of Seoul were analyzed using wavelet transform with a base function of Coiflets 5. It turns out that long term evolution mode detected at 212 scale in 1995 was in a shape of hyperbolic tangent over the entire period due to the development of Sanggae resident site. Furthermore, there was seasonal water demand having something to do with economic cycle which reached its peak at the ends of June and December. The amount of this additional consumption was about $1,700\;\textrm{cm}^3/hr$ on June and $500\;\textrm{cm}^3/hr$ on December. It was also shown that the periods of energy containing sinusoidal component were 3.13 day, 33.33 hr, 23.98 hr and 12 hr, respectively, and the amplitude of 23.98 hr component was the most humongous. The components of relatively short frequency detected at $2^i$[i = 1,2,…12] scale were following Gaussian PDF. The most reliable predictive models are multiple AR[32,16,23] and ARMA[20, 16, 10, 23] which the input of temperature from the view point of minimized predictive error, mutual independence or residuals and the availableness of reliable meteorological data. The predicted values of water supply were quite consistent with the measured data which cast a possibility of the deployment of the predictive model developed in this study for the optimal management of water supply facilities.

  • PDF

Trend Analysis and Prediction of the Number of Births and the Number of Outpatients using Time Series Analysis (시계열 분석을 통한 출생아 수와 소아치과 내원 환자 수 추세 분석 및 예측)

  • Hwayeon, An;Seonmi, Kim;Namki, Choi
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.49 no.3
    • /
    • pp.274-284
    • /
    • 2022
  • The purpose of this study was to analyze the trend of the number of births in Gwangju and the number of outpatients in Pediatric Dentistry at Chonnam National University Dental Hospital over the past 10 years (2010 - 2019) and predict the next year using time series analysis. The number of births showed an unstable downward trend with monthly variations, with the highest in January and the lowest in December. The average number of births in 2020 was predicted to be 682 (595 to 782, 95% CI), and the actual number of births was an average of 610. The number of outpatients was relatively stable, showing a month-to-month variation, with highest in August and the lowest in June. The average number of patients in 2020 was predicted to be 603 (505 to 701, 95% CI), and the average number of actual visits was 587. Despite the decrease in the number of births, the number of outpatients was expected to increase somewhat. Due to the special situation of COVID-19, the actual number of births and patients was to be slightly lower than the predicted values, but it was that they were within the predicted confidence interval. Time series analysis can be used as a basic tool to prepare for the low fertility era in the field of pediatric dentistry.

The Effect of Data Size on the k-NN Predictability: Application to Samsung Electronics Stock Market Prediction (데이터 크기에 따른 k-NN의 예측력 연구: 삼성전자주가를 사례로)

  • Chun, Se-Hak
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.239-251
    • /
    • 2019
  • Statistical methods such as moving averages, Kalman filtering, exponential smoothing, regression analysis, and ARIMA (autoregressive integrated moving average) have been used for stock market predictions. However, these statistical methods have not produced superior performances. In recent years, machine learning techniques have been widely used in stock market predictions, including artificial neural network, SVM, and genetic algorithm. In particular, a case-based reasoning method, known as k-nearest neighbor is also widely used for stock price prediction. Case based reasoning retrieves several similar cases from previous cases when a new problem occurs, and combines the class labels of similar cases to create a classification for the new problem. However, case based reasoning has some problems. First, case based reasoning has a tendency to search for a fixed number of neighbors in the observation space and always selects the same number of neighbors rather than the best similar neighbors for the target case. So, case based reasoning may have to take into account more cases even when there are fewer cases applicable depending on the subject. Second, case based reasoning may select neighbors that are far away from the target case. Thus, case based reasoning does not guarantee an optimal pseudo-neighborhood for various target cases, and the predictability can be degraded due to a deviation from the desired similar neighbor. This paper examines how the size of learning data affects stock price predictability through k-nearest neighbor and compares the predictability of k-nearest neighbor with the random walk model according to the size of the learning data and the number of neighbors. In this study, Samsung electronics stock prices were predicted by dividing the learning dataset into two types. For the prediction of next day's closing price, we used four variables: opening value, daily high, daily low, and daily close. In the first experiment, data from January 1, 2000 to December 31, 2017 were used for the learning process. In the second experiment, data from January 1, 2015 to December 31, 2017 were used for the learning process. The test data is from January 1, 2018 to August 31, 2018 for both experiments. We compared the performance of k-NN with the random walk model using the two learning dataset. The mean absolute percentage error (MAPE) was 1.3497 for the random walk model and 1.3570 for the k-NN for the first experiment when the learning data was small. However, the mean absolute percentage error (MAPE) for the random walk model was 1.3497 and the k-NN was 1.2928 for the second experiment when the learning data was large. These results show that the prediction power when more learning data are used is higher than when less learning data are used. Also, this paper shows that k-NN generally produces a better predictive power than random walk model for larger learning datasets and does not when the learning dataset is relatively small. Future studies need to consider macroeconomic variables related to stock price forecasting including opening price, low price, high price, and closing price. Also, to produce better results, it is recommended that the k-nearest neighbor needs to find nearest neighbors using the second step filtering method considering fundamental economic variables as well as a sufficient amount of learning data.

Labor market forecasts for Information and communication construction business (정보통신공사업 인력수급차 분석 및 전망)

  • Kwak, Jeong Ho;Kwun, Tae Hee;Oh, Dong-Suk;Kim, Jung-Woo
    • Journal of Internet Computing and Services
    • /
    • v.16 no.2
    • /
    • pp.99-107
    • /
    • 2015
  • In this era of smart convergent environment wherein all industries are converged on ICT infrastructure and industries and cultures come together, the information and communication construction business is becoming more important. For the information and communication construction business to continue growing, it is very important to ensure that technical manpower is stably supplied. To date, however, there has been no theoretically methodical analysis of manpower supply and demand in the information and communications construction business. The need for the analysis of manpower supply and demand has become even more important after the government announced the road map for the development of construction business in December 2014 to seek measures to strengthen the human resources capacity based on the mid- to long-term manpower supply and demand analysis. As such, this study developed the manpower supply and demand forecast model for the information and communications construction business and presented the result of manpower supply and demand analysis. The analysis suggested that an overdemand situation would arise since the number of graduates of technical colleges decreased beginning 2007 because of fewer students entering technical colleges and due to the restructuring and reform of departments. In conclusion, it cited the need for the reeducation of existing manpower, continuous upgrading of professional development in the information and communications construction business, and provision of various policy incentives.

Road Accident Trends Analysis with Time Series Models for Various Road Types (도로종류별 교통사고 추세분석 및 시제열 분석모형 개발)

  • Han, Sang-Jin;Kim, Kewn-Jung
    • International Journal of Highway Engineering
    • /
    • v.9 no.3
    • /
    • pp.1-12
    • /
    • 2007
  • Roads in Korea can be classified into four types according to their responsible authorities. For example, Motorway is constructed, managed, and operated by the Korea Highway Corporation. Ministry of Construction and Transportation is in charge of National Highway, and Province Roads are run by each province government. Urban/county Roads are run by corresponding local government. This study analyses the trends of road accidents for each road type. For this purpose, the numbers of accidents, fatalities, and injuries are compared for each road type for last 15 years. The result shows that Urban/County Roads are the most dangerous, while Motorways are the safest, when we simply compare the numbers of accidents, fatalities, and injuries. However, when we compare these numbers by dividing by total road length, National Highway becomes the most dangerous while Province Roads becomes the safest. In the case of road accidents, fatalities, and injuries per vehicle km, which is known as the most objective comparison measure, it turns out that National Highway is the most dangerous roads again. This study also developed time series models to estimate trends of fatalities for each road type. These models will be useful when we set up or evaluate targets of national road safety.

  • PDF