• Title/Summary/Keyword: Time-series data prediction

Search Result 613, Processing Time 0.029 seconds

Comparison of Prediction Accuracy Between Classification and Convolution Algorithm in Fault Diagnosis of Rotatory Machines at Varying Speed (회전수가 변하는 기기의 고장진단에 있어서 특성 기반 분류와 합성곱 기반 알고리즘의 예측 정확도 비교)

  • Moon, Ki-Yeong;Kim, Hyung-Jin;Hwang, Se-Yun;Lee, Jang Hyun
    • Journal of Navigation and Port Research
    • /
    • v.46 no.3
    • /
    • pp.280-288
    • /
    • 2022
  • This study examined the diagnostics of abnormalities and faults of equipment, whose rotational speed changes even during regular operation. The purpose of this study was to suggest a procedure that can properly apply machine learning to the time series data, comprising non-stationary characteristics as the rotational speed changes. Anomaly and fault diagnosis was performed using machine learning: k-Nearest Neighbor (k-NN), Support Vector Machine (SVM), and Random Forest. To compare the diagnostic accuracy, an autoencoder was used for anomaly detection and a convolution based Conv1D was additionally used for fault diagnosis. Feature vectors comprising statistical and frequency attributes were extracted, and normalization & dimensional reduction were applied to the extracted feature vectors. Changes in the diagnostic accuracy of machine learning according to feature selection, normalization, and dimensional reduction are explained. The hyperparameter optimization process and the layered structure are also described for each algorithm. Finally, results show that machine learning can accurately diagnose the failure of a variable-rotation machine under the appropriate feature treatment, although the convolution algorithms have been widely applied to the considered problem.

The Analysis of Future Land Use Change Impact on Hydrology and Water Quality Using SWAT Model (SWAT 모형을 이용한 미래 토지이용변화가 수문 - 수질에 미치는 영향 분석)

  • Park, Jong-Yoon;Lee, Mi Seon;Lee, Yong Jun;Kim, Seong Joon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.2B
    • /
    • pp.187-197
    • /
    • 2008
  • This study is to assess the impact of future land use change on hydrology and water quality in Gyungan-cheon watershed ($255.44km^2$) using SWAT (Soil and Water Assessment Tool) model. Using the 5 past Landsat TM (1987, 1991, 1996, 2004) and $ETM^+$ (2001) satellite images, time series of land use map were prepared, and the future land uses (2030, 2060, 2090) were predicted using CA-Markov technique. The 4 years streamflow and water quality data (SS, T-N, T-P) and DEM (Digital Elevation Model), stream network, and soil information (1:25,000) were prepared. The model was calibrated for 2 years (1999 and 2000), and verified for 2 years (2001 and 2002) with averaged Nash and Sutcliffe model efficiency of 0.59 for streamflow and determination coefficient of 0.88, 0.72, 0.68 for Sediment, T-N (Total Nitrogen), T-P (Total Phosphorous) respectively. The 2030, 2060 and 2090 future prediction based on 2004 values showed that the total runoff increased 1.4%, 2.0% and 2.7% for 0.6, 0.8 and 1.1 increase of watershed averaged CN value. For the future Sediment, T-N and T-P based on 2004 values, 51.4%, 5.0% and 11.7% increase in 2030, 70.5%, 8.5% and 16.7% increase in 2060, and 74.9%, 10.9% and 19.9% increase in 2090.

Assessment of Future Climate and Land Use Change on Hydrology and Stream Water Quality of Anseongcheon Watershed Using SWAT Model (II) (SWAT 모형을 이용한 미래 기후변화 및 토지이용 변화에 따른 안성천 유역 수문 - 수질 변화 분석 (II))

  • Lee, Yong Jun;An, So Ra;Kang, Boosik;Kim, Seong Joon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.6B
    • /
    • pp.665-673
    • /
    • 2008
  • This study is to assess the future potential climate and land use change impact on streamflow and stream water quality of the study watershed using the established model parameters (I). The CCCma (Canadian Centre for Climate Modelling and Analysis) CGCM2 (Canadian Global Coupled Model) based on IPCC SRES (Special Report Emission Scenarios) A2 and B2 scenarios were adopted for future climate condition, and the data were downscaled by Stochastic Spatio-Temporal Random Cascade Model technique. The future land use condition was predicted by using modified CA-Markov (Cellular Automata-Markov chain) technique with the past time series of Landsat satellite images. The model was applied for the future extreme precipitation cases of around 2030, 2060 and 2090. The predicted results showed that the runoff ratio increased 8% based on the 2005 precipitation (1160.1 mm) and runoff ratio (65%). Accordingly the Sediment, T-N and T-P also increased 120%, 16% and 10% respectively for the case of 50% precipitation increase. This research has the meaning in providing the methodological procedures for the evaluation of future potential climate and land use changes on watershed hydrology and stream water quality. This model result are expected to plan in advance for healthy and sustainable watershed management and countermeasures of climate change.

Prediction of Water Storage Rate for Agricultural Reservoirs Using Univariate and Multivariate LSTM Models (단변량 및 다변량 LSTM을 이용한 농업용 저수지의 저수율 예측)

  • Sunguk Joh;Yangwon Lee
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_4
    • /
    • pp.1125-1134
    • /
    • 2023
  • Out of the total 17,000 reservoirs in Korea, 13,600 small agricultural reservoirs do not have hydrological measurement facilities, making it difficult to predict water storage volume and appropriate operation. This paper examined univariate and multivariate long short-term memory (LSTM) modeling to predict the storage rate of agricultural reservoirs using remote sensing and artificial intelligence. The univariate LSTM model used only water storage rate as an explanatory variable, and the multivariate LSTM model added n-day accumulative precipitation and date of year (DOY) as explanatory variables. They were trained using eight years data (2013 to 2020) for Idong Reservoir, and the predictions of the daily water storage in 2021 were validated for accuracy assessment. The univariate showed the root-mean square error (RMSE) of 1.04%, 2.52%, and 4.18% for the one, three, and five-day predictions. The multivariate model showed the RMSE 0.98%, 1.95%, and 2.76% for the one, three, and five-day predictions. In addition to the time-series storage rate, DOY and daily and 5-day cumulative precipitation variables were more significant than others for the daily model, which means that the temporal range of the impacts of precipitation on the everyday water storage rate was approximately five days.

Predicting the Direction of the Stock Index by Using a Domain-Specific Sentiment Dictionary (주가지수 방향성 예측을 위한 주제지향 감성사전 구축 방안)

  • Yu, Eunji;Kim, Yoosin;Kim, Namgyu;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.1
    • /
    • pp.95-110
    • /
    • 2013
  • Recently, the amount of unstructured data being generated through a variety of social media has been increasing rapidly, resulting in the increasing need to collect, store, search for, analyze, and visualize this data. This kind of data cannot be handled appropriately by using the traditional methodologies usually used for analyzing structured data because of its vast volume and unstructured nature. In this situation, many attempts are being made to analyze unstructured data such as text files and log files through various commercial or noncommercial analytical tools. Among the various contemporary issues dealt with in the literature of unstructured text data analysis, the concepts and techniques of opinion mining have been attracting much attention from pioneer researchers and business practitioners. Opinion mining or sentiment analysis refers to a series of processes that analyze participants' opinions, sentiments, evaluations, attitudes, and emotions about selected products, services, organizations, social issues, and so on. In other words, many attempts based on various opinion mining techniques are being made to resolve complicated issues that could not have otherwise been solved by existing traditional approaches. One of the most representative attempts using the opinion mining technique may be the recent research that proposed an intelligent model for predicting the direction of the stock index. This model works mainly on the basis of opinions extracted from an overwhelming number of economic news repots. News content published on various media is obviously a traditional example of unstructured text data. Every day, a large volume of new content is created, digitalized, and subsequently distributed to us via online or offline channels. Many studies have revealed that we make better decisions on political, economic, and social issues by analyzing news and other related information. In this sense, we expect to predict the fluctuation of stock markets partly by analyzing the relationship between economic news reports and the pattern of stock prices. So far, in the literature on opinion mining, most studies including ours have utilized a sentiment dictionary to elicit sentiment polarity or sentiment value from a large number of documents. A sentiment dictionary consists of pairs of selected words and their sentiment values. Sentiment classifiers refer to the dictionary to formulate the sentiment polarity of words, sentences in a document, and the whole document. However, most traditional approaches have common limitations in that they do not consider the flexibility of sentiment polarity, that is, the sentiment polarity or sentiment value of a word is fixed and cannot be changed in a traditional sentiment dictionary. In the real world, however, the sentiment polarity of a word can vary depending on the time, situation, and purpose of the analysis. It can also be contradictory in nature. The flexibility of sentiment polarity motivated us to conduct this study. In this paper, we have stated that sentiment polarity should be assigned, not merely on the basis of the inherent meaning of a word but on the basis of its ad hoc meaning within a particular context. To implement our idea, we presented an intelligent investment decision-support model based on opinion mining that performs the scrapping and parsing of massive volumes of economic news on the web, tags sentiment words, classifies sentiment polarity of the news, and finally predicts the direction of the next day's stock index. In addition, we applied a domain-specific sentiment dictionary instead of a general purpose one to classify each piece of news as either positive or negative. For the purpose of performance evaluation, we performed intensive experiments and investigated the prediction accuracy of our model. For the experiments to predict the direction of the stock index, we gathered and analyzed 1,072 articles about stock markets published by "M" and "E" media between July 2011 and September 2011.

A Comparative Study on Failure Pprediction Models for Small and Medium Manufacturing Company (중소제조기업의 부실예측모형 비교연구)

  • Hwangbo, Yun;Moon, Jong Geon
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.11 no.3
    • /
    • pp.1-15
    • /
    • 2016
  • This study has analyzed predication capabilities leveraging multi-variate model, logistic regression model, and artificial neural network model based on financial information of medium-small sized companies list in KOSDAQ. 83 delisted companies from 2009 to 2012 and 83 normal companies, i.e. 166 firms in total were sampled for the analysis. Modelling with training data was mobilized for 100 companies inlcuding 50 delisted ones and 50 normal ones at random out of the 166 companies. The rest of samples, 66 companies, were used to verify accuracies of the models. Each model was designed by carrying out T-test with 79 financial ratios for the last 5 years and identifying 9 significant variables. T-test has shown that financial profitability variables were major variables to predict a financial risk at an early stage, and financial stability variables and financial cashflow variables were identified as additional significant variables at a later stage of insolvency. When predication capabilities of the models were compared, for training data, a logistic regression model exhibited the highest accuracy while for test data, the artificial neural networks model provided the most accurate results. There are differences between the previous researches and this study as follows. Firstly, this study considered a time-series aspect in light of the fact that failure proceeds gradually. Secondly, while previous studies constructed a multivariate discriminant model ignoring normality, this study has reviewed the regularity of the independent variables, and performed comparisons with the other models. Policy implications of this study is that the reliability for the disclosure documents is important because the simptoms of firm's fail woule be shown on financial statements according to this paper. Therefore institutional arragements for restraing moral laxity from accounting firms or its workers should be strengthened.

  • PDF

Wave Analysis and Spectrum Estimation for the Optimal Design of the Wave Energy Converter in the Hupo Coastal Sea (파력발전장치 설계를 위한후포 연안의 파랑 분석 및 스펙트럼 추정)

  • Kweon, Hyuck-Min;Cho, Hongyeon;Jeong, Weon-Mu
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.25 no.3
    • /
    • pp.147-153
    • /
    • 2013
  • There exist various types of the WEC (Wave Energy Converter), and among them, the point absorber is the most popularly investigated type. However, it is difficult to find examples of systematically measured data analysis for the design of the point absorber type of power buoy in the world. The study investigates the wave load acting on the point absorber type resonance power buoy wave energy extraction system proposed by Kweon et al. (2010). This study analyzes the time series spectra with respect to the three-year wave data (2002.05.01~2005.03.29) measured using the pressure type wave gage at the seaside of north breakwater of Hupo harbor located in the east coast of the Korean peninsula. From the analysis results, it could be deduced that monthly wave period and wave height variations were apparent and that monthly wave powers were unevenly distributed annually. The average wave steepness of the usual wave was 0.01, lower than that of the wind wave range of 0.02-0.04. The mode of the average wave period has the value of 5.31 sec, while mode of the wave height of the applicable period has the value of 0.29 m. The occurrence probability of the peak period is a bi-modal type, with a mode value between 4.47 sec and 6.78 sec. The design wave period can be selected from the above four values of 0.01, 5.31, 4.47, 6.78. About 95% of measured wave heights are below 1 m. Through this study, it was found that a resonance power buoy system is necessary in coastal areas with low wave energy and that the optimal design for overcoming the uneven monthly distribution of wave power is a major task in the development of a WEF (Wave Energy Farm). Finding it impossible to express the average spectrum of the usual wave in terms of the standard spectrum equation, this study proposes a new spectrum equation with three parameters, with which basic data for the prediction of the power production using wave power buoy and the fatigue analysis of the system can be given.

Prediction of Seabed Topography Change Due to Construction of Offshore Wind Power Structures in the West-Southern Sea of Korea (서남해에서 해상풍력구조물의 건설에 의한 해저지형의 변화예측)

  • Jeong, Seung Myung;Kwon, Kyung Hwan;Lee, Jong Sup;Park, Il Heum
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.31 no.6
    • /
    • pp.423-433
    • /
    • 2019
  • In order to predict the seabed topography change due to the construction of offshore wind power structures in the west-southern sea of Korea, field observations for tides, tidal currents, suspended sediment concentrations and seabed sediments were carried out at the same time. These data could be used for numerical simulation. In numerical experiments, the empirical constants for the suspended sediment flux were determined by the trial and error method. When a concentration distribution factor was 0.1 and a proportional constant was 0.05 in the suspended sediment equilibrium concentration formulae, the calculated suspended sediment concentrations were reasonably similar with the observed ones. Also, it was appropriate for the open boundary conditions of the suspended sediment when the south-east boundary corner was 11.0 times, the south-west was 0.5 times, the westnorth 1.0 times, the north-west was 1.0 times and the north-east was 1.0 times, respectively, using the time series of the observed suspended sediment concentrations. In this case, the depth change was smooth and not intermittent around the open boundaries. From these calibrations, the annual water depth change before and after construction of the offshore wind power structures was shown under 1 cm. The reason was that the used numerical model for the large scale grid could not reproduce a local scour phenomenon and they showed almost no significant velocity change over ± 2 cm/s because the jacket structures with small size diameter, about 1 m, were a water-permeable. Therefore, it was natural that there was a slight change on seabed topography in the study area.

Dynamic Equilibrium Position Prediction Model for the Confluence Area of Nakdong River (낙동강 합류부 삼각주의 동적 평형 위치 예측 모델: 감천-낙동강 합류점 중심 분석 연구)

  • Minsik Kim;Haein Shin;Wook-Hyun Nahm;Wonsuck Kim
    • Economic and Environmental Geology
    • /
    • v.56 no.4
    • /
    • pp.435-445
    • /
    • 2023
  • A delta is a depositional landform that is formed when sediment transported by a river is deposited in a relatively low-energy environment, such as a lake, sea, or a main channel. Among these, a delta formed at the confluence of rivers has a great importance in river management and research because it has a significant impact on the hydraulic and sedimentological characteristics of the river. Recently, the equilibrium state of the confluence area has been disrupted by large-scale dredging and construction of levees in the Nakdong River. However, due to the natural recovery of the river, the confluence area is returning to its pre-dredging natural state through ongoing sedimentation. The time-series data show that the confluence delta has been steadily growing since the dredging, but once it reaches a certain size, it repeats growth and retreat, and the overall size does not change significantly. In this study, we developed a model to explain the sedimentation-erosion processes in the confluence area based on the assumption that the confluence delta reaches a dynamic equilibrium. The model is based on two fundamental principles: sedimentation due to supply from the tributary and erosion due to the main channel. The erosion coefficient that represents the Nakdong River confluence areas, was obtained using data from the tributaries of the Nakdong River. Sensitivity analyses were conducted using the developed model to understand how the confluence delta responds to changes in the sediment and water discharges of the tributary and the main channel, respectively. We then used annual average discharge of the Nakdong River's tributaries to predict the dynamic equilibrium positions of the confluence deltas. Finally, we conducted a simulation experiment on the development of the Gamcheon-Nakdong River delta using recorded daily discharge. The results showed that even though it is a simple model, it accurately predicted the dynamic equilibrium positions of the confluence deltas in the Nakdong River, including the areas where the delta had not formed, and those where the delta had already formed and predicted the trend of the response of the Gamcheon-Nakdong River delta. However, the actual retreat in the Gamcheon-Nakdong River delta was not captured fully due to errors and limitations in the simplification process. The insights through this study provide basic information on the sediment supply of the Nakdong River through the confluence areas, which can be implemented as a basic model for river maintenance and management.

A prediction study on the number of emergency patients with ASTHMA according to the concentration of air pollutants (대기오염물질 농도에 따른 천식 응급환자 수 예측 연구)

  • Han Joo Lee;Min Kyu Jee;Cheong Won Kim
    • Journal of Service Research and Studies
    • /
    • v.13 no.1
    • /
    • pp.63-75
    • /
    • 2023
  • Due to the development of industry, interest in air pollutants has increased. Air pollutants have affected various fields such as environmental pollution and global warming. Among them, environmental diseases are one of the fields affected by air pollutants. Air pollutants can affect the human body's skin or respiratory tract due to their small molecular size. As a result, various studies on air pollutants and environmental diseases have been conducted. Asthma, part of an environmental disease, can be life-threatening if symptoms worsen and cause asthma attacks, and in the case of adult asthma, it is difficult to cure once it occurs. Factors that worsen asthma include particulate matter and air pollution. Asthma is an increasing prevalence worldwide. In this paper, we study how air pollutants correlate with the number of emergency room admissions in asthma patients and predict the number of future asthma emergency patients using highly correlated air pollutants. Air pollutants used concentrations of five pollutants: sulfur dioxide(SO2), carbon monoxide(CO), ozone(O3), nitrogen dioxide(NO2), and fine dust(PM10), and environmental diseases used data on the number of hospitalizations of asthma patients in the emergency room. Data on the number of emergency patients of air pollutants and asthma were used for a total of 5 years from January 1, 2013 to December 31, 2017. The model made predictions using two models, Informer and LTSF-Linear, and performance indicators of MAE, MAPE, and RMSE were used to measure the performance of the model. The results were compared by making predictions for both cases including and not including the number of emergency patients. This paper presents air pollutants that improve the model's performance in predicting the number of asthma emergency patients using Informer and LTSF-Linear models.