• Title/Summary/Keyword: 시계열 데이터 분류

Search Result 137, Processing Time 0.026 seconds

Estimation of nonlinear GARCH-M model (비선형 평균 일반화 이분산 자기회귀모형의 추정)

  • Shim, Joo-Yong;Lee, Jang-Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.5
    • /
    • pp.831-839
    • /
    • 2010
  • Least squares support vector machine (LS-SVM) is a kernel trick gaining a lot of popularities in the regression and classification problems. We use LS-SVM to propose a iterative algorithm for a nonlinear generalized autoregressive conditional heteroscedasticity model in the mean (GARCH-M) model to estimate the mean and the conditional volatility of stock market returns. The proposed method combines a weighted LS-SVM for the mean and unweighted LS-SVM for the conditional volatility. In this paper, we show that nonlinear GARCH-M models have a higher performance than the linear GARCH model and the linear GARCH-M model via real data estimations.

Seq2Seq model-based Prognostics and Health Management of Robot Arm (Seq2Seq 모델 기반의 로봇팔 고장예지 기술)

  • Lee, Yeong-Hyeon;Kim, Kyung-Jun;Lee, Seung-Ik;Kim, Dong-Ju
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.12 no.3
    • /
    • pp.242-250
    • /
    • 2019
  • In this paper, we propose a method to predict the failure of industrial robot using Seq2Seq (Sequence to Sequence) model, which is a model for transforming time series data among Artificial Neural Network models. The proposed method uses the data of the joint current and angular value, which can be measured by the robot itself, without additional sensor for fault diagnosis. After preprocessing the measured data for the model to learn, the Seq2Seq model was trained to convert the current to angle. Abnormal degree for fault diagnosis uses RMSE (Root Mean Squared Error) during unit time between predicted angle and actual angle. The performance evaluation of the proposed method was performed using the test data measured under different conditions of normal and defective condition of the robot. When the Abnormal degree exceed the threshold, it was classified as a fault, and the accuracy of the fault diagnosis was 96.67% from the experiment. The proposed method has the merit that it can perform fault prediction without additional sensor, and it has been confirmed from the experiment that high diagnostic performance and efficiency are available without requiring deep expert knowledge of the robot.

Analysis of Relative Settlement Behavior of Retaining Wall Backside Ground Using Clustering (군집분류를 이용한 흙막이 벽체 배면 지반의 상대적 침하거동 분석)

  • Young-Jun Kwack;Heui-Soo Han
    • The Journal of Engineering Geology
    • /
    • v.33 no.1
    • /
    • pp.189-200
    • /
    • 2023
  • As urbanization and industrialization increase development in downtown areas, damage due to ground settlement continues to occur. Building collapse in urban has a high risk of leading to large-scale damage to life and property. However, there has rarely been studied on measurement data analysis methods when uneven loads are applied to the excavated ground and no prior knowledge of the ground. Accordingly, it was attempted to analyze the relative settlement behavior and correlation by processing the time-series surface settlement of construction sites in the urban. In this paper, the average index of difference in settlement and average of relative difference in settlement are defined and calculated, then plotted in the coordinate system to analyze the relative settlement behavior over time. In addition, since there was no prior knowledge of the ground, a standard to classify the clusters was needed, and the observation points were classified into using k-means clustering and Dunn Index. As a result of the analysis, it was confirmed that all the clusters moved to the stable region as the settlement amount converges. The clusters were segmented. Based on the analysis results, it was possible to distinguish between the independent displacement area and same behavior area by analyzing the correlation between measurement points. If possible to analyze the relative settlement behavior between the stations and classify the behavior areas, it can be helpful in settlement and stability management, such as uplift of the surrounding area, prediction of ground failure area, and prevention of activity failure.

A Study on the UAV-based Vegetable Index Comparison for Detection of Pine Wilt Disease Trees (소나무재선충병 피해목 탐지를 위한 UAV기반의 식생지수 비교 연구)

  • Jung, Yoon-Young;Kim, Sang-Wook
    • Journal of Cadastre & Land InformatiX
    • /
    • v.50 no.1
    • /
    • pp.201-214
    • /
    • 2020
  • This study aimed to early detect damaged trees by pine wilt disease using the vegetation indices of UAV images. The location data of 193 pine wilt disease trees were constructed through field surveys and vegetation index analyses of NDVI, GNDVI, NDRE and SAVI were performed using multi-spectral UAV images at the same time. K-Means algorithm was adopted to classify damaged trees and confusion matrix was used to compare and analyze the classification accuracy. The results of the study are summarized as follows. First, the overall accuracy of the classification was analyzed in order of NDVI (88.04%, Kappa coefficient 0.76) > GNDVI (86.01%, Kappa coefficient 0.72) > NDRE (77.35%, Kappa coefficient 0.55) > SAVI (76.84%, Kappa coefficient 0.54) and showed the highest accuracy of NDVI. Second, K-Means unsupervised classification method using NDVI or GNDVI is possible to some extent to find out the damaged trees. In particular, this technique is to help early detection of damaged trees due to its intensive operation, low user intervention and relatively simple analysis process. In the future, it is expected that the utilization of time series images or the application of deep learning techniques will increase the accuracy of classification.

A Time Series Graph based Convolutional Neural Network Model for Effective Input Variable Pattern Learning : Application to the Prediction of Stock Market (효과적인 입력변수 패턴 학습을 위한 시계열 그래프 기반 합성곱 신경망 모형: 주식시장 예측에의 응용)

  • Lee, Mo-Se;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.167-181
    • /
    • 2018
  • Over the past decade, deep learning has been in spotlight among various machine learning algorithms. In particular, CNN(Convolutional Neural Network), which is known as the effective solution for recognizing and classifying images or voices, has been popularly applied to classification and prediction problems. In this study, we investigate the way to apply CNN in business problem solving. Specifically, this study propose to apply CNN to stock market prediction, one of the most challenging tasks in the machine learning research. As mentioned, CNN has strength in interpreting images. Thus, the model proposed in this study adopts CNN as the binary classifier that predicts stock market direction (upward or downward) by using time series graphs as its inputs. That is, our proposal is to build a machine learning algorithm that mimics an experts called 'technical analysts' who examine the graph of past price movement, and predict future financial price movements. Our proposed model named 'CNN-FG(Convolutional Neural Network using Fluctuation Graph)' consists of five steps. In the first step, it divides the dataset into the intervals of 5 days. And then, it creates time series graphs for the divided dataset in step 2. The size of the image in which the graph is drawn is $40(pixels){\times}40(pixels)$, and the graph of each independent variable was drawn using different colors. In step 3, the model converts the images into the matrices. Each image is converted into the combination of three matrices in order to express the value of the color using R(red), G(green), and B(blue) scale. In the next step, it splits the dataset of the graph images into training and validation datasets. We used 80% of the total dataset as the training dataset, and the remaining 20% as the validation dataset. And then, CNN classifiers are trained using the images of training dataset in the final step. Regarding the parameters of CNN-FG, we adopted two convolution filters ($5{\times}5{\times}6$ and $5{\times}5{\times}9$) in the convolution layer. In the pooling layer, $2{\times}2$ max pooling filter was used. The numbers of the nodes in two hidden layers were set to, respectively, 900 and 32, and the number of the nodes in the output layer was set to 2(one is for the prediction of upward trend, and the other one is for downward trend). Activation functions for the convolution layer and the hidden layer were set to ReLU(Rectified Linear Unit), and one for the output layer set to Softmax function. To validate our model - CNN-FG, we applied it to the prediction of KOSPI200 for 2,026 days in eight years (from 2009 to 2016). To match the proportions of the two groups in the independent variable (i.e. tomorrow's stock market movement), we selected 1,950 samples by applying random sampling. Finally, we built the training dataset using 80% of the total dataset (1,560 samples), and the validation dataset using 20% (390 samples). The dependent variables of the experimental dataset included twelve technical indicators popularly been used in the previous studies. They include Stochastic %K, Stochastic %D, Momentum, ROC(rate of change), LW %R(Larry William's %R), A/D oscillator(accumulation/distribution oscillator), OSCP(price oscillator), CCI(commodity channel index), and so on. To confirm the superiority of CNN-FG, we compared its prediction accuracy with the ones of other classification models. Experimental results showed that CNN-FG outperforms LOGIT(logistic regression), ANN(artificial neural network), and SVM(support vector machine) with the statistical significance. These empirical results imply that converting time series business data into graphs and building CNN-based classification models using these graphs can be effective from the perspective of prediction accuracy. Thus, this paper sheds a light on how to apply deep learning techniques to the domain of business problem solving.

Adaptive Short-Term Vehicle Speed Prediction Models (적응성 있는 단기간 속도 예측모형 개발에 관한 연구)

  • 조범철
    • Proceedings of the KOR-KST Conference
    • /
    • 1998.10a
    • /
    • pp.265-274
    • /
    • 1998
  • 본 논문은 도로를 주행하는 차량의 지점속도에 대하여 단기간(short-term)으로 예측하는 네 가지의 모형들에 대한 개발 및 결과의 비교하고 평가했다. 사용된 기법들로는 다중회귀분석, 시계열분석(ARIMA), 인공 신경망, 칼만필터링 등이며, 모형의 구출을 위하여 다수의 독립변수 및 입력변수가 요구되는 다중회귀분석과 인공 신경망에서는 연속방정식에서 고려되는 변수들간의 단순상관계수 및 편상관계수의 계산을 통해서 입력변수가 설정이 되었으며, 시계열분석(ARIMA)과 칼만필터링 등 단일 입력 변수만을 요하는 모형에서는 바로 전 시간대와 현재시간대의간격동안 속도의 변화량을 입력변수로 설정하였다. 속도를 비롯해서 교통 데이터는 현장자료를 사용하였는데, 이는 서울의 한강 옆에 위치한 올림픽대로 중 한강대로에 위치한 검지기 3개를 통해서 천호동 방면으로 이동하는 교통류에 대해서 17시간 (00시~17시)동안 수집했다. 17시간 수집했는데 그중에 검지된 속도는 14km/h에서 98km/h까지 변하는 등, 수집된 자료에는 다양한 교통상태가 포함되어 있는데 이는 각 모형들의 정확한 예측력과 적응성을 평가하기 위함이었다. 각 모형은 예측하고자 하는 시점으로부터 1, 5, 10, 15분 후의 속도를 예측하는 것으로 총 4가지의 예측시간간격으로 각각 실험되었다. 결과는 전반적으로 신뢰성 있게 나왔으나 그중에서도 정확성면에서는 인공신경망과 칼만필터링이 우수했고 적응성면에서는 칼만필터리딩 탁월했다. 또한 1분 후의 속도를 예측하는 결과들은 모형들간에 거의 비슷한 정확도를 보여주었는데 이는 입력변수의 설정이 중요한 것임을 보여주는 것이라 판단된다. 있는 기법이다.적으로 세부적 차종분류로 접근한다.의 영향들을 고려함으로써 가로망 설계 과정에서 가로망의 상반된 역할인 이동성과 접근성의 비교가 가능한 보다 현실적인 가로망 설계 모형을 구축하고자 한다. 지금까지 소개된 가로망 설계모형들은 용량변화에 대한 설계변수의 형태에 따라 이산적 가로망 설계 모형과 연속적 가로망 설계모형으로 나뉘어지게 된다. 본 논문의 경우, 계산속도의 향상 측면에서는 연속적 가로망 설계 모형을 도입할 수 있지만, 이때 요구되는 도로용량이 이산적인 변수(차선 수)로 결정되어야만 신호제어 변수를 결정할 수 있기 때문에, 이산적 가로망 설계 모형이 사용된다. 하지만, 이산적 설계모형의 경우 조합최적화 문제이므로 정확한 최적해를 구하기 위해서는 상당한 시간이 소요되며, 경우에 따라서는 국부 최적해에 빠지게 된다. 이러한 문제를 극복하기 위해, 우선 이상적 모형의 근사화, 혹은 조합최적화문제를 위해 개발된 Simulated Annealing기법의 적용, 연속적 모형의 변수를 이산화하는 방법 등 다양한 모형들을 고려해 본 뒤, 적절한 모형을 적용할 것이다. 가로망 설계 모형에서 신호제어를 고려하기 위해서는 주어진 가로망에 대한 통행 배정과정에서 고려되는 통행시간을 링크통행시간과 교차로 지체시간을 동시에 고려해야 하는데, 이러한 문제의 해결을 위해서 최근 활발히 논의되고 있는 교차로에서의 신호제어에 대응하는 통행배정 모형을 도입하여 고려하고자 한다. 이를 위해서 지금까지 연구되어온 Global Solution Approach와 Iterative Approach를 비교, 검토한 뒤 모형에 보다 알맞은 방법을 선택한다. 차량의

  • PDF

연구개발투자의 산업간 파급효과

  • 김정우;이희경
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1995.09a
    • /
    • pp.144-155
    • /
    • 1995
  • 본 연구는 기술에 대한 대용개념으로 사용되고 있는 연구개발투자의 효과가 산업의 생산성 향상에 얼마만큼 기여하고 있는가에 관한 실증연구로, 그 효 과를 자체 연구개발효과와 파급효과로 나누어 측정하는 데 목적이 있다. 파 급효과의 경우, 중간재의 거래를 통한 체화된 파급효과와 산업간의 기술거리 로 인한 비체화된 파급효과로 나누어 한국 제조산업을 18개로 분류한 후 각 산업의 연구개발스톡을 측정하였으며, 연구개발투자의 체화된 파급효과 측정 을 위하여 산업 연관표를 이용하여 가중치를 계산하였다. 그리고, 비체화된 파급효과 측정을 위하여는 각 산업에서 고용하고 있는 전공별 연구원 수의 자료를 이용 기술거리를 구하였다. 본 연구에서는 각각의 가중치로 구한 연 구개발스톡, 체화된 연구개발 스톡, 그리고 비체화된 연구개발 스톡을 이용 하여 각 독립변수들에 대한 한계생산성을 구하였으며, 분석 방법으로는 단순 회귀분석과 함께 시계열의 효과와 산업간 효과를 고려하는 패널데이터 분석 을 시도하였다. 체화된 파급효과와 비체화된 파급효과 중 하나만을 변수로 포함하는 경우에는 추정치가 유의한 결과를 나타내고 있지만, 두 가지의 변 수를 모두 포함하는 경우에는 보호도 일정하지 않으며 비유의적인 결과를 보였다. 이러한 결과는 다중공선성에 의한 것으로 보인다. 두 가지 파급효과 에 대한 한계생산성 추정치는 기술과 연구개발투자가 외부성을 가지고 있으 며, 기술과 관련된 변수의 도입이 필요함을 시사한다. 또한 이러한 파급효과 의 추정치는 거시차원에서 연구개발 지원의 정당성에 대한 근거를 제시하고 있으며, 기술혁신을 위한 투자의 타당성을 실증적으로 보여주고 있다.사하였다. 이 사례 연구들의 결과는 각 계열사들의 상황에 따라 제시된 외주위탁 전략과 현재의 외주위 탁 전략이 일치할 때 정보 시스템에 대한 사용자 만족도가 보다 높은 것으 로 나타났다. 할 수 있는 효율적인 distributed system를 개발하는 것을 제시하였다. 본 논문은 데이타베이스론의 입장에서 아직 정립되어 있지 않은 분산 환경하에서의 관계형 데이타베이스의 데이타관리의 분류체계를 나름대로 정립하였다는데 그 의의가 있다. 또한 이것의 응용은 현재 분산데이타베이스 구축에 있어 나타나는 기술적인 문제점들을 어느정도 보완할 수 있다는 점에서 그 중요성이 있다.ence of a small(IxEpc),hot(Tex> SOK) core which contains two tempegatlue peaks at -15" east and north of MDS. The column density of HCaN is (1-3):n1014cm-2. Column density at distant position from MD5 is larger than that in the (:entral region. We have deduced that this hot-core has a mass of 10sR1 which i:s about an order of magnitude larger those obtained by previous studies.previous studies.업순서들의 상관관계를 고려하여 보다 개선된 해를 구하기 위한 연구가 요구된다. 또한, 준비작업비용을 발생시키는 작업장의 작업순서결정에 대해서도 연구를 행하여, 보완작업비용과 준비비용을

  • PDF

Analysis of the differences in living population changes and regional responses by COVID-19 outbreak in Seoul (코로나-19에 따른 서울시 생활인구 변화와 동별 반응 차이 분석)

  • Jin, Juhae;Seong, Byeongchan
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.6
    • /
    • pp.697-712
    • /
    • 2020
  • New infectious diseases have broken out repeatedly across the world over the last 20 years; COVID-19 is causing drastic changes and damage to daily lives. Furthermore, as there is no denying that new epidemics will appear in the future, there is a continuous need to develop measures aimed towards responding to economic damage. Against this backdrop, the living population is an important indicator that shows changes in citizens' life patterns. This study analyzes time-based and socio-environmental characteristics by detecting and classifying changes in everyday life caused by COVID-19 from the perspective of the floating population. k-shape Clustering is used to classify living population data of each of the 424 dong's in Seoul measured by the hour; then by applying intervention analysis and One-way ANOVA, each cluster's characteristics and aspects of change in the living population occurring in the aftermath of COVID-19 are scrutinized. In conclusion, this study confirms each cluster's obvious characteristics in changes of population flows before and after the confirmation of coronavirus patients and distinguishes groups that reacted sensitively to the intervention times on the basis of COVID-related incidents from those that did not.

The Relationship between Social Media and Consumer Purchase Decision: Findings from Seoul Sharing Bike (소셜미디어와 소비자 구매 결정과의 관계: 서울 공유 자전거에 대한 시계열 분석을 중심으로)

  • Han, Suhyeon;Jang, Junghwa;Choi, Jeonghye;Chang, Sue Ryung
    • Knowledge Management Research
    • /
    • v.22 no.4
    • /
    • pp.135-155
    • /
    • 2021
  • With the emergence of various types of social media and the diversification of their roles, it has become essential for marketers to understand how different types of social media influence consumers' purchase decisions differently and derive more detailed strategies by social media types. This study classifies social media into two types-expression-focused social media and relationship-focused social media-and investigates the relationship between consumer purchases and social media mentions by type. Using the Seoul bike-sharing data and time-series data for social media mentions, we apply the VAR model with Exogenous Variables (VARX). We find that the increase of product mentions in expression-focused social media positively affects both the number of new customers (customer acquisition) and the number of shared bike rentals, while that in relationship-focused social media negatively affects the number of new customers only. In addition, as new customers increase, the product mentions in both types of social media increase. On the other hand, the number of bike rentals has no significant effect in increasing social media mentions regardless of type. This study contributes to the social media and sharing economy literature and provides managerial implications for establishing sophisticated social media marketing in bike-sharing businesses.

An Anomalous Event Detection System based on Information Theory (엔트로피 기반의 이상징후 탐지 시스템)

  • Han, Chan-Kyu;Choi, Hyoung-Kee
    • Journal of KIISE:Information Networking
    • /
    • v.36 no.3
    • /
    • pp.173-183
    • /
    • 2009
  • We present a real-time monitoring system for detecting anomalous network events using the entropy. The entropy accounts for the effects of disorder in the system. When an abnormal factor arises to agitate the current system the entropy must show an abrupt change. In this paper we deliberately model the Internet to measure the entropy. Packets flowing between these two networks may incur to sustain the current value. In the proposed system we keep track of the value of entropy in time to pinpoint the sudden changes in the value. The time-series data of entropy are transformed into the two-dimensional domains to help visually inspect the activities on the network. We examine the system using network traffic traces containing notorious worms and DoS attacks on the testbed. Furthermore, we compare our proposed system of time series forecasting method, such as EWMA, holt-winters, and PCA in terms of sensitive. The result suggests that our approach be able to detect anomalies with the fairly high accuracy. Our contributions are two folds: (1) highly sensitive detection of anomalies and (2) visualization of network activities to alert anomalies.