• 제목/요약/키워드: Time-series clustering

검색결과 185건 처리시간 0.02초

다학제 분야 학술지의 주제어 동시발생 네트워크를 활용한 기술예측 연구 (A Study on Technology Forecasting based on Co-occurrence Network of Keyword in Multidisciplinary Journals)

  • 김현욱;안상진;정우성
    • 한국경영과학회지
    • /
    • 제40권4호
    • /
    • pp.49-63
    • /
    • 2015
  • Keyword indexed in multidisciplinary journals show trends about science and technology innovation. Nature and Science were selected as multidisciplinary journals for our analysis. In order to reduce the effect of plurality of keyword, stemming algorithm were implemented. After this process, we fitted growth curve of keyword (stem) following bass model, which is a well-known model in diffusion process. Bass model is useful for expressing growth pattern by assuming innovative and imitative activities in innovation spreading. In addition, we construct keyword co-occurrence network and calculate network measures such as centrality indices and local clustering coefficient. Based on network metrics and yearly frequency of keyword, time series analysis was conducted for obtaining statistical causality between these measures. For some cases, local clustering coefficient seems to Granger-cause yearly frequency of keyword. We expect that local clustering coefficient could be a supportive indicator of emerging science and technology.

Prediction of Energy Consumption in a Smart Home Using Coherent Weighted K-Means Clustering ARIMA Model

  • Magdalene, J. Jasmine Christina;Zoraida, B.S.E.
    • International Journal of Computer Science & Network Security
    • /
    • 제22권10호
    • /
    • pp.177-182
    • /
    • 2022
  • Technology is progressing with every passing day and the enormous usage of electricity is becoming a necessity. One of the techniques to enjoy the assistances in a smart home is the efficiency to manage the electric energy. When electric energy is managed in an appropriate way, it drastically saves sufficient power even to be spent during hard time as when hit by natural calamities. To accomplish this, prediction of energy consumption plays a very important role. This proposed prediction model Coherent Weighted K-Means Clustering ARIMA (CWKMCA) enhances the weighted k-means clustering technique by adding weights to the cluster points. Forecasting is done using the ARIMA model based on the centroid of the clusters produced. The dataset for this proposed work is taken from the Pecan Project in Texas, USA. The level of accuracy of this model is compared with the traditional ARIMA model and the Weighted K-Means Clustering ARIMA Model. When predicting,errors such as RMSE, MAPE, AIC and AICC are analysed, the results of this suggested work reveal lower values than the ARIMA and Weighted K-Means Clustering ARIMA models. This model also has a greater loglikelihood, demonstrating that this model outperforms the ARIMA model for time series forecasting.

병렬구조 퍼지시스템을 이용한 태양흑점 시계열 데이터의 예측 (Prediction of Sunspot Number Time Series using the Parallel-Structure Fuzzy Systems)

  • 김민수;정찬수
    • 대한전기학회논문지:시스템및제어부문D
    • /
    • 제54권6호
    • /
    • pp.390-395
    • /
    • 2005
  • Sunspots are dark areas that grow and decay on the lowest level of the sun that is visible from the Earth. Shot-term predictions of solar activity are essential to help plan missions and to design satellites that will survive for their useful lifetimes. This paper presents a parallel-structure fuzzy system(PSFS) for prediction of sunspot number time series. The PSFS consists of a multiple number of component fuzzy systems connected in parallel. Each component fuzzy system in the PSFS predicts future data independently based on its past time series data with different embedding dimension and time delay. An embedding dimension determines the number of inputs of each component fuzzy system and a time delay decides the interval of inputs of the time series. According to the embedding dimension and the time delay, the component fuzzy system takes various input-output pairs. The PSFS determines the final predicted value as an average of all the outputs of the component fuzzy systems in order to reduce error accumulation effect.

A Biclustering Method for Time Series Analysis

  • Lee, Jeong-Hwa;Lee, Young-Rok;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • 제9권2호
    • /
    • pp.131-140
    • /
    • 2010
  • Biclustering is a method of finding meaningful subsets of objects and attributes simultaneously, which may not be detected by traditional clustering methods. It is popularly used for the analysis of microarray data representing the expression levels of genes by conditions. Usually, biclustering algorithms do not consider a sequential relation between attributes. For time series data, however, bicluster solutions should keep the time sequence. This paper proposes a new biclustering algorithm for time series data by modifying the plaid model. The proposed algorithm introduces a parameter controlling an interval between two selected time points. Also, the pruning step preventing an over-fitting problem is modified so as to eliminate only starting or ending points. Results from artificial data sets show that the proposed method is more suitable for the extraction of biclusters from time series data sets. Moreover, by using the proposed method, we find some interesting observations from real-world time-course microarray data sets and apartment price data sets in metropolitan areas.

에너지 효율 증대를 위한 에너지 사용량 예측과 에너지 수요이전 모델 연구 (A Study on the Energy Usage Prediction and Energy Demand Shift Model to Increase Energy Efficiency)

  • 김재환;양세모;이강윤
    • 인터넷정보학회논문지
    • /
    • 제24권2호
    • /
    • pp.57-66
    • /
    • 2023
  • 현재, 에너지 효율 향상으로 소비감축을 시행하는 새로운 에너지 시스템이 대두되고 있다. 이에 스마트그리드가 확산되면서 계시별 요금제가 확대되고 있다. 계시별 요금제는 계절별 / 시간별로 요금을 다르게 적용해 사용량에 따라 요금을 내는 요금제이다. 본 연구에서는 에너지 전력 사용량 데이터를 예측하기 위해, 온도/요일/시간/계절 등 외부 요인을 고려하고 시계열 예측 모델인 LSTM을 활용한다. 이러한 에너지 사용량 예측 모델을 기반으로 기기별 사용패턴을 분석하여 전력 에너지를 최대부하시간대에서 경부하시간대로 수요이전 함으로써 에너지 사용요금을 절감한다. 기기별 사용패턴을 분석하기 위해서는 시간대별로 기기의 사용량 패턴을 학습 및 분류하는 clustering 기법을 사용한다. 정리하자면, 본 연구에서는 사용자의 전력 데이터 사용량을 기반으로 사용량과 사용 요금을 예측 및 기기별 사용패턴을 분석하고 분석 기반의 맞춤형 수요이전 서비스를 제공함으로써 사용자에게 요금 절감 효과를 가져다 준다.

Cluster Analysis of Daily Electricity Demand with t-SNE

  • Min, Yunhong
    • 한국컴퓨터정보학회논문지
    • /
    • 제23권5호
    • /
    • pp.9-14
    • /
    • 2018
  • For an efficient management of electricity market and power systems, accurate forecasts for electricity demand are essential. Since there are many factors, either known or unknown, determining the realized loads, it is difficult to forecast the demands with the past time series only. In this paper we perform a cluster analysis on electricity demand data collected from Jan. 2000 to Dec. 2017. Our purpose of clustering on electricity demand data is that each cluster is expected to consist of data whose latent variables are same or similar values. Then, if properly clustered, it is possible to develop an accurate forecasting model for each cluster separately. To validate the feasibility of this approach for building better forecasting models, we clustered data with t-SNE. To apply t-SNE to time series data effectively, we adopt the dynamic time warping as a similarity measure. From the result of experiments, we found that several clusters are well observed and each cluster can be interpreted as a mix of well-known factors such as trends, seasonality and holiday effects and other unknown factors. These findings can motivate the approaches which build forecasting models with respect to each cluster independently.

Reference Map을 이용한 시계열 image data의 자동분류법 (Automatic Classification Method for Time-Series Image Data using Reference Map)

  • 홍선표
    • 한국음향학회지
    • /
    • 제16권2호
    • /
    • pp.58-65
    • /
    • 1997
  • 본 논문에서는 시계열 image data를 안정되고 높은 정확도로 분류할 수 있는 자동분류법을 제안하였다. 제안한 방법은 대상 영역에 관한 분류도가 기존재하던 가, 아니면 최소한 시계열 image data 중 어느 한 image data가 분류되어 있다고 하는 전제조건에 그 기초를 두고 있다. 분류도는 training area를 선정하기 위라여 사용하는 기준주제도로 사용되어진다. 제안한 방법은 1)기준주제도를 사용한 training data의 추출, 2)taining data의 균질성에 의거한 변화화소의 검출, 3)검출된 변화화소에 대한 clustering, 4)training data의 재구성, 5)maximum likelihood classifier와 같은 판별법에 의한 분류 등 5개의 단계로 구성된다. 제안한 방법의 성능을 정량적으로 평가하기 위하여 4개의 시계열 Landsat TM image data를 제안한 방법과 숙련된 operator가 필요한 기존의 방법으로 각각 분류하여 비교 검토하였다. 그 결과, 기존의 방법으로는 숙련된 operator가 필요하고, 분류도를 얻기까지 수일이 소요되는 데 반하여, 제안한 방법으로는 숙련된 operator 없이, 신뢰성 있는 분류도를 수 시간 내에 자동으로 얻을 수 있었다.

  • PDF

금융 및 특수시계열 모형의 조망 (A recent overview on financial and special time series models)

  • 황선영
    • 응용통계연구
    • /
    • 제29권1호
    • /
    • pp.1-12
    • /
    • 2016
  • 금융시계열은 일반 시계열과는 차별적으로 stylized facts로 불리는 특징을 가지고 있다. 이 특징들은 급첨 성질, 비정규분포, 변동성 집중 및 비대칭성을 포함한다. 이러한 특징들을 설명하기 위해서는 기존의 선형 ARMA 모형에서 벗어난 특수한 모형이 필요하게 되었다. 본 논문은 변동성 모형인 GARCH 형태의 모형을 중심으로 특수 금융시계열 모형들을 소개하고 연관된 통계적 이슈들에 대해 가능한 최근 연구를 중심으로 폭 넓게 조망하고 있다.

Building Energy Time Series Data Mining for Behavior Analytics and Forecasting Energy consumption

  • Balachander, K;Paulraj, D
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권6호
    • /
    • pp.1957-1980
    • /
    • 2021
  • The significant aim of this research has always been to evaluate the mechanism for efficient and inherently aware usage of vitality in-home devices, thus improving the information of smart metering systems with regard to the usage of selected homes and the time of use. Advances in information processing are commonly used to quantify gigantic building activity data steps to boost the activity efficiency of the building energy systems. Here, some smart data mining models are offered to measure, and predict the time series for energy in order to expose different ephemeral principles for using energy. Such considerations illustrate the use of machines in relation to time, such as day hour, time of day, week, month and year relationships within a family unit, which are key components in gathering and separating the effect of consumers behaviors in the use of energy and their pattern of energy prediction. It is necessary to determine the multiple relations through the usage of different appliances from simultaneous information flows. In comparison, specific relations among interval-based instances where multiple appliances use continue for certain duration are difficult to determine. In order to resolve these difficulties, an unsupervised energy time-series data clustering and a frequent pattern mining study as well as a deep learning technique for estimating energy use were presented. A broad test using true data sets that are rich in smart meter data were conducted. The exact results of the appliance designs that were recognized by the proposed model were filled out by Deep Convolutional Neural Networks (CNN) and Recurrent Neural Networks (LSTM and GRU) at each stage, with consolidated accuracy of 94.79%, 97.99%, 99.61%, for 25%, 50%, and 75%, respectively.

Damage detection of railway bridges using operational vibration data: theory and experimental verifications

  • Azim, Md Riasat;Zhang, Haiyang;Gul, Mustafa
    • Structural Monitoring and Maintenance
    • /
    • 제7권2호
    • /
    • pp.149-166
    • /
    • 2020
  • This paper presents the results of an experimental investigation on a vibration-based damage identification framework for a steel girder type and a truss bridge based on acceleration responses to operational loading. The method relies on sensor clustering-based time-series analysis of the operational acceleration response of the bridge to the passage of a moving vehicle. The results are presented in terms of Damage Features from each sensor, which are obtained by comparing the actual acceleration response from the sensors to the predicted response from the time-series model. The damage in the bridge is detected by observing the change in damage features of the bridge as structural changes occur in the bridge. The relative severity of the damage can also be quantitatively assessed by observing the magnitude of the changes in the damage features. The experimental results show the potential usefulness of the proposed method for future applications on condition assessment of real-life bridge infrastructures.