• Title/Summary/Keyword: 시계열 및 클러스터 분석

Search Result 13, Processing Time 0.041 seconds

정박 중 준해양사고 원인에 대한 빅데이터 분석 연구

  • No, Beom-Seok;Kim, Tae-Hun;Gang, Seok-Yong
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2018.05a
    • /
    • pp.144-146
    • /
    • 2018
  • 준해상사고를 줄이기 위하여 준해양사고 등을 분석하여 사고 예방에 활용하였다. 하지만 준해양사고 건수가 많은 대신 주내용이 정성적이기 때문에 다양한 정량적 데이터로 분석하기에는 현실적 어려움이 있었다. 이러 장단점을 고려하여 준해양사고에 대해서 그동안 단순한 내용 검토 방식에서 통계적 분석과 이를 통한 객관적 결과 토출이 가능한 빅데이터 기법를 적용한 연구가 필요하다. 이를 위해 10,000여건의 준해양사고 보고서를 전처리 작업을 통해 통일된 양식으로 정리하였다. 이 데이터를 기반으로 1차로 텍스트마이닝 분석을 통해 정박 중 준해양사고 발생 원인에 대한 주요 키워드를 도출하였다. 주요 키워드에 대해 2차로 시계열 및 클러스터 분석을 통해 발생할 수 있는 준해양 사고 상황에 대한 경향 예측을 도출하였다. 이번 연구에서는 정성적 자료인 준해양사고 보고서를 빅데이터 기법을 활용하여 정량화된 데이터로 전환할 수 있고 이를 통해 통계적 분석이 가능함을 확인하였다. 또한 빅데이터 기법을 통해 차 후 발생할 수 있는 준해양사고 객관적인 경향을 파악함으로써 예방 대책에 대한 정보 제공이 가능함을 확인할 수 있었다.

  • PDF

Evolutionary Computation-based Hybird Clustring Technique for Manufacuring Time Series Data (제조 시계열 데이터를 위한 진화 연산 기반의 하이브리드 클러스터링 기법)

  • Oh, Sanghoun;Ahn, Chang Wook
    • Smart Media Journal
    • /
    • v.10 no.3
    • /
    • pp.23-30
    • /
    • 2021
  • Although the manufacturing time series data clustering technique is an important grouping solution in the field of detecting and improving manufacturing large data-based equipment and process defects, it has a disadvantage of low accuracy when applying the existing static data target clustering technique to time series data. In this paper, an evolutionary computation-based time series cluster analysis approach is presented to improve the coherence of existing clustering techniques. To this end, first, the image shape resulting from the manufacturing process is converted into one-dimensional time series data using linear scanning, and the optimal sub-clusters for hierarchical cluster analysis and split cluster analysis are derived based on the Pearson distance metric as the target of the transformation data. Finally, by using a genetic algorithm, an optimal cluster combination with minimal similarity is derived for the two cluster analysis results. And the performance superiority of the proposed clustering is verified by comparing the performance with the existing clustering technique for the actual manufacturing process image.

A Case Study of Regional Industry Clusters : Clusters Estimate Index and Policy (지역산업클러스터 사례연구 : 클러스터 평가지표와 정책과제)

  • Won, Gu-Hyun
    • Korean Business Review
    • /
    • v.18 no.2
    • /
    • pp.197-223
    • /
    • 2005
  • The industrial cluster policy of 21st century rise to the focus method of regional economic promotion, therefore, the importance of study in cluster identification and mapping as policy task will bring into relief. This paper will confirms the estimate index and policy of industrial clusters with regional industry. The result in this case study, Cluster development should embrace the pursuit of competitive advantage and specialization rather than simply imitate successful clusters in other locations. This requires building on local sources of uniqueness. Government should reinforce and building on existing and emerging clusters rather than attempt to create entirely new ones. This sort of role for government is very different from industrial policy. The aim of cluster policy is to reinforce the development of all clusters. Not all clusters will succeed, but market forces should determine the outcomes. In other words, government should build on market- oriented system and innovative infra. The result of this study is meaning to the development of objectivity estimate index and derivation of cluster-focused policy with a case study of industrial clusters.

  • PDF

An ESDA Tool for Time-series Spatial Association (지역분석을 위한 시계열 공간연관성 탐색도구)

  • Ahn Jae-Seong;Park Key-Ho;Lee Yang-Won
    • Spatial Information Research
    • /
    • v.14 no.1 s.36
    • /
    • pp.163-176
    • /
    • 2006
  • The concept of 'spatial association' explains spatial distribution pattern of geographical phenomenon based on similarity with neighborhoods, as in the Tobler's Law of Geography: 'Everything is related to everything else, but near things are more related than distant things.' In this study, we develop a time-series exploratory analysis tool for discovering temporal patterns of spatial association by combining spatial statistics and geo-visualization, and thus present a possibility to support spatial decision-making process. As for the spatial proximity weight matrix indispensable to measuring global and local spatial association, we employ a variety of flexible weighting schemes using geometric characteristics of areal unit. In addition, we renovate the existing visualization methods for more effective understanding of the procedures and results of time-series analysis on spatial association: for instance, temporal parallel coordinate plot with box plot, animated map for spatial association, and 3D Moran scatterplot. The feasibility of our system is verified by time-series analysis experiments on the spatial association of land price fluctuation rate for all administrative units in Korea, $1995{\sim}2004$.

  • PDF

Development of Poisson cluster generation model considering the climate change effects (기후변화 영향을 고려한 포아송 클러스터 가상강우생성모형 개발 및 검증)

  • Park, Hyunjin;Han, Jaemoon;Kim, Jongho;Kim, Dongkyun
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2015.05a
    • /
    • pp.189-189
    • /
    • 2015
  • 본 연구는 기후변화의 영향을 고려한 포아송 강우생성모형의 일종인 MBLRP(Modified Bartlett-Lewis Rectangular Pulse)를 개발하고, 대한민국 주요 도시에 대해 향후 100년간 강우의 변화를 살펴보았다. 기존 MBLRP 모형에서 기후변화에 따른 강우량 변화를 고려할 수 있도록 GCM 모형의 강우 자료를 활용하였고, GCM 모형으로부터 발생하는 불확실성을 고려하기 위해 IPCC의 RCP(Representative Concentration Pathways) 시나리오를 모의한 16개의 GCM 모형을 사용하였다. 2007년부터 2099년까지의 미래기간을 3개의 시 구간으로 구분하고, 16개 GCM 앙상블을 사용하여 미래기간 동안 대한민국 16개 도시에 대해 1000개의 샘플을 BWA 방법을 이용하여 생성하였다. 제어기간(1973-2005) 대비 미래기간(2007-2099)의 변화율을 나타내는 FOC(factor of change)와 온도의 연별 변화율을 나타내는 SF(scaling factor)의 개념을 결합하여 미래기간에 대한 CF(correction factor)를 산정하였다. 이때 CF는 16개 도시의 연 단위 강우량 변화 비율을 월별로 나타내며, 제어기간의 월 강우 관측치와 CF를 몬테카를로 모의를 실시하여 미래기간의 강우 시나리오를 산정한다. 이를 통해 월 평균 강우량 통계치를 연 단위로 얻을 수 있으며, 월 평균 강우량이 월 평균 분산, 무강우확률, 자기상관계수와 가지는 선형 관계를 통해 강우 통계치를 산출한다. 이와 같은 강우 통계치는 가상강우생성모형인 MBLRP 모형에 입력 자료로 활용되어 월 강우량을 시 단위의 강우 시계열 자료로 생성해낸다. 최종적으로 MBLRP 모형으로 산정된 시 단위 강우 시계열은 기후변화 영향을 고려한 GCMs 앙상블로 생성된 강우 시나리오를 기반으로 산출되기 때문에 향후 수자원 분석에 활용 가능할 것이라 기대된다.

  • PDF

Development of hybrid stochastic model for rainfall generation considering rainfall inter-annual variability (연간 강우 변동성을 고려한 혼합 추계 강우 생성 모형의 개발)

  • Park, Jeong Ha;Kim, Dong Kyun
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.11-11
    • /
    • 2018
  • 본 연구에서는 1시간부터 1년 단위의 강우 특성들을 잘 모의하는 혼합 추계 강우 생성 모형을 개발하였다. 본 모형의 가상 강우 생성 과정은 4단계로 이루어진다. 첫 단계에서 Seasonal ARIMA 모형을 통하여 시계열 특성을 반영한 월 강우를 생성한다. 두 번째 단계는 생성된 월 강우에 해당하는 일 단위 이하의 강우 통계치 세트를 생성하는 것이며, 통계치간 상관관계를 통해 평균, 표준편차, 자기상관 계수, 무강우 확률을 생성한다. 생성된 통계치 세트는 세 번째 단계에서 Modified Bartlett-Lewis Rectangular Pulse (MBLRP) 모형의 6개의 매개변수를 보정하는데 사용되며, 마지막으로 MBLRP 매개변수 세트를 통해 가상 강우 시계열을 생성한다. 위 모형을 통해 미국 동부 지역 29개 강우 관측소에 대하여 200년 길이의 가상 강우를 생성하였으며, 그 결과 시 단위부터 연 단위까지 강우의 1차, 2차 통계치 및 무강우 확률을 성공적으로 재현하였다. 또한 기존 MBLRP 모형에 비하여 극한 강우 사상을 재현하는 능력이 향상되었다. 빈도분석 결과를 통하여 MBLRP 모형이 재현기간에 따라 10%에서부터 40%까지 극한 사상을 과소 추정한 반면, 본 모형에서는 20% 이내의 값을 나타내었다.

  • PDF

Investigation of Correlation Between Cognition/Emotion Styles and Judgmental Time-Series Forecasting Using a Self-Organizing Neural Network (자기 조직 신경망에 의한 인지/감성 유형의 시계열 직관 예측과의 상관성 조사)

  • Yoo Hyeon-Joong;Park Hung Kook;Cho Taekyung;Park Jongil
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.42 no.3 s.303
    • /
    • pp.29-38
    • /
    • 2005
  • Although people frequently rely on intuition in managing activities, they rarely use it in developing effective decision-making support systems. In this paper, we investigate and compare the correlations between such characteristics as cognition and emotion characteristics and judgmental time-series forecasting accuracy by using a self-organizing neural network, and eventually aim to help build efficient decision-making atmosphere. The neural network used in this paper employs a self-supervised adaptive algorithm, and the feature of which is that it inherently can use correlation between input vectors by exchanging information between neuron clusters in the self-organizing layer during the training. Our experiments showed that both cognition and emotion characteristics had correlations with judgmental time-series forecasting, and that cognition characteristics had larger correlation than emotion characteristics. We also found that conceptual style had larger correlation than behavioral and analytical styles, and displeasure-sleepiness style had larger correlation than pleasure-arousal style with the forecasting.

An Analysis of Causes of Marine Incidents at sea Using Big Data Technique (빅데이터 기법을 활용한 항해 중 준해양사고 발생원인 분석에 관한 연구)

  • Kang, Suk-Young;Kim, Ki-Sun;Kim, Hong-Beom;Rho, Beom-Seok
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.24 no.4
    • /
    • pp.408-414
    • /
    • 2018
  • Various studies have been conducted to reduce marine accidents. However, research on marine incidents is only marginal. There are many reports of marine incidents, but the main content of existing studies has been qualitative, which makes quantitative analysis difficult. However, quantitative analysis of marine accidents is necessary to reduce marine incidents. The purpose of this paper is to analyze marine incident data quantitatively by applying big data techniques to predict marine incident trends and reduce marine accident. To accomplish this, about 10,000 marine incident reports were prepared in a unified format through pre-processing. Using this preprocessed data, we first derived major keywords for the Marine incidents at sea using text mining techniques. Secondly, time series and cluster analysis were applied to major keywords. Trends for possible marine incidents were predicted. The results confirmed that it is possible to use quantified data and statistical analysis to address this topic. Also, we have confirmed that it is possible to provide information on preventive measures by grasping objective tendencies for marine incidents that may occur in the future through big data techniques.

Development and validation of poisson cluster stochastic rainfall generation web application across South Korea (포아송 클러스터 가상강우생성 웹 어플리케이션 개발 및 검증 - 우리나라에 대해서)

  • Han, Jaemoon;Kim, Dongkyun
    • Journal of Korea Water Resources Association
    • /
    • v.49 no.4
    • /
    • pp.335-346
    • /
    • 2016
  • This study produced the parameter maps of the Modified Bartlett-Lewis Rectangular Pulse (MBLRP) stochastic rainfall generation model across South Korea and developed and validated the web application that automates the process of rainfall generation based on the produced parameter maps. To achieve this purpose, three deferent sets of parameters of the MBLRP model were estimated at 62 ground gage locations in South Korea depending on the distinct purpose of the synthetic rainfall time series to be used in hydrologic modeling (i.e. flood modeling, runoff modeling, and general purpose). The estimated parameters were spatially interpolated using the Ordinary Kriging method to produce the parameter maps across South Korea. Then, a web application has been developed to automate the process of synthetic rainfall generation based on the parameter maps. For validation, the synthetic rainfall time series has been created using the web application and then various rainfall statistics including mean, variance, autocorrelation, probability of zero rainfall, extreme rainfall, extreme flood, and runoff depth were calculated, then these values were compared to the ones based on the observed rainfall time series. The mean, variance, autocorrelation, and probability of zero rainfall of the synthetic rainfall were similar to the ones of the observed rainfall while the extreme rainfall and extreme flood value were smaller than the ones derived from the observed rainfall by the degree of 16%-40%. Lastly, the web application developed in this study automates the entire process of synthetic rainfall generation, so we expect the application to be used in a variety of hydrologic analysis needing rainfall data.

Prediction of Power Consumption for Improving QoS in an Energy Saving Server Cluster Environment (에너지 절감형 서버 클러스터 환경에서 QoS 향상을 위한 소비 전력 예측)

  • Cho, Sungchoul;Kang, Sanha;Moon, Hungsik;Kwak, Hukeun;Chung, Kyusik
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.4 no.2
    • /
    • pp.47-56
    • /
    • 2015
  • In an energy saving server cluster environment, the power modes of servers are controlled according to load situation, that is, by making ON only minimum number of servers needed to handle current load while making the other servers OFF. This algorithm works well under normal circumstances, but does not guarantee QoS under abnormal circumstances such as sharply rising or falling loads. This is because the number of ON servers cannot be increased immediately due to the time delay for servers to turn ON from OFF. In this paper, we propose a new prediction algorithm of the power consumption for improving QoS under not only normal but also abnormal circumstances. The proposed prediction algorithm consists of two parts: prediction based on the conventional time series analysis and prediction adjustment based on trend analysis. We performed experiments using 15 PCs and compared performance for 4 types of conventional time series based prediction methods and their modified methods with our prediction algorithm. Experimental results show that Exponential Smoothing with Trend Adjusted (ESTA) and its modified ESTA (MESTA) proposed in this paper are outperforming among 4 types of prediction methods in terms of normalized QoS and number of good reponses per power consumed, and QoS of MESTA proposed in this paper is 7.5% and 3.3% better than that of conventional ESTA for artificial load pattern and real load pattern, respectively.