• Title/Summary/Keyword: k-mean algorithm

Search Result 1,271, Processing Time 0.03 seconds

The Comparative Study for Software Reliability Models Based on NHPP (NHPP에 기초한 소프트웨어 신뢰도 모형에 대한 비교연구)

  • Gan, Gwang-Hyeon;Kim, Hui-Cheol;Lee, Byeong-Su
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.393-400
    • /
    • 2001
  • This paper presents a stochastic model for the software failure phenomenon based on a nonhomogeneous Poisson process (NHPP). The failure process is analyzed to develop a suitable mean value function for the NHPP ; expressions are given for several performance measure. Actual software failure data are compared with generalized model by Goel dependent on the constant reflecting the quality of testing. The performance measures and parametric inferences of the new models, Rayleigh and Gumbel distributions, are discussed. The results of the new models are applied to real software failure data and compared with Goel-Okumoto and Yamada, Ohba and Osaki models. Tools of parameter inference was used method of the maximun likelihood estimate and the bisection algorithm for the computing nonlinear root. In this paper, using the sum of the squared errors, model selection was employed. The numerical example by NTDS data was illustrated.

  • PDF

Fuzzy Clustering Method for the Identification of Joint Sets (절리군 분석을 위한 퍼지 클러스터링 기법)

  • 정용복;전석원
    • Tunnel and Underground Space
    • /
    • v.13 no.4
    • /
    • pp.294-303
    • /
    • 2003
  • The structural behaviour of rock mass structure, such as tunnel or slope is critically dependent on the various characteristics of discontinuities. Therefore, it is important to survey and analyze discontinuities correctly for the design and construction of rock mass structure. One inevitable Procedure of discontinuity survey and analysis is joint set identification from a lot of raw directional joint data. The identification procedure is generally done by a graphical method. This type of analysis has some shortcomings such as subjective identification results, inability to use extra information on discontinuity, and so on. In this study, a computer program for joint set identification based on the fuzzy clustering algorithm was implemented and tested using two kinds of joint data. It was confirmed that fuzzy clustering method is effective and valid for joint set identification and estimation of mean direction and degree of clustering of huge joint data through the applications.

Implementation of Intelligence Pulse Wave Detection System (지능형 맥진기 구현)

  • Hong, Y.S.;Yu, J.S.;Chang, S.J.;Sun, S.H.;Lee, W.B.;Nam, D.H.;Yu, M.S.;Choi, M.B.;Lee, S.S.
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.13 no.2
    • /
    • pp.245-254
    • /
    • 2013
  • In oriental medicine, it is possible to classify and treat many diseases using the pulse wave detection system. Other problems may arise. As it is a very subjective way to analyze the pulse wave. One problem of the conventional pulse wave detection system is that the arterial pulse sensor is not located correctly at the radial artery. Threrefore measurement results can differ depending on the measurement position and the measurement procedure. This is mostly due to it's sensitivity to high reproducibility. In order to solve this problem this paper proposes an algorithm to analyze the weak pulse wave symptom and strong pulse wave symptom. It uses the portable pulse wave detection system which includes a Hall Sensor. As a final result, it analyzed the weak pulse wave symptom and strong pulse wave symptom by the SPSS statistics technique. It proves that N time (notch point time) and S Amp (rise waveform size) mean values are significantly different in 95% confidence interval.

Problems in Fuzzy c-means and Its Possible Solutions (Fuzzy c-means의 문제점 및 해결 방안)

  • Heo, Gyeong-Yong;Seo, Jin-Seok;Lee, Im-Geun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.1
    • /
    • pp.39-46
    • /
    • 2011
  • Clustering is one of the well-known unsupervised learning methods, in which a data set is grouped into some number of homogeneous clusters. There are numerous clustering algorithms available and they have been used in various applications. Fuzzy c-means (FCM), the most well-known partitional clustering algorithm, was established in 1970's and still in use. However, there are some unsolved problems in FCM and variants of FCM are still under development. In this paper, the problems in FCM are first explained and the available solutions are investigated, which is aimed to give researchers some possible ways of future research. Most of the FCM variants try to solve the problems using domain knowledge specific to a given problem. However, in this paper, we try to give general solutions without using any domain knowledge. Although there are more things left than discovered, this paper may be a good starting point for researchers newly entered into a clustering area.

Monitoring and Prediction of Appliances Electricity Usage Using Neural Network (신경회로망을 이용한 가전기기 전기 사용량 모니터링 및 예측)

  • Jung, Kyung-Kwon;Choi, Woo-Seung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.8
    • /
    • pp.137-146
    • /
    • 2011
  • In order to support increased consumer awareness regarding energy consumption, we present new ways of monitoring and predicting with energy in electric appliances. The proposed system is a design of a common electrical power outlet called smart plug that measures the amount of current passing through current sensor at 0.5 second. To acquire data for training and testing the proposed neural network, weather parameters used include average temperature of day, min and max temperature, humidity, and sunshine hour as input data, and power consumption as target data from smart plug. Using the experimental data for training, the neural network model based on Back-Propagation algorithm was developed. Multi layer perception network was used for nonlinear mapping between the input and the output data. It was observed that the proposed neural network model can predict the power consumption quite well with correlation coefficient was 0.9965, and prediction mean square error was 0.02033.

Panel data analysis with regression trees (회귀나무 모형을 이용한 패널데이터 분석)

  • Chang, Youngjae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1253-1262
    • /
    • 2014
  • Regression tree is a tree-structured solution in which a simple regression model is fitted to the data in each node made by recursive partitioning of predictor space. There have been many efforts to apply tree algorithms to various regression problems like logistic regression and quantile regression. Recently, algorithms have been expanded to the panel data analysis such as RE-EM algorithm by Sela and Simonoff (2012), and extension of GUIDE by Loh and Zheng (2013). The algorithms are briefly introduced and prediction accuracy of three methods are compared in this paper. In general, RE-EM shows good prediction accuracy with least MSE's in the simulation study. A RE-EM tree fitted to business survey index (BSI) panel data shows that sales BSI is the main factor which affects business entrepreneurs' economic sentiment. The economic sentiment BSI of non-manufacturing industries is higher than that of manufacturing ones among the relatively high sales group.

An Evaluation of Multiple-input Dual-output Run-to-Run Control Scheme for Semiconductor Manufacturing

  • Fan, Shu-Kai-S.;Lin, Yen
    • Industrial Engineering and Management Systems
    • /
    • v.4 no.1
    • /
    • pp.54-67
    • /
    • 2005
  • This paper provides an evaluation of an optimization-based, multiple-input double-output (MIDO) run-to-run (R2R) control scheme for general semiconductor manufacturing processes. The controller in this research, termed adaptive dual response optimizing controller (ADROC), can serve as a process optimizer as well as a recipe regulator between consecutive runs of wafer fabrication. In evaluation, it is assumed that the equipment model could be appropriately described by a pair of second-order polynomial functions in terms of a set of controllable variables. Of practical relevance is to consider a drifting effect in the equipment model since in common semiconductor practice the process tends to drift due to machine aging and tool wearing. We select a typical application of R2R control to chemical mechanical planarization (CMP) in semiconductor manufacturing in this evaluation, and there are five different CMP process scenarios demonstrated, including mean shift, variance increase, and IMA disturbances. For the controller, ADROC, an on-line estimation technique is implemented in a self-tuning (ST) control manner for the adaptation purpose. Subsequently, an ad hoc global optimization algorithm based on the dual response approach, arising from the response surface methodology (RSM) literature, is used to seek the optimum recipe within the acceptability region for the execution of next run. The main components of ADROC are described and its control performance is assessed. It reveals from the evaluation that ADROC can provide excellent control actions for the MIDO R2R situations even though the process exhibits complicated, nonlinear interaction effects between control variables, and the drifting disturbances.

Estimation of spatiotemporal soil moisture distribution for Yongdam-dam watershed using Sentinel-1 C-band Synthetic Aperture Radar images (Sentinel-1 C-band SAR 영상을 이용한 용담댐 유역의 시공간 토양수분 산정)

  • Chung, Jeehun;Lee, Yonggwan;Jang, Wonjin;Kim, Seongjoon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.162-162
    • /
    • 2020
  • 토양수분은 TDR(Time Domain Reflectometry)이나 Tensiometer 등의 장비를 이용하여 측정을 시행하고 있으나, 이를 위해서는 많은 인력과 경제적 자원이 소비될 뿐만 아니라 시공간적으로 측정할 수 있는 범위에 한계가 있다. 지상 관측의 대안으로 MIRAS(Microwave Imaging Radiometer with Aperture Synthesis)나 SMAP(Soil Moisture Active Passive), AMSR2(Advanced Microwave Scanning Radiometer 2) 등의 수동 마이크로파 위성 센서를 이용한 공간 토양수분 관측이 수행되었으나, 낮은 공간 해상도(9~36km)는 지역 규모의 토양수분 분포를 나타내기 충분하지 않고, 높은 불확실성을 내포하고 있다. 본 연구에서는 금강 상류의 용담댐 유역(930.0㎢)을 대상으로 Sentinel-1 C-band SAR(Synthetic Aperture Radar) 영상을 이용한 토지 피복 및 토양 속성을 고려한 10m 해상도의 토양수분 산출을 수행하였다. 용담댐 유역은 산림 79.7%, 논 9.0%, 밭 5.4%, 주거지 2.9%의 토지 피복 비율을 가지며 토양은 사양토(66.6%)와 양토(20.9%)가 우세하다. Sentinel-1 C-band SAR 영상은 SeNtinel Application Platform(SNAP)을 이용하여 전처리 후, 후방산란계수로 변환하였다. 토양수분 알고리즘은 TU-Wien change detection algorithm과 Regression model을 활용하였고, 검증을 위한 실측 토양수분 자료는 한국수자원공사(K-water)에서 제공하는 5년(2014~2018)간의 토양수분 관측자료를 이용하였다. 산출된 토양수분은 결정계수(Coefficient of determination, R2) 및 평균제곱근오차(Root Mean Square Error, RMSE)를 이용하여 실측 토양수분과 비교하였다. Sentinel-1 C-band SAR 영상을 이용한 고해상도의 토양수분 산출은 토지 피복 및 토양 속성을 고려한 지역 규모의 공간 토양수분 분포 및 시간적 변화를 표현 가능할 것으로 판단된다.

  • PDF

Three-dimensional Model Generation for Active Shape Model Algorithm (능동모양모델 알고리듬을 위한 삼차원 모델생성 기법)

  • Lim, Seong-Jae;Jeong, Yong-Yeon;Ho, Yo-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.43 no.6 s.312
    • /
    • pp.28-35
    • /
    • 2006
  • Statistical models of shape variability based on active shape models (ASMs) have been successfully utilized to perform segmentation and recognition tasks in two-dimensional (2D) images. Three-dimensional (3D) model-based approaches are more promising than 2D approaches since they can bring in more realistic shape constraints for recognizing and delineating the object boundary. For 3D model-based approaches, however, building the 3D shape model from a training set of segmented instances of an object is a major challenge and currently it remains an open problem in building the 3D shape model, one essential step is to generate a point distribution model (PDM). Corresponding landmarks must be selected in all1 training shapes for generating PDM, and manual determination of landmark correspondences is very time-consuming, tedious, and error-prone. In this paper, we propose a novel automatic method for generating 3D statistical shape models. Given a set of training 3D shapes, we generate a 3D model by 1) building the mean shape fro]n the distance transform of the training shapes, 2) utilizing a tetrahedron method for automatically selecting landmarks on the mean shape, and 3) subsequently propagating these landmarks to each training shape via a distance labeling method. In this paper, we investigate the accuracy and compactness of the 3D model for the human liver built from 50 segmented individual CT data sets. The proposed method is very general without such assumptions and can be applied to other data sets.

Development of Market Growth Pattern Map Based on Growth Model and Self-organizing Map Algorithm: Focusing on ICT products (자기조직화 지도를 활용한 성장모형 기반의 시장 성장패턴 지도 구축: ICT제품을 중심으로)

  • Park, Do-Hyung;Chung, Jaekwon;Chung, Yeo Jin;Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.1-23
    • /
    • 2014
  • Market forecasting aims to estimate the sales volume of a product or service that is sold to consumers for a specific selling period. From the perspective of the enterprise, accurate market forecasting assists in determining the timing of new product introduction, product design, and establishing production plans and marketing strategies that enable a more efficient decision-making process. Moreover, accurate market forecasting enables governments to efficiently establish a national budget organization. This study aims to generate a market growth curve for ICT (information and communication technology) goods using past time series data; categorize products showing similar growth patterns; understand markets in the industry; and forecast the future outlook of such products. This study suggests the useful and meaningful process (or methodology) to identify the market growth pattern with quantitative growth model and data mining algorithm. The study employs the following methodology. At the first stage, past time series data are collected based on the target products or services of categorized industry. The data, such as the volume of sales and domestic consumption for a specific product or service, are collected from the relevant government ministry, the National Statistical Office, and other relevant government organizations. For collected data that may not be analyzed due to the lack of past data and the alteration of code names, data pre-processing work should be performed. At the second stage of this process, an optimal model for market forecasting should be selected. This model can be varied on the basis of the characteristics of each categorized industry. As this study is focused on the ICT industry, which has more frequent new technology appearances resulting in changes of the market structure, Logistic model, Gompertz model, and Bass model are selected. A hybrid model that combines different models can also be considered. The hybrid model considered for use in this study analyzes the size of the market potential through the Logistic and Gompertz models, and then the figures are used for the Bass model. The third stage of this process is to evaluate which model most accurately explains the data. In order to do this, the parameter should be estimated on the basis of the collected past time series data to generate the models' predictive value and calculate the root-mean squared error (RMSE). The model that shows the lowest average RMSE value for every product type is considered as the best model. At the fourth stage of this process, based on the estimated parameter value generated by the best model, a market growth pattern map is constructed with self-organizing map algorithm. A self-organizing map is learning with market pattern parameters for all products or services as input data, and the products or services are organized into an $N{\times}N$ map. The number of clusters increase from 2 to M, depending on the characteristics of the nodes on the map. The clusters are divided into zones, and the clusters with the ability to provide the most meaningful explanation are selected. Based on the final selection of clusters, the boundaries between the nodes are selected and, ultimately, the market growth pattern map is completed. The last step is to determine the final characteristics of the clusters as well as the market growth curve. The average of the market growth pattern parameters in the clusters is taken to be a representative figure. Using this figure, a growth curve is drawn for each cluster, and their characteristics are analyzed. Also, taking into consideration the product types in each cluster, their characteristics can be qualitatively generated. We expect that the process and system that this paper suggests can be used as a tool for forecasting demand in the ICT and other industries.