• 제목/요약/키워드: clustered data

검색결과 556건 처리시간 0.024초

백화점 고객의 구매 분석 및 고객관계관리 전략 적용 (Analyzing Customer Purchase Behavior of a Department Store and Applying Customer Relationship Management Strategies)

  • 하성호;백경훈
    • 경영과학
    • /
    • 제21권3호
    • /
    • pp.55-69
    • /
    • 2004
  • This study analyzes customer buying-behavior patterns in a department store as time goes on, and predicts moving patterns of its customers. Through them, it suggests in this paper short-term and long-term marketing promotion strategies. RFM techniques are utilized for customer segmentation. Customers are clustered by using the Kohonen's Self Organizing Map as a method of data mining techniques. Then C5.0, a decision tree analysis technique, is used to predict moving patterns of customers. Using real world data, this study evaluates the prediction accuracy of predictive models.

Context-Aware Mobile Gateway Relocation Scheme for Clustered Wireless Sensor Networks

  • Encarnacion, Nico N.;Yang, Hyunho
    • Journal of information and communication convergence engineering
    • /
    • 제10권4호
    • /
    • pp.365-371
    • /
    • 2012
  • In recent years, researchers have been attracted to clustering methods to improve communication and data transmission in a network. Compared with traditional wireless networks, wireless sensor networks are energy constrained and have lower data rates. The concept of implementing a clustering algorithm in an existing project on gateway relocation is being explored here. Low energy adaptive clustering hierarchy (LEACH) is applied to an existing study on relocating a gateway. The study is further improved by moving the gateway to a specific cluster based on the number or significance of the events detected. The protocol is improved so that each cluster head can communicate with a mobile gateway. The cluster heads are the only nodes that can communicate with the mobile gateway when it (the mobile gateway) is out of the cluster nodes' transmission range. Once the gateway is in range, the nodes will begin their transmission of real-time data. This alleviates the load of the nodes that would be located closest to the gateway if it were static.

일반화된 선형 혼합 모형(GENERALIZED LINEAR MIXED MODEL: GLMM)에 관한 최근의 연구 동향 (A Study for Recent Development of Generalized Linear Mixed Model)

  • 이준영
    • 응용통계연구
    • /
    • 제13권2호
    • /
    • pp.541-562
    • /
    • 2000
  • 일반화된 선형 혼합 모형(GLMM)은 자료가 계수의 형태로 나타나는 범주형 자료의 경우, 혹은 집락의 형태나 과산포된 비정규 자료, 또는 비선형 모형에 따르는 자료를 다루기 위한 모형 설정에 사용된다. 본 연구에서는 이에 대한 개요와 더불어, 이 모형의 적합을 위해 제시된 통계적 기법들중 의사가능도(quasi-likelihood: QL)를 이용한 추정 방법 및 Monte-Carlo 기법을 이용한 추정 방법들에 대해 조사하였다. 또한 GLMM에 대한 현재의 연구 방향 및 앞으로의 연구 가능 주제들에 대해서도 언급하였다.

  • PDF

우리나라 기상자료에 대한 군집분석 (Clustering analysis of Korea's meteorological data)

  • 여인권
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권5호
    • /
    • pp.941-949
    • /
    • 2011
  • 이 논문에서는 1999년 1월 1일부터 2010년 6월 30일까지 전국 72개 관측소에서 측정된 우리나라 기상자료를 평균연결법에 의한 계층적 병합방법을 통해 군집분석을 실시하고 각 기상자료에서 유도된 군집의 특성을 파악해 본다. 이 분석에서 유도된 군집과 2010년 기후변화에 따른 식중독 발생연구에서 사용되었던 산맥을 경계로 구분한 군집을 비교해 본다.

Global Optimization of Clusters in Gene Expression Data of DNA Microarrays by Deterministic Annealing

  • Lee, Kwon Moo;Chung, Tae Su;Kim, Ju Han
    • Genomics & Informatics
    • /
    • 제1권1호
    • /
    • pp.20-24
    • /
    • 2003
  • The analysis of DNA microarry data is one of the most important things for functional genomics research. The matrix representation of microarray data and its successive 'optimal' incisional hyperplanes is a useful platform for developing optimization algorithms to determine the optimal partitioning of pairwise proximity matrix representing completely connected and weighted graph. We developed Deterministic Annealing (DA) approach to determine the successive optimal binary partitioning. DA algorithm demonstrated good performance with the ability to find the 'globally optimal' binary partitions. In addition, the objects that have not been clustered at small non­zero temperature, are considered to be very sensitive to even small randomness, and can be used to estimate the reliability of the clustering.

A visualizing method for investigating individual frailties using frailtyHL R-package

  • Ha, Il Do;Noh, Maengseok
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권4호
    • /
    • pp.931-940
    • /
    • 2013
  • For analysis of clustered survival data, the inferences of parameters in semi-parametric frailty models have been widely studied. It is also important to investigate the potential heterogeneity in event times among clusters (e.g. centers, patients). For purpose of this analysis, the interval estimation of frailty is useful. In this paper we propose a visualizing method to present confidence intervals of individual frailties across clusters using the frailtyHL R-package, which is implemented from h-likelihood methods for frailty models. The proposed method is demonstrated using two practical examples.

ML estimation using Poisson HGLM approach in semi-parametric frailty models

  • Ha, Il Do
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권5호
    • /
    • pp.1389-1397
    • /
    • 2016
  • Semi-parametric frailty model with nonparametric baseline hazards has been widely used for the analyses of clustered survival-time data. The frailty models can be fitted via an auxiliary Poisson hierarchical generalized linear model (HGLM). For the inferences of the frailty model marginal likelihood, which gives MLE, is often used. The marginal likelihood is usually obtained by integrating out random effects, but it often requires an intractable integration. In this paper, we propose to obtain the MLE via Laplace approximation using a Poisson HGLM approach for semi-parametric frailty model. The proposed HGLM approach uses hierarchical-likelihood (h-likelihood), which avoids integration itself. The proposed method is illustrated using a numerical study.

연속미디어 저장 서버에서의 실시간 저장 및 검색 기법 (Real-Time Storage and Retrieval Techniques for Continuous Media Storage Server)

  • CheolSu Lim
    • 전자공학회논문지B
    • /
    • 제32B권11호
    • /
    • pp.1365-1373
    • /
    • 1995
  • In this paper, we address the issues related to storage and retrieval of continuous media (CM)data we face in designing multimedia on-demand (MOD) storage servers. To support the two orthogonal factors of MOD server design, i.e., storage and retrieval of CM data, this paper discusses the techniques of disk layout, disk striping and real-time disk scheduling, which are integrated as a combined solution to the high- performance MOD storage subsystem. The proposed clustered striping technique enables either a multiple-disk or a parallel system to guarantee a continuous retrieval of CM data at the bandwidth required to support user playback rate by avoiding the formation of I/O bottlenecks.

  • PDF

부분방전 펄스파형에 대한 TF Map 적용 가능성애 대한 연구 (A Study on the Applicability of TF Map for the Partial Discharge Pulse Shapes)

  • 김정태;이호근;임윤석;김지홍;구자윤
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2004년도 추계학술대회 논문집 전기물성,응용부문
    • /
    • pp.107-109
    • /
    • 2004
  • It is well known that on-site corona and various noises make severe problems on the PD(partial discharge) measurement. In this paper, for the improvement of the on-site PD measuring performance, the applicability of TF(Time-Frequency) map was investigated. For the purpose, PDs that can be generated from the defect in the power cables as well as corona discharges and ground noises were measured. And through the TF analysis for the measured data, we have tried to distinguish the real PDs, corona discharges and noises. As a result, it is confirmed that the data of PDs can be clustered themselves separable from other data such as corona and noises on the TF map.

  • PDF

Effects on Regression Estimates under Misspecified Generalized Linear Mixed Models for Counts Data

  • Jeong, Kwang Mo
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.1037-1047
    • /
    • 2012
  • The generalized linear mixed model(GLMM) is widely used in fitting categorical responses of clustered data. In the numerical approximation of likelihood function the normality is assumed for the random effects distribution; subsequently, the commercial statistical packages also routinely fit GLMM under this normality assumption. We may also encounter departures from the distributional assumption on the response variable. It would be interesting to investigate the impact on the estimates of parameters under misspecification of distributions; however, there has been limited researche on these topics. We study the sensitivity or robustness of the maximum likelihood estimators(MLEs) of GLMM for counts data when the true underlying distribution is normal, gamma, exponential, and a mixture of two normal distributions. We also consider the effects on the MLEs when we fit Poisson-normal GLMM whereas the outcomes are generated from the negative binomial distribution with overdispersion. Through a small scale Monte Carlo study we check the empirical coverage probabilities of parameters and biases of MLEs of GLMM.