• 제목/요약/키워드: index clustering

검색결과 323건 처리시간 0.039초

Association between hemoglobin glycation index and cardiometabolic risk factors in Korean pediatric nondiabetic population

  • Lee, Bora;Heo, You Jung;Lee, Young Ah;Lee, Jieun;Kim, Jae Hyun;Lee, Seong Yong;Shin, Choong Ho;Yang, Sei Won
    • Annals of Pediatric Endocrinology and Metabolism
    • /
    • 제23권4호
    • /
    • pp.196-203
    • /
    • 2018
  • Purpose: The hemoglobin glycation index (HGI) represents the degree of nonenzymatic glycation and has been positively associated with cardiometabolic risk factors (CMRFs) and cardiovascular disease in adults. This study aimed to investigate the association between HGI, components of metabolic syndrome (MS), and alanine aminotransferase (ALT) in a pediatric nondiabetic population. Methods: Data from 3,885 subjects aged 10-18 years from the Korea National Health and Nutrition Examination Survey (2011-2016) were included. HGI was defined as subtraction of predicted glycated hemoglobin ($HbA1_c$) from measured $HbA1_c$. Participants were divided into 3 groups according to HGI tertile. Components of MS (abdominal obesity, fasting glucose, triglycerides, high-density lipoprotein cholesterol, and blood pressure), and proportion of MS, CMRF clustering (${\geq}2$ of MS components), and elevated ALT were compared among the groups. Results: Body mass index (BMI) z-score, obesity, total cholesterol, ALT, abdominal obesity, elevated triglycerides, and CMRF clustering showed increasing HGI trends from lower-to-higher tertiles. Multiple logistic regression analysis showed the upper HGI tertile was associated with elevated triglycerides (odds ratio, 1.65; 95% confidence interval, 1.18-2.30). Multiple linear regression analysis showed HGI level was significantly associated with BMI z-score, $HbA1_c$, triglycerides, and ALT. When stratified by sex, age group, and BMI category, overweight/obese subjects showed linear HGI trends for presence of CMRF clustering and ALT elevation. Conclusion: HGI was associated with CMRFs in a Korean pediatric population. High HGI might be an independent risk factor for CMRF clustering and ALT elevation in overweight/obese youth. Further studies are required to establish the clinical relevance of HGI for cardiometabolic health in youth.

고정 그리드 공간 색인을 위한 클러스터링 알고리즘의 성능 평가 (Performance Evaluation of Clustering Algorithms for Fixed-Grid Spatial Index)

  • 유진영;김진덕;김동현;홍봉희;김장수
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 1998년도 가을 학술발표논문집 Vol.25 No.2 (1)
    • /
    • pp.32-134
    • /
    • 1998
  • 공간 색인의 하나인 그리드 파일은 공간 데이터 영역을 격자 형태의 셀로 분할하여 구성하는데 특히, 셀들의 크기가 모두 동일한 값으로 고정되어진 것을 고정 그리드(fixed grid)라고 한다. 셀들의 크기가 고정된으로 인해 샐 분할선 상에 객체가 존재하는 경우가 자주 발생하게 되고 이러한 객체들은 하나 이상의 셀에 의해 중복으로 참조된다. 중복 참조 객체는 1/10 시간을 증가시켜 질의 처리 시 성능 저하의 주요한 원인이 된다. 따라서 중복 객체를 효율적으로 처리 할 수 있는 클러스터링 알고리즘의 고안이 필요하다. 이 논문에서는 중복 참조 객체를 처리하기 위한 객체 클러스터링(Object clustering)과 셀 단위로 클러스터하기 위한 셀 클러스터링(Cell clustering) 알고리즘을 구현한다. 그리고 공간 질의 수행 시에 각 클러스터기법들에 대한 성능을 평가한다.

A Study of Association Rule Mining by Clustering through Data Fusion

  • Cho, Kwang-Hyun;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권4호
    • /
    • pp.927-935
    • /
    • 2007
  • Currently, Gyeongnam province is executing the social index survey every year to the provincials. But, this survey has the limit of the analysis as execution of the different survey per 3 year cycles. The solution of this problem is data fusion. Data fusion is the process of combining multiple data in order to provide information of tactical value to the user. But, data fusion doesn#t mean the ultimate result. Therefore, efficient analysis for the data fusion is also important. In this study, we present data fusion method of statistical survey data. Also, we suggest application methodology of association rule mining by clustering through data fusion of statistical survey data.

  • PDF

진화 프로그램을 이용한 퍼지 클러스터링 (Fuzzy Clustering using Evolution Program)

  • 정창호;임영희;박주영;박대희
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제26권1호
    • /
    • pp.130-130
    • /
    • 1999
  • In this paper, we propose a novel design method for improving performance of existing FCM-type clustering algorithms. First, we define the performance measure which focuses on bothcompactness and separation of clusters. Next, we optimize this measure using evolution program.Especially the proposed method has following merits: ① using evolution program, it solves suchproblems as initialization, number of clusters, and convergence to local optimum ② it reduces searchspace and improves convergence speed of algorithm since it represents chromosome with possiblepotential centers which are selected possible candidates of centers by density measure ③ it improvesperformance of clustering algorithm with the performance index which embedded both compactnessand separation Properties ④ it is robust to noise data since it minimizes its effect on center search.

Validation Measures of Bicluster Solutions

  • Lee, Young-Rok;Lee, Jeong-Hwa;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • 제8권2호
    • /
    • pp.101-108
    • /
    • 2009
  • Biclustering is a method to extract subsets of objects and features from a dataset which are characterized in some way. In contrast to traditional clustering algorithms which group objects similar in a whole feature set, biclustering methods find groups of objects which have similar values or patterns in some features. Both in clustering and biclustering, validating how much the result is informative or reliable is a very important task. Whereas validation methods of cluster solutions have been studied actively, there are only few measures to validate bicluster solutions. Furthermore, the existing validation methods of bicluster solutions have some critical problems to be used in general cases. In this paper, we review several well-known validation measures for cluster and bicluster solutions and discuss their limitations. Then, we propose several improved validation indices as modified versions of existing ones.

흡연, 음주와 운동습관의 군집현상을 통한 건강행태의 고위험군: 국민건강영양 조사 (High Risk Groups in Health Behavior Defined by Clustering of Smoking, Alcohol, and Exercise Habits: National Heath and Nutrition Examination Survey)

  • 강기원;성주헌;김창엽
    • Journal of Preventive Medicine and Public Health
    • /
    • 제43권1호
    • /
    • pp.73-83
    • /
    • 2010
  • Objectives: We investigated the clustering of selected lifestyle factors (cigarette smoking, heavy alcohol consumption, lack of physical exercise) and identified the population characteristics associated with increasing lifestyle risks. Methods: Data on lifestyle risk factors, sociodemographic characteristics, and history of chronic diseases were obtained from 7,694 individuals ${\geq}20$ years of age who participated in the 2005 Korea National Health and Nutrition Examination Survey (KNHANES). Clustering of lifestyle risks involved the observed prevalence of multiple risks and those expected from marginal exposure prevalence of the three selected risk factors. Prevalence odds ratio was adopted as a measurement of clustering. Multiple correspondence analysis, Kendall tau correlation, Man-Whitney analysis, and ordinal logistic regression analysis were conducted to identify variables increasing lifestyle risks. Results: In both men and women, increased lifestyle risks were associated with clustering of: (1) cigarette smoking and excessive alcohol consumption, and (2) smoking, excessive alcohol consumption, and lack of physical exercise. Patterns of clustering for physical exercise were different from those for cigarette smoking and alcohol consumption. The increased unhealthy clustering was found among men 20-64 years of age with mild or moderate stress, and among women 35-49 years of age who were never-married, with mild stress, and increased body mass index (>$30\;kg/m^2$). Conclusions: Addressing a lack of physical exercise considering individual characteristics including gender, age, employment activity, and stress levels should be a focus of health promotion efforts.

증분형 K-means 클러스터링 기반 방사형 기저함수 신경회로망 모델 설계 (Design of Incremental K-means Clustering-based Radial Basis Function Neural Networks Model)

  • 박상범;이승철;오성권
    • 전기학회논문지
    • /
    • 제66권5호
    • /
    • pp.833-842
    • /
    • 2017
  • In this study, the design methodology of radial basis function neural networks based on incremental K-means clustering is introduced for learning and processing the big data. If there is a lot of dataset to be trained, general clustering may not learn dataset due to the lack of memory capacity. However, the on-line processing of big data could be effectively realized through the parameters operation of recursive least square estimation as well as the sequential operation of incremental clustering algorithm. Radial basis function neural networks consist of condition part, conclusion part and aggregation part. In the condition part, incremental K-means clustering algorithms is used tweights of the conclusion part are given as linear function and parameters are calculated using recursive least squareo get the center points of data and find the fitness using gaussian function as the activation function. Connection s estimation. In the aggregation part, a final output is obtained by center of gravity method. Using machine learning data, performance index are shown and compared with other models. Also, the performance of the incremental K-means clustering based-RBFNNs is carried out by using PSO. This study demonstrates that the proposed model shows the superiority of algorithmic design from the viewpoint of on-line processing for big data.

다차원 색인을 이용한 하향식 계층 클러스터링 (Top-down Hierarchical Clustering using Multidimensional Indexes)

  • 황재준;문양세;황규영
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제29권5호
    • /
    • pp.367-380
    • /
    • 2002
  • 최근 공간 데이타 분석, 영상 분석 등과 같은 대용량 데이타를 관리하는 다양한 응용 업무들이 증가함에 따라, 대용량의 데이타베이스를 위한 클러스터링 기법이 많이 연구되고 있다. 그 중에서도 계층 클러스터링 기법은 데이타베이스의 계층 분할을 표현하는 계층 트리를 생성하고 이를 이용하여 효율적인 클러스터링을 수행하는 방법으로서, 지금까지는 주로 트리를 하위 계층으로부터 상위 계층으로 생성해 가는 상향식(bottom-up) 계층 클러스터링 기법들이 연구되었다. 이러한 상향식 클러스터링 방법은 트리를 생성하기 위하여 전체 데이타베이스를 한 번 이상 액세스하여야 할 뿐만 아니라, 하위 계층에서부터 검색을 시작하기 때문에 트리의 많은 부분을 검색하여야 하는 문제점이 있다. 본 논문에서는 대부분의 데이타베이스 응용에서 이미 유지하고 있는 다차원 색인을 이용하여 클러스터링을 수행하는 새로운 하향식(top-down) 계층 클러스터링 기법을 제안한다. 일반적으로 다차원 색인에서는 가까운 객체들이 동일한 (혹은 인접한) 페이지에 저장될 가능성이 큰 클러스터링 성질을 가진다. 이러한 다차원 색인의 클러스터링 성질을 사용하면 각 객체들간의 거리를 일일이 계산하지 않고도 이웃한 객체들을 식별할 수 있다. 우선 객체들의 밀도에 기반하여 클러스터를 정형적으로 정의한다. 이를 위하여, 객체를 포함하는 영역의 밀도를 이용한 영역 대조 분할(region contrast partition) 개념을 사용한다. 또, 클러스터링 알고리즘에서의 빠른 검색을 위하여 분기 한정(branch-and-bound) 알고리즘을 사용하며, 여기서의 한계값(bound)을 제안하고 이의 정확성을 이론적으로 증명한다. 실험 결과, 제안한 방법은 상향식 계층 클러스터링 방법인 BIRCH와 비교하여, 정확성 측면에서 우수하거나 유사한 것으로 나타났으며, 데이타 페이지 액세스 횟수를 데이타베이스 크기에 따라 최고 26~187배까지 감소시킨 것으로 나타났다. 이 같은 결과로 볼 때, 제안한 방법은 대용량 데이타베이스에서의 클러스터링 성능을 크게 향상시키는 기법으로서, 일반 데이타베이스 응용에 실용적으로 적용 가능하다고 판단된다.

Retail Outlet Clustering of the Imported Automobile Distributors in Korea

  • Park, Koo-Woong
    • 유통과학연구
    • /
    • 제16권5호
    • /
    • pp.45-59
    • /
    • 2018
  • Purpose - This paper aims to analyze the distinct pattern of clustering of imported automobile distributors and provide evidence for the phenomenon using Korean data. Research design, data, and methodology - In this paper, we use data from Korea Automobile Importers & Distributors Association of 23 foreign automobile brands to evaluate the degree of concentration of showrooms using locational Gini index. We identify possible causes for the high level of clustering from two perspectives; 1) on the distributors' side and 2) on the customers' side. Results - We find a very strong locational concentration of imported automobile showrooms within close vicinity in the major cities and districts in Korea. Locational Gini coefficients are 0.1024 at the national level, 0.1836~0.3763 at city level, and 0.3941~0.4311 at district level on a [0,0.5] scale. Conclusions - Luxury foreign automobile customers tend to shop extensively around multiple brands prior to their ideal model selection. Accordingly, the imported automobile distributors cluster together close to their direct competitors in order to give a good comparison opportunity for the potential customers. This will maximize the probability of the visits of potential customers and lead to successful sales performance.

가정방문을 통한 일 광역시 성인의 대사증후군 유병률 및 위험요인 조사 (Prevalence Rates and Risk Factors of Metabolic Disorder in Urban Adults assessed in Home Visits)

  • 김종임
    • 가정간호학회지
    • /
    • 제16권1호
    • /
    • pp.12-21
    • /
    • 2009
  • Purpose: The survey-based study aimed to determine the distribution and clustering tendency of metabolic syndrome risk factors in urban residents, and cluster odds ratios. Methods: Cluster sampling involved 827 urban participants and analysis of the collected data. Results: Regarding the prevalence of metabolic syndrome risk factors used for diagnosis, abdominal obesity was higher in women(69.5%) than in men(34.3%), high blood pressure was higher in men(57%) than in women(46.5%), and blood sugar was higher in men(6.9%) than in women(5.7%). Clustering increased with increasing body mass index(BMI), weight:height ratio(W/Ht) and abdominal obesity Risk factors for females were 1.7 times higher than for males. Participants with a family history of metabolic syndrome displayed related risk factors 1.5 times more than participants without a family history. Participants having a BMI ranking them as obese were 9.5 times more likely to display metabolic syndrome risk factors than non-obese participants. Obese participants were 20 times more likely to display risk factors than non-obese participants. Conclusion: BMI, W/Ht and abdominal obesity correlate with clustering of metabolic syndrome risk factors. The risk is increased by smoking and family history. Exercise weight control and non-smoking are recommended for comprehensive management of clustering of metabolic syndrome risk factors.

  • PDF