• Title/Summary/Keyword: Clustering coefficient

Search Result 196, Processing Time 0.024 seconds

Reconstruction from Feature Points of Face through Fuzzy C-Means Clustering Algorithm with Gabor Wavelets (FCM 군집화 알고리즘에 의한 얼굴의 특징점에서 Gabor 웨이브렛을 이용한 복원)

  • 신영숙;이수용;이일병;정찬섭
    • Korean Journal of Cognitive Science
    • /
    • v.11 no.2
    • /
    • pp.53-58
    • /
    • 2000
  • This paper reconstructs local region of a facial expression image from extracted feature points of facial expression image using FCM(Fuzzy C-Meang) clustering algorithm with Gabor wavelets. The feature extraction in a face is two steps. In the first step, we accomplish the edge extraction of main components of face using average value of 2-D Gabor wavelets coefficient histogram of image and in the next step, extract final feature points from the extracted edge information using FCM clustering algorithm. This study presents that the principal components of facial expression images can be reconstructed with only a few feature points extracted from FCM clustering algorithm. It can also be applied to objects recognition as well as facial expressions recognition.

  • PDF

Retail Outlet Clustering of the Imported Automobile Distributors in Korea

  • Park, Koo-Woong
    • Journal of Distribution Science
    • /
    • v.16 no.5
    • /
    • pp.45-59
    • /
    • 2018
  • Purpose - This paper aims to analyze the distinct pattern of clustering of imported automobile distributors and provide evidence for the phenomenon using Korean data. Research design, data, and methodology - In this paper, we use data from Korea Automobile Importers & Distributors Association of 23 foreign automobile brands to evaluate the degree of concentration of showrooms using locational Gini index. We identify possible causes for the high level of clustering from two perspectives; 1) on the distributors' side and 2) on the customers' side. Results - We find a very strong locational concentration of imported automobile showrooms within close vicinity in the major cities and districts in Korea. Locational Gini coefficients are 0.1024 at the national level, 0.1836~0.3763 at city level, and 0.3941~0.4311 at district level on a [0,0.5] scale. Conclusions - Luxury foreign automobile customers tend to shop extensively around multiple brands prior to their ideal model selection. Accordingly, the imported automobile distributors cluster together close to their direct competitors in order to give a good comparison opportunity for the potential customers. This will maximize the probability of the visits of potential customers and lead to successful sales performance.

Metamemory and Categorical Organization Strategy for Age, Category Typicality, and Recall Tasks (연령, 범주전형성 및 회상조건에 따른 아동의 상위기억과 범주적 조직화 책략 사용)

  • Lee, Hae Lyun;Lee, Gyung Nim
    • Korean Journal of Child Studies
    • /
    • v.16 no.2
    • /
    • pp.125-138
    • /
    • 1995
  • The purpose of the present research was to study developmental trends in categorical organization strategy. The subjects were 160 children - 40 nine - year - old boys, 40 nine - year - old girls, 40 seven - year - old boys, 40 seven - year - old girls. All subjects received one of three lists of items differing in category representativeness in either a free -recall or a sort -recall task. The selection of list materials permitted separation of the effects of age differences in category knowledge from those of knowledge per se on children's recall behavior. The tasks were administered to children individually with the memory task followed by the metamemory task. The data was analyzed with three - way ANOVA arid Pearson's correlation coefficient. The results were that (1) Children's recall, clustering, and metamemory increased with age, while age effects for clustering were restricted to the sort - recall/high typicality condition. At each age level, children showed higher level of recall, clustering and metamemory for category typical rather than atypical list, and sort - recall than free-recall. Level of clustering and metamemory were superior in the sort - recall task and for items of high category typicality. (2) 9 - year - old children were capable of deliberately and efficiently using category organization as a memory strategy at least when appropriate contextual support was present (as determined by task requirements and list materials: sort - recall/high typicality).

  • PDF

Optimization study of a clustering algorithm for cosmic-ray muon scattering tomography used in fast inspection

  • Hou, Linjun;Huo, Yonggang;Zuo, Wenming;Yao, Qingxu;Yang, Jianqing;Zhang, Quanhu
    • Nuclear Engineering and Technology
    • /
    • v.53 no.1
    • /
    • pp.208-215
    • /
    • 2021
  • Cosmic-ray muon scattering tomography (MST) technology is a new radiation imaging technology with unique advantages. As the performance of its image reconstruction algorithm has a crucial influence on the imaging quality, researches on this algorithm are of great significance to the development and application of this technology. In this paper, a fast inspection algorithm based on clustering analysis for the identification of the existence of nuclear materials is studied and optimized. Firstly, the principles of MST technology and a binned clustering algorithm were introduced, and then several simulation experiments were carried out using Geant4 toolkit to test the effects of exposure time, algorithm parameter, the size and structure of object on the performance of the algorithm. Based on these, we proposed two optimization methods for the clustering algorithm: the optimization of vertical distance coefficient and the displacement of sub-volumes. Finally, several sets of experiments were designed to validate the optimization effect, and the results showed that these two optimization methods could significantly enhance the distinguishing ability of the algorithm for different materials, help to obtain more details in practical applications, and was therefore of great importance to the development and application of the MST technology.

A hybrid algorithm for classifying rock joints based on improved artificial bee colony and fuzzy C-means clustering algorithm

  • Ji, Duofa;Lei, Weidong;Chen, Wenqin
    • Geomechanics and Engineering
    • /
    • v.31 no.4
    • /
    • pp.353-364
    • /
    • 2022
  • This study presents a hybrid algorithm for classifying the rock joints, where the improved artificial bee colony (IABC) and the fuzzy C-means (FCM) clustering algorithms are incorporated to take advantage of the artificial bee colony (ABC) algorithm by tuning the FCM clustering algorithm to obtain the more reasonable and stable result. A coefficient is proposed to reduce the amount of blind random searches and speed up convergence, thus achieving the goals of optimizing and improving the ABC algorithm. The results from the IABC algorithm are used as initial parameters in FCM to avoid falling to the local optimum in the local search, thus obtaining stable classifying results. Two validity indices are adopted to verify the rationality and practicability of the IABC-FCM algorithm in classifying the rock joints, and the optimal amount of joint sets is obtained based on the two validity indices. Two illustrative examples, i.e., the simulated rock joints data and the field-survey rock joints data, are used in the verification to check the feasibility and practicability in rock engineering for the proposed algorithm. The results show that the IABC-FCM algorithm could be applicable in classifying the rock joint sets.

Identification of Unknown Cryptographic Communication Protocol and Packet Analysis Using Machine Learning (머신러닝을 활용한 알려지지 않은 암호통신 프로토콜 식별 및 패킷 분류)

  • Koo, Dongyoung
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.2
    • /
    • pp.193-200
    • /
    • 2022
  • Unknown cryptographic communication protocols may have advantage of guaranteeing personal and data privacy, but when used for malicious purposes, it is almost impossible to identify and respond to using existing network security equipment. In particular, there is a limit to manually analyzing a huge amount of traffic in real time. Therefore, in this paper, we attempt to identify packets of unknown cryptographic communication protocols and separate fields comprising a packet by using machine learning techniques. Using sequential patterns analysis, hierarchical clustering, and Pearson's correlation coefficient, we found that the structure of packets can be automatically analyzed even for an unknown cryptographic communication protocol.

Two-stage Sampling for Estimation of Prevalence of Bovine Tuberculosis (이단계표본추출을 이용한 소결핵병 유병률 추정)

  • Pak, Son-Il
    • Journal of Veterinary Clinics
    • /
    • v.28 no.4
    • /
    • pp.422-426
    • /
    • 2011
  • For a national survey in which wide geographic region or an entire country is targeted, multi-stage sampling approach is widely used to overcome the problem of simple random sampling, to consider both herd- and animallevel factors associated with disease occurrence, and to adjust clustering effect of disease in the population in the calculation of sample size. The aim of this study was to establish sample size for estimating bovine tuberculosis (TB) in Korea using stratified two-stage sampling design. The sample size was determined by taking into account the possible clustering of TB-infected animals on individual herds to increase the reliability of survey results. In this study, the country was stratified into nine provinces (administrative unit) and herd, the primary sampling unit, was considered as a cluster. For all analyses, design effect of 2, between-cluster prevalence of 50% to yield maximum sample size, and mean herd size of 65 were assumed due to lack of information available. Using a two-stage sampling scheme, the number of cattle sampled per herd was 65 cattle, regardless of confidence level, prevalence, and mean herd size examined. Number of clusters to be sampled at a 95% level of confidence was estimated to be 296, 74, 33, 19, 12, and 9 for desired precision of 0.01, 0.02, 0.03, 0.04, 0.05, and 0.06, respectively. Therefore, the total sample size with a 95% confidence level was 172,872, 43,218, 19,224, 10,818, 6,930, and 4,806 for desired precision ranging from 0.01 to 0.06. The sample size was increased with desired precision and design effect. In a situation where the number of cattle sampled per herd is fixed ranging from 5 to 40 with a 5-head interval, total sample size with a 95% confidence level was estimated to be 6,480, 10,080, 13,770, 17,280, 20.925, 24,570, 28,350, and 31,680, respectively. The percent increase in total sample size resulting from the use of intra-cluster correlation coefficient of 0.3 was 22.2, 32.1, 36.3, 39.6, 41.9, 42.9, 42,2, and 44.3%, respectively in comparison to the use of coefficient of 0.2.

The Clustering Threshold Image Processing Technique in fMRI (핵자기 뇌기능 영상에서 군집경계기법을 이용한 영상처리법)

  • Jeong, Sun-Cheol;No, Yong-Man;Jo, Jang-Hui
    • Journal of Biomedical Engineering Research
    • /
    • v.16 no.4
    • /
    • pp.425-430
    • /
    • 1995
  • The correlation technique has been widely used in ctRl data processing. The proposed CLT (clus- tering threshold) technique is a modified CCT (correlation coefficient threshold) technique and has many advantages compared with the conventional CCT technique. The CLT technique is explained by the following two steps. First, once the correlation coefficient map above the proper TH value is obtained using the CCT technique which is discrete and includes splash noise data, then the spurious pixels are rejected and the real neural activity pixels extracted using an nxn matrix box. Second, a clustering operation is performed by the two correction rules. The real neuronal activated pixels can be clustered and the false spurious pixels can be suppressed by the proposed CLT technique. The proposed CLT technique used in the post processing in ctRl has advantages over other existing techniques. It is especially proved to be robust in noisy environment.

  • PDF

Clustering-Based Recommendation Using Users' Preference (사용자 선호도를 사용한 군집 기반 추천 시스템)

  • Kim, Younghyun;Shin, Won-Yong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.2
    • /
    • pp.277-284
    • /
    • 2017
  • In a flood of information, most users will want to get a proper recommendation. If a recommender system fails to give appropriate contents, then quality of experience (QoE) will be drastically decreased. In this paper, we propose a recommender system based on the intra-cluster users' item preference for improving recommendation accuracy indices such as precision, recall, and F1 score. To this end, first, users are divided into several clusters based on the actual rating data and Pearson correlation coefficient (PCC). Afterwards, we give each item an advantage/disadvantage according to the preference tendency by users within the same cluster. Specifically, an item will be received an advantage/disadvantage when the item which has been averagely rated by other users within the same cluster is above/below a predefined threshold. The proposed algorithm shows a statistically significant performance improvement over the item-based collaborative filtering algorithm with no clustering in terms of recommendation accuracy indices such as precision, recall, and F1 score.

SWAT Direct Runoff and Baseflow Evaluation using Web-based Flow Clustering EI Estimation System (웹기반의 유량 군집화 EI 평가시스템을 이용한 SWAT 직접유출과 기저유출 평가)

  • Jang, Won Seok;Moon, Jong Pil;Kim, Nam Won;Yoo, Dong Sun;Kum, Dong Hyuk;Kim, Ik Jae;Mun, Yuri;Lim, Kyoung Jae
    • Journal of Korean Society on Water Environment
    • /
    • v.27 no.1
    • /
    • pp.61-72
    • /
    • 2011
  • In order to assess hydrologic and nonpoint source pollutant behaviors in a watershed with Soil and Water Assessment Tool (SWAT) model, the accuracy evaluation of SWAT model should be conducted prior to the application of it to a watershed. When calibrating and validating hydrological components of SWAT model, the Nash-Sutcliffe efficiency coefficient (EI) has been widely used. However, the EI value has been known as it is affected sensitively by big numbers among the range of numbers. In this study, a Web-based flow clustering EI estimation system using K-means clustering algorithm was developed and used for SWAT hydrology evaluation. Even though the EI of total streamflow was high, the EI values of hydrologic components (i.e., direct runoff and baseflow) were not high. Also when the EI values of flow group I and II (i.e., low and high value group) clustered from direct runoff and baseflow were computed, respectively, the EI values of them were much lower with negative EI values for some flow group comparison. The SWAT auto-calibration tool estimated values also showed negative EI values for most flow group I and II of direct runoff and baseflow although EI value of total streamflow was high. The result obtained in this study indicates that the SWAT hydrology component should be calibrated until all four positive EI values for each flow group of direct runoff and baseflow are obtained for better accuracy both in direct runoff and baseflow.