• Title/Summary/Keyword: nearest-neighbor analysis

Search Result 254, Processing Time 0.023 seconds

Plurality Rule-based Density and Correlation Coefficient-based Clustering for K-NN

  • Aung, Swe Swe;Nagayama, Itaru;Tamaki, Shiro
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.6 no.3
    • /
    • pp.183-192
    • /
    • 2017
  • k-nearest neighbor (K-NN) is a well-known classification algorithm, being feature space-based on nearest-neighbor training examples in machine learning. However, K-NN, as we know, is a lazy learning method. Therefore, if a K-NN-based system very much depends on a huge amount of history data to achieve an accurate prediction result for a particular task, it gradually faces a processing-time performance-degradation problem. We have noticed that many researchers usually contemplate only classification accuracy. But estimation speed also plays an essential role in real-time prediction systems. To compensate for this weakness, this paper proposes correlation coefficient-based clustering (CCC) aimed at upgrading the performance of K-NN by leveraging processing-time speed and plurality rule-based density (PRD) to improve estimation accuracy. For experiments, we used real datasets (on breast cancer, breast tissue, heart, and the iris) from the University of California, Irvine (UCI) machine learning repository. Moreover, real traffic data collected from Ojana Junction, Route 58, Okinawa, Japan, was also utilized to lay bare the efficiency of this method. By using these datasets, we proved better processing-time performance with the new approach by comparing it with classical K-NN. Besides, via experiments on real-world datasets, we compared the prediction accuracy of our approach with density peaks clustering based on K-NN and principal component analysis (DPC-KNN-PCA).

Face Recognition using Fisherface Method with Fuzzy Membership Degree (퍼지 소속도를 갖는 Fisherface 방법을 이용한 얼굴인식)

  • 곽근창;고현주;전명근
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.6
    • /
    • pp.784-791
    • /
    • 2004
  • In this study, we deal with face recognition using fuzzy-based Fisherface method. The well-known Fisherface method is more insensitive to large variation in light direction, face pose, and facial expression than Principal Component Analysis method. Usually, the various methods of face recognition including Fisherface method give equal importance in determining the face to be recognized, regardless of typicalness. The main point here is that the proposed method assigns a feature vector transformed by PCA to fuzzy membership rather than assigning the vector to particular class. In this method, fuzzy membership degrees are obtained from FKNN(Fuzzy K-Nearest Neighbor) initialization. Experimental results show better recognition performance than other methods for ORL and Yale face databases.

A study on the outlier data estimation method for anomaly detection of photovoltaic system (태양광 발전 이상감지를 위한 아웃라이어 추정 방법에 대한 연구)

  • Seo, Jong Kwan;Lee, Tae Il;Lee, Whee Sung;Park, Jeom Bae
    • Journal of IKEEE
    • /
    • v.24 no.2
    • /
    • pp.403-408
    • /
    • 2020
  • Photovoltaic (PV) has both intermittent and uncertainty in nature, so it is difficult to accurately predict. Thus anomaly detection technology is important to diagnose real time PV generation. This paper identifies a correlation between various parameters and classifies the PV data applying k-nearest neighbor and dynamic time warpping. Results for the two classifications showed that an outlier detection by a fault of some facilities, and a temporary power loss by partial shading and overall shading occurring during the short period. Based on 100kW plant data, machine learning analysis and test results verified actual outliers and candidates of outlier.

Pattern Classification Methods for Keystroke Identification (키스트로크 인식을 위한 패턴분류 방법)

  • Cho Tai-Hoon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.5
    • /
    • pp.956-961
    • /
    • 2006
  • Keystroke time intervals can be a discriminating feature in the verification and identification of computer users. This paper presents a comparison result obtained using several classification methods including k-NN (k-Nearest Neighbor), back-propagation neural networks, and Bayesian classification for keystroke identification. Performance of k-NN classification was best with small data samples available per user, while Bayesian classification was the most superior to others with large data samples per user. Thus, for web-based on-line identification of users, it seems to be appropriate to selectively use either k-NN or Bayesian method according to the number of keystroke samples accumulated by each user.

Interpolation of Color Image Scales (칼라 이미지 스케일의 보간)

  • Kim, Sung-Hwan;Jeong, Sung-Hwan;Lee, Joon-Whoan
    • Science of Emotion and Sensibility
    • /
    • v.10 no.3
    • /
    • pp.289-297
    • /
    • 2007
  • Color image scale captures the knowledge of colorists and represents both adjectives and colors in the same adjective image scales in order to select color(s) corresponding to an adjective. Due to the difficulty of psychological experiment and statistical analysis, in general, only a limited number of colors are located in the color image scales. This can make color selection process hard especially to non-expert. In this paper, we propose an interpolation of color image scale based on the fuzzy K-nearest neighbor method, which provides continuous colors according to the coordinates of the image scales. The experimental results show that the interpolated image scales can be practically useful for color selection process.

  • PDF

Analysis of Morton Code Conversion for 32 Bit IEEE 754 Floating Point Variables (IEEE 754 부동 소수점 32비트 float 변수의 Morton Code 변환 분석)

  • Park, Taejung
    • Journal of Digital Contents Society
    • /
    • v.17 no.3
    • /
    • pp.165-172
    • /
    • 2016
  • Morton codes play important roles in many parallel GPU applications for the nearest neighbor (NN) search in huge data and queries with its applications growing. This paper discusses and analyzes the meaning of Tero Karras's 32-bit 'unsigned int' Morton code algorithm for three-dimensional spatial information in $[0,1]^3$ and its geometric implications. Based on this, this paper proposes 64-bit 'unsigned long long' version of Morton code and compares the results in both CPU vs. GPU and 32-bit vs. 64-bit versions. The proposed GPU algorithm runs around 1000 times faster than the CPU version.

Detection and Analysis of DNA Hybridization Characteristics by using Thermodynamic Method (열역학법을 이용한 DNA hybridization 특성 검출 및 해석)

  • Kim, Do-Gyun;Gwon, Yeong-Su
    • The Transactions of the Korean Institute of Electrical Engineers C
    • /
    • v.51 no.6
    • /
    • pp.265-270
    • /
    • 2002
  • The determination of DNA hybridization reaction can apply the molecular biology research, clinic diagnostics, bioengineering, environment monitoring, food science and application area. So, the improvement of DNA hybridization detection method is very important for the determination of this hybridization reaction. Several molecular biological techniques require accurate predictions of matched versus mismatched hybridization thermodynamics, such as PCR, sequencing by hybridization, gene diagnostics and antisense oligonucleotide probes. In addition, recent developments of oligonucleotide chip arrays as means for biochemical assays and DNA sequencing requires accurate knowledge of hybridization thermodynamics and population ratios at matched and mismatched target sites. In this study, we report the characteristics of the probe and matched, mismatched target oligonucleotide hybridization reaction using thermodynamic method. Thermodynamic of 5 oligonucleotides with central and terminal mismatch sequences were obtained by measured UV-absorbance as a function of temperature. The data show that the nearest-neighbor base-pair model is adequate for predicting thermodynamics of oligonucleotides with average deviations for $\Delta$H$^{0}$ , $\Delta$S$^{0}$ , $\Delta$G$_{37}$ $^{0}$ and T$_{m}$, respectively.>$^{0}$ and T$_{m}$, respectively.

Improvement of location positioning using KNN, Local Map Classification and Bayes Filter for indoor location recognition system

  • Oh, Seung-Hoon;Maeng, Ju-Hyun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.6
    • /
    • pp.29-35
    • /
    • 2021
  • In this paper, we propose a method that combines KNN(K-Nearest Neighbor), Local Map Classification and Bayes Filter as a way to increase the accuracy of location positioning. First, in this technique, Local Map Classification divides the actual map into several clusters, and then classifies the clusters by KNN. And posterior probability is calculated through the probability of each cluster acquired by Bayes Filter. With this posterior probability, the cluster where the robot is located is searched. For performance evaluation, the results of location positioning obtained by applying KNN, Local Map Classification, and Bayes Filter were analyzed. As a result of the analysis, it was confirmed that even if the RSSI signal changes, the location information is fixed to one cluster, and the accuracy of location positioning increases.

Efficient Nearest Neighbor Search on Moving Object Trajectories (이동객체궤적에 대한 효율적인 최근접 이웃 검색)

  • KIm, Gyu-Jae;Park, Young-Hee;Cho, Woo-Hyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.10a
    • /
    • pp.418-421
    • /
    • 2014
  • Because of the rapid growth of mobile communication and wireless communication, Location-based services are handled in many applications. So, the management and analysis of spatio-temporal data are a hot issue in database research. Index structure and query processing of such contents are very important for these applications. This paper addressees algorithms that make index structure by using Douglas-Peucker Algorithm and process nearest neighbor search query efficiently on moving objects trajectories. We compare and analyze our algorithms by experiments. Our algorithms make small size of index structure and process the query more efficiently.

  • PDF

Spatial Point-pattern Analysis of a Population of Lodgepole Pine

  • Chhin, Sophan;Huang, Shongming
    • Journal of Forest and Environmental Science
    • /
    • v.34 no.6
    • /
    • pp.419-428
    • /
    • 2018
  • Spatial point-patterns analyses were conducted to provide insight into the ecological process behind competition and mortality in two lodgepole pine (Pinus contorta Dougl. ex Loud. var. latifolia Engelm.) stands, one in the Lower Foothills, and the other in the Upper Foothills natural subregions in the boreal forest of Alberta, Canada. Spatial statistical tests were applied to live and dead trees and included Clark-Evans nearest neighbor statistic (R), nearest neighbor distribution function (G(r)), and a variant of Ripley's K function (L(r)). In both lodgepole pine plots, the results indicated that there was significant regularity in the spatial point-pattern of the surviving trees which indicates that competition has been a key driver of mortality and forest dynamics in these plots. Dead trees generally showed a clumping pattern in higher density patches. There were also significant bivariate relationships between live and dead trees, but the relationships differed by natural subregion. In the Lower Foothills plot there was significant attraction between live and dead tees which suggests mainly one-sided competition for light. In contrast, in the Upper Foothills plot, there was significant repulsion between live and dead trees which suggests two-sided competition for soil nutrients and soil moisture.