• Title/Summary/Keyword: K-Nearest Neighbor

Search Result 641, Processing Time 0.024 seconds

Facial Expression Recognition using ICA-Factorial Representation Method (ICA-factorial 표현법을 이용한 얼굴감정인식)

  • Han, Su-Jeong;Kwak, Keun-Chang;Go, Hyoun-Joo;Kim, Sung-Suk;Chun, Myung-Geun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.3
    • /
    • pp.371-376
    • /
    • 2003
  • In this paper, we proposes a method for recognizing the facial expressions using ICA(Independent Component Analysis)-factorial representation method. Facial expression recognition consists of two stages. First, a method of Feature extraction transforms the high dimensional face space into a low dimensional feature space using PCA(Principal Component Analysis). And then, the feature vectors are extracted by using ICA-factorial representation method. The second recognition stage is performed by using the Euclidean distance measure based KNN(K-Nearest Neighbor) algorithm. We constructed the facial expression database for six basic expressions(happiness, sadness, angry, surprise, fear, dislike) and obtained a better performance than previous works.

Analysis of Texture Features and Classifications for the Accurate Diagnosis of Prostate Cancer (전립선암의 정확한 진단을 위한 질감 특성 분석 및 등급 분류)

  • Kim, Cho-Hee;So, Jae-Hong;Park, Hyeon-Gyun;Madusanka, Nuwan;Deekshitha, Prakash;Bhattacharjee, Subrata;Choi, Heung-Kook
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.8
    • /
    • pp.832-843
    • /
    • 2019
  • Prostate cancer is a high-risk with a high incidence and is a disease that occurs only in men. Accurate diagnosis of cancer is necessary as the incidence of cancer patients is increasing. Prostate cancer is also a disease that is difficult to predict progress, so it is necessary to predict in advance through prognosis. Therefore, in this paper, grade classification is attempted based on texture feature extraction. There are two main methods of classification: Uses One-way Analysis of Variance (ANOVA) to determine whether texture features are significant values, compares them with all texture features and then uses only one classification i.e. Benign versus. The second method consisted of more detailed classifications without using ANOVA for better analysis between different grades. Results of both these methods are compared and analyzed through the machine learning models such as Support Vector Machine and K-Nearest Neighbor. The accuracy of Benign versus Grade 4&5 using the second method with the best results was 90.0 percentage.

A study on neighbor selection methods in k-NN collaborative filtering recommender system (근접 이웃 선정 협력적 필터링 추천시스템에서 이웃 선정 방법에 관한 연구)

  • Lee, Seok-Jun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.5
    • /
    • pp.809-818
    • /
    • 2009
  • Collaborative filtering approach predicts the preference of active user about specific items transacted on the e-commerce by using others' preference information. To improve the prediction accuracy through collaborative filtering approach, it must be needed to gain enough preference information of users' for predicting preference. But, a bit much information of users' preference might wrongly affect on prediction accuracy, and also too small information of users' preference might make bad effect on the prediction accuracy. This research suggests the method, which decides suitable numbers of neighbor users for applying collaborative filtering algorithm, improved by existing k nearest neighbors selection methods. The result of this research provides useful methods for improving the prediction accuracy and also refines exploratory data analysis approach for deciding appropriate numbers of nearest neighbors.

  • PDF

Performance analysis of maximum likelihood detection for the spatial multiplexing system with multiple antennas (다중 안테나를 갖는 공간 다중화 시스템을 위한 maximum likelihood 검출기의 성능 분석)

  • Shin Myeongcheol;Song Young Seog;Kwon Dong-Seung;Seo Jeongtae;Lee Chungyong
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.42 no.12
    • /
    • pp.103-110
    • /
    • 2005
  • The performance of maximum likelihood(ML) detection for the given channel is analyzed in spatially multiplexed MIMO system. In order to obtain the vector symbol error rate, we define error vectors which represent the geometrical relation between lattice points. The properties of error vectors are analyzed to show that all lattice points in infinite lattice almost surely have four nearest neighbors after random channel transformation. Using this information and minimum distance obtained by the modified sphere decoding algorithm, we formulate the analytical performance of vector symbol error over the given channel. To verify the result, we simulate ML performance over various random channel which are classified into three categories: unitary channel, dense channel, and sparse channel. From the simulation results, it is verified that the derived analytical result gives a good approximation about the performance of ML detector over the all random MIMO channels.

Comparison of the Tracking Methods for Multiple Maneuvering Targets (다중 기동 표적에 대한 추적 방식의 비교)

  • Lim, Sang Seok
    • Journal of Advanced Navigation Technology
    • /
    • v.1 no.1
    • /
    • pp.35-46
    • /
    • 1997
  • Over last decade Multiple Target Tracking (MTT) has been the subject of numerous presentations and conferences [1979-1900]. Various approaches have been proposed to solve the problem. Representative works in the problem are Nearest Neighbor (NN) method based on non-probabilistic data association (DA), Multiple Hypothesis Test (MHT) and Joint Probabilistic Data Association (JPDA) as the probabilistic approaches. These techniques have their own advantages and limitations in computational requirements and in the tracking performances. In this paper, the three promising algorithms based on the NN standard filter, MHT and JPDA methods are presented and their performances against simulated multiple maneuvering targets are compared through numerical simulations.

  • PDF

ENVIRONMENT DEPENDENCE OF DISK MORPHOLOGY OF SPIRAL GALAXIES

  • Ann, Hong Bae
    • Journal of The Korean Astronomical Society
    • /
    • v.47 no.1
    • /
    • pp.1-13
    • /
    • 2014
  • We analyze the dependence of disk morphology (arm class, Hubble type, bar type) of nearby spiral galaxies on the galaxy environment by using local background density (${\Sigma}_n$), projected distance ($r_p$), and tidal index (T I) as measures of the environment. There is a strong dependence of arm class and Hubble type on the galaxy environment, while the bar type exhibits a weak dependence with a high frequency of SB galaxies in high density regions. Grand design fractions and early-type fractions increase with increasing ${\Sigma}_n$, $1/r_p$, and T I, while fractions of flocculent spirals and late-type spirals decrease. Multiple-arm and intermediate-type spirals exhibit nearly constant fractions with weak trends similar to grand design and early-type spirals. While bar types show only a marginal dependence on ${\Sigma}_n$, they show a fairly clear dependence on $r_p$ with a high frequency of SB galaxies at small $r_p$. The arm class also exhibits a stronger correlation with $r_p$ than ${\Sigma}_n$ and T I, whereas the Hubble type exhibits similar correlations with ${\Sigma}_n$ and $r_p$. This suggests that the arm class is mostly affected by the nearest neighbor while the Hubble type is affected by the local densities contributed by neighboring galaxies as well as the nearest neighbor.

Application of Curve Interpolation Algorithm in CAD/CAM to Remove the Blurring of Magnified Image

  • Lee Yong-Joong
    • Proceedings of the Korean Society of Machine Tool Engineers Conference
    • /
    • 2005.05a
    • /
    • pp.115-124
    • /
    • 2005
  • This paper analyzes the problems that occurred in the magnification process for a fine input image and investigates a method to improve the problems. This paper applies a curve interpolation algorithm in CAD/CAM for the same test images with the existing image algorithm in order to improve the problems. As a result. the nearest neighbor interpolation. which is the most frequently applied algorithm for the existing image interpolation algorithm. shows that the identification of a magnified image is not possible. Therefore. this study examines an interpolation of gray-level data by applying a low-pass spatial filter and verifies that a bilinear interpolation presents a lack of property that accentuates the boundary of the image where the image is largely changed. The periodic B-spline interpolation algorithm used for curve interpolation in CAD/CAM can remove the blurring but shows a problem of obscuration, and the Ferguson's curve interpolation algorithm shows a more sharpened image than that of the periodic B-spline algorithm. For the future study, hereafter. this study will develop an interpolation algorithm that has an excel lent improvement for the boundary of the image and continuous and flexible property by using the NURBS. Ferguson's complex surface. and Bezier surface used in CAD/CAM engineering based on. the results of this study.

  • PDF

Detection and Analysis of DNA Hybridization Characteristics by using Thermodynamic Method (열역학법을 이용한 DNA hybridization 특성 검출 및 해석)

  • Kim, Do-Gyun;Gwon, Yeong-Su
    • The Transactions of the Korean Institute of Electrical Engineers C
    • /
    • v.51 no.6
    • /
    • pp.265-270
    • /
    • 2002
  • The determination of DNA hybridization reaction can apply the molecular biology research, clinic diagnostics, bioengineering, environment monitoring, food science and application area. So, the improvement of DNA hybridization detection method is very important for the determination of this hybridization reaction. Several molecular biological techniques require accurate predictions of matched versus mismatched hybridization thermodynamics, such as PCR, sequencing by hybridization, gene diagnostics and antisense oligonucleotide probes. In addition, recent developments of oligonucleotide chip arrays as means for biochemical assays and DNA sequencing requires accurate knowledge of hybridization thermodynamics and population ratios at matched and mismatched target sites. In this study, we report the characteristics of the probe and matched, mismatched target oligonucleotide hybridization reaction using thermodynamic method. Thermodynamic of 5 oligonucleotides with central and terminal mismatch sequences were obtained by measured UV-absorbance as a function of temperature. The data show that the nearest-neighbor base-pair model is adequate for predicting thermodynamics of oligonucleotides with average deviations for $\Delta$H$^{0}$ , $\Delta$S$^{0}$ , $\Delta$G$_{37}$ $^{0}$ and T$_{m}$, respectively.>$^{0}$ and T$_{m}$, respectively.

Spatial Point-pattern Analysis of a Population of Lodgepole Pine

  • Chhin, Sophan;Huang, Shongming
    • Journal of Forest and Environmental Science
    • /
    • v.34 no.6
    • /
    • pp.419-428
    • /
    • 2018
  • Spatial point-patterns analyses were conducted to provide insight into the ecological process behind competition and mortality in two lodgepole pine (Pinus contorta Dougl. ex Loud. var. latifolia Engelm.) stands, one in the Lower Foothills, and the other in the Upper Foothills natural subregions in the boreal forest of Alberta, Canada. Spatial statistical tests were applied to live and dead trees and included Clark-Evans nearest neighbor statistic (R), nearest neighbor distribution function (G(r)), and a variant of Ripley's K function (L(r)). In both lodgepole pine plots, the results indicated that there was significant regularity in the spatial point-pattern of the surviving trees which indicates that competition has been a key driver of mortality and forest dynamics in these plots. Dead trees generally showed a clumping pattern in higher density patches. There were also significant bivariate relationships between live and dead trees, but the relationships differed by natural subregion. In the Lower Foothills plot there was significant attraction between live and dead tees which suggests mainly one-sided competition for light. In contrast, in the Upper Foothills plot, there was significant repulsion between live and dead trees which suggests two-sided competition for soil nutrients and soil moisture.

Product Evaluation Criteria Extraction through Online Review Analysis: Using LDA and k-Nearest Neighbor Approach (온라인 리뷰 분석을 통한 상품 평가 기준 추출: LDA 및 k-최근접 이웃 접근법을 활용하여)

  • Lee, Ji Hyeon;Jung, Sang Hyung;Kim, Jun Ho;Min, Eun Joo;Yeo, Un Yeong;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.97-117
    • /
    • 2020
  • Product evaluation criteria is an indicator describing attributes or values of products, which enable users or manufacturers measure and understand the products. When companies analyze their products or compare them with competitors, appropriate criteria must be selected for objective evaluation. The criteria should show the features of products that consumers considered when they purchased, used and evaluated the products. However, current evaluation criteria do not reflect different consumers' opinion from product to product. Previous studies tried to used online reviews from e-commerce sites that reflect consumer opinions to extract the features and topics of products and use them as evaluation criteria. However, there is still a limit that they produce irrelevant criteria to products due to extracted or improper words are not refined. To overcome this limitation, this research suggests LDA-k-NN model which extracts possible criteria words from online reviews by using LDA and refines them with k-nearest neighbor. Proposed approach starts with preparation phase, which is constructed with 6 steps. At first, it collects review data from e-commerce websites. Most e-commerce websites classify their selling items by high-level, middle-level, and low-level categories. Review data for preparation phase are gathered from each middle-level category and collapsed later, which is to present single high-level category. Next, nouns, adjectives, adverbs, and verbs are extracted from reviews by getting part of speech information using morpheme analysis module. After preprocessing, words per each topic from review are shown with LDA and only nouns in topic words are chosen as potential words for criteria. Then, words are tagged based on possibility of criteria for each middle-level category. Next, every tagged word is vectorized by pre-trained word embedding model. Finally, k-nearest neighbor case-based approach is used to classify each word with tags. After setting up preparation phase, criteria extraction phase is conducted with low-level categories. This phase starts with crawling reviews in the corresponding low-level category. Same preprocessing as preparation phase is conducted using morpheme analysis module and LDA. Possible criteria words are extracted by getting nouns from the data and vectorized by pre-trained word embedding model. Finally, evaluation criteria are extracted by refining possible criteria words using k-nearest neighbor approach and reference proportion of each word in the words set. To evaluate the performance of the proposed model, an experiment was conducted with review on '11st', one of the biggest e-commerce companies in Korea. Review data were from 'Electronics/Digital' section, one of high-level categories in 11st. For performance evaluation of suggested model, three other models were used for comparing with the suggested model; actual criteria of 11st, a model that extracts nouns by morpheme analysis module and refines them according to word frequency, and a model that extracts nouns from LDA topics and refines them by word frequency. The performance evaluation was set to predict evaluation criteria of 10 low-level categories with the suggested model and 3 models above. Criteria words extracted from each model were combined into a single words set and it was used for survey questionnaires. In the survey, respondents chose every item they consider as appropriate criteria for each category. Each model got its score when chosen words were extracted from that model. The suggested model had higher scores than other models in 8 out of 10 low-level categories. By conducting paired t-tests on scores of each model, we confirmed that the suggested model shows better performance in 26 tests out of 30. In addition, the suggested model was the best model in terms of accuracy. This research proposes evaluation criteria extracting method that combines topic extraction using LDA and refinement with k-nearest neighbor approach. This method overcomes the limits of previous dictionary-based models and frequency-based refinement models. This study can contribute to improve review analysis for deriving business insights in e-commerce market.