• Title/Summary/Keyword: Improved k-Nearest Neighbor

Search Result 47, Processing Time 0.029 seconds

Enhancement of Text Classification Method (텍스트 분류 기법의 발전)

  • Shin, Kwang-Seong;Shin, Seong-Yoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.155-156
    • /
    • 2019
  • Traditional machine learning based emotion analysis methods such as Classification and Regression Tree (CART), Support Vector Machine (SVM), and k-nearest neighbor classification (kNN) are less accurate. In this paper, we propose an improved kNN classification method. Improved methods and data normalization achieve the goal of improving accuracy. Then, three classification algorithms and an improved algorithm were compared based on experimental data.

  • PDF

Model-Based Object Recognition using PCA & Improved k-Nearest Neighbor (PCA와 개선된 k-Nearest Neighbor를 이용한 모델 기반형 물체 인식)

  • Jung Byeong-Soo;Kim Byung-Gi
    • The KIPS Transactions:PartB
    • /
    • v.13B no.1 s.104
    • /
    • pp.53-62
    • /
    • 2006
  • Object recognition techniques using principal component analysis are disposed to be decreased recognition rate when lighting change of image happens. The purpose of this thesis is to propose an object recognition technique using new PCA analysis method that discriminates an object in database even in the case that the variation of illumination in training images exists. And the object recognition algorithm proposed here represents more enhanced recognition rate using improved k-Nearest Neighbor. In this thesis, we proposed an object recognition algorithm which creates object space by pre-processing and being learned image using histogram equalization and median filter. By spreading histogram of test image using histogram equalization, the effect to change of illumination is reduced. This method is stronger to change of illumination than basic PCA method and normalization, and almost removes effect of illumination, therefore almost maintains constant good recognition rate. And, it compares ingredient projected test image into object space with distance of representative value and recognizes after representative value of each object in model image is made. Each model images is used in recognition unit about some continual input image using improved k-Nearest Neighbor in this thesis because existing method have many errors about distance calculation.

Probabilistic K-nearest neighbor classifier for detection of malware in android mobile (안드로이드 모바일 악성 앱 탐지를 위한 확률적 K-인접 이웃 분류기)

  • Kang, Seungjun;Yoon, Ji Won
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.25 no.4
    • /
    • pp.817-827
    • /
    • 2015
  • In this modern society, people are having a close relationship with smartphone. This makes easier for hackers to gain the user's information by installing the malware in the user's smartphone without the user's authority. This kind of action are threats to the user's privacy. The malware characteristics are different to the general applications. It requires the user's authority. In this paper, we proposed a new classification method of user requirements method by each application using the Principle Component Analysis(PCA) and Probabilistic K-Nearest Neighbor(PKNN) methods. The combination of those method outputs the improved result to classify between malware and general applications. By using the K-fold Cross Validation, the measurement precision of PKNN is improved compare to the previous K-Nearest Neighbor(KNN). The classification which difficult to solve by KNN also can be solve by PKNN with optimizing the discovering the parameter k and ${\beta}$. Also the sample that has being use in this experiment is based on the Contagio.

Object Recognition using Improved k-Nearest Neighbor (개선된 k-Nearest Neighbor를 이용한 물체 인식)

  • Jung Byeongsoo;Wi Seungjung;Kim Jonghyeuk;Kim Byungki
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.799-801
    • /
    • 2005
  • 모델 영상내의 각각의 물체의 대표 값을 만든 후에 실험 영상을 물체 공간에 투영 시켜서 나온 성분과 대표 값의 거리를 비교하여 인식하게 된다. 그러나 단순히 기존의 방법인 Point to Point 방식인 단순 거리 계산은 오차가 많기 때문에 된 논문에서는 개선된 Class to Class방식인 k-Nearest Neighbor를 이용하여 몇 개의 연속적인 입력영상에 대해 각각의 모델영상들을 인식의 단위로 이용하였다.

  • PDF

An Improved Text Classification Method for Sentiment Classification

  • Wang, Guangxing;Shin, Seong Yoon
    • Journal of information and communication convergence engineering
    • /
    • v.17 no.1
    • /
    • pp.41-48
    • /
    • 2019
  • In recent years, sentiment analysis research has become popular. The research results of sentiment analysis have achieved remarkable results in practical applications, such as in Amazon's book recommendation system and the North American movie box office evaluation system. Analyzing big data based on user preferences and evaluations and recommending hot-selling books and hot-rated movies to users in a targeted manner greatly improve book sales and attendance rate in movies [1, 2]. However, traditional machine learning-based sentiment analysis methods such as the Classification and Regression Tree (CART), Support Vector Machine (SVM), and k-nearest neighbor classification (kNN) had performed poorly in accuracy. In this paper, an improved kNN classification method is proposed. Through the improved method and normalizing of data, the purpose of improving accuracy is achieved. Subsequently, the three classification algorithms and the improved algorithm were compared based on experimental data. Experiments show that the improved method performs best in the kNN classification method, with an accuracy rate of 11.5% and a precision rate of 20.3%.

Performance of Indoor Positioning using Visible Light Communication System (가시광 통신을 이용한 실내 사용자 단말 탐지 시스템)

  • Park, Young-Sik;Hwang, Yu-Min;Song, Yu-Chan;Kim, Jin-Young
    • Journal of Digital Contents Society
    • /
    • v.15 no.1
    • /
    • pp.129-136
    • /
    • 2014
  • Wi-Fi fingerprinting system is a very popular positioning method used in indoor spaces. The system depends on Wi-Fi Received Signal Strength (RSS) from Access Points (APs). However, the Wi-Fi RSS is changeable by multipath fading effect and interference due to walls, obstacles and people. Therefore, the Wi-Fi fingerprinting system produces low position accuracy. Also, Wi-Fi signals pass through walls. For this reason, the existing system cannot distinguish users' floor. To solve these problems, this paper proposes a LED fingerprinting system for accurate indoor positioning. The proposed system uses a received optical power from LEDs and LED-Identification (LED-ID) instead of the Wi-Fi RSS. In training phase, we record LED fingerprints in database at each place. In serving phase, we adopt a K-Nearest Neighbor (K-NN) algorithm for comparing existing data and new received data of users. We show that our technique performs in terms of CDF by computer simulation results. From simulation results, the proposed system shows that a positioning accuracy is improved by 8.6 % on average.

An Efficient Multidimensional Index Structure for Parallel Environments

  • Bok Koung-Soo;Song Seok-Il;Yoo Jae-Soo
    • International Journal of Contents
    • /
    • v.1 no.1
    • /
    • pp.50-58
    • /
    • 2005
  • Generally, multidimensional data such as image and spatial data require large amount of storage space. There is a limit to store and manage those large amounts of data in single workstation. If we manage the data on parallel computing environment which is being actively researched these days, we can get highly improved performance. In this paper, we propose a parallel multidimensional index structure that exploits the parallelism of the parallel computing environment. The proposed index structure is nP(processor)-nxmD(disk) architecture which is the hybrid type of nP-nD and 1P-nD. Its node structure in-creases fan-out and reduces the height of an index. Also, a range search algorithm that maximizes I/O parallelism is devised, and it is applied to k-nearest neighbor queries. Through various experiments, it is shown that the proposed method outperforms other parallel index structures.

  • PDF

Medical Image Classification and Retrieval Using BoF Feature Histogram with Random Forest Classifier (Random Forest 분류기와 Bag-of-Feature 특징 히스토그램을 이용한 의료영상 자동 분류 및 검색)

  • Son, Jung Eun;Ko, Byoung Chul;Nam, Jae Yeal
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.4
    • /
    • pp.273-280
    • /
    • 2013
  • This paper presents novel OCS-LBP (Oriented Center Symmetric Local Binary Patterns) based on orientation of pixel gradient and image retrieval system based on BoF (Bag-of-Feature) and random forest classifier. Feature vectors extracted from training data are clustered into code book and each feature is transformed new BoF feature using code book. BoF features are applied to random forest for training and random forest having N classes is constructed by combining several decision trees. For testing, the same OCS-LBP feature is extracted from a query image and BoF is applied to trained random forest classifier. In contrast to conventional retrieval system, query image selects similar K-nearest neighbor (K-NN) classes after random forest is performed. Then, Top K similar images are retrieved from database images that are only labeled K-NN classes. Compared with other retrieval algorithms, the proposed method shows both fast processing time and improved retrieval performance.

Predicting sorptivity and freeze-thaw resistance of self-compacting mortar by using deep learning and k-nearest neighbor

  • Turk, Kazim;Kina, Ceren;Tanyildizi, Harun
    • Computers and Concrete
    • /
    • v.30 no.2
    • /
    • pp.99-111
    • /
    • 2022
  • In this study, deep learning and k-Nearest Neighbor (kNN) models were used to estimate the sorptivity and freeze-thaw resistance of self-compacting mortars (SCMs) having binary and ternary blends of mineral admixtures. Twenty-five environment-friendly SCMs were designed as binary and ternary blends of fly ash (FA) and silica fume (SF) except for control mixture with only Portland cement (PC). The capillary water absorption and freeze-thaw resistance tests were conducted for 91 days. It was found that the use of SF with FA as ternary blends reduced sorptivity coefficient values compared to the use of FA as binary blends while the presence of FA with SF improved freeze-thaw resistance of SCMs with ternary blends. The input variables used the models for the estimation of sorptivity were defined as PC content, SF content, FA content, sand content, HRWRA, water/cementitious materials (W/C) and freeze-thaw cycles. The input variables used the models for the estimation of sorptivity were selected as PC content, SF content, FA content, sand content, HRWRA, W/C and predefined intervals of the sample in water. The deep learning and k-NN models estimated the durability factor of SCM with 94.43% and 92.55% accuracy and the sorptivity of SCM was estimated with 97.87% and 86.14% accuracy, respectively. This study found that deep learning model estimated the sorptivity and durability factor of SCMs having binary and ternary blends of mineral admixtures higher accuracy than k-NN model.

Vehicle trajectory prediction based on Hidden Markov Model

  • Ye, Ning;Zhang, Yingya;Wang, Ruchuan;Malekian, Reza
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.7
    • /
    • pp.3150-3170
    • /
    • 2016
  • In Intelligent Transportation Systems (ITS), logistics distribution and mobile e-commerce, the real-time, accurate and reliable vehicle trajectory prediction has significant application value. Vehicle trajectory prediction can not only provide accurate location-based services, but also can monitor and predict traffic situation in advance, and then further recommend the optimal route for users. In this paper, firstly, we mine the double layers of hidden states of vehicle historical trajectories, and then determine the parameters of HMM (hidden Markov model) by historical data. Secondly, we adopt Viterbi algorithm to seek the double layers hidden states sequences corresponding to the just driven trajectory. Finally, we propose a new algorithm (DHMTP) for vehicle trajectory prediction based on the hidden Markov model of double layers hidden states, and predict the nearest neighbor unit of location information of the next k stages. The experimental results demonstrate that the prediction accuracy of the proposed algorithm is increased by 18.3% compared with TPMO algorithm and increased by 23.1% compared with Naive algorithm in aspect of predicting the next k phases' trajectories, especially when traffic flow is greater, such as this time from weekday morning to evening. Moreover, the time performance of DHMTP algorithm is also clearly improved compared with TPMO algorithm.