• Title/Summary/Keyword: scale invariant feature

Search Result 235, Processing Time 0.028 seconds

Rotation-Scale-Translation-Intensity Invariant Algorithm for Fingerprint Identigfication (RSTI 불변 지문인식 알고리즘)

  • Kim, Hyun;Kim, Hak-Il
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.6
    • /
    • pp.88-100
    • /
    • 1998
  • In this paper, an algorithm for a real-time automatic fingerprint identification system is proposed. The fingerprint feature volume is extracted by considering distinct and local characteristics(such as intensity and image quality difference etc.) in fingerprint images, which makes the algorithm properly adaptive to various image acquisitionj methods. Also the matching technique is designed to be invariant on rotation, scaling and translation (RST) changes while being capable of real-time processing. And the classification of fingerprints is performed based on the ridge flow and the relations among singular points such as cores and deltas. The developed fingerprint identification algorithm has been applied to various sets of fingerprint images such as one from NIST(National Institute of Standards and Technology, USA), a pressed fingerprint database constructed according to Korean population distributions in sex, ages and jobs, and a set of rolled-than-scanned fingerprint images. The overall performance of the algorithm has been analyzed and evaluated to the false rejection ratio of 0.07% while holding the false acceptance ratio of 0%.

  • PDF

Real-time Sign Object Detection in Subway station using Rotation-invariant Zernike Moment (회전 불변 제르니케 모멘트를 이용한 실시간 지하철 기호 객체 검출)

  • Weon, Sun-Hee;Kim, Gye-Young;Choi, Hyung-Il
    • Journal of Digital Contents Society
    • /
    • v.12 no.3
    • /
    • pp.279-289
    • /
    • 2011
  • The latest hardware and software techniques are combined to give safe walking guidance and convenient service of realtime walking assistance system for visually impaired person. This system consists of obstacle detection and perception, place recognition, and sign recognition for pedestrian can safely walking to arrive at their destination. In this paper, we exploit the sign object detection system in subway station for sign recognition that one of the important factors of walking assistance system. This paper suggest the adaptive feature map that can be robustly extract the sign object region from complexed environment with light and noise. And recognize a sign using fast zernike moment features which is invariant under translation, rotation and scale of object during walking. We considered three types of signs as arrow, restroom, and exit number and perform the training and recognizing steps through adaboost classifier. The experimental results prove that our method can be suitable and stable for real-time system through yields on the average 87.16% stable detection rate and 20 frame/sec of operation time for three types of signs in 5000 images of sign database.

Nearest-Neighbors Based Weighted Method for the BOVW Applied to Image Classification

  • Xu, Mengxi;Sun, Quansen;Lu, Yingshu;Shen, Chenming
    • Journal of Electrical Engineering and Technology
    • /
    • v.10 no.4
    • /
    • pp.1877-1885
    • /
    • 2015
  • This paper presents a new Nearest-Neighbors based weighted representation for images and weighted K-Nearest-Neighbors (WKNN) classifier to improve the precision of image classification using the Bag of Visual Words (BOVW) based models. Scale-invariant feature transform (SIFT) features are firstly extracted from images. Then, the K-means++ algorithm is adopted in place of the conventional K-means algorithm to generate a more effective visual dictionary. Furthermore, the histogram of visual words becomes more expressive by utilizing the proposed weighted vector quantization (WVQ). Finally, WKNN classifier is applied to enhance the properties of the classification task between images in which similar levels of background noise are present. Average precision and absolute change degree are calculated to assess the classification performance and the stability of K-means++ algorithm, respectively. Experimental results on three diverse datasets: Caltech-101, Caltech-256 and PASCAL VOC 2011 show that the proposed WVQ method and WKNN method further improve the performance of classification.

Recognition and Pose Estimation of 3-D Objects for Visual Servoing (Visual Servoing을 위한 3차원 물체의 인식 및 자세 추정)

  • Yang, Jae-Ho;Jeong, Moon-Ho;Park, Mig-Non
    • Proceedings of the KIEE Conference
    • /
    • 2006.07d
    • /
    • pp.1931-1932
    • /
    • 2006
  • 로봇이 어떤 물체를 인지하고 그 물체에 대해 어떤 작업을 하고자 할 때 특정 물체의 인식 문제, 3차원 정보를 획득하는 문제, 자세를 추정하는 문제 등 해결해야 될 문제들이 있다. 물체를 인식하는 과정에서는 주위 배경과 물체의 크기의 변화, 회전, 가려짐 등으로 인해 물체 인식을 어렵게 만드는 요소들이 있다. 2차원 이미지를 통해 3차원 정보를 추출하는 과정은 일반적으로 두 대의 카메라를 이용하여 스테레오 이미지를 통해 얻는다. 이 때 좌우 영상간의 매칭의 과정이 필요하다. 자세 추정의 문제는 카메라 좌표와 물체의 좌표간의 관계를 알아야 한다. Visual Servoing을 어렵게 만드는 많은 요인들이 있으며 본 논문에서는 물체의 크기, 회전, 이동에 불변인 디스크립터(descriptor)를 사용하는 SIFT(Scale Invariant Feature Transform)를 통해 3차원 물체의 인식과 자세를 추정하는 방법을 제시한다. 또한 자세 추정을 위해 2차원 Keypoint들의 매칭을 3차원 정보를 통해 검증하는 방법을 제시한다. (SIFT에 의해 추출된 point를 Keypoint라 명한다.)

  • PDF

3D Workspace Modeling Based on Context Understanding for Robotic Manipulation (컨텍스트 이해를 통한 로봇의 작업을 위해 필요한 3D 작업공간 모델링)

  • Kim, Eun-Young;Lee, Suk-Han;Jang, Dae-Sik;Han, Jung-Hyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2005.05a
    • /
    • pp.1619-1622
    • /
    • 2005
  • 본 논문에서는 로봇이 작업을 계획하기 위해 필요한 3차원 작업 공간을 세 가지의 컨텍스트(context)들을 이해함으로써 빠르게 모델링하는 새로운 기법을 소개 하고 있다. 로봇이 사람과 비슷한 속도와 정확도로 작업 공간을 이해하고 모델링하는 것에 초점을 두고 있으며 이를 위해 작업 공간상의 특징적인 세 가지의 컨텍스트(작업공간의 간략화를 위한 전체 공간상의 평면특징, 데이터베이스에 미리 정의된 물체 그리고 로봇의 주어진 작업에 따라 다양한 상세함을 갖는 그 외의 장애물)를 정의하였고, 그것들을 빠르게 이해함으로써 어떻게 3차원 작업 공간을 형성하는지 설명하고 있다. 본 논문에서 3 차원 정보를 갖는 scale invariant feature transformation(SIFT)를 stereo-sis SIFT 로 간주했으며 이를 이용하여 위에서 언급한 컨텍스트들을 이해하였고 다양한 카메라의 위치로부터 얻어지는 여러 개의 장면들을 정합하였다. 또한, 실험을 통해 제안한 방법의 타당성도 검증하였다.

  • PDF

Feature Based Techniques for a Driver's Distraction Detection using Supervised Learning Algorithms based on Fixed Monocular Video Camera

  • Ali, Syed Farooq;Hassan, Malik Tahir
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.8
    • /
    • pp.3820-3841
    • /
    • 2018
  • Most of the accidents occur due to drowsiness while driving, avoiding road signs and due to driver's distraction. Driver's distraction depends on various factors which include talking with passengers while driving, mood disorder, nervousness, anger, over-excitement, anxiety, loud music, illness, fatigue and different driver's head rotations due to change in yaw, pitch and roll angle. The contribution of this paper is two-fold. Firstly, a data set is generated for conducting different experiments on driver's distraction. Secondly, novel approaches are presented that use features based on facial points; especially the features computed using motion vectors and interpolation to detect a special type of driver's distraction, i.e., driver's head rotation due to change in yaw angle. These facial points are detected by Active Shape Model (ASM) and Boosted Regression with Markov Networks (BoRMaN). Various types of classifiers are trained and tested on different frames to decide about a driver's distraction. These approaches are also scale invariant. The results show that the approach that uses the novel ideas of motion vectors and interpolation outperforms other approaches in detection of driver's head rotation. We are able to achieve a percentage accuracy of 98.45 using Neural Network.

A novel hardware design for SIFT generation with reduced memory requirement

  • Kim, Eung Sup;Lee, Hyuk-Jae
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.13 no.2
    • /
    • pp.157-169
    • /
    • 2013
  • Scale Invariant Feature Transform (SIFT) generates image features widely used to match objects in different images. Previous work on hardware-based SIFT implementation requires excessive internal memory and hardware logic [1]. In this paper, a new hardware organization is proposed to implement SIFT with less memory and hardware cost than the previous work. To this end, a parallel Gaussian filter bank is adopted to eliminate the buffers that store intermediate results because parallel operations allow all intermediate results available at the same time. Furthermore, the processing order is changed from the raster-scan order to the block-by-block order so that the line buffer size storing the source image is also reduced. These techniques trade the reduction of memory size with a slight increase of the execution time and external memory bandwidth. As a result, the memory size is reduced by 94.4%. The proposed hardware for SIFT implementation includes the Descriptor generation block, which is omitted in the previous work [1]. The addition of the hardwired descriptor generation improves the computation speed by about 30 times when compared with the previous work.

An Image Retrieving Scheme Using Salient Features and Annotation Watermarking

  • Wang, Jenq-Haur;Liu, Chuan-Ming;Syu, Jhih-Siang;Chen, Yen-Lin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.1
    • /
    • pp.213-231
    • /
    • 2014
  • Existing image search systems allow users to search images by keywords, or by example images through content-based image retrieval (CBIR). On the other hand, users might learn more relevant textual information about an image from its text captions or surrounding contexts within documents or Web pages. Without such contexts, it's difficult to extract semantic description directly from the image content. In this paper, we propose an annotation watermarking system for users to embed text descriptions, and retrieve more relevant textual information from similar images. First, tags associated with an image are converted by two-dimensional code and embedded into the image by discrete wavelet transform (DWT). Next, for images without annotations, similar images can be obtained by CBIR techniques and embedded annotations can be extracted. Specifically, we use global features such as color ratios and dominant sub-image colors for preliminary filtering. Then, local features such as Scale-Invariant Feature Transform (SIFT) descriptors are extracted for similarity matching. This design can achieve good effectiveness with reasonable processing time in practical systems. Our experimental results showed good accuracy in retrieving similar images and extracting relevant tags from similar images.

The Comparison of the SIFT Image Descriptor by Contrast Enhancement Algorithms with Various Types of High-resolution Satellite Imagery

  • Choi, Jaw-Wan;Kim, Dae-Sung;Kim, Yong-Min;Han, Dong-Yeob;Kim, Yong-Il
    • Korean Journal of Remote Sensing
    • /
    • v.26 no.3
    • /
    • pp.325-333
    • /
    • 2010
  • Image registration involves overlapping images of an identical region and assigning the data into one coordinate system. Image registration has proved important in remote sensing, enabling registered satellite imagery to be used in various applications such as image fusion, change detection and the generation of digital maps. The image descriptor, which extracts matching points from each image, is necessary for automatic registration of remotely sensed data. Using contrast enhancement algorithms such as histogram equalization and image stretching, the normalized data are applied to the image descriptor. Drawing on the different spectral characteristics of high resolution satellite imagery based on sensor type and acquisition date, the applied normalization method can be used to change the results of matching interest point descriptors. In this paper, the matching points by scale invariant feature transformation (SIFT) are extracted using various contrast enhancement algorithms and injection of Gaussian noise. The results of the extracted matching points are compared with the number of correct matching points and matching rates for each point.

The Construction Method of Precise DTM of UAV Images Using Sobel-median Filtering (소벨-메디언 필터링을 이용한 UAV 영상의 정밀 DTM 구축 방법에 관한 연구)

  • Na, Young-Woo
    • Journal of Urban Science
    • /
    • v.12 no.2
    • /
    • pp.43-52
    • /
    • 2023
  • UAV have the disadvantage that are weak from rainfall or winds due to the light platform, so use Scale-Invariant Feature Transform (SIFT) method which extrude keypoints in image matching process. To find the efficient filtering method for the construction of precise Digital Terrain Model (DTM) using UAV images, comparatively analyzed sobel and Differential of Gaussian (DoG) and found sobel is more efficient way to extrude buildings, trees, and so on. And edges are extruded more clearly when applying median additionally which have the merit of preserving edge and eliminating noise. In this study, applied sobel-median filtering which plus median to sobel and constructed the 1st filtered DTM that extrude building and trees and 2nd filtered DTM that extrude cars by threshold of gradient, Analysis of the degree of accuracy improvement showed that standard deviations of 1st filtered DTM and 2nd filtered DTM are 0.32m, 0.287m respectively, and both are acceptable for the tolerance of 0.33m for elevation points of 1/1,000 digital map, and the accuracy was increased about 10% by filtering automobiles. Plus, moving things are changed those position and direction in every image, and these are not target to filter because of the characteristic that is excluded from SIFT method.