• Title/Summary/Keyword: Visual location recognition

Search Result 47, Processing Time 0.026 seconds

Visual Location Recognition Using Time-Series Streetview Database (시계열 스트리트뷰 데이터베이스를 이용한 시각적 위치 인식 알고리즘)

  • Park, Chun-Su;Choeh, Joon-Yeon
    • Journal of the Semiconductor & Display Technology
    • /
    • v.18 no.4
    • /
    • pp.57-61
    • /
    • 2019
  • Nowadays, portable digital cameras such as smart phone cameras are being popularly used for entertainment and visual information recording. Given a database of geo-tagged images, a visual location recognition system can determine the place depicted in a query photo. One of the most common visual location recognition approaches is the bag-of-words method where local image features are clustered into visual words. In this paper, we propose a new bag-of-words-based visual location recognition algorithm using time-series streetview database. The proposed algorithm selects only a small subset of image features which will be used in image retrieval process. By reducing the number of features to be used, the proposed algorithm can reduce the memory requirement of the image database and accelerate the retrieval process.

Effect of Task-irrelevant Feature Information on Visual Short-term Recognition of Task-relevant Feature (기억자극의 과제 무관련 세부특징 정보가 과제 관련 세부특징에 대한 시각단기재인에 미치는 영향)

  • Hyun, Joo-Seok
    • Korean Journal of Cognitive Science
    • /
    • v.23 no.2
    • /
    • pp.225-248
    • /
    • 2012
  • The summed-similarity model of visual short-term recognition proposes that the estimated amount of summed similarity between remembered items and a recognition probe determines recognition judgement decision (Kahan & Sekuler, 2002). This study examined the effect of a task-irrelevant location change on the recognition decision against two remembered Gabor gratings differing in their spatial frequencies. On each trial in Experiment, participants reported if two gratings displayed across the visual fields are the same or not as the probe grating displayed after about a second of memory delay. The probe grating would be the same as or different from the memory items (lure) by 1 or 4 JND units. The location of the probe would also vary randomly across the left and right visual field with respect to the location of the corresponding memory item. The participants were instructed to perform their recognition task exclusively to the spatial frequencies of the memory items and the probe while ignoring the potential location change of the probe. The results showed that false-recognition rates of the lure probe increased as the summed similarity between the memory items and the probe increased. The rates also further increased in the condition where the probe location was different from the location of the corresponding memory item compared to the condition where the probe location was the same. The increased false-recognition rates indicate that information stored into visual short-term memory is represented as a form of well-bound visual features rather than independent features.

  • PDF

A Salient Based Bag of Visual Word Model (SBBoVW): Improvements toward Difficult Object Recognition and Object Location in Image Retrieval

  • Mansourian, Leila;Abdullah, Muhamad Taufik;Abdullah, Lilli Nurliyana;Azman, Azreen;Mustaffa, Mas Rina
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.2
    • /
    • pp.769-786
    • /
    • 2016
  • Object recognition and object location have always drawn much interest. Also, recently various computational models have been designed. One of the big issues in this domain is the lack of an appropriate model for extracting important part of the picture and estimating the object place in the same environments that caused low accuracy. To solve this problem, a new Salient Based Bag of Visual Word (SBBoVW) model for object recognition and object location estimation is presented. Contributions lied in the present study are two-fold. One is to introduce a new approach, which is a Salient Based Bag of Visual Word model (SBBoVW) to recognize difficult objects that have had low accuracy in previous methods. This method integrates SIFT features of the original and salient parts of pictures and fuses them together to generate better codebooks using bag of visual word method. The second contribution is to introduce a new algorithm for finding object place based on the salient map automatically. The performance evaluation on several data sets proves that the new approach outperforms other state-of-the-arts.

An Approach for Localization Around Indoor Corridors Based on Visual Attention Model (시각주의 모델을 적용한 실내 복도에서의 위치인식 기법)

  • Yoon, Kook-Yeol;Choi, Sun-Wook;Lee, Chong-Ho
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.2
    • /
    • pp.93-101
    • /
    • 2011
  • For mobile robot, recognizing its current location is very important to navigate autonomously. Especially, loop closing detection that robot recognize location where it has visited before is a kernel problem to solve localization. A considerable amount of research has been conducted on loop closing detection and localization based on appearance because vision sensor has an advantage in terms of costs and various approaching methods to solve this problem. In case of scenes that consist of repeated structures like in corridors, perceptual aliasing in which, the two different locations are recognized as the same, occurs frequently. In this paper, we propose an improved method to recognize location in the scenes which have similar structures. We extracted salient regions from images using visual attention model and calculated weights using distinctive features in the salient region. It makes possible to emphasize unique features in the scene to classify similar-looking locations. In the results of corridor recognition experiments, proposed method showed improved recognition performance. It shows 78.2% in the accuracy of single floor corridor recognition and 71.5% for multi floor corridors recognition.

Visual Positioning System based on Voxel Labeling using Object Simultaneous Localization And Mapping

  • Jung, Tae-Won;Kim, In-Seon;Jung, Kye-Dong
    • International Journal of Advanced Culture Technology
    • /
    • v.9 no.4
    • /
    • pp.302-306
    • /
    • 2021
  • Indoor localization is one of the basic elements of Location-Based Service, such as indoor navigation, location-based precision marketing, spatial recognition of robotics, augmented reality, and mixed reality. We propose a Voxel Labeling-based visual positioning system using object simultaneous localization and mapping (SLAM). Our method is a method of determining a location through single image 3D cuboid object detection and object SLAM for indoor navigation, then mapping to create an indoor map, addressing it with voxels, and matching with a defined space. First, high-quality cuboids are created from sampling 2D bounding boxes and vanishing points for single image object detection. And after jointly optimizing the poses of cameras, objects, and points, it is a Visual Positioning System (VPS) through matching with the pose information of the object in the voxel database. Our method provided the spatial information needed to the user with improved location accuracy and direction estimation.

Credit Card Number Recognition for People with Visual Impairment (시력 취약 계층을 위한 신용 카드 번호 인식 연구)

  • Park, Dahoon;Kwon, Kon-Woo
    • Journal of IKEEE
    • /
    • v.25 no.1
    • /
    • pp.25-31
    • /
    • 2021
  • The conventional credit card number recognition system generally needs a card to be placed in a designated location before its processing, which is not an ideal user experience especially for people with visual impairment. To improve the user experience, this paper proposes a novel algorithm that can automatically detect the location of a credit card number based on the fact that a group of sixteen digits has a fixed aspect ratio. The proposed algorithm first performs morphological operations to obtain multiple candidates of the credit card number with >4:1 aspect ratio, then recognizes the card number by testing each candidate via OCR and BIN matching techniques. Implemented with OpenCV and Firebase ML, the proposed scheme achieves 77.75% accuracy in the credit card number recognition task.

The Effect of Spatial Attention in Hangul Word Recognition: Depending on Visual Factors (한글 단어 재인에서 시각적 요인에 따른 공간주의의 영향)

  • Ko Eun Lee;Hye-Won Lee
    • Korean Journal of Cognitive Science
    • /
    • v.34 no.1
    • /
    • pp.1-20
    • /
    • 2023
  • In this study, we examined the effects of spatial attention in Hangul word recognition depending on visual factors. The visual complexity of words (Experiment 1) and contrast (Experiment 2) were manipulated to examine whether the effect of spatial attention differs depending on visual quality. Participants responded to words with and without codas in experiment 1 and words in high-contrast and low-contrast conditions in experiment 2. The effects of spatial attention were investigated by calculating the difference in performance between the condition where spatial cues were given at the target location (valid trial) and the condition where the spatial cues were not given at the target location (invalid trial) as the cuing effects. As a result, the cuing effects were similar depending on the complexity of the words. It indicates that the effects of spatial attention were not different across the visual complexity conditions. The cuing effects were greater in the low-contrast condition than in the high-contrast condition. The greater effect of spatial attention when the contrast is low was explained as a mechanism of signal enhancement.

Experimentation on The Recognition of Arithmetic Expressions (수식 표현의 인식에 관한 연구)

  • Lee, Young Kyo;Kim, Young Po
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.10 no.4
    • /
    • pp.29-35
    • /
    • 2014
  • The formula contains up between the text and the structural information, as well as their mathematical symbols. Research on-line or off-line recognition formula is underway actively used in various fields, and various forms of the equation are implemented recognition system. Although many documents are included in the various formulas, it is not easy to enter a formula into the computer. Recognition of the expression is divided into two processes of symbol recognition and structural analysis. After analyzing the location information of each character is specified to recognize the effective area after each symbol, and to the structure analysis based on the proximity between the characters is recognized as an independent single formula. Furthermore, analyzing the relationship between the front and back each time a combination of the position relationship between each symbol, and then to add the symbol which was able to easily update the structure of the entire formula. In this paper, by using a scanner to scan the book formula was used to interpret the meaning of the recognized symbol has a relative size and location information of the expression symbol. An algorithm to remove the formulas for calculation of the number of formula is present at the same time is proposed. Using the proposed algorithms to scan the books in the formula in order to evaluate the performance verification as 100% separation and showed the recognition rate equation.

An approach to visual pattern recognition by neural network system

  • Hatakeyama, Yasuhiro;Kakazu, Yukinori
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1992.10b
    • /
    • pp.61-64
    • /
    • 1992
  • In this paper, a visual pattern recognition system is proposed, which can recognize both a pattern and its location. This system, referred to as the expanded neocognitron, has the following capabilities: (1) A higher performance in extraction of features, and (2) A new capability for recognizing the locations of patterns. This system adopts the learning and recognizing mechanism of the neocognitron. First, the ability to classify pattern is enhanced by improving the mechanisms of feature extraction and learning algorithm. Second, the function of detecting the location of each pattern is realized by developing an architecture which does not reduce structure, i.e., the unit density is constant all the way from the input stage to the output stage.

  • PDF

로보트 아크용접에서 시각인식장치를 이용한 용접선의 추적

  • 손영탁;김재선;조형석
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 1993.10a
    • /
    • pp.550-555
    • /
    • 1993
  • The aim of this paper is to present the development of visual seam tracking system equipped with visual range finder. The visual range finder, which consists of a CCD camera and a diode laser system with line generating optics, developed to recognize the types of weld joints and detect the location of weld joints. In practical applications, however, images of the weld joints are often degraded due to spatters, are flares, surface specularity, and welding smoke. To overcome the problem, this paper proposes a syntactic approach which is a class of artificial intelligence techniques. In the approach, the type of weld joint is inferred based upon the production rules which are linguiques grammars consisting of a set of line and junction primitives of laser strip image projected on weld joint. The production rules eliminate several noisy primitives to create new primitives through the merging process of primitives. After the recognition of weld joint, arc welding is started and the location of weld joints is repeatedly detected using a spring model-based template matching in which the template model is a by-product of the recognition process of weld joint. To show the effectiveness of the proposed approach a series of experiments-identification and robotic tracking-are conducted for four different types of weld joints.

  • PDF