• Title/Summary/Keyword: spatial recognition

Search Result 491, Processing Time 0.025 seconds

A Tree Regularized Classifier-Exploiting Hierarchical Structure Information in Feature Vector for Human Action Recognition

  • Luo, Huiwu;Zhao, Fei;Chen, Shangfeng;Lu, Huanzhang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.3
    • /
    • pp.1614-1632
    • /
    • 2017
  • Bag of visual words is a popular model in human action recognition, but usually suffers from loss of spatial and temporal configuration information of local features, and large quantization error in its feature coding procedure. In this paper, to overcome the two deficiencies, we combine sparse coding with spatio-temporal pyramid for human action recognition, and regard this method as the baseline. More importantly, which is also the focus of this paper, we find that there is a hierarchical structure in feature vector constructed by the baseline method. To exploit the hierarchical structure information for better recognition accuracy, we propose a tree regularized classifier to convey the hierarchical structure information. The main contributions of this paper can be summarized as: first, we introduce a tree regularized classifier to encode the hierarchical structure information in feature vector for human action recognition. Second, we present an optimization algorithm to learn the parameters of the proposed classifier. Third, the performance of the proposed classifier is evaluated on YouTube, Hollywood2, and UCF50 datasets, the experimental results show that the proposed tree regularized classifier obtains better performance than SVM and other popular classifiers, and achieves promising results on the three datasets.

Development of Gesture Recognition-Based 3D Serious Games (치매 예방을 위한 제스처 인식 기반 3D 기능성 게임 개발)

  • He, Guan-Feng;Park, Jin-Woong;Kang, Sun-Kyung;Jung, Sung-Tae
    • Journal of Korea Game Society
    • /
    • v.11 no.6
    • /
    • pp.103-113
    • /
    • 2011
  • In this paper, we propose gesture recognition based 3D Serious Games to prevent dementia. These games are designed to enhance the effect of preventing dementia by helping increase brain usage and physical activities of users by the entire body gesture recognition. The existing cameras used for gesture recognition technology are limited in terms of recognition ratio and operation range. For more stable recognition of the body gestures, we recognized users with a 3D depth camera, obtained joint data of users, and analyzed joint motions to recognize gestures of the body. Game contents were designed to practice memory, reasoning, calculation, and spatial recognition focusing on the atrophy of brain cells as a major cause of dementia. Game results of each user were saved and analyzed to measure how their recognition skills improved.

Foreign Immigrants‘ Recognition on Macro-contexts of Transnational Migration (외국인 이주자의 거시적 이주 배경에 관한 인지)

  • Choi, Byung-Doo;Lee, Gyung-Ja
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.13 no.1
    • /
    • pp.64-88
    • /
    • 2010
  • Rapidly increasing transnational migration can be seen as a typical process which has proceeded under macro-contexts of socio-spatial characters of origin and destination country and their relationships, shaped with global uneven regional development in the process of glocalization and development of transportation and communication on the global level. In order to consider macro-contexts of transnational migration, this paper emphasizes the concept of multicultural space and some key elements implied in it, that is, place, territory, network, scale (suggested by Jessop et al.) and spatial flow and difference. As results of questionnaire analysis of foreign immigrants' recognition of macro-contexts, this paper suggests some findings: that is, a high level of recognition of all types of foreign immigrants on global changes, the most negative recognition of migrant workers among 4 types of foreign immigrants on economic and social conditions of their origin country, a positive recognition of people in all regions of their origin (except few countries such as Japan) on international migration, and a low level of their recognition in all types on S. Korea's characters as their destination country.

  • PDF

Character Detection and Recognition of Steel Materials in Construction Drawings using YOLOv4-based Small Object Detection Techniques (YOLOv4 기반의 소형 물체탐지기법을 이용한 건설도면 내 철강 자재 문자 검출 및 인식기법)

  • Sim, Ji-Woo;Woo, Hee-Jo;Kim, Yoonhwan;Kim, Eung-Tae
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.391-401
    • /
    • 2022
  • As deep learning-based object detection and recognition research have been developed recently, the scope of application to industry and real life is expanding. But deep learning-based systems in the construction system are still much less studied. Calculating materials in the construction system is still manual, so it is a reality that transactions of wrong volumn calculation are generated due to a lot of time required and difficulty in accurate accumulation. A fast and accurate automatic drawing recognition system is required to solve this problem. Therefore, we propose an AI-based automatic drawing recognition accumulation system that detects and recognizes steel materials in construction drawings. To accurately detect steel materials in construction drawings, we propose data augmentation techniques and spatial attention modules for improving small object detection performance based on YOLOv4. The detected steel material area is recognized by text, and the number of steel materials is integrated based on the predicted characters. Experimental results show that the proposed method increases the accuracy and precision by 1.8% and 16%, respectively, compared with the conventional YOLOv4. As for the proposed method, Precision performance was 0.938. The recall was 1. Average Precision AP0.5 was 99.4% and AP0.5:0.95 was 67%. Accuracy for character recognition obtained 99.9.% by configuring and learning a suitable dataset that contains fonts used in construction drawings compared to the 75.6% using the existing dataset. The average time required per image was 0.013 seconds in the detection, 0.65 seconds in character recognition, and 0.16 seconds in the accumulation, resulting in 0.84 seconds.

Face Recognition Using Tensor Subspace Analysis in Robot Environments (로봇 환경에서 텐서 부공간 분석기법을 이용한 얼굴인식)

  • Kim, Sung-Suk;Kwak, Keun-Chang
    • The Journal of Korea Robotics Society
    • /
    • v.3 no.4
    • /
    • pp.300-307
    • /
    • 2008
  • This paper is concerned with face recognition for human-robot interaction (HRI) in robot environments. For this purpose, we use Tensor Subspace Analysis (TSA) to recognize the user's face through robot camera when robot performs various services in home environments. Thus, the spatial correlation between the pixels in an image can be naturally characterized by TSA. Here we utilizes face database collected in u-robot test bed environments in ETRI. The presented method can be used as a core technique in conjunction with HRI that can naturally interact between human and robots in home robot applications. The experimental results on face database revealed that the presented method showed a good performance in comparison with the well-known methods such as Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) in distant-varying environments.

  • PDF

3D Depth Measurement System-based Unpaved Trail Recognition for Mobile Robots (이동 로봇을 위한 3차원 거리 측정 장치기반 비포장 도로 인식)

  • Gim Seong-Chan;Kim Jong-Man;Kim Hyong-Suk
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.12 no.4
    • /
    • pp.395-399
    • /
    • 2006
  • A method to recognize unpaved road region using a 3D depth measurement system is proposed for mobile robots. For autonomous maneuvering of mobile robots, recognition of obstacles or recognition of road region is the essential task. In this paper, the 3D depth measurement system which is composed of a rotating mirror, a line laser and mono-camera is employed to detect depth, where the laser light is reflected by the mirror and projected to the scene objects whose locations are to be determined. The obtained depth information is converted into an image. Such depth images of the road region represent even and plane while that of off-road region is irregular or textured. Therefore, the problem falls into a texture identification problem. Road region is detected employing a simple spatial differentiation technique to detect the plain textured area. Identification results of the diverse situation of unpaved trail are included in this paper.

Pattern Recognition with Rotation Invariant Multiresolution Features

  • Rodtook, S.;Makhanov, S.S.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1057-1060
    • /
    • 2004
  • We propose new rotation moment invariants based on multiresolution filter bank techniques. The multiresolution pyramid motivates our simple but efficient feature selection procedure based on the fuzzy C-mean clustering, combined with the Mahalanobis distance. The procedure verifies an impact of random noise as well as an interesting and less known impact of noise due to spatial transformations. The recognition accuracy of the proposed techniques has been tested with the preceding moment invariants as well as with some wavelet based schemes. The numerical experiments, with more than 30,000 images, demonstrate a tangible accuracy increase of about 3% for low noise, 8% for the average noise and 15% for high level noise.

  • PDF

Displacement Measurement of Multi-Point Using a Pattern Recognition from Video Signal (영상 신호에서 패턴인식을 이용한 다중 포인트 변위측정)

  • Jeon, Hyeong-Seop;Choi, Young-Chul;Park, Jong-Won
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2008.11a
    • /
    • pp.675-680
    • /
    • 2008
  • This paper proposes a way to measure the displacement of a multi-point by using a pattern recognition from video signal. Generally in measuring displacement, gab sensor, which is a displacement sensor, is used. However, it is difficult to measure displacement by using a common sensor in places where it is unsuitable to attach a sensor, such as high-temperature areas or radioactive places. In this kind of places, non-contact methods should be used to measure displacement and in this study, images of CCD camera were used. When displacement is measure by using camera images, it is possible to measure displacement with a non-contact method. It is simple to install and multi-point displacement measuring device so that it is advantageous to solve problems of spatial constraints.

  • PDF

Segmentation and Classification of Range Data Using Phase Information of Gabor Fiter (Gabor 필터의 위상 정보를 이용한 거리 영상의 분할 및 분류)

  • 현기호;이광호;황병곤;조석제;하영호
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.8
    • /
    • pp.1275-1283
    • /
    • 1990
  • Perception of surfaces from range images plays a key role in 3-D object recognition. Recognition of 3-D objects from range images is performed by matching the perceived surface descriptions with stored object models. The first step of the 3-d object recognition from range images is image segmentation. In this paper, an approach for segmenting 3-D range images into symbolic surface descriptions using spatial Gabor filter is proposed. Since the phase of data has a lot of important information, the phase information with magnitude information can effectively segment the range imagery into regions satisfying a common homogeneity criterion. The phase and magnitude of Gabor filter can represent a unique featur vector at a point of range data. As a result, range images are trnasformed into feature vectors in 3-parameter representation. The methods not only to extract meaningful features but also to classify a patch information from range images is presented.

  • PDF

Development of a visual-data processing system for a polyhedral object recognition by the projection of laser ring beam (다면체 물체 인식을 위한 환상레이져 빔 투사형 시각 정보 처리 시스템 개발)

  • 김종형;조용철;조형석
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1988.10a
    • /
    • pp.428-432
    • /
    • 1988
  • In this study, some issues on 3- dimentional object recognition and pose determination are discussed. The method employs a laser projector which projects a cyliderical light beam on the object plane where it produces a bright ring pattern. The picture is then taken by a T.V camera. The ring pattern is mathmetically the ellipse of which the geometrical parameters have the 3-dimentional feature of the object plane. This paper gives the mathematical aspects of 3-dimentional recognition method and shows experimentally the variations of ellipse parameters as the spatial deviation of the plane object.

  • PDF