• Title/Summary/Keyword: Recognition Speed

Search Result 770, Processing Time 0.031 seconds

Vehicle Detection Scheme Based on a Boosting Classifier with Histogram of Oriented Gradient (HOG) Features and Image Segmentation] (HOG 특징 및 영상분할을 이용한 부스팅분류 기반 자동차 검출 기법)

  • Choi, Mi-Soon;Lee, Jeong-Hwan;Roh, Tae-Moon;Shim, Jae-Chang
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.10
    • /
    • pp.955-961
    • /
    • 2010
  • In this paper, we describe a study of a vehicle detection method based on a Boosting Classifier which uses Histogram of Oriented Gradient (HOG) features and Image Segmentation techniques. An input image is segmented by means of a split and merge algorithm. Then, the two largest segmented regions are removed in order to reduce the search region and speed up processing time. The HOG features are then calculated for each pixel in the search region. In order to detect the vehicle region we used the AdaBoost (adaptive boost) method, which is well known for classifying samples with two classes. To evaluate the performance of the proposed method, 537 training images were used to train and learn the classifier, followed by 500 non-training images to provide the recognition rate. From these experiments we were able to detect the proper image 98.34% of the time for the 500 non-training images. In conclusion, the proposed method can be used for detecting the location of a vehicle in an intelligent vehicle control system.

Design and Implementation of High-Resolution Image Transmission Interface for Mobile Device (모바일 환경을 위한 맞춤형 서비스 유비쿼터스 영상전송 시스템의 설계)

  • Lee, Sang-Wook;Ahn, Yong-Beom;Kim, Eung-Soo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.4
    • /
    • pp.791-799
    • /
    • 2008
  • An image recognition for surrounding conditions is very important in image transmission. In recently rears, as the information infrastructure is more general, the user-centered demands in which they want to identify by object's states image using wire or wireless environment have increased. However, existing mobile solution could be hard to expect high quality mage, because limitation of software processing according as network based on mobile terminal which has low band width supports software codec. To solve this weak point, this paper describes on hardware codec design based on MPEG-4 which is international video compression standard. Implemented system contains the embedded CPU for optimized design and it works high quality service as transmission speed and resolution in mobile circumstance.

Recognizing that a person doesn't put on a safety cap using DSP. (DSP(Digital signal proccesor)를 이용한 산업현장에서의 안전모 미착용 인식 기술)

  • Lee, Yong-Woog;Song, Kang-Suk;Jeong, Moo-Il;Lim, Chul-Hoo;Moon, Sung-Mo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.10a
    • /
    • pp.530-533
    • /
    • 2009
  • This paper proposes a method of recognizing that a person doesn't put on a safety cap using image processing method in DSP(Digital Signal Processor). It processes inputted images by image input devices that equipped in a industrial settings. If the method recognizes a person that doesn't put on a safety cap, a system transfers relevant recognition result to a supervisor and takes proper measures. If an accident happens and someone doesn't put on a safety cap, additional casualities could be. Proposed method can nip additional casualties in the bud. To recognize that a person don't put on a safety cap, images are processed by object abstraction, removal of noise, decision of a thing or a person, abstraction of a head part in a image, recognizing whether a man puts on a safety cap using HSV color space or not, and so on. Image input and image process are processed by DSP. And C language-based codes are optimized by an eignefunction(Intrinsics) for speed improvement of algorithms.

  • PDF

Fast Algorithm for Polynomial Reconstruction of Fuzzy Fingerprint Vault (지문 퍼지볼트의 빠른 다항식 복원 방법)

  • Choi, Woo-Yong;Lee, Sung-Ju;Chung, Yong-Wha;Moon, Ki-Young
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.18 no.2
    • /
    • pp.33-38
    • /
    • 2008
  • Biometric based authentication can provide strong security guarantee about the identity of users. However, security of biometric data is particularly important as compromise of the data will be permanent. Cancelable biometrics stores a non - invertible transformed version of the biometric data. Thus, even if the storage is compromised, the biometric data remains safe. Cancelable biometrics also provide a higher level of privacy by allowing many templates for the same biometric data and hence non-linkability of user's data stored in different databases. In this paper, we proposed the fast polynomial reconstruction algorithm for fuzzy fingerprint vault. The proposed method needs (k+1) real points to reconstruct the polynomial of degree (k-1). It enhances the speed, however, by $300{\sim}1500$ times according to the degree of polynomial compared with the exhaust search.

Performance analysis in automatic modulation classification based on deep learning (딥러닝 기반 자동 변조 인식 성능 분석)

  • Kang, Jong-Jin;Kim, Jae-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.3
    • /
    • pp.427-432
    • /
    • 2021
  • In this paper, we conduct performance analysis in automatic modulation classification of unknown communication signal to identify its modulation types based on deep neural network. The modulation classification performance was verified using time domain digital sample data of the modulated signal, frequency domain data to which FFT was applied, and time and frequency domain mixed data as neural network input data. For 11 types of analog and digitally modulated signals, the modulation classification performance was verified in various SNR environments ranging from -20 to 18 dB and reason for false classification was analyzed. In addition, by checking the learning speed according to the type of input data for neural network, proposed method is effective for constructing an practical automatic modulation recognition system that require a lot of time to learn.

Mask Wearing Detection System using Deep Learning (딥러닝을 이용한 마스크 착용 여부 검사 시스템)

  • Nam, Chung-hyeon;Nam, Eun-jeong;Jang, Kyung-Sik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.1
    • /
    • pp.44-49
    • /
    • 2021
  • Recently, due to COVID-19, studies have been popularly worked to apply neural network to mask wearing automatic detection system. For applying neural networks, the 1-stage detection or 2-stage detection methods are used, and if data are not sufficiently collected, the pretrained neural network models are studied by applying fine-tuning techniques. In this paper, the system is consisted of 2-stage detection method that contain MTCNN model for face recognition and ResNet model for mask detection. The mask detector was experimented by applying five ResNet models to improve accuracy and fps in various environments. Training data used 17,217 images that collected using web crawler, and for inference, we used 1,913 images and two one-minute videos respectively. The experiment showed a high accuracy of 96.39% for images and 92.98% for video, and the speed of inference for video was 10.78fps.

A Reference Frame Selection Method Using RGB Vector and Object Feature Information of Immersive 360° Media (실감형 360도 미디어의 RGB 벡터 및 객체 특징정보를 이용한 대표 프레임 선정 방법)

  • Park, Byeongchan;Yoo, Injae;Lee, Jaechung;Jang, Seyoung;Kim, Seok-Yoon;Kim, Youngmo
    • Journal of IKEEE
    • /
    • v.24 no.4
    • /
    • pp.1050-1057
    • /
    • 2020
  • Immersive 360-degree media has a problem of slowing down the video recognition speed when the video is processed by the conventional method using a variety of rendering methods, and the video size becomes larger with higher quality and extra-large volume than the existing video. In addition, in most cases, only one scene is captured by fixing the camera in a specific place due to the characteristics of the immersive 360-degree media, it is not necessary to extract feature information from all scenes. In this paper, we propose a reference frame selection method for immersive 360-degree media and describe its application process to copyright protection technology. In the proposed method, three pre-processing processes such as frame extraction of immersive 360 media, frame downsizing, and spherical form rendering are performed. In the rendering process, the video is divided into 16 frames and captured. In the central part where there is much object information, an object is extracted using an RGB vector per pixel and deep learning, and a reference frame is selected using object feature information.

An Intelligent Spraying Machine Capable of Selective Spraying Corresponding to the Shape of Fruit Trees Using LiDAR (LiDAR를 활용한 과수 형상에 따라 선택적 방제가 가능한 지능형 방제기)

  • Yang, Changju;Kim, Gookhwan;Lee, Meonghun;Kim, Kyoung-Chul;Hong, Youngki;Kim, Hyunjong;Lee, Siyoung;Ryu, Hee-Suk;Kwon, Kyung-Do;Oh, Min-seok
    • Journal of Drive and Control
    • /
    • v.17 no.4
    • /
    • pp.23-30
    • /
    • 2020
  • Driving on irregular and inclined roads using agricultural machinery such as spraying machines or trucks in orchards causes farmer casualties associated with the overturning of agricultural machinery. In addition, the harm to agricultural workers caused by the excessive inhalation of the scattered pesticide frequently occurs during pest control processes. To address these problems, we introduced precision agricultural technology that could selectively spray pesticides only where the fruit is present by recognizing the presence or shape of the fruit in the orchard. In this paper, a 16-channel LIDAR (VLP-16) made of Velodyne was used to identify the shape of fruit trees. Solenoid valves were attached to the end parts of 12 nozzles of the orchard spraying machine for on/off control. The smart spraying machine implemented in this way was mounted on a vehicle capable of autonomous travel and performed selective control depending upon the shape of the fruit trees while traveling in the orchards. This is expected to significantly reduce the amounts of pesticides used in orchards and production costs.

Road Object Graph Modeling Method for Efficient Road Situation Recognition (효과적인 도로 상황 인지를 위한 도로 객체 그래프 모델링 방법)

  • Ariunerdene, Nyamdavaa;Jeong, Seongmo;Song, Seokil
    • Journal of Platform Technology
    • /
    • v.9 no.4
    • /
    • pp.3-9
    • /
    • 2021
  • In this paper, a graph data model is introduced to effectively recognize the situation between each object on the road detected by vehicles or road infrastructure sensors. The proposed method builds a graph database by modeling each object on the road as a node of the graph and the relationship between objects as an edge of the graph, and updates object properties and edge properties in real time. In this case, the relationship between objects represented as edges is set when there is a possibility of approach between objects in consideration of the position, direction, and speed of each object. Finally, we propose a spatial indexing technique for graph nodes and edges to update the road object graph database represented through the proposed graph modeling method continuously in real time. To show the superiority of the proposed indexing technique, we compare the proposed indexing based database update method to the non-indexing update method through simulation. The results of the simulation show the proposed method outperforms more than 10 times to the non-indexing method.

Dynamic characteristics monitoring of wind turbine blades based on improved YOLOv5 deep learning model

  • W.H. Zhao;W.R. Li;M.H. Yang;N. Hong;Y.F. Du
    • Smart Structures and Systems
    • /
    • v.31 no.5
    • /
    • pp.469-483
    • /
    • 2023
  • The dynamic characteristics of wind turbine blades are usually monitored by contact sensors with the disadvantages of high cost, difficult installation, easy damage to the structure, and difficult signal transmission. In view of the above problems, based on computer vision technology and the improved YOLOv5 (You Only Look Once v5) deep learning model, a non-contact dynamic characteristic monitoring method for wind turbine blade is proposed. First, the original YOLOv5l model of the CSP (Cross Stage Partial) structure is improved by introducing the CSP2_2 structure, which reduce the number of residual components to better the network training speed. On this basis, combined with the Deep sort algorithm, the accuracy of structural displacement monitoring is mended. Secondly, for the disadvantage that the deep learning sample dataset is difficult to collect, the blender software is used to model the wind turbine structure with conditions, illuminations and other practical engineering similar environments changed. In addition, incorporated with the image expansion technology, a modeling-based dataset augmentation method is proposed. Finally, the feasibility of the proposed algorithm is verified by experiments followed by the analytical procedure about the influence of YOLOv5 models, lighting conditions and angles on the recognition results. The results show that the improved YOLOv5 deep learning model not only perform well compared with many other YOLOv5 models, but also has high accuracy in vibration monitoring in different environments. The method can accurately identify the dynamic characteristics of wind turbine blades, and therefore can provide a reference for evaluating the condition of wind turbine blades.