Search | Korea Science

Object Detection and Optical Character Recognition for Mobile-based Air Writing (모바일 기반 Air Writing을 위한 객체 탐지 및 광학 문자 인식 방법)

Kim, Tae-Il;Ko, Young-Jin;Kim, Tae-Young
- The Journal of Korean Institute of Next Generation Computing
- /
- v.15 no.5
- /
- pp.53-63
- /
- 2019
To provide a hand gesture interface through deep learning in mobile environments, research on the light-weighting of networks is essential for high recognition rates while at the same time preventing degradation of execution speed. This paper proposes a method of real-time recognition of written characters in the air using a finger on mobile devices through the light-weighting of deep-learning model. Based on the SSD (Single Shot Detector), which is an object detection model that utilizes MobileNet as a feature extractor, it detects index finger and generates a result text image by following fingertip path. Then, the image is sent to the server to recognize the characters based on the learned OCR model. To verify our method, 12 users tested 1,000 words using a GALAXY S10+ and recognized their finger with an average accuracy of 88.6%, indicating that recognized text was printed within 124 ms and could be used in real-time. Results of this research can be used to send simple text messages, memos, and air signatures using a finger in mobile environments.

Real time instruction classification system

Sang-Hoon Lee;Dong-Jin Kwon
- International Journal of Internet, Broadcasting and Communication
- /
- v.16 no.3
- /
- pp.212-220
- /
- 2024
A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.
https://doi.org/10.7236/IJIBC.2024.16.3.212 인용 PDF

Dynamic Object Detection Architecture for LiDAR Embedded Processors (라이다 임베디드 프로세서를 위한 동적 객체인식 아키텍처 구현)

Jung, Minwoo;Lee, Sanghoon;Kim, Dae-Young
- Journal of Platform Technology
- /
- v.8 no.4
- /
- pp.11-19
- /
- 2020
In an autonomous driving environment, dynamic recognition of objects is essential as the situation changes in real time. In addition, as the number of sensors and control modules built into an autonomous vehicle increases, the amount of data the central control unit has to process also rapidly increases. By minimizing the output data from the sensor, the load on the central control unit can be reduced. This study proposes a dynamic object recognition algorithm solely using the embedded processor on a LiDAR sensor. While there are open source algorithms to process the point cloud output from LiDAR sensors, most require a separate high-performance processor. Since the embedded processors installed in LiDAR sensors often have resource constraints, it is essential to optimize the algorithm for efficiency. In this study, an embedded processor based object recognition algorithm was developed for autonomous vehicles, and the correlation between the size of the point clouds and processing time was analyzed. The proposed object recognition algorithm evaluated that the processing time directly increased with the size of the point cloud, with the processor stalling at a specific point if the point cloud size is beyond the threshold
PDF

The Development of real-time system for taking the dimensions of objects with arbitray shape

Chung, Yun-Su;Won, Jong-Un;Kim, Jin-Seok
- Proceedings of the IEEK Conference
- /
- 2002.07c
- /
- pp.1523-1526
- /
- 2002
In this paper, we propose a method fur measuring the dimensions of an arbitrary object using geometric relationship between a perspective projection image and a rectangular parallelepiped model. For recognizing the vertexes of the rectangular parallelepiped surrounding an arbitrary object, the method adopts a strategy that derives the equations for vertex recognition from the geometrical relationships for image formation between 2D image and the rectangular parallelepiped model. extracts from 2D image with vertical view features (or junctions) of minimum quadrangle circumscribing an arbitrary shape object, and then recognizes vertexes from the features with the equations. Finally, the dimensions of the object are calculated from these results of vertex recognition. By the experimental results, it is demonstrated that this method is very effective to recognize the vertexes of the arbitrary objects.
PDF

Development of Remote-Controlled Object-Recognizing Mobile Home CCTV Using Smartphone and Arduino (스마트폰과 아두이노를 이용한 원격제어 객체인식 이동형 홈 CCTV 개발)

Kim, Dong-Ju;Lim, Chae-Won;Choi, Hyun-Ho
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.24 no.11
- /
- pp.1546-1549
- /
- 2020
This paper introduces the development process of mobile home CCTV that enables remote control and object recognition using unused smartphones and Arduino. Clients can control motors connected to Arduino through button, enable bidirectional voice communication between client-server and receive video from the server in real time. The server sends a PUSH notification to the client when its battery is low. When the server recognizes the charger, the client's remote control allows the server to dock to the charger and charge it. It was confirmed that video and voice delivery between client and server works well without any problems, and that object recognition works smoothly.
https://doi.org/10.6109/jkiice.2020.24.11.1546 인용 PDF KSCI

Realtime Markerless 3D Object Tracking for Augmented Reality (증강현실을 위한 실시간 마커리스 3차원 객체 추적)

Min, Jae-Hong;Islam, Mohammad Khairul;Paul, Anjan Kumar;Baek, Joong-Hwan
- Journal of Advanced Navigation Technology
- /
- v.14 no.2
- /
- pp.272-277
- /
- 2010
AR(Augmented Reality) needs medium between real and virtual, world, and recognition techniques are necessary to track an object continuously. Optical tracking using marker is mainly used, but it takes time and is inconvenient to attach marker onto the target objects. Therefore, many researchers try to develop markerless tracking techniques nowaday. In this paper, we extract features and 3D position from 3D objects and suggest realtime tracking based on these features and positions, which do not use just coplanar features and 2D position. We extract features using SURF, get rotation matrix and translation vector of 3D object using POSIT with these features and track the object in real time. If the extracted features are nor enough and it fail to track the object, then new features are extracted and re-matched to recover the tracking. Also, we get rotation in matrix and translation vector of 3D object using POSIT and track the object in real time.
PDF KSCI

Real-Time Object Recognition Using Local Features (지역 특징을 사용한 실시간 객체인식)

Kim, Dae-Hoon;Hwang, Een-Jun
- Journal of IKEEE
- /
- v.14 no.3
- /
- pp.224-231
- /
- 2010
Automatic detection of objects in images has been one of core challenges in the areas such as computer vision and pattern analysis. Especially, with the recent deployment of personal mobile devices such as smart phone, such technology is required to be transported to them. Usually, these smart phone users are equipped with devices such as camera, GPS, and gyroscope and provide various services through user-friendly interface. However, the smart phones fail to give excellent performance due to limited system resources. In this paper, we propose a new scheme to improve object recognition performance based on pre-computation and simple local features. In the pre-processing, we first find several representative parts from similar type objects and classify them. In addition, we extract features from each classified part and train them using regression functions. For a given query image, we first find candidate representative parts and compare them with trained information to recognize objects. Through experiments, we have shown that our proposed scheme can achieve resonable performance.
PDF KSCI

Image Recognition Using Colored-hear Transformation Based On Human Synesthesia (인간의 공감각에 기반을 둔 색청변환을 이용한 영상 인식)

Shin, Seong-Yoon;Moon, Hyung-Yoon;Pyo, Seong-Bae
- Journal of the Korea Society of Computer and Information
- /
- v.13 no.2
- /
- pp.135-141
- /
- 2008
In this paper, we propose colored-hear recognition that distinguishing feature of synesthesia for human sensing by shared vision and specific sense of hearing. We perceived what potential influence of human's structured object recognition by visual analysis through the camera, So we've studied how to make blind persons can feel similar vision of real object. First of all, object boundaries are detected in the image data representing a specific scene. Then, four specific features such as object location in the image focus, feeling of average color, distance information of each object, and object area are extracted from picture. Finally, mapping these features to the audition factors. The audition factors are used to recognize vision for blind persons. Proposed colored-hear transformation for recognition can get fast and detail perception, and can be transmit information for sense at the same time. Thus, we were get a food result when applied this concepts to blind person's case of image recognition.
PDF

Object Recognition Face Detection With 3D Imaging Parameters A Research on Measurement Technology (3D영상 객체인식을 통한 얼굴검출 파라미터 측정기술에 대한 연구)

Choi, Byung-Kwan;Moon, Nam-Mee
- Journal of the Korea Society of Computer and Information
- /
- v.16 no.10
- /
- pp.53-62
- /
- 2011
In this paper, high-tech IT Convergence, to the development of complex technology, special technology, video object recognition technology was considered only as a smart - phone technology with the development of personal portable terminal has been developed crossroads. Technology-based detection of 3D face recognition technology that recognizes objects detected through the intelligent video recognition technology has been evolving technologies based on image recognition, face detection technology with through the development speed is booming. In this paper, based on human face recognition technology to detect the object recognition image processing technology is applied through the face recognition technology applied to the IP camera is the party of the mouth, and allowed the ability to identify and apply the human face recognition, measurement techniques applied research is suggested. Study plan: 1) face model based face tracking technology was developed and applied 2) algorithm developed by PC-based measurement of human perception through the CPU load in the face value of their basic parameters can be tracked, and 3) bilateral distance and the angle of gaze can be tracked in real time, proved effective.
https://doi.org/10.9708/jksci.2011.16.10.053 인용 PDF KSCI

The Recognition of Crack Detection Using Difference Image Analysis Method based on Morphology (모폴로지 기반의 차영상 분석기법을 이용한 균열검출의 인식)

Byun Tae-bo;Kim Jang-hyung;Kim Hyung-soo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.10 no.1
- /
- pp.197-205
- /
- 2006
This paper presents the moving object tracking method using vision system. In order to track object in real time, the image of moving object have to be located the origin of the image coordinate axes. Accordingly, Fuzzy Control System is investigated for tracking the moving object, which control the camera module with Pan/Tilt mechanism. Hereafter, so the this system is applied to mobile robot, we design and implement image processing board for vision system. Also fuzzy controller is implemented to the StrongArm board. Finally, the proposed fuzzy controller is useful for the real-time moving object tracking system by experiment.
PDF KSCI

Search Result 279, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)