Search | Korea Science

Real time instruction classification system

Sang-Hoon Lee;Dong-Jin Kwon
- International Journal of Internet, Broadcasting and Communication
- /
- v.16 no.3
- /
- pp.212-220
- /
- 2024
A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.
https://doi.org/10.7236/IJIBC.2024.16.3.212 인용 PDF

Dynamic swarm particle for fast motion vehicle tracking

Jati, Grafika;Gunawan, Alexander Agung Santoso;Jatmiko, Wisnu
- ETRI Journal
- /
- v.42 no.1
- /
- pp.54-66
- /
- 2020
Nowadays, the broad availability of cameras and embedded systems makes the application of computer vision very promising as a supporting technology for intelligent transportation systems, particularly in the field of vehicle tracking. Although there are several existing trackers, the limitation of using low-cost cameras, besides the relatively low processing power in embedded systems, makes most of these trackers useless. For the tracker to work under those conditions, the video frame rate must be reduced to decrease the burden on computation. However, doing this will make the vehicle seem to move faster on the observer's side. This phenomenon is called the fast motion challenge. This paper proposes a tracker called dynamic swarm particle (DSP), which solves the challenge. The term particle refers to the particle filter, while the term swarm refers to particle swarm optimization (PSO). The fundamental concept of our method is to exploit the continuity of vehicle dynamic motions by creating dynamic models based on PSO. Based on the experiments, DSP achieves a precision of 0.896 and success rate of 0.755. These results are better than those obtained by several other benchmark trackers.
https://doi.org/10.4218/etrij.2018-0435 인용 PDF KSCI

Night-time Vehicle Detection Based On Multi-class SVM (다중-클래스 SVM 기반 야간 차량 검출)

Lim, Hyojin;Lee, Heeyong;Park, Ju H.;Jung, Ho-Youl
- IEMEK Journal of Embedded Systems and Applications
- /
- v.10 no.5
- /
- pp.325-333
- /
- 2015
Vision based night-time vehicle detection has been an emerging research field in various advanced driver assistance systems(ADAS) and automotive vehicle as well as automatic head-lamp control. In this paper, we propose night-time vehicle detection method based on multi-class support vector machine(SVM) that consists of thresholding, labeling, feature extraction, and multi-class SVM. Vehicle light candidate blobs are extracted by local mean based thresholding following by labeling process. Seven geometric and stochastic features are extracted from each candidate through the feature extraction step. Each candidate blob is classified into vehicle light or not by multi-class SVM. Four different multi-class SVM including one-against-all(OAA), one-against-one(OAO), top-down tree structured and bottom-up tree structured SVM classifiers are implemented and evaluated in terms of vehicle detection performances. Through the simulations tested on road video sequences, we prove that top-down tree structured and bottom-up tree structured SVM have relatively better performances than the others.
https://doi.org/10.14372/IEMEK.2015.10.5.325 인용 PDF KSCI

Development of a Vision-based Lane Change Assistance System for Safe Driving (안전주행을 위한 비전 기반의 차선변경보조시스템 개발)

Sung, Jun-Yong;Han, Min-Hong;Ro, Kwang-Hyun
- Journal of the Korea Society of Computer and Information
- /
- v.11 no.5 s.43
- /
- pp.329-336
- /
- 2006
This paper describes a lane change assistance system for the help of safe lane change, which detects vehicles approaching from the rear side by using a computer vision algorithm and notifies the possibility of safe lane change to a driver. In case a driver tries to lane change, the proposed system can detect vehicles and keep track of them. After detecting side lane lines, region of interest for vehicle detection is decided. For detection a vehicle, optical flow technique is applied. The experimental result of the proposed algorithm and system showed that the vehicle detection rate was 91% and the embedded system would have application to a lane change assistance system being commercialized in the near future.
PDF

A Study on Machine Vision System Module based on high speed realtime triggering (초고속 실시간 트리거에 의한 머신 비전 시스템 모듈에 관한 연구)

Lee, Myeongsoo;Kim, Dongmin
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.11a
- /
- pp.1118-1119
- /
- 2017
머신 비전 시스템은 영상 처리와 영상 분석을 함께 사용하여 공장의 조립라인 등 다양한 분야에서 응용 되고 있는 시스템이다. 하지만 기존의 시스템은 고가의 초고속 카메라와 다양한 센서를 동시에 이용하는 문제로 시스템 구축 및 확장성 등의 불편함을 야기하고 있다. 이에 본 연구에서는 초고속 실시간 트리거를 이용한 저가형 이미지 센서 기반의 머신 비전 시스템을 구성하고자 한다.
https://doi.org/10.3745/PKIPS.y2017m11a.1118 인용 PDF

Design of OpenCV based Finger Recognition System using binary processing and histogram graph

Baek, Yeong-Tae;Lee, Se-Hoon;Kim, Ji-Seong
- Journal of the Korea Society of Computer and Information
- /
- v.21 no.2
- /
- pp.17-23
- /
- 2016
NUI is a motion interface. It uses the body of the user without the use of HID device such as a mouse and keyboard to control the device. In this paper, we use a Pi Camera and sensors connected to it with small embedded board Raspberry Pi. We are using the OpenCV algorithms optimized for image recognition and computer vision compared with traditional HID equipment and to implement a more human-friendly and intuitive interface NUI devices. comparison operation detects motion, it proposed a more advanced motion sensors and recognition systems fused connected to the Raspberry Pi.
https://doi.org/10.9708/jksci.2016.21.2.017 인용 PDF KSCI

Meme Analysis using Image Captioning Model and GPT-4

Marvin John Ignacio;Thanh Tin Nguyen;Jia Wang;Yong-Guk Kim
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.11a
- /
- pp.628-631
- /
- 2023
We present a new approach to evaluate the generated texts by Large Language Models (LLMs) for meme classification. Analyzing an image with embedded texts, i.e. meme, is challenging, even for existing state-of-the-art computer vision models. By leveraging large image-to-text models, we can extract image descriptions that can be used in other tasks, such as classification. In our methodology, we first generate image captions using BLIP-2 models. Using these captions, we use GPT-4 to evaluate the relationship between the caption and the meme text. The results show that OPT_6.7B provides a better rating than other LLMs, suggesting that the proposed method has a potential for meme classification.
https://doi.org/10.3745/PKIPS.y2023m11a.628 인용 PDF

Multiview Stereo Matching on Mobile Devices Using Parallel Processing on Embedded GPU (임베디드 GPU에서의 병렬처리를 이용한 모바일 기기에서의 다중뷰 스테레오 정합)

Jeon, Yun Bae;Park, In Kyu
- Journal of Broadcast Engineering
- /
- v.24 no.6
- /
- pp.1064-1071
- /
- 2019
Multiview stereo matching algorithm is used to reconstruct 3D shape from a set of 2D images. Conventional multiview stereo algorithms have been implemented on high-performance hardware due to the heavy complexity that contains a large number of calculations in each step. However, as the performance of mobile graphics processors has recently increased rapidly, complex computer vision algorithms can now be implemented on mobile devices like a smartphone and an embedded board. In this paper we parallelize an multiview stereo algorithm using OpenCL on mobile GPU and provide various optimization techniques on the embedded hardware with limited resource.
https://doi.org/10.5909/JBE.2019.24.6.1064 인용 PDF KSCI KPUBS

Light-weight Gender Classification and Age Estimation based on Ensemble Multi-tasking Deep Learning (앙상블 멀티태스킹 딥러닝 기반 경량 성별 분류 및 나이별 추정)

Huy Tran, Quoc Bao;Park, JongHyeon;Chung, SunTae
- Journal of Korea Multimedia Society
- /
- v.25 no.1
- /
- pp.39-51
- /
- 2022
Image-based gender classification and age estimation of human are classic problems in computer vision. Most of researches in this field focus just only one task of either gender classification or age estimation and most of the reported methods for each task focus on accuracy performance and are not computationally light. Thus, running both tasks together simultaneously on low cost mobile or embedded systems with limited cpu processing speed and memory capacity are practically prohibited. In this paper, we propose a novel light-weight gender classification and age estimation method based on ensemble multitasking deep learning with light-weight processing neural network architecture, which processes both gender classification and age estimation simultaneously and in real-time even for embedded systems. Through experiments over various well-known datasets, it is shown that the proposed method performs comparably to the state-of-the-art gender classification and/or age estimation methods with respect to accuracy and runs fast enough (average 14fps) on a Jestson Nano embedded board.
https://doi.org/10.9717/kmms.2022.25.1.039 인용 PDF KSCI HTML

The Design and Implementation of Internet Outlet with Multiple User Interface Using TCP/IP Processor (TCP/IP프로세서를 이용한 다중 사용자 인터페이스 지원 인터넷 전원 콘센트의 설계 및 구현)

Baek, Jeong-Hyun
- Journal of the Korea Society of Computer and Information
- /
- v.17 no.9
- /
- pp.103-112
- /
- 2012
Recently, the infrastructure to be connected to the internet is much provided, there is more and more need to connect electric or electronic products to the internet to monitor or control them remotely. However, most of the existing products lack the network interface, so it was very inconvenient to be connected to the internet. Therefore, this article designs and realizes the internet outlet allowing real-time scheduling that can control the power remotely on the internet by using the hardware TCP/IP processor. The realized product consumes low production cost because it can be realized by using the hardware TCP/IP processor and the 8-bit small microprocessor. In addition, the product can be used widely in both wired and wireless environments with a variety of user interface, including the dedicated control program which provides the environment configuration functions; embedded web service that enables the webpage to be saved on the external flash memory; Android smartphone application; motion recognition control environment that uses the OpenCV computer vision library, etc.
https://doi.org/10.9708/jksci/2012.17.9.103 인용 PDF KSCI

Search Result 70, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)