• Title/Summary/Keyword: Vision Assistant

Search Result 29, Processing Time 0.032 seconds

Performance Analysis of Vision-based Positioning Assistance Algorithm (비전 기반 측위 보조 알고리즘의 성능 분석)

  • Park, Jong Soo;Lee, Yong;Kwon, Jay Hyoun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.3
    • /
    • pp.101-108
    • /
    • 2019
  • Due to recent improvements in computer processing speed and image processing technology, researches are being actively carried out to combine information from camera with existing GNSS (Global Navigation Satellite System) and dead reckoning. In this study, developed a vision-based positioning assistant algorithm to estimate the distance to the object from stereo images. In addition, GNSS/on-board vehicle sensor/vision based positioning algorithm is developed by combining vision based positioning algorithm with existing positioning algorithm. For the performance analysis, the velocity calculated from the actual driving test was used for the navigation solution correction, simulation tests were performed to analyse the effects of velocity precision. As a result of analysis, it is confirmed that about 4% of position accuracy is improved when vision information is added compared to existing GNSS/on-board based positioning algorithm.

The Development of X-ray image processing system for product inspection. (물품 검사를 위한 X-선 영상 처리 시스템 개발)

  • Moon, Ha-jung;Lee, Dong-hoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.826-828
    • /
    • 2014
  • Recently trend of product is miniaturization. As a result, We need products surface as well as products internal defect inspection. Generally, Inspection products in production process uses a lot of optical inspection. However, This is difficult to internal inspection of products. We used optical device instead of X-ray generator. At the same time, We have developed system to determine the product defect. First, obtain X-ray image from Machine vision function. Next, Measured value is recognize suitability within error range. otherwise recognize defect. Results presence of defective products can be stored by user.

  • PDF

A Study on the Web Building Assistant System Using GUI Object Detection and Large Language Model (웹 구축 보조 시스템에 대한 GUI 객체 감지 및 대규모 언어 모델 활용 연구)

  • Hyun-Cheol Jang;Hyungkuk Jang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2024.05a
    • /
    • pp.830-833
    • /
    • 2024
  • As Large Language Models (LLM) like OpenAI's ChatGPT[1] continue to grow in popularity, new applications and services are expected to emerge. This paper introduces an experimental study on a smart web-builder application assistance system that combines Computer Vision with GUI object recognition and the ChatGPT (LLM). First of all, the research strategy employed computer vision technology in conjunction with Microsoft's "ChatGPT for Robotics: Design Principles and Model Abilities"[2] design strategy. Additionally, this research explores the capabilities of Large Language Model like ChatGPT in various application design tasks, specifically in assisting with web-builder tasks. The study examines the ability of ChatGPT to synthesize code through both directed prompts and free-form conversation strategies. The researchers also explored ChatGPT's ability to perform various tasks within the builder domain, including functions and closure loop inferences, basic logical and mathematical reasoning. Overall, this research proposes an efficient way to perform various application system tasks by combining natural language commands with computer vision technology and LLM (ChatGPT). This approach allows for user interaction through natural language commands while building applications.

The Manufacture of Digital X-ray Devices and Implementation of Image Processing Algorithm (디지털 X-ray 장치 제작 및 영상 처리 알고리즘 구현)

  • Kim, So-young;Park, Seung-woo;Lee, Dong-hoon
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.21 no.4
    • /
    • pp.195-201
    • /
    • 2020
  • This study studied scoliosis, one of the most common modern diseases caused by lifestyle patterns of office workers sitting in front of computers all day and modern people who use smart phones frequently. Scoliosis is a typical complication that takes more than 80% of the nation's total population at least once. X-ray are used to test for these complications. X-ray, a non-destructive testing method that allows scoliosis to be easily performed and filmed in various areas such as the chest, abdomen and bone without contrast agents or other instruments. We uses NI DAQ to miniaturize digital X-ray imaging devices and image intensifier in self-shielding housing with Vision Assistant for drawing lines to the top and the bottom of the spine to acquire angles, i.e. curvature in real-time. In this way, the research was conducted to see scoliosis patients and their condition easily and to help rapid treatment for solving the problem of posture correction in modern people.

Lane-Level Positioning based on 3D Tracking Path of Traffic Signs (교통 표지판의 3차원 추적 경로를 이용한 자동차의 주행 차로 추정)

  • Park, Soon-Yong;Kim, Sung-ju
    • The Journal of Korea Robotics Society
    • /
    • v.11 no.3
    • /
    • pp.172-182
    • /
    • 2016
  • Lane-level vehicle positioning is an important task for enhancing the accuracy of in-vehicle navigation systems and the safety of autonomous vehicles. GPS (Global Positioning System) and DGPS (Differential GPS) are generally used in navigation service systems, which however only provide an accuracy level up to 2~3 m. In this paper, we propose a 3D vision based lane-level positioning technique which can provides accurate vehicle position. The proposed method determines the current driving lane of a vehicle by tracking the 3D position of traffic signs which stand at the side of the road. Using a stereo camera, the 3D tracking paths of traffic signs are computed and their projections to the 2D road plane are used to determine the distance from the vehicle to the signs. Several experiments are performed to analyze the feasibility of the proposed method in many real roads. According to the experimental results, the proposed method can achieve 90.9% accuracy in lane-level positioning.

Improvement of Stixel Segmentation Using Additive Image Domain Features and Genetic Algorithm-based Optimization (영상 영역 특징 추가 및 유전 알고리즘 기반 최적화를 통한 스틱셀 분할 개선 방법)

  • Lee, Sunyoung;Suhr, Jae Kyu;Jung, Ho Gi
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.23 no.6
    • /
    • pp.565-574
    • /
    • 2015
  • Recently, a medium-level representation named "Stixel" has been extensively researched in stereo vision-based environmental perception. Obstacle detection using Stixel representation consists of three steps: static Stixel generation, dynamic Stixel generation, and Stixel segmentation. This paper focuses on the Stixel segmentation step and has two contributions. One is that it shows that Stixel segmentation performance can be enhanced by utilizing both image domain and real world domain features. The other is that it suggests that parameters used for Stixel segmentation can be effectively tuned based on genetic algorithm. The proposed method was quantitatively evaluated and the result showed that the proposed method increased Stixel segmentation accuracy compared with the previous method.

Lane Detection for Adaptive Control of Autonomous Vehicle (지능형 자동차의 적응형 제어를 위한 차선인식)

  • Kim, Hyeon-Koo;Ju, Yeonghwan;Lee, Jonghun;Park, Yongwan;Jeong, Ho-Yeol
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.4 no.4
    • /
    • pp.180-189
    • /
    • 2009
  • Currently, most automobile companies are interested in research on intelligent autonomous vehicle. They are mainly focused on driver's intelligent assistant and driver replacement. In order to develop an autonomous vehicle, lateral and longitudinal control is necessary. This paper presents a lateral and longitudinal control system for autonomous vehicle that has only mono-vision camera. For lane detection, we present a new lane detection algorithm using clothoid parabolic road model. The proposed algorithm in compared with three other methods such as virtual line method, gradient method and hough transform method, in terms of lane detection ratio. For adaptive control, we apply a vanishing point estimation to fuzzy control. In order to improve handling and stability of the vehicle, the modeling errors between steering angle and predicted vanishing point are controlled to be minimized. So, we established a fuzzy rule of membership functions of inputs (vanishing point and differential vanishing point) and output (steering angle). For simulation, we developed 1/8 size robot (equipped with mono-vision system) of the actual vehicle and tested it in the athletics track of 400 meter. Through the test, we prove that our proposed method outperforms 98 % in terms of detection rate in normal condition. Compared with virtual line method, gradient method and hough transform method, our method also has good performance in the case of clear, fog and rain weather.

  • PDF

PDA-based Text Extraction System using Client/Server Architecture (Client/Server구조를 이용한 PDA기반의 문자 추출 시스템)

  • Park Anjin;Jung Keechul
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.2
    • /
    • pp.85-98
    • /
    • 2005
  • Recently, a lot of researches about mobile vision using Personal Digital Assistant(PDA) has been attempted. Many CPUs for PDA are integer CPUs, which have no floating-computation component. It results in slow computation of the algorithms peformed by vision system or image processing, which have much floating-computation. In this paper, in order to resolve this weakness, we propose the Client(PDA)/server(PC) architecture which is connected to each other with a wireless LAN, and we construct the system with pipelining processing using two CPUs of the Client(PDA) and the Server(PC) in image sequence. The Client(PDA) extracts tentative text regions using Edge Density(ED). The Server(PC) uses both the Multi-1.aver Perceptron(MLP)-based texture classifier and Connected Component(CC)-based filtering for a definite text extraction based on the Client(PDA)'s tentativel99-y extracted results. The proposed method leads to not only efficient text extraction by using both the MLP and the CC, but also fast running time using Client(PDA)/server(PC) architecture with the pipelining processing.

Development a Meal Support System for the Visually Impaired Using YOLO Algorithm (YOLO알고리즘을 활용한 시각장애인용 식사보조 시스템 개발)

  • Lee, Gun-Ho;Moon, Mi-Kyeong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.5
    • /
    • pp.1001-1010
    • /
    • 2021
  • Normal people are not deeply aware of their dependence on sight when eating. However, since the visually impaired do not know what kind of food is on the table, the assistant next to them holds the blind spoon and explains the position of the food in a clockwise direction, front and rear, left and right, etc. In this paper, we describe the development of a meal assistance system that recognizes each food image and announces the name of the food by voice when a visually impaired person looks at their table using a smartphone camera. This system extracts the food on which the spoon is placed through the YOLO model that has learned the image of food and tableware (spoon), recognizes what the food is, and notifies it by voice. Through this system, it is expected that the visually impaired will be able to eat without the help of a meal assistant, thereby increasing their self-reliance and satisfaction.

The Relationships Between Low Vision and Socioeconomic Status in Korean Adults (저시력과 사회경제적 상태와의 관계)

  • Park, Jee-Hyun
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.16 no.3
    • /
    • pp.319-325
    • /
    • 2011
  • Purpose: The relativity of factors between low vision and socioeconomic status were investigated. This study represented the preliminary data for establishment of public eye health policy. Further, this report would encourage people to change the social attitudes about the eye health equity of the nation. Methods: The number of people (2,514 people) who have been tested the forced visual activity were examined as it was referred the Korea National Health and Nutrition Examination Survey (KNHNE) of 2009-year data. The prevalence rate of low vision of subjects which are related with house income, education level and occupations were conducted with ttest and chi square test. Besides, the Binominal Logistic Regression was conducted to measure the odds ratio of the subjects. Results: In outline, the prevalence rate of low vision was high with low house income, low education level and low function. The odds ratio represented that 2.77(95% CI, 1.72-4.47) at low house income group and 4.02(95% CI, 1.75-9.23) at the case of below primary school education level. Moreover, the results of unemployed group showed 3.65(1.14-11.68) from the odds ratio measurement. Conclusions: The eye health policy need be instituted which is broad and meticulous support to ease the eye health equity of low eye sight patients. For instance, the education about eye health, examination business of eye disease, and education of assistant units which are useful for low eye sight would suggest practical solution.