• Title/Summary/Keyword: a vision system

Search Result 3,192, Processing Time 0.032 seconds

Design and Implementation of OpenCV-based Inventory Management System to build Small and Medium Enterprise Smart Factory (중소기업 스마트공장 구축을 위한 OpenCV 기반 재고관리 시스템의 설계 및 구현)

  • Jang, Su-Hwan;Jeong, Jopil
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.19 no.1
    • /
    • pp.161-170
    • /
    • 2019
  • Multi-product mass production small and medium enterprise factories have a wide variety of products and a large number of products, wasting manpower and expenses for inventory management. In addition, there is no way to check the status of inventory in real time, and it is suffering economic damage due to excess inventory and shortage of stock. There are many ways to build a real-time data collection environment, but most of them are difficult to afford for small and medium-sized companies. Therefore, smart factories of small and medium enterprises are faced with difficult reality and it is hard to find appropriate countermeasures. In this paper, we implemented the contents of extension of existing inventory management method through character extraction on label with barcode and QR code, which are widely adopted as current product management technology, and evaluated the effect. Technically, through preprocessing using OpenCV for automatic recognition and classification of stock labels and barcodes, which is a method for managing input and output of existing products through computer image processing, and OCR (Optical Character Recognition) function of Google vision API. And it is designed to recognize the barcode through Zbar. We propose a method to manage inventory by real-time image recognition through Raspberry Pi without using expensive equipment.

A Study on Utilization of Vision Transformer for CTR Prediction (CTR 예측을 위한 비전 트랜스포머 활용에 관한 연구)

  • Kim, Tae-Suk;Kim, Seokhun;Im, Kwang Hyuk
    • Knowledge Management Research
    • /
    • v.22 no.4
    • /
    • pp.27-40
    • /
    • 2021
  • Click-Through Rate (CTR) prediction is a key function that determines the ranking of candidate items in the recommendation system and recommends high-ranking items to reduce customer information overload and achieve profit maximization through sales promotion. The fields of natural language processing and image classification are achieving remarkable growth through the use of deep neural networks. Recently, a transformer model based on an attention mechanism, differentiated from the mainstream models in the fields of natural language processing and image classification, has been proposed to achieve state-of-the-art in this field. In this study, we present a method for improving the performance of a transformer model for CTR prediction. In order to analyze the effect of discrete and categorical CTR data characteristics different from natural language and image data on performance, experiments on embedding regularization and transformer normalization are performed. According to the experimental results, it was confirmed that the prediction performance of the transformer was significantly improved when the L2 generalization was applied in the embedding process for CTR data input processing and when batch normalization was applied instead of layer normalization, which is the default regularization method, to the transformer model.

Performance of Mini-Sprinkler - (2) Size of Droplets (미니 스프링클러의 살수 기능 - (2) 살수 입자의 크기)

  • 서상룡;성제훈
    • Journal of Bio-Environment Control
    • /
    • v.6 no.3
    • /
    • pp.183-189
    • /
    • 1997
  • This study was performed to Investigate size of droplet sprinkled from mini-sprinkler. Twelve different kinds of the sprinkler having various structures and sizes of nozzle orifices were selected and tested. Diameters of the droplet reached at several distances from a sprinkler were measured by a machine vision system and the volume median diameters (VMM) were determined statistically. The size of droplet was not affected much by the size of nozzle orifice of a sprinkler but was rather more affected by structure of the sprinkler, especially by the shape of spreader of the sprinkler. Experiment of varying pressure of sprinkling water validated that the size of droplet was inversely proportional to water pressure powered by 1/3. Hence the size of droplet at any water pressure could be easily estimated from experimental data. The size of droplet increased as travel distance of the droplet increases in a relationship of and order function. The size of droplet of the tested sprinkler were in the ranges of 100-300fm within 1m of droplet travel distance, 230~470${\mu}{\textrm}{m}$ within 1~2m of droplet travel distance and 300~770${\mu}{\textrm}{m}$ within 2~3m of droplet travel distance.

  • PDF

A study on the accuracy of optotypes test chart (자동 시력표 정확도에 관한 연구)

  • Song, Kyung-Sek;Kim, Tae-Hun;Sung, A-Young
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.10 no.2
    • /
    • pp.111-118
    • /
    • 2005
  • The optotypes widely used as a necessity in the course of optometry are the world authorized versions which contain items such as the Landolt's rings, Snellen's chart and also Arabian numbers, Korean letters, Pictures and so on. In Korea, the Hahn-chun-suk test chart has been In use generally alolng with Chung-san test chart and Jin-yong-han test chart also in use on the wall. But these sort of test charts hung on the wall have some problems such as the difference in test results owing to the rate of illumination and so a more accurate method is required. To solve the problem of inaccuracy in optometry, the projected type of charts with digital instrument such as the beam projector has been developed lately. This chart projector with consistent high resolution and the ability to provide various charts can help eye examiner perform effective examination and thus is looked positively upon as the automated total optometry system. So in this study our purpose is to examine the accuracy of the projected chart. It was done by comparing it with the frequently used test chart. The results of experiment are as follow. When the projected chart was used, cases that subject read charts one step higher were 10%, two step higher 2% than perfectly corrected vision. When Han-chun-suk test chart was used, cases that subject read charts one step higher were 12%, and two step higher were 4%.

  • PDF

An Adaptive Multi-Level Thresholding and Dynamic Matching Unit Selection for IC Package Marking Inspection (IC 패키지 마킹검사를 위한 적응적 다단계 이진화와 정합단위의 동적 선택)

  • Kim, Min-Ki
    • The KIPS Transactions:PartB
    • /
    • v.9B no.2
    • /
    • pp.245-254
    • /
    • 2002
  • IC package marking inspection system using machine vision locates and identifies the target elements from input image, and decides the quality of marking by comparing the extracted target elements with the standard patterns. This paper proposes an adaptive multi-level thresholding (AMLT) method which is suitable for a series of operations such as locating the target IC package, extracting the characters, and detecting the Pinl dimple. It also proposes a dynamic matching unit selection (DMUS) method which is robust to noises as well as effective to catch out the local marking errors. The main idea of the AMLT method is to restrict the inputs of Otsu's thresholding algorithm within a specified area and a partial range of gray values. Doing so, it can adapt to the specific domain. The DMUS method dynamically selects the matching unit according to the result of character extraction and layout analysis. Therefore, in spite of the various erroneous situation occurred in the process of character extraction and layout analysis, it can select minimal matching unit in any environment. In an experiment with 280 IC package images of eight types, the correct extracting rate of IC package and Pinl dimple was 100% and the correct decision rate of marking quality was 98.8%. This result shows that the proposed methods are effective to IC package marking inspection.

An Experience of Living Lab as Energy Transition Experiment: The Case of Urban Living Lab for Mini-PV System in Seong-Dae-Gol, Seoul, KOREA (에너지전환 실험의 장으로서 한국 리빙랩의 경험: 성대골의 도시지역 미니태양광 사례를 중심으로)

  • Kim, Jun han;Han, Jae kak
    • Journal of Science and Technology Studies
    • /
    • v.18 no.1
    • /
    • pp.219-265
    • /
    • 2018
  • Recently, interest in energy tranisition is rising. Energy transition requires active participation and cooperation of diverse stakeholders, including users / citizens, in that it requires not only changes in technological factors but also changes and coordination of various social factors. Living labs are attracting attention as one of the ways to do this. This article is a detailed analysis of the activities of the mini-PV living lab in the urban area from 2016 to 2017 at the Seoul, Sung Dae Goal. Through the Living Lab, mini PV DIY products, backup centers, local financial services, and the development of a variety of education and training strategies have been achieved. These activities and achievements were analyzed through questions raised on strategic, tactical, and operational levels, as well as through multi-level perspective and interaction between initiative, regime, and niche. In conclusion, this living lab activity confirmed the possibility of a 'transition lap' to solve social problems such as sustainability of energy production and utilization. In particular, it gained remarkable results in terms of the operational leves of transition management governance, that is, transition experiment, and it was also remarkable in that it was the initiative of citizens. However, it did not proceed without difficulty. In particular, structural problems such as the conflict between the flexibility inherent in living lab and the bureaucratic rigidity of the financial support organization have appeared. There was also a limitation that there was no 'transition field' on the strategic level necessary to replicate and expand strategic niches while spreading the knowledge gained from the transition experiment, forming the vision of transition.

The Characteristics of View Landscape in Modern Daegu (근대 대구시의 조망경관 특성분석)

  • Park, Jin-Wook;Hwang, Guk-Woong
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.16 no.3
    • /
    • pp.54-67
    • /
    • 2013
  • This study deals with the characteristics of view landscape in modern Daegu city which were analysed employing geographic information system(GIS). The view landscape analysis was performed by using GIS that enables to overlap land use map with the map of range of visibility, and the 3-D simulation. The results are as follows; First of all, the ratio of forest is enormously high in the range of visibility. The distribution of landscape components allows the dwellers to obtain a clear view towards forests from anywhere. The landscape components include west eroded lowlands, east open rolling lands, east eroded lowlands, and high mountain areas: Apsan(Mt.) in the south; Waryoungsan(Mt.) in the west; and Hamjisan(Mt.) and Hakbong(Mt.) in the north. On the tops of those, people are able to secure a clear vision from the viewpoint towards the surrounding mountains because of the rural areas continuing from the viewpoint to the mountains. A continuous view landscape has been formed by these natural environmental factors. Finally, there are multiple view targets with relatively high altitude that are covered with forests in the space between the urban area and the outer mountains that are higher than the view targets, which provides a scenery of mountains overlapped by higher mountains.

A Novel Fast and High-Performance Image Quality Assessment Metric using a Simple Laplace Operator (단순 라플라스 연산자를 사용한 새로운 고속 및 고성능 영상 화질 측정 척도)

  • Bae, Sung-Ho;Kim, Munchurl
    • Journal of Broadcast Engineering
    • /
    • v.21 no.2
    • /
    • pp.157-168
    • /
    • 2016
  • In image processing and computer vision fields, mean squared error (MSE) has popularly been used as an objective metric in image quality optimization problems due to its desirable mathematical properties such as metricability, differentiability and convexity. However, as known that MSE is not highly correlated with perceived visual quality, much effort has been made to develop new image quality assessment (IQA) metrics having both the desirable mathematical properties aforementioned and high prediction performances for subjective visual quality scores. Although recent IQA metrics having the desirable mathematical properties have shown to give some promising results in prediction performance for visual quality scores, they also have high computation complexities. In order to alleviate this problem, we propose a new fast IQA metric using a simple Laplace operator. Since the Laplace operator used in our IQA metric can not only effectively mimic operations of receptive fields in retina for luminance stimulus but also be simply computed, our IQA metric can yield both very fast processing speed and high prediction performance. In order to verify the effectiveness of the proposed IQA metric, our method is compared to some state-of-the-art IQA metrics. The experimental results showed that the proposed IQA metric has the fastest running speed compared the IQA methods except MSE under comparison. Moreover, our IQA metric achieves the best prediction performance for subjective image quality scores among the state-of-the-art IQA metrics under test.

Edge-Enhanced Error Diffusion Halftoning using Local mean and Spatial Activity (국부 평균과 공간 활성도를 이용한 에지 강조 오차확산법)

  • Kwak Nae-Joung;Kwon Dong-Jin;Kim Young-Gil;Ahn Jae-Hyeong
    • The KIPS Transactions:PartB
    • /
    • v.13B no.2 s.105
    • /
    • pp.77-82
    • /
    • 2006
  • Digital halftoning is the technique to obtain a bilevel-toned image from continuous-toned image. Among halftoning methods, the error diffusion method gives better subjective quality than other halftoning ones. But it also makes edges of objects blurred. To overcome the defect, we proposes the modified error diffusion to enhance the edges using the property that human vision perceives the local average luminance and doesn't perceive a little variation of the spatial variation. The proposed method computes a spatialactivity, which is the difference between a pixel luminance and the average of its $3{\times}3$ neighborhood pixels' Iuminance weighted according to the spatial positioning. The system also usesof edge enhancement (IEE), which is computed from the normalized spatial activitymultiplied by the average luminance. The IEE is added to the quantizer's input pixel and feeds into the halftoning quantizer. The quantizer produces the halftone image having the enhanced edge. The computer experimental results show that the proposed method produces clearer bilevel-toned images than conventional methodsand the edge of objects is preserved well. Also the performance of the preposed method is improved, compared with that of the conventional method by measuring the edge correlation and the local average accordance at some ranges of viewing distance.

Gesture Spotting by Web-Camera in Arbitrary Two Positions and Fuzzy Garbage Model (임의 두 지점의 웹 카메라와 퍼지 가비지 모델을 이용한 사용자의 의미 있는 동작 검출)

  • Yang, Seung-Eun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.1 no.2
    • /
    • pp.127-136
    • /
    • 2012
  • Many research of hand gesture recognition based on vision system have been conducted which enable user operate various electronic devices more easily. 3D position calculation and meaningful gesture classification from similar gestures should be executed to recognize hand gesture accurately. A simple and cost effective method of 3D position calculation and gesture spotting (a task to recognize meaningful gesture from other similar meaningless gestures) is described in this paper. 3D position is achieved by calculation of two cameras relative position through pan/tilt module and a marker regardless with the placed position. Fuzzy garbage model is proposed to provide a variable reference value to decide whether the user gesture is the command gesture or not. The reference is achieved from fuzzy command gesture model and fuzzy garbage model which returns the score that shows the degree of belonging to command gesture and garbage gesture respectively. Two-stage user adaptation is proposed that off-line (batch) adaptation for inter-personal difference and on-line (incremental) adaptation for intra-difference to enhance the performance. Experiment is conducted for 5 different users. The recognition rate of command (discriminate command gesture) is more than 95% when only one command like meaningless gesture exists and more than 85% when the command is mixed with many other similar gestures.