• Title/Summary/Keyword: Multi-Label Recognition

Search Result 22, Processing Time 0.028 seconds

Deep Learning based Sentence Analysis for Query Generation (검색어 생성을 위한 딥 러닝 기반 문장 분석 연구)

  • Na, Seong-Won;Yoon, Kyoungro
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2018.06a
    • /
    • pp.336-337
    • /
    • 2018
  • 최근 이미지의 Visual 정보를 추출하고 Multi label 분류를 통해 나온 결과의 상관관계를 modeling하여 문장으로 출력하는 CNN-RNN 아키텍처가 많은 발전을 이뤘다. 이 아키텍처의 출력은 이미지의 정보가 요약되어 문장으로 표현되기 때문에 Semantic정보가 풍부하여 유사 콘텐츠 검색에도 사용 가능하다. 하지만 결과 문장에 사람이 포함 되면 광범위한 검색 결과를 얻게 되고 부정확한 결과를 초래하게 된다. 이에 본 논문에서는 문장에서 사람을 인식하여 Identity를 부여함으로써 검색어를 좀 더 구체적으로 생성하고자 한다. 이 문제를 해결하기 위해 자연어 처리의 분야 중 하나인 개체명 인식(Named Entity Recognition) 문제로 다루며, 가장 많이 사용되고 있는 모델인 Bidirectional-LSTM-CRF와 CoNLL2003 dataset을 사용하여 수행 한다.

  • PDF

Design and Implementation of OpenCV-based Inventory Management System to build Small and Medium Enterprise Smart Factory (중소기업 스마트공장 구축을 위한 OpenCV 기반 재고관리 시스템의 설계 및 구현)

  • Jang, Su-Hwan;Jeong, Jopil
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.19 no.1
    • /
    • pp.161-170
    • /
    • 2019
  • Multi-product mass production small and medium enterprise factories have a wide variety of products and a large number of products, wasting manpower and expenses for inventory management. In addition, there is no way to check the status of inventory in real time, and it is suffering economic damage due to excess inventory and shortage of stock. There are many ways to build a real-time data collection environment, but most of them are difficult to afford for small and medium-sized companies. Therefore, smart factories of small and medium enterprises are faced with difficult reality and it is hard to find appropriate countermeasures. In this paper, we implemented the contents of extension of existing inventory management method through character extraction on label with barcode and QR code, which are widely adopted as current product management technology, and evaluated the effect. Technically, through preprocessing using OpenCV for automatic recognition and classification of stock labels and barcodes, which is a method for managing input and output of existing products through computer image processing, and OCR (Optical Character Recognition) function of Google vision API. And it is designed to recognize the barcode through Zbar. We propose a method to manage inventory by real-time image recognition through Raspberry Pi without using expensive equipment.

Development of a sdms (Self-diagnostic monitoring system) with prognostics for a reciprocating pump system

  • Kim, Wooshik;Lim, Chanwoo;Chai, Jangbom
    • Nuclear Engineering and Technology
    • /
    • v.52 no.6
    • /
    • pp.1188-1200
    • /
    • 2020
  • In this paper, we consider a SDMS (Self-Diagnostic Monitoring System) for a reciprocating pump for the purpose of not only diagnosis but also prognosis. We have replaced a multi class estimator that selects only the most probable one with a multi label estimator such that we are able to see the state of each of the components. We have introduced a measure called certainty so that we are able to represent the symptom and its state. We have built a flow loop for a reciprocating pump system and presented some results. With these changes, we are not only able to detect both the dominant symptom as well as others but also to monitor how the degree of severity of each component changes. About the dominant ones, we found that the overall recognition rate of our algorithm is about 99.7% which is slightly better than that of the former SDMS. Also, we are able to see the trend and to make a base to find prognostics to estimate the remaining useful life. With this we hope that we have gone one step closer to the final goal of prognosis of SDMS.

MCBP Neural Netwoek for Effcient Recognition of Tire Claddification Code (타이어 분류 코드의 효율적 인식을 위한 MCBP망)

  • Koo, Gun-Seo;O, Hae-Seok
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.2
    • /
    • pp.465-482
    • /
    • 1997
  • In this paper, we have studied on cinstructing code-recognition shstem by neural network according to a image process taking the DOT classification code stamped on tire surface.It happened to a few problems that characters distorted in edge by diffused reflection and two adjacent characters take the same label,even very sen- sitive to illumination ofr recognition the stamped them on tire.Thus,this paper would propose the algorithm for tire code under being cinscious of these properties and prove the algorithm drrciency with a simulation.Also,we have suggerted the MCBP network composing of multi-linked recognizers of dffcient identify the DOT code being tire classification code.The MCBP network extracts the projection balue for classifying each character's rdgion after taking out the prjection of each chracter's region on X,Y axis,processes each chracters by taking 7$\times$8 normalization.We have improved error rate 3% through the MCBP network and post-process comparing the DOT code Database. This approach has a accomplished that learming time get's improvenent at 60% and recognition rate has become to 95% from 90% than BckPropagation with including post- processing it has attained greate rates of entire of tire recoggnition at 98%.

  • PDF

Multi-Stage Object Tracking Technique for Label Recognition (다단계 객체 추적을 통한 표시 정보의 인식 기법)

  • Choi, Ji-Su;Jung, Dongju;Min, Kyeongsic;Lee, Byungjeong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.972-975
    • /
    • 2019
  • 건강 보조 식품, 의약품, 화장품 등 현대 제품에는 성분에 대한 제품의 구성정보가 라벨 형태로 상세히 기재 되어있다. 이러한 제품들은 실생활에서 접하기 쉽지만, 비전공자인 일반 사용자들이 이러한 성분들을 모두 기억하고 구분하여 사용하기에는 물질의 종류가 너무 많으며, 각 성분의 역할에 대해 면밀히 조사하기란 사실상 불가능하다. 하지만 제품에 대한 정확한 이해 없이는 제품을 사용 및 섭취함으로써 특정 부작용이 생길 수 있으며, 오용 및 남용할 가능성 또한 다분하다. 따라서, 제품 소비자가 사용하고 있는 제품이 어떠한 성분을 가지고 있는지를 정확히 파악할 필요가 있다. 이를 해결하기 위해, 본 논문에서는 기계 학습을 통한 객체 인식에 사용되는 실시간 객체 추적 기법을 활용하여 제품의 라벨을 1 차적으로 인식하고, 2 차적으로 라벨에 기재되어 있는 제품의 구성성분을 객체 인식하는 기법을 제안하고자 한다. 추가적으로, 해당 기법을 모바일 어플리케이션에 적용하여 건강 보조 식품 관리에 활용할 수 있는 방법에 대해 소개한다.

Railway Object Recognition Using Mobile Laser Scanning Data (모바일 레이저 스캐닝 데이터로부터 철도 시설물 인식에 관한 연구)

  • Luo, Chao;Jwa, Yoon Seok;Sohn, Gun Ho;Won, Jong Un;Lee, Suk
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.19 no.2
    • /
    • pp.85-91
    • /
    • 2014
  • The objective of the research is to automatically recognize railway objects from MLS data in which 9 key objects including terrain, track, bed, vegetation, platform, barrier, posts, attachments, powerlines are targeted. The proposed method can be divided into two main sub-steps. First, multi-scale contextual features are extracted to take the advantage of characterizing objects of interest from different geometric levels such as point, line, volumetric and vertical profile. Second, by considering contextual interactions amongst object labels, a contextual classifier is utilized to make a prediction with local coherence. In here, the Conditional Random Field (CRF) is used to incorporate the object context. By maximizing the object label agreement in the local neighborhood, CRF model could compensate the local inconsistency prediction resulting from other local classifiers. The performance of proposed method was evaluated based on the analysis of commission and omission error and shows promising results for the practical use.

Extraction of Worker Behavior at Manufacturing Site using Mask R-CNN and Dense-Net (Mask R-CNN과 Dense-Net을 이용한 제조 현장에서의 작업자 행동 추출)

  • Rijayanti, Rita;Hwang, Mintae;Jin, Kyohong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.150-153
    • /
    • 2022
  • This paper reports a technique that automatically extracts object shapes through Dense-Net, and subsequently, detects the objects using Mask R-CNN in a manufacturing site, in which workers and objects are mixed. It is based on the customized factory dataset by targeting workers, machines, tools, control boxes, and products as the objects. Mask R-CNN supports multi-object recognition as a well-known object recognition method, while Dense-Net effectively extracts a feature from multiple and overlapping objects. After immediate implementation using the two technologies, the object is naturally extracted from a still image of the manufacturing site to describe image. Afterwards, the result is planned to be used to detect workers' abnormal behavior by adding a label on the objects.

  • PDF

Ultrastructural Change of the Bile Duct Fibroblast at Infected Rat with Clonorchis sinensis (간흡충에 감염된 실험쥐 담관 섬유모세포의 미세구조적 변화)

  • Kim, Soo-Jin;Min, Byoung-Hoon
    • Applied Microscopy
    • /
    • v.34 no.2
    • /
    • pp.121-130
    • /
    • 2004
  • In this study, ultrastructural change of the bile duct fibroblast at infected rat with Clonorchis sinensis, and the distribution of lectin receptors and actin protein in cultured bile duct infected with Clonorchis sinensis. It explored using colloidal gold label complex with lectin WGA purified from wheat germ (Triticum vulgaris) and anti actin antibody purified actin (43 kDa) isolated from chicken back muscle. The lectin WGA with protein A gold complex labeled sections of the cultured fibroblast revealed gold particles specifically distributed on the multi vesicular form Golgi complex and cell surface of the fibroblast. The actin antibody with protein A gold complex labeled sections of the cultured fibroblast revealed gold particles specifically distributed on the cytoplasm of the fibroblast. Labeling of cultured fibroblast in rat bile duct infected with Clonorchis sinensis was then quantified and compared to that of cultured Fibroblast in Rat Bile duct. These results indicate that lectin WGA receptors are located in the multi vesicular form Golgi complex in the cytoplasm to the cytoplasmic process of the Rat bile duct fibroblast infected with Clonorchis sinensis. Therefore, the GlcNAc and NeuNac regions on the cell surface and cytoplasmic process appear to be functionally associated with cell-recognition and protection from other cell of the tissue, and linked with secretion and exocytosis of the fibroblst cytoplasm. GlcNAc and NeuNAc product in the multi vesicular form Golgi complex then it is transported to cell surface. Actin protein is many appears that infected fibroblast rather than normal fibroblast. The fibroblast of infected with Clonorchis sinensis are against of the physical and chemical stimulation. Then development of cytoplasmic process is relative some stimulation.

Event Cognition-based Daily Activity Prediction Using Wearable Sensors (웨어러블 센서를 이용한 사건인지 기반 일상 활동 예측)

  • Lee, Chung-Yeon;Kwak, Dong Hyun;Lee, Beom-Jin;Zhang, Byoung-Tak
    • Journal of KIISE
    • /
    • v.43 no.7
    • /
    • pp.781-785
    • /
    • 2016
  • Learning from human behaviors in the real world is essential for human-aware intelligent systems such as smart assistants and autonomous robots. Most of research focuses on correlations between sensory patterns and a label for each activity. However, human activity is a combination of several event contexts and is a narrative story in and of itself. We propose a novel approach of human activity prediction based on event cognition. Egocentric multi-sensor data are collected from an individual's daily life by using a wearable device and smartphone. Event contexts about location, scene and activities are then recognized, and finally the users" daily activities are predicted from a decision rule based on the event contexts. The proposed method has been evaluated on a wearable sensor data collected from the real world over 2 weeks by 2 people. Experimental results showed improved recognition accuracies when using the proposed method comparing to results directly using sensory features.

Training Performance Analysis of Semantic Segmentation Deep Learning Model by Progressive Combining Multi-modal Spatial Information Datasets (다중 공간정보 데이터의 점진적 조합에 의한 의미적 분류 딥러닝 모델 학습 성능 분석)

  • Lee, Dae-Geon;Shin, Young-Ha;Lee, Dong-Cheon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.2
    • /
    • pp.91-108
    • /
    • 2022
  • In most cases, optical images have been used as training data of DL (Deep Learning) models for object detection, recognition, identification, classification, semantic segmentation, and instance segmentation. However, properties of 3D objects in the real-world could not be fully explored with 2D images. One of the major sources of the 3D geospatial information is DSM (Digital Surface Model). In this matter, characteristic information derived from DSM would be effective to analyze 3D terrain features. Especially, man-made objects such as buildings having geometrically unique shape could be described by geometric elements that are obtained from 3D geospatial data. The background and motivation of this paper were drawn from concept of the intrinsic image that is involved in high-level visual information processing. This paper aims to extract buildings after classifying terrain features by training DL model with DSM-derived information including slope, aspect, and SRI (Shaded Relief Image). The experiments were carried out using DSM and label dataset provided by ISPRS (International Society for Photogrammetry and Remote Sensing) for CNN-based SegNet model. In particular, experiments focus on combining multi-source information to improve training performance and synergistic effect of the DL model. The results demonstrate that buildings were effectively classified and extracted by the proposed approach.