• Title/Summary/Keyword: Mask Recognition

Search Result 181, Processing Time 0.023 seconds

Block Label-based Binary Connected-component Labeling using an efficient pixel-based scan mask (효율적인 화소기반 스캔마스크를 이용한 블록라벨기반 이진연결요소 라벨링)

  • Kim, Kyoil
    • Journal of Digital Convergence
    • /
    • v.11 no.4
    • /
    • pp.259-266
    • /
    • 2013
  • Binary connected-components labeling, which is widely used in the field of the pattern recognition, has been researched for a long time as one of the basic image processing techniques. Two-scan algorithm has been mainly used in the researches of the connected-components labeling. Recently, for the first scan in the two-scan algorithm, block-based labeling approaches have been used and reported as the fastest methods. In this paper, a new efficient scan mask for connected-components labeling with a block-based labeling approach is proposed. Labeling with the new pixel-based scan mask is more efficient than any other existing method. The results of the experiments show that the proposed method is faster than the existing fastest method.

Impacts of label quality on performance of steel fatigue crack recognition using deep learning-based image segmentation

  • Hsu, Shun-Hsiang;Chang, Ting-Wei;Chang, Chia-Ming
    • Smart Structures and Systems
    • /
    • v.29 no.1
    • /
    • pp.207-220
    • /
    • 2022
  • Structural health monitoring (SHM) plays a vital role in the maintenance and operation of constructions. In recent years, autonomous inspection has received considerable attention because conventional monitoring methods are inefficient and expensive to some extent. To develop autonomous inspection, a potential approach of crack identification is needed to locate defects. Therefore, this study exploits two deep learning-based segmentation models, DeepLabv3+ and Mask R-CNN, for crack segmentation because these two segmentation models can outperform other similar models on public datasets. Additionally, impacts of label quality on model performance are explored to obtain an empirical guideline on the preparation of image datasets. The influence of image cropping and label refining are also investigated, and different strategies are applied to the dataset, resulting in six alternated datasets. By conducting experiments with these datasets, the highest mean Intersection-over-Union (mIoU), 75%, is achieved by Mask R-CNN. The rise in the percentage of annotations by image cropping improves model performance while the label refining has opposite effects on the two models. As the label refining results in fewer error annotations of cracks, this modification enhances the performance of DeepLabv3+. Instead, the performance of Mask R-CNN decreases because fragmented annotations may mistake an instance as multiple instances. To sum up, both DeepLabv3+ and Mask R-CNN are capable of crack identification, and an empirical guideline on the data preparation is presented to strengthen identification successfulness via image cropping and label refining.

New Approach to Optimize the Size of Convolution Mask in Convolutional Neural Networks

  • Kwak, Young-Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.1
    • /
    • pp.1-8
    • /
    • 2016
  • Convolutional neural network (CNN) consists of a few pairs of both convolution layer and subsampling layer. Thus it has more hidden layers than multi-layer perceptron. With the increased layers, the size of convolution mask ultimately determines the total number of weights in CNN because the mask is shared among input images. It also is an important learning factor which makes or breaks CNN's learning. Therefore, this paper proposes the best method to choose the convolution size and the number of layers for learning CNN successfully. Through our face recognition with vast learning examples, we found that the best size of convolution mask is 5 by 5 and 7 by 7, regardless of the number of layers. In addition, the CNN with two pairs of both convolution and subsampling layer is found to make the best performance as if the multi-layer perceptron having two hidden layers does.

Facial Recognition Algorithm Based on Edge Detection and Discrete Wavelet Transform

  • Chang, Min-Hyuk;Oh, Mi-Suk;Lim, Chun-Hwan;Ahmad, Muhammad-Bilal;Park, Jong-An
    • Transactions on Control, Automation and Systems Engineering
    • /
    • v.3 no.4
    • /
    • pp.283-288
    • /
    • 2001
  • In this paper, we proposed a method for extracting facial characteristics of human being in an image. Given a pair of gray level sample images taken with and without human being, the face of human being is segmented from the image. Noise in the input images is removed with the help of Gaussian filters. Edge maps are found of the two input images. The binary edge differential image is obtained from the difference of the two input edge maps. A mask for face detection is made from the process of erosion followed by dilation on the resulting binary edge differential image. This mask is used to extract the human being from the two input image sequences. Features of face are extracted from the segmented image. An effective recognition system using the discrete wave let transform (DWT) is used for recognition. For extracting the facial features, such as eyebrows, eyes, nose and mouth, edge detector is applied on the segmented face image. The area of eye and the center of face are found from horizontal and vertical components of the edge map of the segmented image. other facial features are obtained from edge information of the image. The characteristic vectors are extrated from DWT of the segmented face image. These characteristic vectors are normalized between +1 and -1, and are used as input vectors for the neural network. Simulation results show recognition rate of 100% on the learned system, and about 92% on the test images.

  • PDF

Object Detection based on Mask R-CNN from Infrared Camera (적외선 카메라 영상에서의 마스크 R-CNN기반 발열객체검출)

  • Song, Hyun Chul;Knag, Min-Sik;Kimg, Tae-Eun
    • Journal of Digital Contents Society
    • /
    • v.19 no.6
    • /
    • pp.1213-1218
    • /
    • 2018
  • Recently introduced Mask R - CNN presents a conceptually simple, flexible, general framework for instance segmentation of objects. In this paper, we propose an algorithm for efficiently searching objects of images, while creating a segmentation mask of heat generation part for an instance which is a heating element in a heat sensed image acquired from a thermal infrared camera. This method called a mask R - CNN is an algorithm that extends Faster R - CNN by adding a branch for predicting an object mask in parallel with an existing branch for recognition of a bounding box. The mask R - CNN is added to the high - speed R - CNN which training is easy and fast to execute. Also, it is easy to generalize the mask R - CNN to other tasks. In this research, we propose an infrared image detection algorithm based on R - CNN and detect heating elements which can not be distinguished by RGB images. As a result of the experiment, a heat-generating object which can not be discriminated from Mask R-CNN was detected normally.

Recognition of Unconstrained Handwritten Numerals using Fully-connected RNN (완전궤환 신경망을 이용한 무제약 서체 숫자 인식)

  • 원상철;배수정;최한고
    • Proceedings of the IEEK Conference
    • /
    • 1999.11a
    • /
    • pp.1007-1010
    • /
    • 1999
  • This paper describes the recognition of totally unconstrained handwritten numerals using neural networks. Neural networks with multiple output nodes have been successfully used to classify complex handwritten numerals. The recognition system consists of the preprocessing stage to extract features using Kirsch mask and the classification stage to recognize the numerals using the fully-connected recurrent neural networks (RNN). Simulation results with the numeral database of Concordia university, Montreal, Canada, are presented. The recognition system proposed in this paper outperforms other recognition systems reported on the same database.

  • PDF

Gesture Recognition Using Zernike Moments Masked By Duel Ring (이중 링 마스크 저니키 모멘트를 이용한 손동작 인식)

  • Park, Jung-Su;Kim, Tae-Yong
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.10
    • /
    • pp.171-180
    • /
    • 2013
  • Generally, when we apply zernike moments value for matching, we can use those moments value obtained from projecting image information under circumscribed circle to zernike basis function. However, the problem is that the power of discrimination can be reduced because hand images include lots of overlapped information due to its special characteristic. On the other hand, when distinguishing hand poses, information in specific area of image information except for overlapped information can increase the power of discrimination. In this paper, in order to solve problems like those, we design R3 ring mask by combining image obtained from R2 ring mask, which can weight information of the power of discrimination and image obtained from R1 ring mask, which eliminate the overlapped information. The moments which are obtained by R3 ring mask decrease operational time by reducing dimension through principle component analysis. In order to confirm the superiority of the suggested method, we conducted some experiments by comparing our method to other method using seven different hand poses.

A Study of the Mask and Hand Contamination in Dental Clinic (치과에서의 마스크 및 손의 미생물 오염정도 비교)

  • Pyo, Eunji;Lee, Kyunghee
    • Journal of The Korean Society of Integrative Medicine
    • /
    • v.7 no.3
    • /
    • pp.85-94
    • /
    • 2019
  • Purpose: The purpose of this study was to observe the degree of mask contamination in dental hygienist for general and oral bacteria and to identify areas of mask contamination after treatment. Methods: Masks were collected with every fifty dental hygienists who currently working in the department of preventive dentistry, prosthodontics, and orthodontics in Busan. The mask bacteria were collected in specific upper and side parts of the mask. Hand germs were collected using sterile cotton swabs, and then placed in a sterile conical tube. These were transferred to the laboratory. Hand germs and mask bacteria were incubated with nutrient broth (NB) and brain heart infusion broth (BHI) for 24 hrs and each cultured with NB and BHI plate at $37^{\circ}C$ for 48 hrs. Collected data were analyzed using the SPSS Window 20. Results: The number of bacteria was observed in the order of the department of preventive dentistry ($10.1{\times}10^5CFU/ml$), prosthodontics ($14.7{\times}10^5CFU/ml$), and orthodontics ($23.3{\times}10^5CFU/ml$) in the hand. In general bacteria, the difference of contamination was seen by the parts of the mask, but there was no significant difference. However, the oral bacteria were observed highly contaminated upper part of the mask in preventive dentistry. The mask contamination according to the medical departments was observed. Especially, the contamination of mask in preventive dentistry was significantly higher than other departments in oral bacteria. Conclusion: This study suggested that correct mask replacement and recognition of contamination areas can contribute to the prevention of infectious disease. and it would be necessary to increase hand hygiene performance to prevent cross-infection with masks. Also, this study may give an idea for making guidelines for mask management and supporting to establish clear criteria for the education program of personal protective equipment.

A Study on Edge Detection using Directional Mask in Impulse Noise Image (임펄스 잡음 영상에서 방향성 마스크를 이용한 에지 검출에 관한 연구)

  • Lee, Chang-Young;Kim, Nam-Ho
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.15 no.4
    • /
    • pp.135-140
    • /
    • 2014
  • As the digital image devices are widely used, interests in the software- and the hardware-related image processing become higher and the image processing techniques are applied in various fields such as object recognition, object detection, fingerprint recognition, and etc. For the edge detections Sobel, Prewitt, Laplacian, Roberts and Canny detectors are used and these existing methods can excellently detect the edges of the images without noise. However, in the images corrupted by the impulse noise, these methods are insufficent in noise elimination characteristics, showing unsatisfactory edge detection. Therefore in this paper, in order to obtain excellent edge detection characteristics in the corrupted image by the impulse noise, an detection algorithm is porposed, which uses the central pixel of mask divided by four regions along the axis, calculates the estimated mask according to the representing pixel values in each regions, and detects the final edges by applying the estimates mask and the new directional one.

Design of Face with Mask Detection System in Thermal Images Using Deep Learning (딥러닝을 이용한 열영상 기반 마스크 검출 시스템 설계)

  • Yong Joong Kim;Byung Sang Choi;Ki Seop Lee;Kyung Kwon Jung
    • Convergence Security Journal
    • /
    • v.22 no.2
    • /
    • pp.21-26
    • /
    • 2022
  • Wearing face masks is an effective measure to prevent COVID-19 infection. Infrared thermal image based temperature measurement and identity recognition system has been widely used in many large enterprises and universities in China, so it is totally necessary to research the face mask detection of thermal infrared imaging. Recently introduced MTCNN (Multi-task Cascaded Convolutional Networks)presents a conceptually simple, flexible, general framework for instance segmentation of objects. In this paper, we propose an algorithm for efficiently searching objects of images, while creating a segmentation of heat generation part for an instance which is a heating element in a heat sensed image acquired from a thermal infrared camera. This method called a mask MTCNN is an algorithm that extends MTCNN by adding a branch for predicting an object mask in parallel with an existing branch for recognition of a bounding box. It is easy to generalize the R-CNN to other tasks. In this paper, we proposed an infrared image detection algorithm based on R-CNN and detect heating elements which can not be distinguished by RGB images.