• Title/Summary/Keyword: Image Recognition Technologies

Search Result 159, Processing Time 0.022 seconds

Automatic Poster Generation System Using Protagonist Face Analysis

  • Yeonhwi You;Sungjung Yong;Hyogyeong Park;Seoyoung Lee;Il-Young Moon
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.4
    • /
    • pp.287-293
    • /
    • 2023
  • With the rapid development of domestic and international over-the-top markets, a large amount of video content is being created. As the volume of video content increases, consumers tend to increasingly check data concerning the videos before watching them. To address this demand, video summaries in the form of plot descriptions, thumbnails, posters, and other formats are provided to consumers. This study proposes an approach that automatically generates posters to effectively convey video content while reducing the cost of video summarization. In the automatic generation of posters, face recognition and clustering are used to gather and classify character data, and keyframes from the video are extracted to learn the overall atmosphere of the video. This study used the facial data of the characters and keyframes as training data and employed technologies such as DreamBooth, a text-to-image generation model, to automatically generate video posters. This process significantly reduces the time and cost of video-poster production.

Development of an Integrated Quarantine System Using Thermographic Cameras (열화상 카메라를 이용한 통합 방역 시스템 개발)

  • Jung, Bum-Jin;Lee, Jung-Im;Seo, Gwang-Deok;Jeong, Kyung-Ok
    • Journal of the Korea Safety Management & Science
    • /
    • v.24 no.1
    • /
    • pp.31-38
    • /
    • 2022
  • The most common symptoms of COVID-19 are high fever, cough, headache, and fever. These symptoms may vary from person to person, but checking for "fever" is the government's most basic measure. To confirm this, many facilities use thermographic cameras. Since the previously developed thermographic camera measures body temperature one by one, it takes a lot of time to measure body temperature in places where many people enter and exit, such as multi-use facilities. In order to prevent malfunctions and errors and to prevent sensitive personal information collection, this research team attempted to develop a facial recognition thermographic camera. The purpose of this study is to compensate for the shortcomings of existing thermographic cameras with disaster safety IoT integrated solution products and to provide quarantine systems using advanced facial recognition technologies. In addition, the captured image information should be protected as personal sensitive information, and a recent leak to China occurred. In order to prevent another case of personal information leakage, it is urgent to develop a thermographic camera that reflects this part. The thermal imaging camera system based on facial recognition technology developed in this study received two patents and one application as of January 2022. In the COVID-19 infectious disease disaster, 'quarantine' is an essential element that must be done at the preventive stage. Therefore, we hope that this development will be useful in the quarantine management field.

User Customized Realization of Virtual Earthquakes based on Visual Intelligence and Dynamic Simulation (시각지능 및 동적 시뮬레이션 기반의 사용자 맞춤형 가상 지진 실감화)

  • Kwon, Jihoe;Ryu, Dongwoo;Lee, Sangho
    • Journal of the Korean Society of Mineral and Energy Resources Engineers
    • /
    • v.55 no.6
    • /
    • pp.614-623
    • /
    • 2018
  • The recent occurrence of consecutive large earthquakes in the southeastern part of the Korean peninsula has brought significant attention to the prevention of earthquake damage in Korea. This article aims to explore a technology-based approach for earthquake drills using state-of-the-art visual intelligence and virtual reality technologies. The technical process consists of several stages, including acquisition of image information in living spaces using a camera, recognition of objects from the acquired image information, extraction of three dimensional geometric information, simulation of virtual earthquakes using dynamic modelling techniques such as the discrete element method, and realization of the simulated earthquake in a virtual reality environment. This article provides a comprehensive analysis of the individual processes at each stage of the technical process, a survey on the current status of related technologies, and discussion of the technical challenges in its execution.

Evaluation of CAPTCHA Efficiency

  • Youthasoontorn, Passzarkorn;Phaibulpanich, Akarin;Piromsopa, Krerk
    • Journal of Information Technology Applications and Management
    • /
    • v.22 no.3
    • /
    • pp.55-64
    • /
    • 2015
  • We propose statistical methods for evaluating the efficiency of CAPTCHA. Most people unfairly assumed that machines are not capable at reading precisely. This fact leads to the invention of CAPTCHA, a distorted word or short phase, which is designed to thwart computers and separate human from machines. However, advances in image recognition technologies mean that machines are constantly getting better at recognizing CAPTCHA. This forces CAPTCHA designers to design even more difficult CAPTCHAs to prevent their systems from being gamed by malicious bots. However, this arm race has an unintended side effect on the common users. Many CAPTCHAs are now so hard that many people are unable to read them. This obviously conflicts with the original purpose that CAPTCHA was invented in the first place. Our analysis shows that some CAPTCHAs are more users friendly. In particular, Yahoo-style CAPTCHA is the most friendliness. This suggests that a good CAPTCHA could be a simple text with some distortion that prevents machines from correctly segmenting characters.

Deep Learning Model Parallelism (딥러닝 모델 병렬 처리)

  • Park, Y.M.;Ahn, S.Y.;Lim, E.J.;Choi, Y.S.;Woo, Y.C.;Choi, W.
    • Electronics and Telecommunications Trends
    • /
    • v.33 no.4
    • /
    • pp.1-13
    • /
    • 2018
  • Deep learning (DL) models have been widely applied to AI applications such image recognition and language translation with big data. Recently, DL models have becomes larger and more complicated, and have merged together. For the accelerated training of a large-scale deep learning model, model parallelism that partitions the model parameters for non-shared parallel access and updates across multiple machines was provided by a few distributed deep learning frameworks. Model parallelism as a training acceleration method, however, is not as commonly used as data parallelism owing to the difficulty of efficient model parallelism. This paper provides a comprehensive survey of the state of the art in model parallelism by comparing the implementation technologies in several deep learning frameworks that support model parallelism, and suggests a future research directions for improving model parallelism technology.

Technology Trends and Analysis of Deep Learning Based Object Classification and Detection (딥러닝 기반 객체 분류 및 검출 기술 분석 및 동향)

  • Lee, S.J.;Lee, K.D.;Lee, S.W.;Ko, J.G.;Yoo, W.Y.
    • Electronics and Telecommunications Trends
    • /
    • v.33 no.4
    • /
    • pp.33-42
    • /
    • 2018
  • Object classification and detection are fundamental technologies in computer vision and its applications. Recently, a deep-learning based approach has shown significant improvement in terms of object classification and detection. This report reviews the progress of deep-learning based object classification and detection in views of the ImageNet Large Scale Visual Recognition Challenge (ILSVRC), and analyzes recent trends of object classification and detection technology and its applications.

Comparative Analysis for Emotion Expression Using Three Methods Based by CNN (CNN기초로 세 가지 방법을 이용한 감정 표정 비교분석)

  • Yang, Chang Hee;Park, Kyu Sub;Kim, Young Seop;Lee, Yong Hwan
    • Journal of the Semiconductor & Display Technology
    • /
    • v.19 no.4
    • /
    • pp.65-70
    • /
    • 2020
  • CNN's technologies that represent emotional detection include primitive CNN algorithms, deployment normalization, and drop-off. We present the methods and data of the three experiments in this paper. The training database and the test database are set up differently. The first experiment is to extract emotions using Batch Normalization, which complemented the shortcomings of distribution. The second experiment is to extract emotions using Dropout, which is used for rapid computation. The third experiment uses CNN using convolution and maxpooling. All three results show a low detection rate, To supplement these problems, We will develop a deep learning algorithm using feature extraction method specialized in image processing field.

Extraction of Worker Behavior at Manufacturing Site using Mask R-CNN and Dense-Net (Mask R-CNN과 Dense-Net을 이용한 제조 현장에서의 작업자 행동 추출)

  • Rijayanti, Rita;Hwang, Mintae;Jin, Kyohong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.150-153
    • /
    • 2022
  • This paper reports a technique that automatically extracts object shapes through Dense-Net, and subsequently, detects the objects using Mask R-CNN in a manufacturing site, in which workers and objects are mixed. It is based on the customized factory dataset by targeting workers, machines, tools, control boxes, and products as the objects. Mask R-CNN supports multi-object recognition as a well-known object recognition method, while Dense-Net effectively extracts a feature from multiple and overlapping objects. After immediate implementation using the two technologies, the object is naturally extracted from a still image of the manufacturing site to describe image. Afterwards, the result is planned to be used to detect workers' abnormal behavior by adding a label on the objects.

  • PDF

Robust Deep Age Estimation Method Using Artificially Generated Image Set

  • Jang, Jaeyoon;Jeon, Seung-Hyuk;Kim, Jaehong;Yoon, Hosub
    • ETRI Journal
    • /
    • v.39 no.5
    • /
    • pp.643-651
    • /
    • 2017
  • Human age estimation is one of the key factors in the field of Human-Robot Interaction/Human-Computer Interaction (HRI/HCI). Owing to the development of deep-learning technologies, age recognition has recently been attempted. In general, however, deep learning techniques require a large-scale database, and for age learning with variations, a conventional database is insufficient. For this reason, we propose an age estimation method using artificially generated data. Image data are artificially generated through 3D information, thus solving the problem of shortage of training data, and helping with the training of the deep-learning technique. Augmentation using 3D has advantages over 2D because it creates new images with more information. We use a deep architecture as a pre-trained model, and improve the estimation capacity using artificially augmented training images. The deep architecture can outperform traditional estimation methods, and the improved method showed increased reliability. We have achieved state-of-the-art performance using the proposed method in the Morph-II dataset and have proven that the proposed method can be used effectively using the Adience dataset.

Detection of Facial Direction using Facial Features (얼굴 특징 정보를 이용한 얼굴 방향성 검출)

  • Park Ji-Sook;Dong Ji-Youn
    • Journal of Internet Computing and Services
    • /
    • v.4 no.6
    • /
    • pp.57-67
    • /
    • 2003
  • The recent rapid development of multimedia and optical technologies brings great attention to application systems to process facial Image features. The previous research efforts in facial image processing have been mainly focused on the recognition of human face and facial expression analysis, using front face images. Not much research has been carried out Into image-based detection of face direction. Moreover, the existing approaches to detect face direction, which normally use the sequential Images captured by a single camera, have limitations that the frontal image must be given first before any other images. In this paper, we propose a method to detect face direction by using facial features such as facial trapezoid which is defined by two eyes and the lower lip. Specifically, the proposed method forms a facial direction formula, which is defined with statistical data about the ratio of the right and left area in the facial trapezoid, to identify whether the face is directed toward the right or the left. The proposed method can be effectively used for automatic photo arrangement systems that will often need to set the different left or right margin of a photo according to the face direction of a person in the photo.

  • PDF