• Title/Summary/Keyword: Image Recognition Technologies

Search Result 157, Processing Time 0.033 seconds

Trends and Prospects in the Application of AI Technology for Creative Contents (차세대 콘텐츠를 위한 AI 기술 활용 동향 및 전망)

  • Hong, S.J.;Lee, S.W.;Yoon, M.S.;Park, J.Y.;Lee, S.W.;Kim, A.Y.;Jeong, I.K.
    • Electronics and Telecommunications Trends
    • /
    • v.35 no.5
    • /
    • pp.123-133
    • /
    • 2020
  • With the development of artificial intelligence (AI) and 5G technology, an ecosystem of digital content is gradually becoming intelligent, immersive, and convergent. However, there is not enough ultra-realistic content for the ecosystem. For ultra-realistic content services, creative content technologies using AI are being developed. This paper introduces the trends in and prospects of creative content technologies such as 3D content creation, digital holography, image-based motion recognition, content analysis/understanding/searching, sport AI, and content distribution.

A Study on the History, Classification and Development Direction of Artificial Intelligence (인공지능의 역사, 분류 그리고 발전 방향에 관한 연구)

  • Cho, Min-Ho
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.2
    • /
    • pp.307-312
    • /
    • 2021
  • Artificial Intelligence has a long history and is used in various fields including image recognition and automatic translation. Therefore, when we first encounter artificial intelligence, many terms, concepts and technologies often have difficulty in setting or implementing research direction. This study summarized important concepts related to artificial intelligence and summarized the progress of the past 60 years to help researcher suffering from these difficulties. Through this, it is possible to establish the basis for the use of vast artificial intelligence technologies and establish the right direction for research.

A Study on Smart Touch Projector System Technology Using Infrared (IR) Imaging Sensor (적외선 영상센서를 이용한 스마트 터치 프로젝터 시스템 기술 연구)

  • Lee, Kuk-Seon;Oh, Sang-Heon;Jeon, Kuk-Hui;Kang, Seong-Soo;Ryu, Dong-Hee;Kim, Byung-Gyu
    • Journal of Korea Multimedia Society
    • /
    • v.15 no.7
    • /
    • pp.870-878
    • /
    • 2012
  • Recently, very rapid development of computer and sensor technologies induces various kinds of user interface (UI) technologies based on user experience (UX). In this study, we investigate and develop a smart touch projector system technology on the basis of IR sensor and image processing. In the proposed system, a user can control computer by understanding the control events based on gesture of IR pen as an input device. In the IR image, we extract the movement (or gesture) of the devised pen and track it for recognizing gesture pattern. Also, to correct the error between the coordinate of input image sensor and display device (projector), we propose a coordinate correction algorithm to improve the accuracy of operation. Through this system technology as the next generation human-computer interaction, we can control the events of the equipped computer on the projected image screen without manipulating the computer directly.

Car Driver Drowsiness Detection Technology (자동차 운전자 졸림 감지 기술)

  • Chung, Wan-Young;Kim, Jong-Jin;Kwon, Tae-Ha
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2011.05a
    • /
    • pp.481-484
    • /
    • 2011
  • Recent Automotive technology is driven from mechanical device to the electronic components which improve the vehicle's safety and convenience. The future competitiveness of the car will come from safety issues and energy efficiency, convenience and the application of the technologies. In this study, various techniques for driver drowsiness detection are introduced and compared with each others. The advantages and disadvantages of commercially available technologies and developed technologies are compared. To enhance the detection resolution, multiple sensing technologies are introduced in this paper. The feasibility of two drowsiness detection methods, that is, existing camera image recognition method and bio signal analysis method, are tested. The direct drowsiness detection by the camera image of eyes and driver's vital signs detected indirectly are combined and analyzed by the developed noble algorithm for stress, fatigue, drowsiness detection with a more accurate high-drowsiness detection.

  • PDF

Color Analysis for the Quantitative Aesthetics of Qiong Kiln Ceramics

  • Wang, Fei;Cha, Hang;Leng, Lu
    • Journal of Multimedia Information System
    • /
    • v.7 no.2
    • /
    • pp.97-106
    • /
    • 2020
  • The subjective experience would degrade the current artificial artistic aesthetic analysis. Since Qiong kiln ceramics have a long history and occupy a very important position in ceramic arts, we employed computer-aided technologies to quickly automatically accurately and quantitatively process a large number of Qiong kiln ceramic images and generate the detailed statistical data. Because the color features are simple and significant visual characteristics, the color features of Qiong kiln ceramics are analyzed for the quantitative aesthetics. The Qiong kiln ceramic images are segmented with GrabCut algorithm. Three moments (1st-order, 2nd-order, and 3rd-order) are calculated in two typical color spaces, namely RGB and HSV. The discrimination powers of the color features are analyzed according to various dynasties (Tang Dynasty, Five Dynasties, Song Dynasty) and various utensils (Pot, kettle, bowl), which are helpful to the selection of the discriminant color features among various dynasties and utensils. This paper is helpful to promoting the quantitative aesthetic research of Qiong kiln ceramics and is also conducive to the research on the aesthetics of other ceramics.

Building Information-rich Maps for Intuitive Human Interface Using Networked Knowledge Base

  • Ryu, Jae-Kwan;Kanayama, Chie;Chong, Nak-Young
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.1887-1891
    • /
    • 2005
  • Despite significant advances in multimedia transferring technologies in various fields of robotics, it is sometimes quite difficult for the operator to fully understand the context of 3D remote environments from 2D image feedback. Particularly, in the remote control of mobile robots, the recognition of the object associated with the task is very important, because the operator has to control the robot safely in various situations not through trial and error. Therefore, it is necessary to provide the operator with 3D volumetric models of the object and object-related information as well such as locations, shape, size, material properties, and so on. Thus, in this paper, we propose a vision-based human interface system that provides an interactive, information-rich map through network-based information brokering. The system consists of an object recognition part, a 3D map building part, a networked knowledge base part, and a control part of the mobile robot.

  • PDF

Smart Mirror for Facial Expression Recognition Based on Convolution Neural Network (컨볼루션 신경망 기반 표정인식 스마트 미러)

  • Choi, Sung Hwan;Yu, Yun Seop
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.200-203
    • /
    • 2021
  • This paper introduces a smart mirror technology that recognizes a person's facial expressions through image classification among several artificial intelligence technologies and presents them in a mirror. 5 types of facial expression images are trained through artificial intelligence. When someone looks at the smart mirror, the mirror recognizes my expression and shows the recognized result in the mirror. The dataset fer2013 provided by kaggle used the faces of several people to be separated by facial expressions. For image classification, the network structure is trained using convolution neural network (CNN). The face is recognized and presented on the screen in the smart mirror with the embedded board such as Raspberry Pi4.

  • PDF

Efficient Object Recognition by Masking Semantic Pixel Difference Region of Vision Snapshot for Lightweight Embedded Systems (경량화된 임베디드 시스템에서 의미론적인 픽셀 분할 마스킹을 이용한 효율적인 영상 객체 인식 기법)

  • Yun, Heuijee;Park, Daejin
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.6
    • /
    • pp.813-826
    • /
    • 2022
  • AI-based image processing technologies in various fields have been widely studied. However, the lighter the board, the more difficult it is to reduce the weight of image processing algorithm due to a lot of computation. In this paper, we propose a method using deep learning for object recognition algorithm in lightweight embedded boards. We can determine the area using a deep neural network architecture algorithm that processes semantic segmentation with a relatively small amount of computation. After masking the area, by using more accurate deep learning algorithm we could operate object detection with improved accuracy for efficient neural network (ENet) and You Only Look Once (YOLO) toward executing object recognition in real time for lightweighted embedded boards. This research is expected to be used for autonomous driving applications, which have to be much lighter and cheaper than the existing approaches used for object recognition.

A Deep Learning-Based Image Recognition Model for Illegal Parking Enforcement (불법 주정차 단속을 위한 딥러닝 기반 이미지 인식 모델)

  • Min Kyu Cho;Minjun Kim;Jae Hwan Kim;Jinwook Kim;Byungsun Hwang;Seongwoo Lee;Joonho Seon;Jin Young Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.1
    • /
    • pp.59-64
    • /
    • 2024
  • Recently, research on the convergence of drones and artificial intelligence technologies have been conducted in various industrial fields. In this paper, we propose an illegal parking vehicle recognition model using deep learning-based object recognition and classification algorithms. The model of object recognition and classification consist of YOLOv8 and ResNet18, respectively. The proposed model was trained using image data collected in general road environment, and the trained model showed high accuracy in determining illegal parking. From simulation results, it was confirmed that the proposed model has generalization performance to identify illegal parking vehicles from various images.

Spam Image Detection Model based on Deep Learning for Improving Spam Filter

  • Seong-Guk Nam;Dong-Gun Lee;Yeong-Seok Seo
    • Journal of Information Processing Systems
    • /
    • v.19 no.3
    • /
    • pp.289-301
    • /
    • 2023
  • Due to the development and dissemination of modern technology, anyone can easily communicate using services such as social network service (SNS) through a personal computer (PC) or smartphone. The development of these technologies has caused many beneficial effects. At the same time, bad effects also occurred, one of which was the spam problem. Spam refers to unwanted or rejected information received by unspecified users. The continuous exposure of such information to service users creates inconvenience in the user's use of the service, and if filtering is not performed correctly, the quality of service deteriorates. Recently, spammers are creating more malicious spam by distorting the image of spam text so that optical character recognition (OCR)-based spam filters cannot easily detect it. Fortunately, the level of transformation of image spam circulated on social media is not serious yet. However, in the mail system, spammers (the person who sends spam) showed various modifications to the spam image for neutralizing OCR, and therefore, the same situation can happen with spam images on social media. Spammers have been shown to interfere with OCR reading through geometric transformations such as image distortion, noise addition, and blurring. Various techniques have been studied to filter image spam, but at the same time, methods of interfering with image spam identification using obfuscated images are also continuously developing. In this paper, we propose a deep learning-based spam image detection model to improve the existing OCR-based spam image detection performance and compensate for vulnerabilities. The proposed model extracts text features and image features from the image using four sub-models. First, the OCR-based text model extracts the text-related features, whether the image contains spam words, and the word embedding vector from the input image. Then, the convolution neural network-based image model extracts image obfuscation and image feature vectors from the input image. The extracted feature is determined whether it is a spam image by the final spam image classifier. As a result of evaluating the F1-score of the proposed model, the performance was about 14 points higher than the OCR-based spam image detection performance.