• Title/Summary/Keyword: Video Images

Search Result 1,446, Processing Time 0.026 seconds

Analysis of Deep Learning Model for the Development of an Optimized Vehicle Occupancy Detection System (최적화된 차량 탑승인원 감지시스템 개발을 위한 딥러닝 모델 분석)

  • Lee, JiWon;Lee, DongJin;Jang, SungJin;Choi, DongGyu;Jang, JongWook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.1
    • /
    • pp.146-151
    • /
    • 2021
  • Currently, the demand for vehicles from one family is increasing in many countries at home and abroad, reducing the number of people on the vehicle and increasing the number of vehicles on the road. The multi-passenger lane system, which is available to solve the problem of traffic congestion, is being implemented. The system allows police to monitor fast-moving vehicles with their own eyes to crack down on illegal vehicles, which is less accurate and accompanied by the risk of accidents. To address these problems, applying deep learning object recognition techniques using images from road sites will solve the aforementioned problems. Therefore, in this paper, we compare and analyze the performance of existing deep learning models, select a deep learning model that can identify real-time vehicle occupants through video, and propose a vehicle occupancy detection algorithm that complements the object-ident model's problems.

Acquisition of Region of Interest through Illumination Correction in Dynamic Image Data (동영상 데이터에서 조명 보정을 사용한 관심 영역의 획득)

  • Jang, Seok-Woo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.3
    • /
    • pp.439-445
    • /
    • 2021
  • Low-cost, ultra-high-speed cameras, made possible by the development of image sensors and small displays, can be very useful in image processing and pattern recognition. This paper introduces an algorithm that corrects irregular lighting from a high-speed image that is continuously input with a slight time interval, and which then obtains an exposed skin color region that is the area of interest in a person from the corrected image. In this study, the non-uniform lighting effect from a received high-speed image is first corrected using a frame blending technique. Then, the region of interest is robustly obtained from the input high-speed color image by applying an elliptical skin color distribution model generated from iterative learning in advance. Experimental results show that the approach presented in this paper corrects illumination in various types of color images, and then accurately acquires the region of interest. The algorithm proposed in this study is expected to be useful in various types of practical applications related to image recognition, such as face recognition and tracking, lighting correction, and video indexing and retrieval.

A Study on the Characteristics of Christian Dior's Brand Communication through YouTube Channel Fashion Film Analysis (유튜브 채널 패션필름 분석을 통한 크리스찬 디올의 브랜드 커뮤니케이션 특성 연구)

  • Baek, Jeong Hyun;Bae, Soo Jeong
    • Fashion & Textile Research Journal
    • /
    • v.22 no.6
    • /
    • pp.716-726
    • /
    • 2020
  • This study presents methods and alternative examples for fashion brands to effectively use video-based communication channels to form brand identity that analyzes the definition, status and type of YouTube channel fashion films as well as enables the ability to derive brand identity characteristics. Literature studies focused on Christian Dior's official website and related previous studies. The temporal range of the case studies was from October 7, 2010, the date when the first fashion film was uploaded to current Christian Dior YouTube to July 17, 2020 (the survey date), and there are a total of 550 subjects for quantitative analysis. The succession of the couture spirit means that Christian Dior's craftsmanship was created and passed down by Musée Christian Dior to act as a contemporary key element of brand identity. The iconic expression of femininity is Dior's core design philosophy that began when the woman image of a new era was presented through a new look, and Dior's femininity means a woman that reflects the character of the times as is interpreted as her own personality from the perspective of modernism through the creative directors of future generations. The brand's core identity code 'Miss Dior' expresses the brand's vision and eternity through perfume as well as targets Z generation male consumers through an emotional approach based on forms that used emotional images such as movie-type films.

Deep-Learning Based Real-time Fire Detection Using Object Tracking Algorithm

  • Park, Jonghyuk;Park, Dohyun;Hyun, Donghwan;Na, Youmin;Lee, Soo-Hong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.1
    • /
    • pp.1-8
    • /
    • 2022
  • In this paper, we propose a fire detection system based on CCTV images using an object tracking technology with YOLOv4 model capable of real-time object detection and a DeepSORT algorithm. The fire detection model was learned from 10800 pieces of learning data and verified through 1,000 separate test sets. Subsequently, the fire detection rate in a single image and fire detection maintenance performance in the image were increased by tracking the detected fire area through the DeepSORT algorithm. It is verified that a fire detection rate for one frame in video data or single image could be detected in real time within 0.1 second. In this paper, our AI fire detection system is more stable and faster than the existing fire accident detection system.

Pyramid Feature Compression with Inter-Level Feature Restoration-Prediction Network (계층 간 특징 복원-예측 네트워크를 통한 피라미드 특징 압축)

  • Kim, Minsub;Sim, Donggyu
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.283-294
    • /
    • 2022
  • The feature map used in the network for deep learning generally has larger data than the image and a higher compression rate than the image compression rate is required to transmit the feature map. This paper proposes a method for transmitting a pyramid feature map with high compression rate, which is used in a network with an FPN structure that has robustness to object size in deep learning-based image processing. In order to efficiently compress the pyramid feature map, this paper proposes a structure that predicts a pyramid feature map of a level that is not transmitted with pyramid feature map of some levels that transmitted through the proposed prediction network to efficiently compress the pyramid feature map and restores compression damage through the proposed reconstruction network. Suggested mAP, the performance of object detection for the COCO data set 2017 Train images of the proposed method, showed a performance improvement of 31.25% in BD-rate compared to the result of compressing the feature map through VTM12.0 in the rate-precision graph, and compared to the method of performing compression through PCA and DeepCABAC, the BD-rate improved by 57.79%.

Weighted Filter Algorithm based on Distribution Pattern of Pixel Value for AWGN Removal (AWGN 제거를 위한 화소값 분포패턴에 기반한 가중치 필터 알고리즘)

  • Cheon, Bong-Won;Kim, Nam-Ho
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.1
    • /
    • pp.44-49
    • /
    • 2022
  • Abstract Recently, with the development of IoT technology and communication media, various video equipment is being used in industrial fields. Image data acquired from cameras and sensors are easily affected by noise during transmission and reception, and noise removal is essential as it greatly affects system reliability. In this paper, we propose a weight filter algorithm based on the pixel value distribution pattern to preserve details in the process of restoring images damaged in AWGN. The proposed algorithm calculates weights according to the pixel value distribution pattern of the image and restores the image by applying a filtering mask. In order to analyze the noise removal performance of the proposed algorithm, it was simulated using enlarged image and PSNR compared to the existing method. The proposed algorithm preserves important characteristics of the image and shows the performance of efficiently removing noise compared to the existing method.

A Study on the Creation of Interactive Text Collage using Viewer Narratives (관람자 내러티브를 활용한 인터랙티브 텍스트 콜라주 창작 연구)

  • Lim, Sooyeon
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.297-302
    • /
    • 2022
  • Contemporary viewers familiar with the digital space show their desire for self-expression and use voice, text and gestures as tools for expression. The purpose of this study is to create interactive art that expresses the narrative uttered by the viewer in the form of a collage using the viewer's figure, and reproduces and expands the story by the viewer's movement. The proposed interactive art visualizes audio and video information acquired from the viewer in a text collage, and uses gesture information and a natural user interface to easily and conveniently interact in real time and express personalized emotions. The three pieces of information obtained from the viewer are connected to each other to express the viewer's current temporary emotions. The rigid narrative of the text has some degree of freedom through the viewer's portrait images and gestures, and at the same time produces and expands the structure of the story close to reality. The artwork space created in this way is an experience space where the viewer's narrative is reflected, updated, and created in real time, and it is a reflection of oneself. It also induces active appreciation through the active intervention and action of the viewer.

Multimodal Interaction Framework for Collaborative Augmented Reality in Education

  • Asiri, Dalia Mohammed Eissa;Allehaibi, Khalid Hamed;Basori, Ahmad Hoirul
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.7
    • /
    • pp.268-282
    • /
    • 2022
  • One of the most important technologies today is augmented reality technology, it allows users to experience the real world using virtual objects that are combined with the real world. This technology is interesting and has become applied in many sectors such as the shopping and medicine, also it has been included in the sector of education. In the field of education, AR technology has become widely used due to its effectiveness. It has many benefits, such as arousing students' interest in learning imaginative concepts that are difficult to understand. On the other hand, studies have proven that collaborative between students increases learning opportunities by exchanging information, and this is known as Collaborative Learning. The use of multimodal creates a distinctive and interesting experience, especially for students, as it increases the interaction of users with the technologies. The research aims at developing collaborative framework for developing achievement of 6th graders through designing a framework that integrated a collaborative framework with a multimodal input "hand-gesture and touch", considering the development of an effective, fun and easy to use framework with a multimodal interaction in AR technology that was applied to reformulate the genetics and traits lesson from the science textbook for the 6th grade, the first semester, the second lesson, in an interactive manner by creating a video based on the science teachers' consultations and a puzzle game in which the game images were inserted. As well, the framework adopted the cooperative between students to solve the questions. The finding showed a significant difference between post-test and pre-test of the experimental group on the mean scores of the science course at the level of remembering, understanding, and applying. Which indicates the success of the framework, in addition to the fact that 43 students preferred to use the framework over traditional education.

Latest Information Technologies in the UK Adults Education System

  • Tverezovska, Nina;Bilyk, Ruslana;Rozman, Iryna;Semerenko, Zhanna;Orlova, Nataliya;Vytrykhovska, Oksana;Oros, Ildiko
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.8
    • /
    • pp.25-34
    • /
    • 2022
  • Today, further education of adults in the UK is one of the developing areas of continuing education. The Open University with distance learning, in the process of which innovative forms and methods based on computer and telecommunication technologies are used, is particularly successful in the organization of additional education of the adult population. The advantages of distance learning, multimedia - the latest information technologies, which provide the combination of graphic images, video, sound with the help of modern computer tools, are noted. The basic principles and forms underlying the technologies and forms of work with the elderly are defined. The international experience of implementing "Universities of the Third Age" is summarized. The most widespread approach in adult education in Great Britain is informational. The use of computer technologies motivates a new paradigm in educational methods and strategies, which requires new approaches, forms of learning, and innovative ways of delivering educational materials to adult learners. Information technologies have gained great popularity in such activities as distance learning, online learning, assistance in the education management system, development of programs and virtual textbooks in various subjects, online search for information for the educational process, computer testing of students' knowledge, creation of electronic libraries, formation of a single scientific electronic environment, publication of virtual magazines and newspapers on pedagogical topics, teleconferences, expansion of international cooperation in the field of Internet education. The information technology of synchronous distance learning "online" has gained considerable popularity in the educational process today. A promising direction is the use of multimedia technologies in educational activities to create a design of a virtual computer environment by decoding audiovisual information.

'Gwangju Light+' Laser Linked Projection Mapping Study ('광주의 빛+' 레이저 연동 프로젝션 맵핑 연구)

  • Park, Sunghun;Kim, Hyung Gi
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.2
    • /
    • pp.35-43
    • /
    • 2022
  • The 2020 Gwangju Media Art Festival (2020 GMAF) was held in the area of the National Asian Cultural Center. Under the slogan "The entire Gwangju shines under the theme of "Aesthetics of Light and Coexistence," the media festival demonstrated projection mapping to Jeonnam (former) Provincial Office at the DATA+ Research Institute of Chung-Ang University's Graduate School of Advanced Video. This paper focuses on explaining the overall production process and content development of projection mapping demonstrated in the Jeollanam-do Provincial Government, which is a symbol of Korean democratization and is located in the center of Gwangju, Jeolla-do. It was intended to faithfully express the history of the 2020 GMAF and Jeonnam (former) provincial government, Gwangju's history, and democratization records. It was intended to show images, background sounds, sound effects, and visual effects using various special effects and high-power laser devices using unique characteristics of projection mapping. To this end, about 5 minutes and 30 seconds of content were planned, and it was divided into parts and topics, and one individual story was developed for each chapter.