• Title/Summary/Keyword: Video image processing

Search Result 866, Processing Time 0.032 seconds

Aesthetic Strategies in Steina and Woody Vasulka's Video Art (비디오아티스트 슈테이너 바술카와 우디 바술카의 미적 전략)

  • Lim, Shan
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.3
    • /
    • pp.261-266
    • /
    • 2020
  • As pioneers of the early video art, Steina Vasulka(1940-) and Woody Vasulka(1937-2019) had lead not only their own experimental arts, but also entire changes of contemporary avant-garde performance, music, and visual art. Two artists invented and developed electronic machines for video image-processing by collaborating with engineers, and performed creative experiment on transformation of digital image. For them, video art is not just a means of documentation. The Vasulkas' artistic practices were not bounded by conventional canons and rules in art world, and preferably were parts of active aesthetic strategies for coexistence of vision of human and vision of machine. Particularly, their video art recognized the video as the key medium in an era where media technology began to dominate the system of communication, and established artist's authority over manipulation of moving image electronically without depending on video camera. In that regard, we can value on their video art. Therefore, the paper reflects on the Vasulkas' art and life which have not yet been studied, and suggests academic interests in the context of their artistic activities and aesthetic strategies.

CNN based Image Restoration Method for the Reduction of Compression Artifacts (압축 왜곡 감소를 위한 CNN 기반 이미지 화질개선 알고리즘)

  • Lee, Yooho;Jun, Dongsan
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.5
    • /
    • pp.676-684
    • /
    • 2022
  • As realistic media are widespread in various image processing areas, image or video compression is one of the key technologies to enable real-time applications with limited network bandwidth. Generally, image or video compression cause the unnecessary compression artifacts, such as blocking artifacts and ringing effects. In this study, we propose a Deep Residual Channel-attention Network, so called DRCAN, which consists of an input layer, a feature extractor and an output layer. Experimental results showed that the proposed DRCAN can reduced the total memory size and the inference time by as low as 47% and 59%, respectively. In addition, DRCAN can achieve a better peak signal-to-noise ratio and structural similarity index measure for compressed images compared to the previous methods.

Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion

  • Zhou, Xuan
    • Journal of Information Processing Systems
    • /
    • v.17 no.2
    • /
    • pp.337-351
    • /
    • 2021
  • Automatically recognizing facial expressions in video sequences is a challenging task because there is little direct correlation between facial features and subjective emotions in video. To overcome the problem, a video facial expression recognition method using spatiotemporal recurrent neural network and feature fusion is proposed. Firstly, the video is preprocessed. Then, the double-layer cascade structure is used to detect a face in a video image. In addition, two deep convolutional neural networks are used to extract the time-domain and airspace facial features in the video. The spatial convolutional neural network is used to extract the spatial information features from each frame of the static expression images in the video. The temporal convolutional neural network is used to extract the dynamic information features from the optical flow information from multiple frames of expression images in the video. A multiplication fusion is performed with the spatiotemporal features learned by the two deep convolutional neural networks. Finally, the fused features are input to the support vector machine to realize the facial expression classification task. The experimental results on cNTERFACE, RML, and AFEW6.0 datasets show that the recognition rates obtained by the proposed method are as high as 88.67%, 70.32%, and 63.84%, respectively. Comparative experiments show that the proposed method obtains higher recognition accuracy than other recently reported methods.

Image Processing of Pseudo-rate-distortion Function Based on MSSSIM and KL-Divergence, Using Multiple Video Processing Filters for Video Compression (MSSSIM 및 쿨백-라이블러 발산 기반 의사 율-왜곡 평가 함수와 복수개의 영상처리 필터를 이용한 동영상 전처리 방법)

  • Seok, Jinwuk;Cho, Seunghyun;Kim, Hui Yong;Choi, Jin Soo
    • Journal of Broadcast Engineering
    • /
    • v.23 no.6
    • /
    • pp.768-779
    • /
    • 2018
  • In this paper, we propose a novel video quality function for video processing based on MSSSIM to select an appropriate video processing filter and to accommodate multiple processing filters to each pixel block in a picture frame by a mathematical selection law so as to maintain video quality and to reduce the bitrate of compressed video. In viewpoint of video compression, since the properties of video quality and bitrate is different for each picture of video frames and for each areas in the same frame, it is difficult for the video filter with single property to satisfy the object of increasing video quality and decreasing bitrate. Consequently, to maintain the subjective video quality in spite of decreasing bitrate, we propose the methodology about the MSSSIM as the measure of subjective video quality, the KL-Divergence as the measure of bitrate, and the combination method of those two measurements. Moreover, using the proposed combinatorial measurement, when we use the multiple image filters with mutually different properties as a pre-processing filter for video, we can verify that it is possible to compress video with maintaining the video quality under decreasing the bitrate, as possible.

The Analysis of Digital Watermarking for MPEG-21 Digital Item Adaptation (디지털 영상 워터마킹에 대한 MPEG-21 DIA의 영향 분석)

  • Bae, Tae Meon;Kang, Seok Jun;Ro, Yong Man;Ine, So Ran
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2004.05a
    • /
    • pp.139-142
    • /
    • 2004
  • 본 논문에서는 MPEG-21 Digital Item Adaptation(DIA)에 의한 워터마크 신호의 영향을 실험하고 분석한다. MPEG-21 DIA에서는 다양한 소비환경에 맞게 멀티미디어 컨텐츠를 변할 수 있는 기능들을 제공하고 있다. 그러나 컨텐츠 변환기능들은 저작권 보호를 위해 컨텐츠에 삽입된 워터마크신호를 홰손시킬 수 있으므로, DIA 환경에서 워터마킹기술을 사용하기 위해서는 워터마킹기술에 대한 DIA의 영향을 분석할 필요가 있다. 본 논문에서는 일반적으로 널리 알려진 대표적인 워터마킹기술을 이용하여 MPEG-21 DIA에서 정의하고 있는 각각의 적응변환기능에 대한 워터마크의 강인성을 실험하여, 그 결과를 바탕으로 DIA 환경에서 워터마킹기술을 적용할 때 필요한 요구사항을 분석하였다.

  • PDF

Face detection enhancement using independent color channels (독립적 컬러채널을 이용한 얼굴검출 성능개선)

  • Lee, Young-Bok;Min, Hyun-Seok;Ro, Yong-Man
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.05a
    • /
    • pp.95-98
    • /
    • 2008
  • 본 논문은 기존의 질감기반 (texture) 얼굴검출 시스템에서 컬러 영상을 도입하여 성능개선의 중요한 부분인 얼굴 오검출율을 줄이는 방법을 제안한다. 얼굴 영상의 컬러 성분은 흑백 성분과 비교하여 낮은 공간 주파수 영역을 가지는 특징이 있다. 질감기반 얼굴검출에서 높은 대비 (contrast) 성분의 에지는 얼굴이 아닌 영역에서 얼굴로 오인할 수가 있다. 본 논문에서는 이런 오인을 감소하기 위해 독립적인 컬러 채널 성분들을 질감기반 얼굴 검출에 각각 이용하여 그 얻어진 결과들을 융합 (fusion) 하는 방법을 제안한다. 실험결과로 제안한 칼라 채널 융합 방법을 통해 얻은 얼굴 검출율은 기존 흑백 영상과 비슷하게 유지되며 오검출율을 현저히 줄이는 것을 보였다.

Development of Facial Nerve Palsy Grading System with Image Processing (영상처리를 이용한 안면신경마비 평가시스템 개발)

  • Jang, Min;Shin, Sang-Hoon
    • The Journal of the Society of Korean Medicine Diagnostics
    • /
    • v.17 no.3
    • /
    • pp.233-240
    • /
    • 2013
  • Objectives The objective and universal grading system for the facial nerve palsy is needed to the objectification of treatment in Oriental medicine. In this study, the facial nerve palsy grading was developed with combination of image processing technique and Nottingham scale. Methods The developed system is composed of measurement part, image processing part, facial nerve palsy evaluation part, and display part. With the video data recorded by webcam at measurement part, the positions of marker were measured at image processing part. In evaluation part, Nottingham scales were calculated in four different facial expressions with measured marker position. The video of facial movement, time history of marker position, and Nottingham scale were displayed in display part. Results & Conclusion The developed system was applied to a normal subject and a abnormal subject with facial nerve palsy. The left-right difference of Nottingham scores was large in the abnormal compared with the normal. In normal case, the change of the length between supraorbital point and infraorbital point was larger than that of the length between lateral canthus and angle of mouth. The abnormal case showed an opposite result. The developed system showed the possibilities of the objective and universal grading system for the facial nerve palsy.

Attentional mechanisms for video retargeting and 3D compressive processing (비디오 재설정 및 3D 압축처리를 위한 어텐션 메커니즘)

  • Hwang, Jae-Jeong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.4
    • /
    • pp.943-950
    • /
    • 2011
  • In this paper, we presented an attention measurement method in 2D and 3D image/video to be applied for image and video retargeting and compressive processing. 2D attention is derived from the three main components, intensity, color, and orientation, while depth information is added for 3D attention. A rarity-based attention method is presented to obtain more interested region or objects. Displaced depth information is matched to attention probability in distorted stereo images and finally a stereo distortion predictor is designed by integrating low-level HVS responses. As results, more efficient attention scheme is developed from the conventional methods and performance is proved by applying for video retargeting.

Video Coding Using Wavelet Decomposition for Very Low Bit - rate Networks (초저속 전송 네트웍을 위한 웨이브릿 변환을 이용한 비디오 코딩)

  • Oh, Hwang-Seok;Lee, Heung-Kyu
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.10
    • /
    • pp.2629-2639
    • /
    • 1997
  • The video coding for very low bit-rate has recently received considerable attention, but the conventional coding schemes with block based transform suffer from the blocky effect for the constraints of limited bit-rate. In this paper, we present a video coding system based on wavelet transform and multiresolution motion estimation/compensation for very low bit-rate video. The proposed scheme uses the wavelet transform which is flexible to represent non-stationary image signals and adaptable to the human visual characteristics. The wavelet transformed coefficients are coded by various coding modes in accordance with the sum of absolute error after motion estimation/compensation in wavelet decomposed domain. And simple buffer control technique is applied to handle constant image quality. It is shown that the presented scheme has more acceptable image quality without blocky effects than conventional block based transform video coding.

  • PDF

A Hardware/Software Codesign for Image Processing in a Processor Based Embedded System for Vehicle Detection

  • Moon, Ho-Sun;Moon, Sung-Hwan;Seo, Young-Bin;Kim, Yong-Deak
    • Journal of Information Processing Systems
    • /
    • v.1 no.1 s.1
    • /
    • pp.27-31
    • /
    • 2005
  • Vehicle detector system based on image processing technology is a significant domain of ITS (Intelligent Transportation System) applications due to its advantages such as low installation cost and it does not obstruct traffic during the installation of vehicle detection systems on the road[1]. In this paper, we propose architecture for vehicle detection by using image processing. The architecture consists of two main parts such as an image processing part, using high speed FPGA, decision and calculation part using CPU. The CPU part takes care of total system control and synthetic decision of vehicle detection. The FPGA part assumes charge of input and output image using video encoder and decoder, image classification and image memory control.