• Title/Summary/Keyword: Image description

Search Result 339, Processing Time 0.025 seconds

Multi-Description Image Compression Coding Algorithm Based on Depth Learning

  • Yong Zhang;Guoteng Hui;Lei Zhang
    • Journal of Information Processing Systems
    • /
    • v.19 no.2
    • /
    • pp.232-239
    • /
    • 2023
  • Aiming at the poor compression quality of traditional image compression coding (ICC) algorithm, a multi-description ICC algorithm based on depth learning is put forward in this study. In this study, first an image compression algorithm was designed based on multi-description coding theory. Image compression samples were collected, and the measurement matrix was calculated. Then, it processed the multi-description ICC sample set by using the convolutional self-coding neural system in depth learning. Compressing the wavelet coefficients after coding and synthesizing the multi-description image band sparse matrix obtained the multi-description ICC sequence. Averaging the multi-description image coding data in accordance with the effective single point's position could finally realize the compression coding of multi-description images. According to experimental results, the designed algorithm consumes less time for image compression, and exhibits better image compression quality and better image reconstruction effect.

Shape Description and Retrieval Using Included-Angular Ternary Pattern

  • Xu, Guoqing;Xiao, Ke;Li, Chen
    • Journal of Information Processing Systems
    • /
    • v.15 no.4
    • /
    • pp.737-747
    • /
    • 2019
  • Shape description is an important and fundamental issue in content-based image retrieval (CBIR), and a number of shape description methods have been reported in the literature. For shape description, both global information and local contour variations play important roles. In this paper a new included-angular ternary pattern (IATP) based shape descriptor is proposed for shape image retrieval. For each point on the shape contour, IATP is derived from its neighbor points, and IATP has good properties for shape description. IATP is intrinsically invariant to rotation, translation and scaling. To enhance the description capability, multiscale IATP histogram is presented to describe both local and global information of shape. Then multiscale IATP histogram is combined with included-angular histogram for efficient shape retrieval. In the matching stage, cosine distance is used to measure shape features' similarity. Image retrieval experiments are conducted on the standard MPEG-7 shape database and Swedish leaf database. And the shape image retrieval performance of the proposed method is compared with other shape descriptors using the standard evaluation method. The experimental results of shape retrieval indicate that the proposed method reaches higher precision at the same recall value compared with other description method.

A Study on Improving the Direction of Moving Image Material Descriptions (영상기록물 기술의 개선 방향 연구)

  • Shim, Bomee;Chang, Yunkeum
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.29 no.1
    • /
    • pp.325-344
    • /
    • 2018
  • Since the year 2000, the need for an improvement of archival descriptions has been an increasing issue, due to the growing usage and amount of archival materials. Unlike the development of descriptions for paper records, however, the technological development and research for moving image descriptions has been limited due to its diversity and specificity. This research investigated the current status and the specificity of the moving image descriptions and also examined major international archival description cases. In-depth interviews with archival professionals were also conducted. Based on the findings, this study suggested the need for redefinition of and continuous research on the fundamental values of moving image information, moving image description and management based on digilog view points, the development of user-centric description and search aides, the creation of moving image values using a relevant information management system, and the improvement of moving image description elements throughout the life-cycle of the material.

Multiple Description Coding Using Directional Discrete Cosine Transform

  • Lama, Ramesh Kumar;Kwon, Goo-Rak
    • Journal of information and communication convergence engineering
    • /
    • v.11 no.4
    • /
    • pp.293-297
    • /
    • 2013
  • Delivery of high quality video over a wide area network with large number of users poses great challenges for the video communication system. To ensure video quality, multiple descriptions have recently attracted various attention as a way of encoding and visual information delivery over wireless network. We propose a new efficient multiple description coding (MDC) technique. Quincunx lattice sub-sampling is used for generating multiple descriptions of an image. In this paper, we propose the application of a directional discrete cosine transform (DCT) to a sub-sampled quincunx lattice to create an MDC representation. On the decoder side, the image is decoded from the received side information. If all the descriptions arrive successfully, the image is reconstructed by combining the descriptions. However, if only one side description is received, decoding is executed using an interpolation process. The experimental results show that such the directional DCT can achieve a better coding gain as well as energy packing efficiency than the conventional DCT with re-alignment.

Standardized Description Method of Optical Characteristics Tests for Image Sensor Modules (이미지 센서 모듈의 광학적 특성 테스트를 위한 표준화된 기술 방법)

  • Lee, Seongsoo
    • Journal of IKEEE
    • /
    • v.18 no.4
    • /
    • pp.603-611
    • /
    • 2014
  • When image sensor and lens are fixed on the module, mechanical errors often induce tilt, rotation, or narrow field-of-view of the acquired image. Therefore, the optical characteristics of image sensor modules should be tested by test equipments. This paper explains how to test the optical characteristics of images sensors. It also proposes the standardized description methods of optical characteristics tests which are similar with those of image acquisition characteristics tests. The proposed method helps the test equipments to perform image acquisition characteristics tests and optical characteristics tests together.

Convolutional auto-encoder based multiple description coding network

  • Meng, Lili;Li, Hongfei;Zhang, Jia;Tan, Yanyan;Ren, Yuwei;Zhang, Huaxiang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.4
    • /
    • pp.1689-1703
    • /
    • 2020
  • When data is transmitted over an unreliable channel, the error of the data packet may result in serious degradation. The multiple description coding (MDC) can solve this problem and save transmission costs. In this paper, we propose a deep multiple description coding network (MDCN) to realize efficient image compression. Firstly, our network framework is based on convolutional auto-encoder (CAE), which include multiple description encoder network (MDEN) and multiple description decoder network (MDDN). Secondly, in order to obtain high-quality reconstructed images at low bit rates, the encoding network and decoding network are integrated into an end-to-end compression framework. Thirdly, the multiple description decoder network includes side decoder network and central decoder network. When the decoder receives only one of the two multiple description code streams, side decoder network is used to obtain side reconstructed image of acceptable quality. When two descriptions are received, the high quality reconstructed image is obtained. In addition, instead of quantization with additive uniform noise, and SSIM loss and distance loss combine to train multiple description encoder networks to ensure that they can share structural information. Experimental results show that the proposed framework performs better than traditional multiple description coding methods.

Comparison of Common Methods from Intertwined Application in Image Processing

  • Shin, Seong-Yoon;Lee, Hyun-Chang;Rhee, Yang-Won
    • Journal of information and communication convergence engineering
    • /
    • v.8 no.4
    • /
    • pp.405-410
    • /
    • 2010
  • Image processing operations like smoothing and edge detection, and many more are very widely used in areas like Computer Vision. We classify the image processing domain as seven branches-image acquirement and output, image coding and compression, image enhancement and restoration, image transformation, image segmentation, image description, and image recognition and description. We implemented algorithms of gaussian smoothing, laplace sharpening, image contrast effect, image black and white effect, image fog effect, image bright and dark effect, image median filter, and canny edge detection. Such experimental results show the figures respectively.

Multiple Description Embedded Zerotree Wavelet Coding Using Threshold Separation (문턱값 분리를 이용한 다중 기술 엠베디드 제로트리 웨이블릿 압축)

  • 엄일규
    • Proceedings of the IEEK Conference
    • /
    • 2000.06d
    • /
    • pp.19-22
    • /
    • 2000
  • In this paper, we present a new multiple description embedded zerotree wavelet coding method using the two splitted thresholds. We first model a half EZW coder and then we present multiple description coder which has two coding channels using wide threshold EZW(WTEZW) coders. To evaluate the performance of the proposed coder, we provide an image coding applications with two descriptions and compare MDC image coding results reported to date.

  • PDF

A Pen Drawing Method by Tensor-based Strokes Generation (텐서 기반 스트로크 생성에 의한 펜화기법)

  • Shin, Do-kyung;Ahn, Eun-young
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.4
    • /
    • pp.713-720
    • /
    • 2017
  • We present a non-photo realistic pen-ink drawing method for outlining and shading of the input image. Especially, we focus on the detailed illustration of the image of which stroke's direction is important. The pen-ink renderer is an alternative display models user can generate traditional illustration renderings of their photo realistic image. The previously proposed pen drawing methods produce feasible description in general image but it is difficult to express in detail for the sophisticated images that need to consider the direction of stroke for each image region. In order to overcome the disadvantages of the conventional method, a direction vector is extracted from a tensor field and we determine a stroke's direction in consideration of not only an edge area but also a gradient of a surrounding area in the image. For more detailed description for the sophisticated image, we generate white noises based on the light and shade of the input image and determine the direction and length of the stroke by using the tensor field for each generated white noise. The proposed method works particularly well for traditional architectural images where the direction and detailed description of the pen is important.

Performance Analysis of Modified LLAH Algorithm under Gaussian Noise (가우시안 잡음에서 변형된 LLAH 알고리즘의 성능 분석)

  • Ryu, Hosub;Park, Hanhoon
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.8
    • /
    • pp.901-908
    • /
    • 2015
  • Methods of detecting, describing, matching image features, like corners and blobs, have been actively studied as a fundamental step for image processing and computer vision applications. As one of feature description/matching methods, LLAH(Locally Likely Arrangement Hashing) describes image features based on the geometric relationship between their neighbors, and thus is suitable for scenes with poor texture. This paper presents a modified LLAH algorithm, which includes the image features themselves for robustly describing the geometric relationship unlike the original LLAH, and employes a voting-based feature matching scheme that makes feature description much simpler. Then, this paper quantitatively analyzes its performance with synthetic images in the presence of Gaussian noise.