• 제목/요약/키워드: Image description

검색결과 340건 처리시간 0.019초

Multi-Description Image Compression Coding Algorithm Based on Depth Learning

  • Yong Zhang;Guoteng Hui;Lei Zhang
    • Journal of Information Processing Systems
    • /
    • 제19권2호
    • /
    • pp.232-239
    • /
    • 2023
  • Aiming at the poor compression quality of traditional image compression coding (ICC) algorithm, a multi-description ICC algorithm based on depth learning is put forward in this study. In this study, first an image compression algorithm was designed based on multi-description coding theory. Image compression samples were collected, and the measurement matrix was calculated. Then, it processed the multi-description ICC sample set by using the convolutional self-coding neural system in depth learning. Compressing the wavelet coefficients after coding and synthesizing the multi-description image band sparse matrix obtained the multi-description ICC sequence. Averaging the multi-description image coding data in accordance with the effective single point's position could finally realize the compression coding of multi-description images. According to experimental results, the designed algorithm consumes less time for image compression, and exhibits better image compression quality and better image reconstruction effect.

Shape Description and Retrieval Using Included-Angular Ternary Pattern

  • Xu, Guoqing;Xiao, Ke;Li, Chen
    • Journal of Information Processing Systems
    • /
    • 제15권4호
    • /
    • pp.737-747
    • /
    • 2019
  • Shape description is an important and fundamental issue in content-based image retrieval (CBIR), and a number of shape description methods have been reported in the literature. For shape description, both global information and local contour variations play important roles. In this paper a new included-angular ternary pattern (IATP) based shape descriptor is proposed for shape image retrieval. For each point on the shape contour, IATP is derived from its neighbor points, and IATP has good properties for shape description. IATP is intrinsically invariant to rotation, translation and scaling. To enhance the description capability, multiscale IATP histogram is presented to describe both local and global information of shape. Then multiscale IATP histogram is combined with included-angular histogram for efficient shape retrieval. In the matching stage, cosine distance is used to measure shape features' similarity. Image retrieval experiments are conducted on the standard MPEG-7 shape database and Swedish leaf database. And the shape image retrieval performance of the proposed method is compared with other shape descriptors using the standard evaluation method. The experimental results of shape retrieval indicate that the proposed method reaches higher precision at the same recall value compared with other description method.

영상기록물 기술의 개선 방향 연구 (A Study on Improving the Direction of Moving Image Material Descriptions)

  • 심보미;장윤금
    • 한국비블리아학회지
    • /
    • 제29권1호
    • /
    • pp.325-344
    • /
    • 2018
  • 2000년 이후 국내 기관별 기록물 소장량의 지속적인 증가와 이에 대한 활용 요구가 증가하면서 기록물 기술 개선의 필요성이 제기되었다. 하지만 종이기록물에 대한 기술 개발 및 연구는 활발히 진행된 반면 영상기록물 기술 분야는 그 가치와 중요성은 인식되면서도 영상기록물의 다양성과 특수성으로 인해 전문적 기술 개발 및 연구가 미비하였다. 이에 본 연구에서는 영상기록물 기술의 개선 방향을 도출하기 위해 영상기록물 기술의 특수성 및 국내 영상기록물 기술현황, 해외 기록물 기술 사례 및 영상기록관리 전문가 대상 심층면담을 진행하였다. 이를 통해 영상기록물 정보 본질의 재규정 및 지속 연구, 디지로그적 관점의 영상기록물 기술 및 관리, 이용자 중심 기술 및 다양한 검색도구 개발, 연관정보 관리 강화로 영상기록물 가치창출, 영상기록물 생애주기를 관통하는 기술요소 관리 등의 개선 방향을 제안하였다.

Multiple Description Coding Using Directional Discrete Cosine Transform

  • Lama, Ramesh Kumar;Kwon, Goo-Rak
    • Journal of information and communication convergence engineering
    • /
    • 제11권4호
    • /
    • pp.293-297
    • /
    • 2013
  • Delivery of high quality video over a wide area network with large number of users poses great challenges for the video communication system. To ensure video quality, multiple descriptions have recently attracted various attention as a way of encoding and visual information delivery over wireless network. We propose a new efficient multiple description coding (MDC) technique. Quincunx lattice sub-sampling is used for generating multiple descriptions of an image. In this paper, we propose the application of a directional discrete cosine transform (DCT) to a sub-sampled quincunx lattice to create an MDC representation. On the decoder side, the image is decoded from the received side information. If all the descriptions arrive successfully, the image is reconstructed by combining the descriptions. However, if only one side description is received, decoding is executed using an interpolation process. The experimental results show that such the directional DCT can achieve a better coding gain as well as energy packing efficiency than the conventional DCT with re-alignment.

이미지 센서 모듈의 광학적 특성 테스트를 위한 표준화된 기술 방법 (Standardized Description Method of Optical Characteristics Tests for Image Sensor Modules)

  • 이성수
    • 전기전자학회논문지
    • /
    • 제18권4호
    • /
    • pp.603-611
    • /
    • 2014
  • 이미지 센서와 렌즈를 모듈 상에 고정할 때, 기계적인 오차로 인해 취득된 영상의 기울임 또는 회전이 발생하기도 하고 화각이 좁아지기도 한다. 따라서 테스트 장비에서 이미지 센서 모듈의 광학적 특성을 테스트하여야 한다. 본 논문에서는 이미지 센서 모듈의 광학적 특성을 테스트하는 방법을 설명하고, 이를 영상 취득 특성과 유사한 방식으로 표준화한 기술 방법을 제안한다. 제안된 방법은 테스트 장비가 영상 취득 특성과 광학적 특성을 함께 테스트하는데 도움이 된다.

Convolutional auto-encoder based multiple description coding network

  • Meng, Lili;Li, Hongfei;Zhang, Jia;Tan, Yanyan;Ren, Yuwei;Zhang, Huaxiang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권4호
    • /
    • pp.1689-1703
    • /
    • 2020
  • When data is transmitted over an unreliable channel, the error of the data packet may result in serious degradation. The multiple description coding (MDC) can solve this problem and save transmission costs. In this paper, we propose a deep multiple description coding network (MDCN) to realize efficient image compression. Firstly, our network framework is based on convolutional auto-encoder (CAE), which include multiple description encoder network (MDEN) and multiple description decoder network (MDDN). Secondly, in order to obtain high-quality reconstructed images at low bit rates, the encoding network and decoding network are integrated into an end-to-end compression framework. Thirdly, the multiple description decoder network includes side decoder network and central decoder network. When the decoder receives only one of the two multiple description code streams, side decoder network is used to obtain side reconstructed image of acceptable quality. When two descriptions are received, the high quality reconstructed image is obtained. In addition, instead of quantization with additive uniform noise, and SSIM loss and distance loss combine to train multiple description encoder networks to ensure that they can share structural information. Experimental results show that the proposed framework performs better than traditional multiple description coding methods.

Comparison of Common Methods from Intertwined Application in Image Processing

  • Shin, Seong-Yoon;Lee, Hyun-Chang;Rhee, Yang-Won
    • Journal of information and communication convergence engineering
    • /
    • 제8권4호
    • /
    • pp.405-410
    • /
    • 2010
  • Image processing operations like smoothing and edge detection, and many more are very widely used in areas like Computer Vision. We classify the image processing domain as seven branches-image acquirement and output, image coding and compression, image enhancement and restoration, image transformation, image segmentation, image description, and image recognition and description. We implemented algorithms of gaussian smoothing, laplace sharpening, image contrast effect, image black and white effect, image fog effect, image bright and dark effect, image median filter, and canny edge detection. Such experimental results show the figures respectively.

문턱값 분리를 이용한 다중 기술 엠베디드 제로트리 웨이블릿 압축 (Multiple Description Embedded Zerotree Wavelet Coding Using Threshold Separation)

  • 엄일규
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 하계종합학술대회 논문집(4)
    • /
    • pp.19-22
    • /
    • 2000
  • In this paper, we present a new multiple description embedded zerotree wavelet coding method using the two splitted thresholds. We first model a half EZW coder and then we present multiple description coder which has two coding channels using wide threshold EZW(WTEZW) coders. To evaluate the performance of the proposed coder, we provide an image coding applications with two descriptions and compare MDC image coding results reported to date.

  • PDF

텐서 기반 스트로크 생성에 의한 펜화기법 (A Pen Drawing Method by Tensor-based Strokes Generation)

  • 신도경;안은영
    • 한국멀티미디어학회논문지
    • /
    • 제20권4호
    • /
    • pp.713-720
    • /
    • 2017
  • We present a non-photo realistic pen-ink drawing method for outlining and shading of the input image. Especially, we focus on the detailed illustration of the image of which stroke's direction is important. The pen-ink renderer is an alternative display models user can generate traditional illustration renderings of their photo realistic image. The previously proposed pen drawing methods produce feasible description in general image but it is difficult to express in detail for the sophisticated images that need to consider the direction of stroke for each image region. In order to overcome the disadvantages of the conventional method, a direction vector is extracted from a tensor field and we determine a stroke's direction in consideration of not only an edge area but also a gradient of a surrounding area in the image. For more detailed description for the sophisticated image, we generate white noises based on the light and shade of the input image and determine the direction and length of the stroke by using the tensor field for each generated white noise. The proposed method works particularly well for traditional architectural images where the direction and detailed description of the pen is important.

가우시안 잡음에서 변형된 LLAH 알고리즘의 성능 분석 (Performance Analysis of Modified LLAH Algorithm under Gaussian Noise)

  • 류호섭;박한훈
    • 한국멀티미디어학회논문지
    • /
    • 제18권8호
    • /
    • pp.901-908
    • /
    • 2015
  • Methods of detecting, describing, matching image features, like corners and blobs, have been actively studied as a fundamental step for image processing and computer vision applications. As one of feature description/matching methods, LLAH(Locally Likely Arrangement Hashing) describes image features based on the geometric relationship between their neighbors, and thus is suitable for scenes with poor texture. This paper presents a modified LLAH algorithm, which includes the image features themselves for robustly describing the geometric relationship unlike the original LLAH, and employes a voting-based feature matching scheme that makes feature description much simpler. Then, this paper quantitatively analyzes its performance with synthetic images in the presence of Gaussian noise.