• Title/Summary/Keyword: Image Decoding

Search Result 222, Processing Time 0.039 seconds

Design and Implementation of Smart Pen based User Interface System for U-learning (U-Learning 을 위한 스마트펜 인터페이스 시스템 디자인 및 개발)

  • Shim, Jae-Youen;Kim, Seong-Whan
    • Annual Conference of KIPS
    • /
    • 2010.11a
    • /
    • pp.1388-1391
    • /
    • 2010
  • In this paper, we present a design and implementation of U-learning system using pen based augmented reality approach. Student has been given a smart pen and a smart study book, which is similar to the printed material already serviced. However, we print the study book using CMY inks, and embed perceptually invisible dot patterns using K ink. Smart pen includes (1) IR LED for illumination, IR pass filter for extracting the dot patterns, and (3) camera for image captures. From the image sequences, we perform topology analysis which determines the topological distance between dot pixels, and perform error correction decoding using four position symbols and five CRC symbols. When a student touches a smart study books with our smart pen, we show him/her multimedia (visual/audio) information which is exactly related with the selected region. Our scheme can embed 16 bit information, which is more than 200% larger than previous scheme, which supports 7 bits or 8 bits information.

Joint Training of Neural Image Compression and Super Resolution Model (신경망 이미지 부호화 모델과 초해상화 모델의 합동훈련)

  • Cho, Hyun Dong;Kim, YeongWoong;Cha, Junyeong;Kim, DongHyun;Lim, Sung Chang;Kim, Hui Yong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.1191-1194
    • /
    • 2022
  • 인터넷의 발전으로 수많은 이미지와 비디오를 손쉽게 이용할 수 있게 되었다. 이미지와 비디오 데이터의 양이 기하급수적으로 증가함에 따라, JPEG, HEVC, VVC 등 이미지와 비디오를 효율적으로 저장하기 위한 부호화 기술들이 등장했다. 최근에는 인공신경망을 활용한 학습 기반 모델이 발전함에 따라, 이를 활용한 이미지 및 비디오 압축 기술에 관한 연구가 빠르게 진행되고 있다. NNIC (Neural Network based Image Coding)는 이러한 학습 가능한 인공신경망 기반 이미지 부호화 기술을 의미한다. 본 논문에서는 NNIC 모델과 인공신경망 기반의 초해상화(Super Resolution) 모델을 합동훈련하여 기존 NNIC 모델보다 더 높은 성능을 보일 수 있는 방법을 제시한다. 먼저 NNIC 인코더(Encoder)에 이미지를 입력하기 전 다운 스케일링(Down Scaling)으로 쌍삼차보간법을 사용하여 이미지의 화소를 줄인 후 부호화(Encoding)한다. NNIC 디코더(Decoder)를 통해 부호화된 이미지를 복호화(Decoding)하고 업 스케일링으로 초해상화를 통해 복호화된 이미지를 원본 이미지로 복원한다. 이때 NNIC 모델과 초해상화 모델을 합동훈련한다. 결과적으로 낮은 비트량에서 더 높은 성능을 볼 수 있는 가능성을 보았다. 또한 합동훈련을 함으로써 전체 성능의 향상을 보아 학습 시간을 늘리고, 압축 잡음을 위한 초해상화 모델을 사용한다면 기존의 NNIC 보다 나은 성능을 보일 수 있는 가능성을 시사한다.

  • PDF

Near-lossless Coding of Multiview Texture and Depth Information for Graphics Applications (그래픽스 응용을 위한 다시점 텍스처 및 깊이 정보의 근접 무손실 부호화)

  • Yoon, Seung-Uk;Ho, Yo-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.1
    • /
    • pp.41-48
    • /
    • 2009
  • This Paper introduces representation and coding schemes of multiview texture and depth data for complex three-dimensional scenes. We represent input color and depth images using compressed texture and depth map pairs. The proposed X-codec encodes them further to increase compression ratio in a near-lossless way. Our system resolves two problems. First, rendering time and output visual quality depend on input image resolutions rather than scene complexity since a depth image-based rendering techniques is used. Second, the random access problem of conventional image-based rendering could be effectively solved using our image block-based compression schemes. From experimental results, the proposed approach is useful to graphics applications because it provides multiview rendering, selective decoding, and scene manipulation functionalities.

A Still Image Coding of Wavelet Transform Mode by Rearranging DCT Coefficients (DCT계수의 재배열을 통한 웨이브렛 변환 형식의 정지 영상 부호화)

  • Kim, Jeong-Sik;Kim, Eung-Seong;Lee, Geun-Yeong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.38 no.5
    • /
    • pp.464-473
    • /
    • 2001
  • Since DCT algorithm divides an image into blocks uniformly in both the spatial domain and the frequency domain, it has a weak point that it can not reflect HVS(Human Visual System) efficiently To avoid this problem, we propose a new algorithm, which combines only the merits of DCT and wavelet transform. The proposed algorithm uses the high compaction efficiency of DCT, and applies wavelet transform mode to DCT coefficients, so that the algorithm can utilize interband and intraband correlations of wavelet simultaneously After that, the proposed algorithm quantizes each coefficient based on the characteristic of each coefficient's band. In terms of coding method, the quantized coefficients of important DCT coefficients have symmetrical distribution, the bigger that value Is, the smaller occurrence probability is. Using the characteristic, we propose a new still image coding algorithm of symmetric and bidirectional tree structure with simple algorithm and fast decoding time. Comparing the proposed method with JPEG, the proposed method yields better image quality both objectively and subjectively at the same bit rate.

  • PDF

Development of Compression and Transmission Technology of GIS-based High Resolution Image Data in Flood Disaster Situation (홍수재난 상황에서 GIS 기반의 고해상도 영상데이터의 압축 및 전송 기술 개발)

  • Lee, Seung Hyeon;Lee, Eung Joon;Choung, Yun Jae
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.7
    • /
    • pp.1038-1045
    • /
    • 2017
  • The increase in frequency and scale of natural disasters is the typical negative examples of the global climate change and the change of the human living environment. The damage caused by natural disasters in particular including human and physical damage is directly linked to the safety and properties of citizens. Besides, damage occurs directly or indirectly to the SOC facility, and the damaged SOC facility violates the citizens' safety rights. Therefore, a plan to provide prompt and effective risk map information by linking a 3D disaster information display system, which handles the information of the damage that may occur to SOC facilities at the time of disasters, with an on-site assistance application is suggested in this study. The prompt provision of risk map information is defined as a dynamic expression technology in this study. It also processes and compresses the system to display disaster information, a spreading system that can utilize on-site information, and a module developed to organically link with the DB system that builds information and relationships. Based on the module, the effective disaster information compression plan will be prepared, and the prompt information transmission system will be secured.

Design and Implementation of Multi-View 3D Video Player (다시점 3차원 비디오 재생 시스템 설계 및 구현)

  • Heo, Young-Su;Park, Gwang-Hoon
    • Journal of Broadcast Engineering
    • /
    • v.16 no.2
    • /
    • pp.258-273
    • /
    • 2011
  • This paper designs and implements a multi-view 3D video player system which is operated faster than existing video player systems. The structure for obtaining the near optimum speed in a multi-processor environment by parallelizing the component modules is proposed to process large volumes of multi-view image data at high speed. In order to use the concurrency of bottleneck, we designed image decoding, synthesis and rendering modules in a pipeline structure. For load balancing, the decoder module is divided into the unit of viewpoint, and the image synthesis module is geometrically divided based on synthesized images. As a result of this experiment, multi-view images were correctly synthesized and the 3D sense could be felt when watching the images on the multi-view autostereoscopic display. The proposed application processing structure could be used to process large volumes of multi-view image data at high speed, using the multi-processors to their maximum capacity.

A Neural Network based Block Classifier for High Speed Fractal Image Compression (고속 프랙탈 영상압축을 위한 신경회로망 기반 블록분류기)

  • 이용순;한헌수
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.10 no.3
    • /
    • pp.179-187
    • /
    • 2000
  • Fractal theory has strengths such as high compression rate and fast decoding time in application to image compression, but it suffers from long comparison time necessary for finding an optimally similar domain block in the encoding stage. This paper proposes a neural network based block classifier which enhances the encoding time significantly by classifying domain blocks into 4 patterns and searching only those blocks having the same pattern with the range block to be encoded. Size of a block is differently determined depending on the image complexity of the block. The proposed algorithm has been tested with three different images having various featrues. The experimental results have shown that the proposed algorithm enhances the compression time by 40% on average compared to the conventional fractal encoding algorithms, while maintaining allowable image qualify of PSNR 30 dB.

  • PDF

A Robust Digital Watermarking based on Virtual Optics (가상 광학에 기반한 강인한 디지털 워터마킹)

  • Lee, Geum-Boon;Cho, Beom-Joon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.5
    • /
    • pp.1073-1080
    • /
    • 2011
  • In this paper, we propose a novel digital watermarking method by virtual optics which secures multimedia information such as images, videos and sounds. To secure the multimedia data, we use Fresnel transform which describes the diffraction phenomena of the waves. Also, this method attaches the random phase function to Fresnel transform so that original image and watermark image would be gaussian random vectors. The complex numbers of watermark by Fresnel transform are separated the real part and the imaginary part. The former is embedded in original image as a encoding key imperceptibly and the latter is used for detecting the watermark as a decoding key. This method for digital watermarking ensures that watermark can be successfully registered and extracted from the watermarked image. Further, it provides the robustness to signal processing operation and geometric distortion and proves the strong resilience against cropping attack. The performance evaluation of the experiment is carried out with PSNR, and the numerical simulation results show the efficiency of the proposed method.

Image Compression using Validity and Zero Coefficients by DCT(Discrete Cosine Transform) (DCT에서 유효계수와 Zero계수를 이용한 영상 압축)

  • Kim, Jang Won;Han, Sang Soo
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.1 no.3
    • /
    • pp.97-103
    • /
    • 2008
  • In this paper, $256{\times}256$ input image is classified into a validity block and an edge block of $8{\times}8$ block for image compression. DCT(Discrete Cosine Transform) is executed only for the DC coefficient that is validity coefficients for a validity block. Predict the position where a quantization coefficient becomes 0 for an edge block, I propose new algorithm to execute DCT in the reduced region. Not only this algorithm that I proposed reduces computational complexity of FDCT(Forward DCT) and IDCT(Inverse DCT) and decreases encoding time and decoding time. I let compressibility increase by accomplishing other stability verticality zigzag scan by the block size that was classified for each block at the time of huffman encoding each. In addition, the algorithm that I suggested reduces Run-Length by accomplishing the level verticality zigzag scan that is good for a classified block characteristic and, I offer the compressibility that improved thereby.

  • PDF

A Study on the Textuality Represented in Modern Fashion Photographs (현대 패션사진에 나타난 텍스트성 연구)

  • Park, Mi-Joo;Yang, Sook-Hi
    • The Research Journal of the Costume Culture
    • /
    • v.18 no.5
    • /
    • pp.977-990
    • /
    • 2010
  • Today, as individuals show their social identities and reflect their being as the members of society with a culture, an art style and communication function are stood out in fashion photographs. Accordingly, the meanings of images into text are expanded in its interpretative width through the acceptor's various terms. This researcher looked into four theories of both positions on the textuality of language and image, and considered the point of discussion on image of each theory through modern fashion photographs. First, the theory which divides language and image as auditory and visual recognitions in the textuality of language and image is limited from the view it focuses on only one side without considering the ambivalent elements of each field. For the textuality in modern fashion photographs, the observer attempts to turn it into text to give meaning to it as the recognition through five senses conforming to the acceptor's condition. Second, the theory dividing language and image into the text of time properties and spacial properties has limitation in the text, for acceptor's experience of the object appears as the structured form in time and space rather than being defined as two things like time and space. Third, the theory classifying the language and image text into conventional taste and natural taste has limitation from the view that image text is hardly an object of consistent classification in ease of recognition by the code accepted in society. Thus, this can't be fundamental approach for the understanding of the text of decoding trend represented in modern fashion photographs. Fourth, accordingly, this researcher focussed on contextual and arbitrary text of fashion photographs through the theory of Nelson Goodman which discusses image text through the differences in textuality. Basic mechanism of perceiving and recognizing and distinguish image is closely related to habit and custom like language. So, each acceptor perceives the image as a text through arbitrary interpretation obtained by individual, empirical, historical, and educational viewpoints. The textuality of modern fashion photographs aims to widen the range of diverse knowledge and understanding, transcending the regulations of simple function of existing fashion photographs. Consequently, this researcher puts forward the opinion of consistent and diverse follow-up studies on instilling meaning into fashion photographs for the understanding de-regulatory and de-constructive through various senses by avoiding only one sense-dependent fixed and regulatory properties of it.