Search | Korea Science

Discrete Wavelet Transform Network based on Deep Learning (딥러닝 기반 이산웨이블릿변환 네트워크)

Lee, Ju-Won;Park, Chan-Seung;Yoon, Young-Jae;Kim, Dong-Wook
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.347-350
- /
- 2020
본 논문에서는 영상 변환 기술인 이산웨이블릿변환(Discrete Wavelet Transform, DWT)를 딥러닝 기반의 네트워크로 구현한다. 딥러닝 기술 중에도 CNN 기반으로 네트워크를 설계하였으며, 본 DWT 네트워크는 해상도에 의존적이지 않은 계층들로만 구성된다. 데이터세트를 구성할 때 파이썬의 라이브러리를 사용하여 레이블 데이터세트를 구성한다. 128×128크기의 gray-scale 영상을 입력으로 사용하고 이에 대응하는 레이블 데이터세트를 구성하여 1-level DWT를 수행하는 네트워크의 학습을 진행한다. 역방향 변환도 네트워크 설계 후 데이터세트를 구성하여 학습을 진행한다. 학습이 완료된 1-level DWT 네트워크를 반복적으로 사용하여 Multi-level DWT 네트워크를 구성한다. 또한 양자화에 의한 간단한 영상압축 실험을 진행하여 DWT 네트워크의 성능과 압축 등의 응용분야에 활용할 수 있음을 보인다. 설계한 DWT 네트워크의 1-level 순방향 변환 성능은 42.18dB의 PSNR을 보였고, 1-level 역방향 변환 성능은 50.13dB의 PSNR을 보였다.
PDF

Integrity Authentication Algorithm of JPEG Compressed Images through Reversible Watermarking (가역 워터마킹 기술을 통한 JPEG 압축 영상의 무결성 인증 알고리즘)

Jo, Hyun-Wu;Yeo, Dong-Gyu;Lee, Hae-Yeoun
- The KIPS Transactions:PartB
- /
- v.19B no.2
- /
- pp.83-92
- /
- 2012
Multimedia contents can be copied and manipulated without quality degradation. Therefore, they are vulnerable to digital forgery and illegal distribution. In these days, with increasing the importance of multimedia security, various multimedia security techniques are studied. In this paper, we propose a content authentication algorithm based on reversible watermarking which supports JPEG compression commonly used for multimedia contents. After splitting image blocks, a specific authentication code for each block is extracted and embedded into the quantized coefficients on JPEG compression which are preserved against lossy processing. At a decoding process, the watermarked JPEG image is authenticated by extracting the embedded code and restored to have the original image quality. To evaluate the performance of the proposed algorithm, we analyzed image quality and compression ratio on various test images. The average PSNR value and compression ratio of the watermarked JPEG image were 33.13dB and 90.65%, respectively, whose difference with the standard JPEG compression were 2.44dB and 1.63%.
https://doi.org/10.3745/KIPSTB.2012.19B.2.083 인용 PDF KSCI

Uncompressed 3D HD Video and Multi-channel Sound Transport (비압축 3D HD 영상 및 다채널 음성 전송)

Chae, Jong-Kwon;Lee, Young-Han;Kim, Jong-Won;Kim, Hong-Kook
- 한국HCI학회:학술대회논문집
- /
- 2007.02a
- /
- pp.706-712
- /
- 2007
국가간 연구목적으로 개설된 초고속 광 네트워크 기술의 발전은 새로운 응용 기술의 등장을 요구하고 있다. 고화질 저지연의 실감 협업 응용은 이러한 연구 목적에 부합할 뿐만 아니라 향후 커뮤니티 기반 응용 기술의 요구를 충족시킬 것으로 보인다. 본 논문에서는 실감 협업 응용 기술에 필요한 비압축 HD stereoscopic 영상 전송 시스템을 구성해 3D HD 영상을 사용자가 체감할 수 있도록 한다. 또한, 소프트웨어 기반 다채널 음성 재생을 다루며 실험을 통해 방향성 있는 협업 환경 구축의 가능성을 보여준다. 입체감 있는 미디어 재생을 위해 병렬 구조의 좌 우 송수신 시스템을 구축 후 stereoscopic 비압축 영상 전송을 수행하며, 좌 우 영상 세션간의 인터 미디어 동기화 기법의 설계방법을 제안한다. 음성 재생 소프트웨어는 ALSA를 이용하여 구현하였으며 가변 데이터 길이 및 프레임 손실로 인한 채널 뒤섞임(channel swapping)을 방지하기 위한 버퍼를 재생 모듈 전처리단에 추가하였다. 초고속 네트워크와 비압축 미디어 전송의 결합은 IP를 이용해 다채널 음성 지원의 실감 HDTV를 가능케 하므로 이를 유용하게 활용할 수 있는 사용 시나리오를 살펴본다.
PDF

Detection of Frame Deletion for HEVC-coded Video Using CNN (CNN 기반 HEVC 압축된 동영상의 삭제 검출 기법)

Hong, Jin Hyung;Yang, Yoonmo;Oh, Byung Tae
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2018.06a
- /
- pp.190-192
- /
- 2018
최근 딥 러닝 기술의 발전이 가속화됨에 따라, 기존의 알고리즘과 융합하여 뛰어난 성능 향상을 보이는 연구가 급격히 증가하고 있다. 본 논문에서는 딥 러닝을 이용하여 HEVC 로 압축된 동영상의 일부 프레임의 삭제여부를 검출하는 알고리즘을 제안한다. 영상의 삭제 정보가 포함되어 있는 HEVC 의 부호화 파라미터를 추출하여 간단한 전 처리 과정을 통해 데이터의 크기를 효과적으로 압축한 뒤, 동영상의 시간적 특성을 고려할 수 있도록 CNN 네트워크를 구성한다. 실험 결과, 효과적으로 다양한 압축 환경에 강인한 영상 삭제 검출 성능을 보이는 것을 확인하였다.
PDF

A Preprocessing Algorithm for Layered Depth Image Coding (계층적 깊이영상 정보의 압축 부호화를 위한 전처리 방법)

윤승욱;김성열;호요성
- Journal of Broadcast Engineering
- /
- v.9 no.3
- /
- pp.207-213
- /
- 2004
The layered depth image (LDI) is an efficient approach to represent three-dimensional objects with complex geometry for image-based rendering (IBR). LDI contains several attribute values together with multiple layers at each pixel location. In this paper, we propose an efficient preprocessing algorithm to compress depth information of LDI. Considering each depth value as a point in the two-dimensional space, we compute the minimum distance between a straight line passing through the previous two values and the current depth value. Finally, the minimum distance replaces the current attribute value. The proposed algorithm reduces the variance of the depth information , therefore, It Improves the transform and coding efficiency.
PDF KSCI

Quadtree Based Infrared Image Compression in Wavelet Transform Domain (웨이브렛 변환 영역에서 쿼드트리 기반 적외선 영상 압축)

조창호;이상효
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.3C
- /
- pp.387-397
- /
- 2004
The wavelet transform providing both of the frequency and spatial information of an image is proved to be very much effective for the compression of images, and recently lot of studies on coding algorithms for images decomposed by the wavelet transform together with the multi-resolution theory are going on. This paper proposes a quadtree decomposition method of image compression applied to the images decomposed by wavelet transform by using the correlations between pixels and '0'data grouping. Since the coefficients obtained by the wavelet transform have high correlations between scales and high concentrations, the quadtree method can reduce the data quantity effectively. the experimental infrared image with 256${\times}$256 size and 8〔bit〕, was used to compare the performances of the existing and the proposed compression methods.
PDF KSCI

Security for Real-Time Desktop Video Conferencing System (실시간 영상회의 시스템보안)

이상하;장준교;신성철;김동규
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10a
- /
- pp.556-558
- /
- 1998
실시간 영상회의 시스템을 인터넷상에서 다양하게 사용하려는 시도가 이루어지고 있다. 이런 부분의 연구는 오디오, 비디오 압축기법, 멀티미디어의 동기화, 다자간의 영상회의를 지원하기 위한 IP multicast 의 Mbone의 연구가 활발하게 이루어지고 있고, 통신의 회선속도가 고속화됨에 따라 인터넷에서 영상을 통한 다양한 멀티미디어 서비스가 이루어지고 있다. 개방형 분산 인터넷 통신망 환경에서의 영상회의는 영상회의 데이터인 영상 및 음성 보안에 대한 문제가 심각하게 대두된다. 본 논문에서는 실시간 영상회의에서 멀티미디어 데이터의 특성에 따른 보안 방법을 제시하고자 한다.
PDF

Design and Implementation of an MPEG2 Non-linear Editor in Client/Server Environment (클라이언트/서버 환경에서의 MPEG-2 비선형 편집기의 설계 및 구현)

김소만;송기택;유지욱;이성환
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.04b
- /
- pp.541-543
- /
- 2000
최근 들어, 통신 및 멀티미디어의 발달과 더불어 대용량 동영상 데이터의 효과적인 관리 및 처리 방법에 대한 연구가 활발히 진행되고 있는 추세이다. 따라서 본 논문에서는 동영상 압축표준인 MPEG-2 데이터의 효과적인 관리 및 처리를 위한 클라이언트/서버 환경에서의 비선형 편집 시스템을 구현하였다. 본 시스템은 압축 정보를 이용하여 최소한의 복호화 과정을 통해 장면전화 검출 및 색인을 수행하는 색인부의 DirectX를 기반으로 한 편집부로 크게 구성되어 있다. 본 논문에서는 압축 정보를 사용하여 빠르게 색인을 수행하고, DirectX 기술을 사용함으로써 다양한 편집 기능을 제공하며 사용자에게 직관적이고 간편한 인터페이스를 제공한다. 또한 클라이언트/서버 환경에서의 편집기능을 제공함으로써 고성능의 편집 서버를 통해 저사양의 클라이언트 컴퓨터에서도 소프트웨어적으로 MPEG-2 동영상을 효과적으로 편집할 수 있는 시스템을 구현하였다.
PDF

An algorithm for generating temporal texture for video retrieval (동영상 검색을 위한 템포럴 텍스처 생성 알고리즘)

Kim, Do-Nyun;Cho, Dong-Sub
- Proceedings of the KIEE Conference
- /
- 2000.11d
- /
- pp.839-841
- /
- 2000
텍스처 정보는 정지 영상 뿐 아니라 동영상 분석에서도 많은 정보를 제공한다. 이러한 텍스처 정보를 동영상의 움직임 분류에 사용하여 기존의 색, 색영역의 배치 정보, 기준 형상, 명도 텍스처 등을 기본 탐색 키로 삼는 동영상 검색 시스템에 텍스처 특성을 움직임 정보에 적용하여 저 수준 정보에서 움직임 정보가 직접적으로 추출될 수 있음을 보였다. 이 방법의 장점은 배경 소거, 오브젝트 추출 및 추적, 참조 곡선 탐색 등 많은 계산량을 요구하는 연산들이 없이도 움직임 정보를 압축 동영상에서 추출할 수 있다는 것이다. 또한 동영상은 데이터의 양이 매우 크기 때문에 압축되어 있는 것이 필수인데 본 연구에서는 웨이브릿으로 압축되어 있는 동영상에서 움직임 정보가 고주파 부분에 집중되어 있는 점을 이용하여 역변환을 거치지 않고 직접 템포럴 텍스처를 생성하였다. 따라서 계산 속도를 향상시켰으며 계산 과정도 행렬 연산을 기본으로 수행하여 계산 과정을 간단하게 하였다.
PDF

A Study on Evolutionary Computation of Fractal Image Compression (프랙탈 영상 압축의 진화적인 계산에 관한 연구)

Yoo, Hwan-Young;Choi, Bong-Han
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.2
- /
- pp.365-372
- /
- 2000
he paper introduces evolutionary computing to Fractal Image Compression(FIC). In Fractal Image Compression(FIC) a partitioning of the image into ranges is required. As a solution to this problem there is a propose that evolution computation should be applied in image partitionings. Here ranges are connected sets of small square image blocks. Populations consist of $N_p$ configurations, each of which is a partitioning with a fractal code. In the evolution each configuration produces $\sigma$ children who inherit their parent partitionings except for two random neighboring ranges which are merged. From the offspring the best ones are selected for the next generation population based on a fitness criterion Collage Theorem. As the optimum image includes duplication in image data, it gets smaller in saving space more efficient in speed and more capable in image quality than any other technique in which other coding is used. Fractal Image Compression(FIC) using evolution computation in multimedia image processing applies to such fields as recovery of image and animation which needs a high-quality image and a high image-compression ratio.
PDF

Search Result 668, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)