Search | Korea Science

Adaptive Chroma Block Partitioning Method using Comparison of Similarity between Channels (채널 간 유사도 비교를 이용한 적응형 색차 블록 분할 방법)

Baek, A Ram;Choi, Sanggyu;Choi, Haechul
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2018.06a
- /
- pp.260-261
- /
- 2018
MPEG과 VCEG은 차세대 비디오 부호화 표준 기술 개발를 위한 JVET(Joint Video Exploration Team)을 구성하여 현재 비디오 표준화인 HEVC 대비 높은 부호화 효율을 목표로 연구를 진행하며 CfP(Call for Proposal) 단계를 진행 중이다. JVET의 공통 플랫폼인 JEM(Joint Exploration Test Model)은 HEVC의 quad-tree 기반 블록 분할 구조를 대신하여 더 많은 유연성을 제공하는 QTBT(Quad-tree plus binary-tree)가 적용되었다. QTBT는 화면 내 부호화 효율을 높이기 위한 하나의 방법으로 휘도와 색차 신호에 대해 분할된 블록 구조를 지원한다. 이러한 방법은 채널 간 블록 분할 모양이 동일하거나 비슷한 경우에 중복되는 블록 분할 신호가 발생할 수 있는 단점이 있다. 따라서 본 논문에서는 화면 내 부호화에서 채널 간 유사도 비교를 이용하여 적응형 색차 블록 방법을 제안한다. 제안한 방법의 실험 결과로 JEM 6.0과 비교하여 CfE(Call for Evidence) 영상에서 평균 0.28%의 Y BD-rate 감소와 함께 평균 124.5%의 부호화 복잡도 증가를 확인하였다.
PDF

Moving Object Segmentation for MPEG-4 Object-based Coding (MPEG-4객체 분할 코팅을 위한 움직임 객체 분할)

Kim, Jun-Ki;Chang, Jun;Lee, Ho-Suk
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10b
- /
- pp.385-387
- /
- 2001
비디오 객체 분할은 MPEG-4와 같은 객체 기반 코딩 단계를 위한 중요한 구성 요소이다. 새로운 MPEG-4 비디오 표준은 움직임 객체의 모양 정보를 고려하여 높은 효율의 부호화 뿐만 아니라 움직임 객체에 대한 내용기반 기능의 부호화를 수행한다. 본 논문은 비디오 시퀀스에서 움직임 객체 분할을 위한 새로운 알고리즘과 VOP(Video Object Plane) 추출 방법을 소개한다. 본 알고리즘은 첫 번째 프레임을 기준영상으로 설정한 후 두 개의 연속된 프레임 사이의 차이 값으로부터 시작된다. 즉 차이영상을 추출한 후 차이영상에 Canny 에지를 적용하고 다음 프레임의 영상에 Canny 에지와 morphologic일 연산을 적용하여 정확한 움직임 객체 에지(Moving Object Edge)를 생성한다. 이후 생성된 에지를 이용하여 VOP를 추출한다. VOP 추출 단계에서 더욱 정확한 움직임 객체 에지를 얻기 위하여 morphological 연산을 수행하였다.
PDF

Block Shape Adaptive Candidate List Derivation for Inter Prediction in Versatile Video Coding (VVC) (VVC 의 블록모양 적응적 화면간 예측 후보 리스트 유도 기법)

Do, JiHoon;Park, Dohyeon;Kim, Jae-Gon;Jeong, Dae-Gwon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2018.06a
- /
- pp.257-259
- /
- 2018
최근 JVET(Joint Video Experts Team)는 새로운 비디오 압축 표준을 VVC(Versatile Video Coding)으로 이름 짓고 2020 년 완료를 목표로 그 표준화를 시작하였다. HEVC 및 VVC 에서는 화면간 예측의 부호화 효율을 위하여 공간적/시간적 주변블록의 움직임 정보로부터 Merge/AMVP(Advanced Motion Vector Prediction)의 후보 리스트를 구성하고 최적의 움직임 정보를 활용한다. 본 논문에서는 Merge/AMVP 의 후보 리스트를 유도할 때, 현재블록의 모양을 고려하여 상관성이 높은 주변블록의 움직임 정보를 우선 순위로 유도하는 기법을 제안한다. 실험을 통하여 VTM(VVC TM) 대비 제안기법의 성능을 확인한다.
PDF

An Adaptive ROI Mask Generation for ROI coding of JPEG2000 (JPEG200의 관심영역 부호화를 위한 적응적인 관심영역 마스크 생성 방법)

Kang, Ki-Jun;Seo, Yeong-Geon
- Journal of the Korea Society of Computer and Information
- /
- v.12 no.5
- /
- pp.39-47
- /
- 2007
In this thesis, a method of generating an adaptable Region-Of-Interest(ROI) Mask for the Region-Of-Interest coding is suggested. In the method, an ROI Mask is generated using the information of the ROI designated by a user. In the existed method of ROI coding, after scanning all the pixels in order and discriminating an ROI, an ROI Mask is generated. But, in our method, after scanning a part of pixels based on the shape pattern of an ROI and discriminating a ROI by one code block unit, an ROI Mask is generated. Moreover, from the method, a pattern number, threshold of a ROI and background threshold parameter are provided. According to the result of its comparing test with the existed methods to show the usability, it is proved that our method is superior in speed to the existed ones.
PDF

Waveform Coding for Anti-Multipath Mod/Demodulation Systems (내다중파변복조 방식을 위한 파형부호화)

이정재
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.18 no.9
- /
- pp.1323-1331
- /
- 1993
In this paper, novel anti-multipath mod/demodulation techniques are introduced, The principal anti-multipath concepts of the double phase shift keying(DSK) system with a differntial detector and the diversity effect to be obtained from the phase shifts at the middle of symbol duration T are described. A generalized form of DSK, referred to as the $\theta$-DSK, is studied, and comparison of bandwidth is mode for various values of $\theta$ and shaping functions.
PDF

Efficient Coding of Motion Vector and Mode Information for H.264/AVC (H.264/AVC에서 효율적인 움직임 벡터와 모드 정보의 압축)

Lee, Dong-Shik;Kim, Young-Mo
- Journal of Korea Multimedia Society
- /
- v.11 no.10
- /
- pp.1359-1365
- /
- 2008
The portion of header in H.264 gets higher than those of previous standards instead of its better compression efficiency. Therefore, this paper proposes a new technique to compress the header of H.264. Unifying a sentence elementary in H.264, H.264 does not consider the distribution of element which be encoded and uses existing Exp-Golomb method, but it is uneffective for variable length coding. Most of the header are block type(s) and motion vector difference(s), and there are redundancies in the header of H.264. The redundancies in the header of H.264 which are analyzed in this paper are three. There are frequently appearing symbols and non-frequently appearing symbols in block types. And when mode 8 is selected in macroblock, all of four sub-macroblock types are transferred. At last, same values come in motion vector difference, especially '0.' This paper proposes the algorithm using type code and quadtree, and with them presents the redundant information of header in H.264. The type code indicates shape of the macroblock and the quadtree does the tree structured motion compensation. Experimental results show that proposed algorithm achieves lower total number of encoded bits over JM12.4 up to 32.51% bit reduction.
PDF

Local Prominent Directional Pattern for Gender Recognition of Facial Photographs and Sketches (Local Prominent Directional Pattern을 이용한 얼굴 사진과 스케치 영상 성별인식 방법)

Makhmudkhujaev, Farkhod;Chae, Oksam
- Convergence Security Journal
- /
- v.19 no.2
- /
- pp.91-104
- /
- 2019
In this paper, we present a novel local descriptor, Local Prominent Directional Pattern (LPDP), to represent the description of facial images for gender recognition purpose. To achieve a clearly discriminative representation of local shape, presented method encodes a target pixel with the prominent directional variations in local structure from an analysis of statistics encompassed in the histogram of such directional variations. Use of the statistical information comes from the observation that a local neighboring region, having an edge going through it, demonstrate similar gradient directions, and hence, the prominent accumulations, accumulated from such gradient directions provide a solid base to represent the shape of that local structure. Unlike the sole use of gradient direction of a target pixel in existing methods, our coding scheme selects prominent edge directions accumulated from more samples (e.g., surrounding neighboring pixels), which, in turn, minimizes the effect of noise by suppressing the noisy accumulations of single or fewer samples. In this way, the presented encoding strategy provides the more discriminative shape of local structures while ensuring robustness to subtle changes such as local noise. We conduct extensive experiments on gender recognition datasets containing a wide range of challenges such as illumination, expression, age, and pose variations as well as sketch images, and observe the better performance of LPDP descriptor against existing local descriptors.
https://doi.org/10.33778/kcsa.2019.19.2.091 인용 PDF KSCI

Functional Mapping of the Neural Basis for the Encoding and Retrieval of Human Episodic Memory Using ${H_2}^{15}O$ PET ({H_2}^{15}O$ PET을 이용한 정상인의 삽화기억 부호화 및 인출 중추 뇌기능지도화)

Lee, Jae-Sung;Nam, Hyun-Woo;Lee, Dong-Soo;Lee, Sang-Kun;Jang, Myoung-Jin;Ahn, Ji-Young;Park, Kwang-Suk;Chung, June-Key;Lee, Myung-Chul
- The Korean Journal of Nuclear Medicine
- /
- v.34 no.1
- /
- pp.10-21
- /
- 2000
Purpose: Episodic memory is described as an 'autobiographical' memory responsible for storing a record of the events in our lives. We performed functional brain activation study using ${H_2}^{15}O$ PET to reveal the neural basis of the encoding and the retrieval of episodic memory in human normal volunteers. Materials and Methods: Four repeated ${H_2}^{15}O$ PET scans with two reference and two activation tasks were performed on 6 normal volunteers to activate brain areas engaged in encoding and retrieval with verbal materials. Images from the same subject were spatially registered and normalized using linear and nonlinear transformation. Using the means and variances for every condition which were adjusted with analysis of covariance, t-statistic analysis were performed voxel-wise. Results: Encoding of episodic memory activated the opercular and triangular parts of left inferior frontal gyrus, right prefrontal cortex, medial frontal area, cingulate gyrus, posterior middle and inferior temporal gyri, and cerebellum, and both primary visual and visual association areas. Retrieval of episodic memory activated the triangular part of left inferior frontal gyrus and inferior temporal gyrus, right prefrontal cortex and medial temporal area, and both cerebellum and primary visual and visual association areas. The activations in the opercular part of left inferior frontal gyrus and the right prefrontal cortex meant the essential role of these areas in the encoding and retrieval of episodic memory. Conclusion: We could localize the neural basis of the encoding and retrieval of episodic memory using ${H_2}^{15}O$ PET, which was partly consistent with the hypothesis of hemispheric encoding/retrieval asymmetry.
PDF

Design of A Stateless Minimum-Bandwidth Binary Line Code MB46d (Stateless 최소대역폭 2진 선로부호 MB46d의 설계)

Lee, Dong-Il;Kim, Dae-Young
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.35S no.10
- /
- pp.11-18
- /
- 1998
A binary line code, called MB46d, is designed by use of the BUDA(Binary Unit DSV and ASV) cell concept to retain the property of being runlength limited, DC tree, and with a power spectral null at the Nyquist frequency. This new code is a stateless line code with a simple encoding and a decoding rule and enables efficient error monitoring. The power spectrum and the eye pattern of the new line code are simulated for a minimum-bandwidth digital transmission system where the sinc function is used as a basic pulse. The obtained power null at the Nyquist frequency is wide enough to enable easy band-limiting as well as secure insertion of a clock pilot where necessary. The eye is also substantially wide to tolerate a fair amount of timing jitter in the receiver.
PDF

Video Segmentation using the Level Set Method (Level Set 방법을 이용한 영상분할 알고리즘)

김대희;호요성
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.40 no.5
- /
- pp.303-311
- /
- 2003
Since the MPEG-4 visual standard enables content-based functionalities, it is necessary to extract video object from natural video sequences. Segmentation algorithms can largely be classified into automatic segmentation and user-assisted segmentation. In this paper, we propose a user-assisted VOP generation method based on the geometric active contour. Since the geometric active contour, unlike the parametric active contour, employs the level set method to evolve the curve, we can draw the initial curve independent of the shape of the object. In order to generate the edge function from a smoothed image, we propose a vector-valued diffusion process in the LUV color space. We also present a discrete 3-D diffusion model for easy implementation. By combining the curve shrinkage in the vector field space with the curve expansion in the empty vector space, we can make accurate extraction of visual objects from video sequences.
PDF KSCI

Search Result 22, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)