• Title/Summary/Keyword: hierarchical B layer

Search Result 26, Processing Time 0.018 seconds

Multi-resolution hierarchical motion estimation in the wavelet transform domain (웨이브렛 변환된 다해상도 영상을 이용한 계층적 움직임 추정)

  • 김진태;장준필;김동욱;최종수
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.8
    • /
    • pp.50-59
    • /
    • 1996
  • In this paper, a new hierarchical motion estiamtion scheme using the wavelet transformed multi-resolution image layers is proposed. Compared with the full search motion estimation method, the existing hierarchical methods remarkably reduce the amount of the computation but their efficiencies are depreciated by the local minima problem. In order to solve the local minima problem, the multi-resolution image layers are composed using the wavelet transform and the number of layers participated in the motion estimation for a block is determined by considering of its low band energy and higher band energy on the first wavelet transformed layer. The ratio between higher band energy and low band energy of each block is evaluated and in the case of the blocks which include relatively large higher band energy, the motion estimation is carried out in the high resolution layer. Otherwise, all layers are used. The final motion vectors are obtained in the first wavelet transformed layer. So less bits for motion vectors are transmitted, and the decomposition of received image using inverse wavelet transform decreases the blocking effect.

  • PDF

Hangul Recognition Using a Hierarchical Neural Network (계층구조 신경망을 이용한 한글 인식)

  • 최동혁;류성원;강현철;박규태
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.11
    • /
    • pp.852-858
    • /
    • 1991
  • An adaptive hierarchical classifier(AHCL) for Korean character recognition using a neural net is designed. This classifier has two neural nets: USACL (Unsupervised Adaptive Classifier) and SACL (Supervised Adaptive Classifier). USACL has the input layer and the output layer. The input layer and the output layer are fully connected. The nodes in the output layer are generated by the unsupervised and nearest neighbor learning rule during learning. SACL has the input layer, the hidden layer and the output layer. The input layer and the hidden layer arefully connected, and the hidden layer and the output layer are partially connected. The nodes in the SACL are generated by the supervised and nearest neighbor learning rule during learning. USACL has pre-attentive effect, which perform partial search instead of full search during SACL classification to enhance processing speed. The input of USACL and SACL is a directional edge feature with a directional receptive field. In order to test the performance of the AHCL, various multi-font printed Hangul characters are used in learning and testing, and its processing its speed and and classification rate are compared with the conventional LVQ(Learning Vector Quantizer) which has the nearest neighbor learning rule.

  • PDF

Temporal Prediction Structure for Multi-view Video Coding (다시점 비디오 부호화를 위한 시간적 예측 구조)

  • Yoon, Hyo-Sun;Kim, Mi-Young
    • Journal of Korea Multimedia Society
    • /
    • v.15 no.9
    • /
    • pp.1093-1101
    • /
    • 2012
  • Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. Multi-view video coding exploits inter-view correlations among pictures of neighboring views and temporal correlations among pictures of the same view. Multi-view video coding which uses many cameras requires a method to reduce the computational complexity. In this paper, we proposed an efficient prediction structure to improve performance of multi-view video coding. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several small groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. Experimental results show that the proposed prediction structure shows good performance in image quality and bit-rates. When compared to the performance of hierarchical B pictures of Fraunhofer-HHI, the proposed prediction structure achieved 0.07~0.13 (dB) of PSNR gain and was down by 6.5(Kbps) in bitrate.

Improved Prediction Structure and Motion Estimation Method for Multi-view Video Coding (다시점 비디오 부호화를 위한 개선된 예측 구조와 움직임 추정 기법)

  • Yoon, Hyo Sun;Kim, Mi Young
    • Journal of KIISE
    • /
    • v.41 no.11
    • /
    • pp.900-910
    • /
    • 2014
  • Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. The computational complexity of multi view video coding increases in proportion to the number of cameras. To reduce computational complexity and maintain the image quality, improved prediction structure and motion estimation method is proposed in this paper. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. And the proposed motion estimation method uses a hierarchical search strategy. This strategy method consists of modified diamond search pattern, progressive diamond search pattern and modified raster search pattern. Experiment results show that the complexity reduction of the proposed prediction structure and motion estimation method over JMVC (Joint Multiview Video Coding) reference model using hierarchical B pictures of Fraunhofer-HHI and TZ search method can be up to 40~70% while maintaining similar video quality and bit rates.

Temporal Prediction Structure and Motion Estimation Method based on the Characteristic of the Motion Vectors (시간적 예측 구조와 움직임 벡터의 특성을 이용한 움직임 추정 기법)

  • Yoon, Hyo Sun;Kim, Mi Young
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.10
    • /
    • pp.1205-1215
    • /
    • 2015
  • Efficient multi-view coding techniques are needed to reduce the complexity of multi-view video which increases in proportion to the number of cameras. To reduce the complexity and maintain image quality and bit-rates, an motion estimation method and temporal prediction structure are proposed in this paper. The proposed motion estimation method exploits the characteristic of motion vector distribution and the motion direction and motion size of the block to place search points and decide the search patten adaptively. And the proposed prediction structure divides every GOP to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. Experiment results show that the complexity reduction of the proposed temporal prediction structure and motion estimation method over hierarchical B pictures prediction structure and TZ search method which are used in JMVC(Joint Multi-view Video Coding) reference model can be up to 45∼70% while maintaining similar video quality and bit rates.

A Cross-Layer Unequal Error Protection Scheme for Prioritized H.264 Video using RCPC Codes and Hierarchical QAM

  • Chung, Wei-Ho;Kumar, Sunil;Paluri, Seethal;Nagaraj, Santosh;Annamalai, Annamalai Jr.;Matyjas, John D.
    • Journal of Information Processing Systems
    • /
    • v.9 no.1
    • /
    • pp.53-68
    • /
    • 2013
  • We investigate the rate-compatible punctured convolutional (RCPC) codes concatenated with hierarchical QAM for designing a cross-layer unequal error protection scheme for H.264 coded sequences. We first divide the H.264 encoded video slices into three priority classes based on their relative importance. We investigate the system constraints and propose an optimization formulation to compute the optimal parameters of the proposed system for the given source significance information. An upper bound to the significance-weighted bit error rate in the proposed system is derived as a function of system parameters, including the code rate and geometry of the constellation. An example is given with design rules for H.264 video communications and 3.5-4 dB PSNR improvement over existing RCPC based techniques for AWGN wireless channels is shown through simulations.

Parallel, self-organizing, hierarchical neural networks for handwritten digit recognition (필기체 숫자인식을 위한 병렬 자구성 계층 신경회로망)

  • 방극준;조남신;강창언;홍대식
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.7
    • /
    • pp.173-182
    • /
    • 1996
  • In this paper, we propose the parallel, self-organizing, hierarchical neural netowrks as a handwritten digit recognition system. This system can absorb the various shape variations of handwritten digits by using the different methods of extracting the features in each stage neural network (SNN) of the PSHNN, and can reduce training time by using the single layer neural network as the SNN, and can obtain high rate of correct recognition by using the certainty area in all the output nodes individually. experiments have been performed with NIST database. In which we use 21, 315 digits (10, 625 digits for training and 10,663 digits for testing). The results show that the correct rate is 97.48% the error rate is 1.72% and the reject rate is 0.78%.

  • PDF

Improving the Base-Layer BER performance at AT-DMB using a Channel Estimation (AT-DMB 시스템에서 채널추정을 이용한 기본계층 수신 성능 향상기법)

  • Bang, Keuk-Joon
    • 전자공학회논문지 IE
    • /
    • v.49 no.2
    • /
    • pp.46-51
    • /
    • 2012
  • Transmit signal of Enhancement Layer in AT-DMB system is received by Coherent Detection, but in Base Layer of AT-DMB, a differential modulation and demodulation is adopted, same as the T-DMB. Especially for the coherent dectection of enhancement layer in AT-DMB system, a channel estimation must be employed. In this paper, I will show that the BER performance of Base-Layer in AT-DMB system will be improved by using the channel estimation information. The suggested method is focusing the constallations after Equalizaing to the nearlest ${\pi}$/4-shift DQPSK constallation points. Simulation results show that for the non-coding environment, the BER performance of AWGN channel, about 2-dB gain can be achieved at $10^{-4}BER$.

A Study on the Subband Coding System Using Motion Compensation Techniques (이동 보상 기법을 이용한 서브밴드 부호화 시스템에 관한 연구)

  • 이기승;박용철;서정태;윤대희
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.10
    • /
    • pp.99-111
    • /
    • 1994
  • A motion picture compression scheme using subband coding with motion compensation is presneted in this paper. A hierarchical subband decomposition is used to split the image signal into 10 subbands with a 3-layer pyramid structure and motion compensation is used in each band. However, in this case, motion vector information is drastically increased; therefore, initial motion vectors are estimated in the highest pyramid and motion vectors are refined using the reconsructed subband signal in each layer. Simulation results show that the proposed method compares favorably in terms of prediction error energy and side informatio with methods requiring additional information. Images recostructed from the proposed method show good quality compared to those reconstructed using blockwise DCT.

  • PDF

On the Hybrid Prediction Pyramid Compatible Coding Technique (혼성 예측 피라미드 호환 부호화 기법)

  • 이준서;이상욱
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.1
    • /
    • pp.33-46
    • /
    • 1996
  • Inthis paper, we investigate the compatible coding technique, which receives much interest ever since the introduction of HDTV. First, attempts have been made to analyze the theoretical transform coding gains for various hierarchical decomposition techniques, namely subband, pyramid and DCT-based decomposition techniques. It is shown that the spatical domain techniques proide higher transform coding gains than the DCT-based coding technique. Secondly, we compare the performance of these spatial domain techniques, in terms of the PSNR versus various rate allocations to each layer. Based on these analyses, it is believed that the pyramid decomposition is more appropriate for the compatible coding. Also in this paper, we propose a hybrid prediction pyramid coding technique, by combining the spatio-temporal prediction in MPEG-2[3] and the adaptive MC(Motion Compensation)[1]. In the proposed coding technigue, we also employ an adaptive DCT coefficient scanning technique to exploit the direction information of the 2nd-layer signal. Through computer simulations, the proposed hybrid prediction with adaptive scanning technuque shows the PSNR improvement, by about 0.46-1.78dB at low 1st-layer rate(about 0.1bpp) over the adaptive MC[1], and by about 0.33-0.63dB at high 1st-layer rate (about 0.32-0.43bpp) over the spatio-temporal prediction[3].

  • PDF