• Title/Summary/Keyword: 고속 연산

Search Result 897, Processing Time 0.026 seconds

High Speed Implementation of HomePNA 2.0 Frame Processor (HomePNA 2.0 프레임 프로세서의 고속 구현 기법)

  • 강민수;이원철;신요안
    • Proceedings of the IEEK Conference
    • /
    • 2003.07a
    • /
    • pp.533-536
    • /
    • 2003
  • 본 논문에서는 전화선을 이용한 고속 홈네트워크인 HomePNA 2.0 시스템에서 HomePNA 2.0 (H2) 프레임을 만들기 위한 프레임 프로세싱 중, 다항식 나누기 연산을 통한 CRC (Cyclic Redundancy Check) 16비트 생성, HCS (Header Check Sequence) 8비트 생성 및 혼화(Scrambling) 처리에 있어서 입력 8 비트를 동시에 병렬 처리함으로써 기존의 1 비트 입력을 LFSR (Linear Feedback Shift Register)를 사용한 다항식 나누기 연산을 수행했을 때보다 빠른 속도로 H2 프레임을 구현하고자 하는 고속 처리 기법을 제시하고 이의 성능을 검증하였다.

  • PDF

Design of a high-speed 4-2 compressor for fast multiplication (고속 곱셈연산을 위한 고속 4-2 compressor 설계)

  • Lee, Sung-Tae;Kim, Jeong-Beom
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2009.11a
    • /
    • pp.401-402
    • /
    • 2009
  • 4-2 compressor는 곱셈기의 부분 곱 합 트리(partial product summation tree)의 기본적인 구성요소이다. 본 논문은 고속 연산이 가능한 4-2 compressor 구조를 제안한다. 제안한 회로는 최적화된 XORXNOR와 MUX로 구성하였다. 이 회로는 기존의 회로와 비교하였을 때 회로 구성에 필요한 트랜지스터수가 12개 감소하였으며, 지연시간이 32.2% 감소하였다. 제안한 회로는 Samsung 0.18um CMOS 공정을 이용하여 HSPICE로 시뮬레이션 하였다.

A Study on High Performances Floating Point Unit (고성능 부동 소수점 연산기에 대한 연구)

  • Park, Woo-Chan;Han, Tack-Don
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.11
    • /
    • pp.2861-2873
    • /
    • 1997
  • An FPU(Floating Point unit) is the principle component in high performance computer and is placed on a chip together with main processing unit recently. As a Processing speed of the FPU is accelerated, the rounding stage, which occupies one of the floating point Processing steps for floating point operations, has a considerable effect on overall floating point operations. In this paper, by studying and analyzing the processing flows of the conventional floating point adder/subtractor, multipler and divider, which are main component of the FPU, efficient rounding mechanisms are presented. Proposed mechanisms do not require any additional execution time and any high speed adder for rounding operation. Thus, performance improvement and cost-effective design can be achieved by this approach.

  • PDF

A Efficient Architecture of MBA-based Parallel MAC for High-Speed Digital Signal Processing (고속 디지털 신호처리를 위한 MBA기반 병렬 MAC의 효율적인 구조)

  • 서영호;김동욱
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.41 no.7
    • /
    • pp.53-61
    • /
    • 2004
  • In this paper, we proposed a new architecture of MAC(Multiplier-Accumulator) to operate high-speed multiplication-accumulation. We used the MBA(Modified radix-4 Booth Algorithm) which is based on the 1's complement number system, and CSA(Carry Save Adder) for addition of the partial products. During the addition of the partial product, the signed numbers with the 1's complement type after Booth encoding are converted in the 2's complement signed number in the CSA tree. Since 2-bit CLA(Carry Look-ahead Adder) was used in adding the lower bits of the partial product, the input bit width of the final adder and whole delay of the critical path were reduced. The proposed MAC was applied into the DWT(Discrete Wavelet Transform) filtering operation for JPEG2000, and it showed the possibility for the practical application. Finally we identified the improved performance according to the comparison with the previous architecture in the aspect of hardware resource and delay.

Fast block matching algorithm for constrained one-bit transform-based motion estimation using binomial distribution (이항 분포를 이용한 제한된 1비트 변환 움직임 예측의 고속 블록 정합 알고리즘)

  • Park, Han-Jin;Choi, Chang-Ryoul;Jeong, Je-Chang
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.861-872
    • /
    • 2011
  • Many fast block-matching algorithms (BMAs) in motion estimation field reduce computational complexity by screening the number of checking points. Although many fast BMAs reduce computations, sometimes they should endure matching errors in comparison with full-search algorithm (FSA). In this paper, a novel fast BMA for constrained one-bit transform (C1BT)-based motion estimation is proposed in order to decrease the calculations of the block distortion measure. Unlike the classical fast BMAs, the proposed algorithm shows a new approach to reduce computations. It utilizes the binomial distribution based on the characteristic of binary plane which is composed of only two elements: 0 and 1. Experimental results show that the proposed algorithm keeps its peak signal-to-noise ratio (PSNR) performance very close to the FSA-C1BT while the computation complexity is reduced considerably.

A VLSI Architecture of an 8$\times$8 OICT for HDTV Application (HDTU용 8$\times$8 최적화 정수형 여현 변환의 VLSE 구조)

  • 송인준;황상문;이종하;류기수;곽훈성
    • Journal of the Korean Institute of Telematics and Electronics T
    • /
    • v.36T no.1
    • /
    • pp.1-7
    • /
    • 1999
  • We present VLSI architecture for a high performance 2-D DCT processor which is used compressing system of real time image processing or HDTV using fast computational algorithm of the Optimized Integer Cosine Transform(OICT). The coefficients of the OICT are integer, so the OICT performs only the integer operations for both forward and inverse transform. Therefore the proposed architecture could be greatly enhanced in improving the speed, reduced the hardware cost considerably by replacing the multiplication operations with shift and addition operations compared with DCT which performs floating-point operations.

  • PDF

A Study on the New BC-ABBM Motion Estimation Algorithm for Low Bit Rate Video Coding (저 전송률 비디오 압축을 위한 새로운 BC-ABBM 움직임 추정 알고리즘에 관한 연구)

  • 이완범;김환용
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.7C
    • /
    • pp.946-953
    • /
    • 2004
  • Fast search and conventional boolean matching motion estimation algorithms reduce computational complexity and data processing time but this algorithms have disadvantages that is difficult of implementation of hardware because of high control overhead and that is less performance than Full search Algorithm(FA). This paper present new all binary block matching algorithm, called Bit Converted All Binary Block Matching(BC-ABBM). Proposed algorithm have performance closed to the FA by boolean only block matching that may be very efficiently implemented in hardware for low bit rate video communication. Simulation results show that the PSNR of the proposed algorithm is about 0.04dB loss than FA but is about 0.6 ∼ 1.4dB gain than fast search algorithm and conventional boolean matching algorithm.

An Area Efficient High Speed FIR Filter Design and Its Applications (면적 절약형 고속 FIR 필터의 설계 및 응용)

  • Lee, Kwang-Hyun;Rim, Chong-Suck
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.37 no.11
    • /
    • pp.85-95
    • /
    • 2000
  • FIR digital filter is one of important blocks in DSP application. For more effective operation, lots of architecture are proposed. In our paper, we proposed a high speed FIR filter with area efficiency. To fast operation, we used transposed form filter as basic architecute. And, we used dual path registers line to wupport variation of filter operation, and filter cascade is also considered. To reduce area, we adopted truncated Booth multiplier to our filter design. As a result, we showed that filter area is reduced when filter optimization using of dual path registers line and truncated multiplier with same constraints againt previous method.

  • PDF

A fast scalar multiplication on elliptic curves (타원곡선에서 스칼라 곱의 고속연산)

  • 박영호;한동국;오상호;이상진;임종인;주학수
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.12 no.2
    • /
    • pp.3-10
    • /
    • 2002
  • For efficient implementation of scalar multiplication in Kobliz elliptic curves, Frobenius endomorphism is useful. Instead of binary expansion of scalar, using Frobenius expansion of scalar we can speed up scalar multiplication and so fast scalar multiplication is closely related to the expansion length of integral multipliers. In this paper we propose a new idea to reduce the length of Frobenius expansion of integral multipliers of scalar multiplication, which makes speed up scalar multiplication. By using the element whose norm is equal to a prime instead of that whose norm is equal to the order of a given elliptic curve we optimize the length of the Frobenius expansion. It can reduce more the length of the Frobenius expansion than that of Solinas, Smart.

The Study of Digital Servo Controller for LBLDCM Drives Based on TMS320F2812 (TMS320F2812를 이용한 LBLDCM의 디지털 서보제어기 개발에 관한 연구)

  • Cho, Hoon-Hee;Ahn, Jae-Young;Kim, Kwang-Heon
    • Proceedings of the KIPE Conference
    • /
    • 2005.07a
    • /
    • pp.770-773
    • /
    • 2005
  • 최근 산업 분야에 걸쳐서 고속, 고정밀도의 요구사항에 따라, 회전 모터와 볼 스크류, 벨트를 이용한 직선구동방식보다 빠르고 정확하며, 효율이 높은 직접구동 방식의 리니어모터 및 컨트롤러의 개발이 요구되고 있다. 이런 상황에 고속 연산을 수행할 수 있는 DSP(Digital Signal Processor)의 사용이 불가피하며, 기존의 칩들은 A/D변환기, PWM발생장치 등이 내장되지 않아 제어장치의 부품 수증가 및 복잡성을 피할 수 없었다. 따라서 본 논문에서는 SVPWM(Space Vector Pulse Width Modulation) 및 QEP(Quadrature Encoder Pulse) 회로와 PWM 발생기, 12bit의 고속 A/D변환기, 파워 드라이버보호회로 등을 내장한 TMS320F2812 DSP를 사용하여 반도체장비분야, 자동화분야 등에 사용되는 LBLDCM의 제어를 가능하게 만들었다. 또한, 기존의 DSP 시리즈 보다 연산속도가 고속화되어 고속연산에 의한 시간적 제한을 극복 할 수 있게 되었고, 제어에 필요한 하드웨어적인 기능들을 내장하고 있어서 주변회로가 필요 없게 되었다. 따라서 하드웨어의 간소화와 개발 시간의 단축 및 신뢰도의 향상과 모터 효율의 향상을 가져오도록 하였다. 제안된 제어장치는 제작되어, 실험을 통하여 그 타당성을 입증하였다.

  • PDF