• Title/Summary/Keyword: Verilog HDL

Search Result 416, Processing Time 0.033 seconds

SMART7F: VARIABLE PIPELINE STAGE FOR 32-BIT MICROCONTROLLER (내장형 32비트 마이크로콘트롤러에 적합한 VARIABLE PIPELINE STAGE 설계)

  • Cheong Young-Seok;Yang Dong-Hun;Kwak Seung-Ho;Lee Moon-Key
    • Proceedings of the IEEK Conference
    • /
    • 2004.06b
    • /
    • pp.597-600
    • /
    • 2004
  • In this paper. the soft IP (Intellectual Property) of pipeline of 32-bit microcontroller for embedded and portable application is presented. This IP supports variable pipeline stage according to the performance that user wants. In this architecture, three pipeline stages are basically employed and extended to the five pipeline stages. To this purpose, control logic has been partitioned to reflect each pipeline stage. FPGA platform is used for rapidly prototyping the IP. This is designed using Verilog HDL

  • PDF

Parameterized FFT/IFFT Core Generator for ODFM Modulation/Demodulation (OFDM 변복조를 위한 파라메터화된 FFT/IFFT 코어 생성기)

  • Lee, J.W.;Kim, J.H.;Shin, K.W.;Baek, Y.S.;Eo, I.S.
    • Proceedings of the IEEK Conference
    • /
    • 2005.11a
    • /
    • pp.659-662
    • /
    • 2005
  • A parameterized FFT/IFFT core generator (PFFT_CoreGen) is designed, which can be used as an essential IP (Intellectual Property) in various OFDM modem designs. The PFFT_CoreGen generates Verilog-HDL models of FFT cores in the range of 64 ${\sim}$ 2048-point. To optimize the performance of the generated FFT cores, the PFFT_CoreGen can select the word-length of input data, internal data and twiddle factors in the range of 8-b ${\sim}$ 24-b. Some design techniques for low-power design are considered from algorithm level to circuit level.

  • PDF

A CDMA-Based Communication Network for a Multiprocessor SoC (다중 프로세서를 갖는 SoC 를 위한 CDMA 기술에 기반한 통신망 설계)

  • Chun, Ik-Jae;Kim, Bo-Gwan
    • Proceedings of the IEEK Conference
    • /
    • 2005.11a
    • /
    • pp.707-710
    • /
    • 2005
  • In this paper, we propose a new communication network for on-chip communication. The network is based on a direct sequence code division multiple access (DS-CDMA) technique. The new communication network is suitable for a parallel processing system and also drastically reduces the I/O pin count. Our network architecture is mainly divided into a CDMA-based network interface (CNI), a communication channel, a synchronizer. The network includes a reverse communication channel for reducing latency. The network decouples computation task from communication task by the CNI. An extreme truncation is considered to simplify the communication link. For the scalability of the network, we use a PN-code reuse method and a hierarchical structure. The network elements have a modular architecture. The communication network is done using fully synthesizable Verilog HDL to enhance the portability between process technologies.

  • PDF

Design of High-Performance Unified Circuit for Linear and Non-Linear SVM Classifications

  • Kim, Soo-Jin;Lee, Seon-Young;Cho, Kyeong-Soon
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.12 no.2
    • /
    • pp.162-167
    • /
    • 2012
  • This paper describes the design of a high-performance unified SVM classifier circuit. The proposed circuit supports both linear and non-linear SVM classifications. In order to ensure efficient classification, a 48x96 or 64x64 sliding window with 20 window strides is used. We reduced the circuit size by sharing most of the resources required for both types of classification. We described the proposed unified SVM classifier circuit using the Verilog HDL and synthesized the gate-level circuit using 65nm standard cell library. The synthesized circuit consists of 661,261 gates, operates at the maximum operating frequency of 152 MHz and processes up to 33.8 640x480 image frames per second.

IP Design of Corrected Block TEA Cipher with Variable-Length Message for Smart IoT

  • Yeo, Hyeopgoo;Sonh, Seungil;Kang, Mingoo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.2
    • /
    • pp.724-737
    • /
    • 2020
  • Corrected Block TEA(or XXTEA) is a block cipher designed to correct security weakness in the original block TEA in 1998. In this paper, XXTEA cipher hardware which can encrypt or decrypt between 64-bit and 256-bit messages using 128-bit master key is implemented. Minimum message block size is 64-bit wide and maximal message block size is 256-bit wide. The designed XXTEA can encrypt and decrypt variable-length message blocks which are some arbitrary multiple of 32 bits in message block sizes. XXTEA core of this paper is described using Verilog-HDL and downloaded on Vertex4. The operation frequency is 177MHz. The maximum throughput for 64-bit message blocks is 174Mbps and that of 256-bit message blocks is 467Mbps. The cryptographic IP of this paper is applicable as security module of the mobile areas such as smart card, internet banking, e-commerce and IoT.

Implementation of the Multi-Segment Karatsuba Multiplier for Binary Field (멀티 세그먼트 카라츄바 유한체 곱셈기의 구현)

  • Oh, Jong-Soo
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.129-131
    • /
    • 2004
  • Elliptic Curve Cryptography (ECC) coprocessors support massive scalar multiplications of a point. We research the design for multi-segment multipliers in fixed-size ECC coprocessors using the multi-segment Karatsuba algorithm on GF($2^m$). ECC coprocessors of the proposed multiplier is verified on the SoC-design verification kit which embeds ALTERA EXCALIBUR FPGAs. As a result of our experiment, the multi-segment Karatsuba multiplier, which has more efficient performance about twice times than the traditional multi-segment multiplier, can be implemented as adding few H/W resources. Therefore the multi-segment Karatsuba multiplier which satisfies performance for the cryptographic algorithm, is adequate for a low cost embedded system, and is implemented in the minimum area.

  • PDF

A Prefetch Architecture with Efficient Branch Prediction for a 64-bit 4-way Superscalar Microprocessor (64비트 4-way 수퍼스칼라 마이크로프로세서의 효율적인 분기 예측을 수행하는 프리페치 구조)

  • 문상국;문병인;이용환;이용석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.11B
    • /
    • pp.1939-1947
    • /
    • 2000
  • 본 논문에서는 명령어의 효율적인 페치를 위해 분기 타겟 주소 전체를 사용하지 않고 캐쉬 메모리(cache memory) 내의 적은 비트 수로 인덱싱 하여 한 클럭 사이클 안에 최대 4개의 명령어를 다음 파이프라인으로 보내줄 수 있는 방법을 제시한다. 본 프리페치 유닛은 크게 나누어 3개의 영역으로 나눌 수 있는데, 분기에 관련하여 미리 부분적으로 명령어를 디코드 하는 프리디코드(predecode) 블록, 타겟 주소(NTA : Next Target Address) 테이블 영역을 추가시킨 명령어 캐쉬(instruction cache) 블록, 전체 유닛을 제어하고 가상 주소를 관리하는 프리페치(prefetch) 블록으로 나누어진다. 사용된 명령어들은 SPARC(Scalable Processor ARChitecture) V9에 기준 하였고 구현은 Verilog-HDL(Hardwave Description Language)을 사용하여 기능 수준으로 기술되고 검증되었다. 구현된 프리페치 유닛은 명령어 흐름에 분기가 존재하더라도 단일 사이클 안에 4개까지의 명령어들을 정확한 예측 하에 다음 파이프라인으로 보내줄 수 있다. 또한 NTA를 사용한 방법은 같은 수의 레지스터 비트를 사용하였을 때 BTB(Branch Target Buffer)를 사용하는 방법과 비교하여 2배정도 많은 개수의 분기 명령 주소를 저장할 수 있는 장점이 있다.

  • PDF

Design of an Image Interpolator for Low Computation Complexity

  • Jun, Young-Hyun;Yun, Jong-Ho;Park, Jin-Sung;Choi, Myung-Ryul
    • Journal of Information Processing Systems
    • /
    • v.2 no.3 s.4
    • /
    • pp.153-158
    • /
    • 2006
  • In this paper, we propose an image interpolator for low computational complexity. The proposed image interpolator supports the image scaling using a modified cubic convolution interpolation between the input and output resolutions for a full screen display. In order to reduce the computational complexity, we use the difference in value of the adjacent pixels for selecting interpolation methods and linear function of the cubic convolution. The proposed image interpolator is compared with the conventional one for the computational complexity and image quality. The proposed image interpolator has been designed and verified by Verilog HDL(Hardware Description Language). It has been synthesized using the Xilinx VirtexE FPGA, and implemented using an FPGA-based prototype board.

Efficient Signal Reordering Unit Implementation for FFT (FFT를 위한 효율적인 Signal Reordering Unit 구현)

  • Yang, Seung-Won;Lee, Jang-Yeol
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.58 no.6
    • /
    • pp.1241-1245
    • /
    • 2009
  • As FFT(Fast Fourier Transform) processor is used in OFDM(Orthogonal Frequency Division Multiplesing) system. According to increase requirement about mobility and broadband, Research about low power and low area FFT processor is needed. So research concern in reduction of memory size and complex multiplier is in progress. Increasing points of FFT increase memory area of FFT processor. Specially, SRU(Signal Reordering Unit) has the most memory in FFT processor. In this paper, we propose a reduced method of memory size of SRU in FFT processor. SRU of 64, 1024 point FFT processor performed implementation by VerilogHDL coding and it verified by simulation. We select the APEX20KE family EP20k1000EPC672-3 device of Altera Corps. SRU implementation is performed by synthesis of Quartus Tool. The bits of data size decide by 24bits that is 12bits from real, imaginary number respectively. It is shown that, the proposed SRU of 64point and 1024point achieve more than 28%, 24% area reduction respectively.

Design of only PWM Controller Based on FPGA (FPGA를 기반으로 한 PWM전용 컨트롤러 설계)

  • Kim Sang-Ug;Rho Youn-Sung;Lee Jae-Young;Jeon Hee-Jong
    • Proceedings of the KIPE Conference
    • /
    • 2004.07b
    • /
    • pp.842-845
    • /
    • 2004
  • PWM 입력 전압원 반브리지 직류-직류 변환기의 예로 통상적인 스위칭 방법으로 동작시킬 때, 직류 전압원의 단락을 방지하기 위하여 변환기 신호에 데드타임을 삽입한다. 이러한 데드타임은 변환기 시스템의 제어 성능에 좋지 않은 영향을 발생시킨다. 따라서 이 데드타임을 효과적으로 보상하는 것이 필요하다. 또 이 데드타임의 영향을 방지 할 수 있는 최선의 방법으로써 데드타임 최소화 스위칭 방법을 고려할 수도 있다. 본 논문에서는 데드타임 최소화 스위칭 방법의 하나인 아날로그 방법이 아닌 디지털 방법으로 스위칭 방법을 제안하였다. 제안한 스위칭 방법은 SOC를 기반으로 한 Verilog-HDL 언어를 이용하여 PWM전용 컨트롤러를 설계하고, 기존에 문제점으로 PWM 스위칭 발생시 과도기간 동안 턴오프 손실이 발생되어 저하된 효율를 제안된 방법으로 방지하였다.

  • PDF