• 제목/요약/키워드: High-performance processor

검색결과 616건 처리시간 0.026초

디지탈 신호처리소자 TMS320C30을 이용한 고속 영상처리 프로세서의 개발 (Development of a High-speed Image Processing Processor using TMS320C30 DSP)

  • 변중남;오상록;유범재;한동일;김재옥
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1990년도 추계학술대회 논문집 학회본부
    • /
    • pp.439-442
    • /
    • 1990
  • A powerful general purpose image processing processor is developed using a high-speed DSP chip, TMS320C30. The image processing processor, compatible to the standard VME bus, is composed of VME bus interface unit, video rate image grabbing/coding unit, TMS320C30 interface unit and bank of high-speed SRAMs. The performance is evaluated experimentally with the general image processing algorithms and the results show that the developed processor is capable of high speed image processing.

  • PDF

고성능 512-point FFT 프로세서의 설계 (A Design of High Throughput 512-point FFT Processor)

  • 김선호;김정우;오길남;김기철
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 1999년도 학술대회
    • /
    • pp.255-260
    • /
    • 1999
  • 본 논문에서는 데이터 입출력을 고속으로 수행하며 작은 지연시간을 갖는 512-point FFT프로세서의 구조및 설계에 대하여 보인다. 설계된 512-point FFT프로세서는 OFDM방송에서 요구하는 심볼 레이트로 테이타를 처리할 수 있는 것을 목표로 하였다. 설계된 512-point FFT프로세서는 써플메모리를 이용하여 메모리의 요구사항을 최소화하며, 새로운 strength reduction method를 적용한 복소곱셈기를 이용하여 기존의 복소곱셈기에 비하여 하드웨어의 비용이 적은 특징을 갖는다.

  • PDF

지능형 구조물을 위한 간섭형 광섬유 센서 신호처리기 (Interferometric fiberoptic sensor signal processor for smart structures)

  • 홍영준;예윤해
    • 한국광학회지
    • /
    • 제14권6호
    • /
    • pp.588-593
    • /
    • 2003
  • 지능형 구조물에서의 진동 등과 같이 주파수가 1 KHz에 이르는 물리량을 고감도로 측정할 수 있는 광섬유센서를 위한 신호 처리기를 구현하기 위하여 광섬유자이로스코프에 적용되었던 전디지털 위상추적신호처리(ADPT)를 교류량 측정용(다이나믹)으로 변경하여 설계하였다. 구현된 다이나믹 ADPT 신호처리기를 광섬유 Mach-Zehnder간섭계에 적용하여 성능을 평가한 결과 ADPT신호처리방식의 한계인 -50 ㏈에 가까운 전고조파 왜율을 가짐을 확인하였다.

비트 및 워드 연산용 초고속 프로세서 설계 (The Design of High Speed Bit and Word Processor)

  • 허재동;양오
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2002년도 하계학술대회 논문집 D
    • /
    • pp.2534-2536
    • /
    • 2002
  • This paper presents the design of high speed bit and word processor for sequence logic control using a FPGA. This FPGA is able to execute sequence instruction during program fetch cycle, because the program memory was separated from the data memory for high speed execution at 40MHz clock. Also this processor has 274 instructions set with a 32bit fixed width, so instruction decoding time and data memory interface time was reduced. This FPGA was synthesized by V600EHQ240 and Foundation tool of Xilinx company. The final simulation was successfully performed under Foundation tool simulation environment. And the FPGA programmed by VHDL for a 240 pin HQFP package. Finally the benchmark was performed to prove that the designed for bit and word processor has better performance than Q4A of Mitsubishi for the sequence logic control.

  • PDF

임베디드 환경에서의 32-bit RISC-V RV32IM 파이프라인 프로세서 설계 및 구현 (A Design and Implementation of 32-bit RISC-V RV32IM Pipelined Processor in Embedded Systems)

  • 박수빈;김용우
    • 반도체디스플레이기술학회지
    • /
    • 제22권4호
    • /
    • pp.81-86
    • /
    • 2023
  • Recently, demand for embedded systems requiring low power and high specifications has been increasing, and RISC-V processors are being widely applied. RISC-V, a RISC-based open instruction set architecture (ISA), has been developed and researched by UC Berkeley and other researchers since 2010. RV32I ISA is sufficient to support integer operations such as addition and subtraction instructions, but M-extension should be defined for multiplication and division instructions. This paper proposes an RV32I, RV32IM processor, and indicates benchmark performance scores compared to an existing processor. Additionally, A non-stalling method was proposed to support a 2-stage pipelined DSP multiplier to the 5-stage pipelined RV32IM processor. Proposed RV32I and RV32IM processors satisfied a maximum operating frequency of 50 MHz on Artix-7 FPGA. The performance of the proposed processors was verified using benchmark programs from Dhrystone and Coremark. As a result, the Coremark benchmark results of the proposed processor showed that it outperformed the existing RV32IM processor by 23.91%.

  • PDF

저가의 HDTV를 위한 영상출력 모듈의 설계 및 구현 (Design and Implementation of Image Display Module for Low-cost High Definition Television)

  • 최재승;김익환;남재열;하영호
    • 대한전자공학회논문지SP
    • /
    • 제42권3호
    • /
    • pp.65-72
    • /
    • 2005
  • 본 논문은 재료비의 절감을 위하여 저성능의 프로세서를 사용할 수 있도록 영상출력에 할당되는 프로세서 코어성능을 최대한 줄이고자 하는 것을 목적으로 한다. 본 논문은 저성능의 프로세서가 탑재된 전자앨범 기능의 모듈에 고해상도 영상출력 기능을 지원하기 위한 영상출력 시스템을 구현한다. 본 시스템은 영상데이터 처리부로부터의 15프레임의 HD 영상입력을 TV 시스템에서 사용 가능한 60프레임의 HD영상으로 출력하는 기능을 수행한다. 이 결과, 제안된 시스템은 프로세서 성능을 저프레임 영상출력에 해당하는 정도로 줄여줄 수 있으므로 이는 시스템의 비용 절감 및 다양한 부가기능 추가로 연결 되어진다. 결론적으로, 영상출력 시스템을 이용한 전자앨범 기능의 모듈 시스템을 개발하여 본 방식의 유효성을 확인한다.

AB9: A neural processor for inference acceleration

  • Cho, Yong Cheol Peter;Chung, Jaehoon;Yang, Jeongmin;Lyuh, Chun-Gi;Kim, HyunMi;Kim, Chan;Ham, Je-seok;Choi, Minseok;Shin, Kyoungseon;Han, Jinho;Kwon, Youngsu
    • ETRI Journal
    • /
    • 제42권4호
    • /
    • pp.491-504
    • /
    • 2020
  • We present AB9, a neural processor for inference acceleration. AB9 consists of a systolic tensor core (STC) neural network accelerator designed to accelerate artificial intelligence applications by exploiting the data reuse and parallelism characteristics inherent in neural networks while providing fast access to large on-chip memory. Complementing the hardware is an intuitive and user-friendly development environment that includes a simulator and an implementation flow that provides a high degree of programmability with a short development time. Along with a 40-TFLOP STC that includes 32k arithmetic units and over 36 MB of on-chip SRAM, our baseline implementation of AB9 consists of a 1-GHz quad-core setup with other various industry-standard peripheral intellectual properties. The acceleration performance and power efficiency were evaluated using YOLOv2, and the results show that AB9 has superior performance and power efficiency to that of a general-purpose graphics processing unit implementation. AB9 has been taped out in the TSMC 28-nm process with a chip size of 17 × 23 ㎟. Delivery is expected later this year.

Porting LLVM Compiler to a Custom Processor Architecture Using Synopsys Processor Designer

  • Jung, Hyungyun;Shin, Jangseop;Heo, Ingoo;Paek, Yunheung
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2014년도 추계학술발표대회
    • /
    • pp.53-56
    • /
    • 2014
  • Application specific instruction-set processor (ASIP) is a suitable design choice for system designers who seek both flexibility to handle various applications in the domain together with the performance. Successful development of an ASIP, however, requires a software development kit (SDK) to be provided along with the processor. Synopsys Processor Designer is an ASIP development tool, which takes as input a set of files written in a high-level architecture description language called LISA (Language for Instruction Set Architecture), and generates SDK as well as RTL. Recently, they have added support for the generation of LLVM compiler backend, though some manual work is required. In this paper, we introduce some details in porting LLVM compiler to a custom processor architecture in Synopsys Processor Designer.

RISC-V 프로세서상에서의 효율적인 ARIA 암호 확장 명령어 (Efficient ARIA Cryptographic Extension to a RISC-V Processor)

  • 이진재;박종욱;김민재;김호원
    • 정보보호학회논문지
    • /
    • 제31권3호
    • /
    • pp.309-322
    • /
    • 2021
  • 본 연구에서는 저성능 IoT 디바이스에서의 고속 암호화 연산을 지원하기 위해 블록암호 알고리즘 ARIA의 RISC-V 프로세서상에서의 고속 연산을 위한 확장 명령어 셋을 추가한다. 하드웨어상에서의 효율적인 구조로 ARIA 알고리즘을 구현하여 32bit 프로세서에서 동작하기 때문에 효과적인 확장 명령어 셋을 구현한다. 기존의 소프트웨어 암호화 연산과 비교하여 유의미한 성능 향상을 보인다.

FPGA를 이용한 시퀀스 로직 제어용 고속 프로세서 설계 (The Design of High Speed Processor for a Sequence Logic Control using FPGA)

  • 양오
    • 대한전기학회논문지:전력기술부문A
    • /
    • 제48권12호
    • /
    • pp.1554-1563
    • /
    • 1999
  • This paper presents the design of high speed processor for a sequence logic control using field programmable gate array(FPGA). The sequence logic controller is widely used for automating a variety of industrial plants. The FPGA designed by VHDL consists of program and data memory interface block, input and output block, instruction fetch and decoder block, register and ALU block, program counter block, debug control block respectively. Dedicated clock inputs in the FPGA were used for high speed execution, and also the program memory was separated from the data memory for high speed execution of the sequence instructions at 40 MHz clock. Therefore it was possible that sequence instructions could be operated at the same time during the instruction fetch cycle. In order to reduce the instruction decoding time and the interface time of the data memory interface, an instruction code size was implemented by 16 bits or 32 bits respectively. And the real time debug operation was implemented for easy debugging the designed processor. This FPGA was synthesized by pASIC 2 SpDE and Synplify-Lite synthesis tool of Quick Logic company. The final simulation for worst cases was successfully performed under a Verilog HDL simulation environment. And the FPGA programmed for an 84 pin PLCC package was applied to sequence control system with inputs and outputs of 256 points. The designed processor for the sequence logic was compared with the control system using the DSP(TM320C32-40MHz) and conventional PLC system. The designed processor for the sequence logic showed good performance.

  • PDF