통합 검색 | Korea Science

Optimization of ARIA Block-Cipher Algorithm for Embedded Systems with 16-bits Processors

Lee, Wan Yeon;Choi, Yun-Seok
- International Journal of Internet, Broadcasting and Communication
- /
- 제8권1호
- /
- pp.42-52
- /
- 2016
In this paper, we propose the 16-bits optimization design of the ARIA block-cipher algorithm for embedded systems with 16-bits processors. The proposed design adopts 16-bits XOR operations and rotated shift operations as many as possible. Also, the proposed design extends 8-bits array variables into 16-bits array variables for faster chained matrix multiplication. In evaluation experiments, our design is compared to the previous 32-bits optimized design and 8-bits optimized design. Our 16-bits optimized design yields about 20% faster execution speed and about 28% smaller footprint than 32-bits optimized code. Also, our design yields about 91% faster execution speed with larger footprint than 8-bits optimized code.
https://doi.org/10.7236/IJIBC.2016.8.1.42 인용 PDF

SoC 플랫폼 기반 모바일용 3차원 그래픽 Hardwired T&L Accelerator 구현 (Implementation of a 3D Graphics Hardwired T&L Accelerator based on a SoC Platform for a Mobile System)

이광엽;구용서
- 대한전자공학회논문지SD
- /
- 제44권9호
- /
- pp.59-70
- /
- 2007
본 논문에서는 휴대 정보기기 시스템에서 더욱 향상된 실시간 3D 그래픽 가속 능력을 갖는 SoC(System on a Chip) 구현을 위해 효과적인 T&L(Transform & Lighting) Processor 구조를 연구하였다. T&L 과정에 필요한 IP들을 설계하였으며, 이를 바탕으로 SoC Platform 기반으로 검증하였다. 설계된 T&L Processor는 24 bits 부동소수점 형식과 16 bits 고정소수점 형식을 적절하게 혼용하고 계산식의 병렬성을 최대한 활용하여 Transform 과정 연산과 Lighting 과정 연산의 지연시간을 균일하게 배분하여 Transform 과정만 처리할 때와 Lighting과 혼용으로 처리할 때 연산 속도의 차이가 없이 동작이 가능하다. 설계된 T&L Processor는 SoC 플랫폼을 이용하여 성능 측정 실험 및 검증을 하였고, Xilinx-Virtex4 FPGA에서 80 MHz의 동작 주파수를 확인하였고 초당 20M개의 정점(Vertex) 처리 성능을 확인하였다.
PDF KSCI

FPGA를 이용한 시퀀스 로직 제어용 고속 프로세서 설계 (The Design of High Speed Processor for a Sequence Logic Control using FPGA)

양오
- 대한전기학회논문지:전력기술부문A
- /
- 제48권12호
- /
- pp.1554-1563
- /
- 1999
This paper presents the design of high speed processor for a sequence logic control using field programmable gate array(FPGA). The sequence logic controller is widely used for automating a variety of industrial plants. The FPGA designed by VHDL consists of program and data memory interface block, input and output block, instruction fetch and decoder block, register and ALU block, program counter block, debug control block respectively. Dedicated clock inputs in the FPGA were used for high speed execution, and also the program memory was separated from the data memory for high speed execution of the sequence instructions at 40 MHz clock. Therefore it was possible that sequence instructions could be operated at the same time during the instruction fetch cycle. In order to reduce the instruction decoding time and the interface time of the data memory interface, an instruction code size was implemented by 16 bits or 32 bits respectively. And the real time debug operation was implemented for easy debugging the designed processor. This FPGA was synthesized by pASIC 2 SpDE and Synplify-Lite synthesis tool of Quick Logic company. The final simulation for worst cases was successfully performed under a Verilog HDL simulation environment. And the FPGA programmed for an 84 pin PLCC package was applied to sequence control system with inputs and outputs of 256 points. The designed processor for the sequence logic was compared with the control system using the DSP(TM320C32-40MHz) and conventional PLC system. The designed processor for the sequence logic showed good performance.
PDF

생체 신호처리용 Bit-slice Signal Processor에 관한 연구 (A Study on the Bit-slice Signal Processor for the Biological Signal Processing)

김영호;김동록;민병구
- 대한의용생체공학회:의공학회지
- /
- 제6권2호
- /
- pp.15-22
- /
- 1985
We have developed a microprogramir!able signal processor for real-time ultrasonic signal processing. Processing speed was increased by the parallelism in horizontal microprogram using 104bits microcode and the Pipelined architecture. Control unit of the signal processor was designed by microprogrammed architec- ture and writable control store (WCS) which was interfaced with host computer, APPLE- ll . This enables the processor to develop and simulate various digital signal processing algorithms. The performance of the processor was evaluated by the Fast Fourier Transform (FFT) program. The execution time to perform 16 bit 1024 points complex FF7, radix-2 DIT algorithm, was about 175 msec with IMHz master Clock. We can use this processor to Bevelop more efficient signal processing algorithms on the biological signal processing.
PDF

Computer Application to ECG Signal Processing

Okajima, Mitsuharu
- 대한의용생체공학회:의공학회지
- /
- 제6권2호
- /
- pp.13-14
- /
- 1985
We have developed a microprogramir!able signal processor for real-time ultrasonic signal processing. Processing speed was increased by the parallelism in horizontal microprogram using 104bits microcode and the Pipelined architecture. Control unit of the signal processor was designed by microprogrammed architec- ture and writable control store (WCS) which was interfaced with host computer, APPLE- ll . This enables the processor to develop and simulate various digital signal processing algorithms. The performance of the processor was evaluated by the Fast Fourier Transform (FFT) program. The execution time to perform 16 bit 1024 points complex FF7, radix-2 DIT algorithm, was about 175 msec with IMHz master Clock. We can use this processor to Bevelop more efficient signal processing algorithms on the biological signal processing.
PDF

BIT SLICE SIGNAL PROCESSOR를 이용한 DCT의 구현 (Implementation of DCT using Bit Slice Signal Processor)

김동록;고석빈;백승권;이태수;민병구
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 1987년도 전기.전자공학 학술대회 논문집(II)
- /
- pp.1449-1453
- /
- 1987
A microprogrammable Bit Slice Sinal Processor for image processing is implemented. Processing speed is increased by the parallelism in horizontal microprogram using 120bits microcode, pipelined architecture, 2 bank memory switching that interfaces with the Host through DMA, a variable clock control, overflow checking H/W,look-up table method and cache memory. With this processor, a DCT algorithm which uses 2-D FFT is performed. The execution time for $512{\times}512{\times}8$ image is 12 sec when 16 bit operation is runned, and the recovered image has acceptable quality with MSE 0.276%.
PDF

내장형 신호처리를 위한 응용분야 전용 프로세서의 설계 (Design of An Application Specific Instruction-set Processor for Embedded DSP Applications)

이성원;최훈;박인철
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 1999년도 추계종합학술대회 논문집
- /
- pp.228-231
- /
- 1999
This paper describes the design and implementation of an application specific instruction-set processor developed for embedded DSP applications. The instruction-set has an uniform size of 16 bits, and supports 3 types of instructions: Primitive, Complex, and Specific. To reduce code size and cycle count we introduce complex instructions that can be selected according to the application under consideration, which leads to 50% code size reduction maximally. The processor has two independent data memories to double the data throughput and the address space. The processor is synthesized by 0.6$\mu$m single-poly double-metal technology. Critical path simulation shows that the maximum frequency is 110MHz and total gate count is 132, 000.
PDF

고성능 가산기의 최적화 연구 (Study of Optimization for High Performance Adders)

허석원;김문경;이용주;이용석
- 한국통신학회논문지
- /
- 제29권5A호
- /
- pp.554-565
- /
- 2004
본 논문에서는 단일 클락 사이클과 다중 클락 사이클에 수행되는 여러 가산기를 구현하고 area와 time을 비교한다. 가산기의 크기를 64, 128, 256-비트로 다양화 시키면서, 특히 하이브리드 구조의 가산기는 소그룹을 4, 8, 16-비트로 나누어서 group / ungroup으로 합성을 하여 비교하였다. 제안된 가산기들은 Verilog-HDL을 이용하여 하향식 설계 방법으로 구현되었다. Cadence의 Verilog-XL.을 이용하여 설계된 가산기와 behavioral model을 이용한 가산기의 출력이 일치하는지를 비교하여 검증하였다. 검증된 모델은 삼성 0.35um 3.3(V) CMOS standard cell 라이브러리를 이용하여 합성되었으며, 최악 조건 2.7(V), 85($^{\circ}C$)에서 동작하였다. 스마트 카드 IC의 Crypto-Processor에 사용할 수 있는 최적화된 가산기는 64-비트를 기준으로 할 때, group으로 합성된 16-비트 캐리 예측 가산기를 기반으로 하는 리플 캐리 가산기(RCA_CLA)이다. 이 가산기는 198(MHz)의 속도로 동작하며, 게이트 수는 nand2 게이트 기준으로 약 967개이다.
PDF KSCI

32 비트 저전력 스마트카드 IC 설계 (Design of 32 bits tow Power Smart Card IC)

김승철;김원종;조한진;정교일
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2002년도 하계종합학술대회 논문집(2)
- /
- pp.349-352
- /
- 2002
In this Paper, we introduced 32 bit SOC implementation for multi-application Smart Card and described the methodology for reducing power consumption. It consists of ARMTTDMI micro-processor, 192 KBytes EEPROM, 16 KB SRAM, crypto processors and card reader interface based on AMBA bus system. We used Synopsys Power Compiler to estimate and optimize power consumption. Experimental results show that we can reduce Power consumption up to 62 % without increasing the chip area.
PDF

마이크로 프로세서를 이용한 위성용 전력 시스템 제어에 관한 연구 (A Study on the Space Power System using Micro-Processor)

김희준;김영태;김인기
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 1992년도 하계학술대회 논문집 B
- /
- pp.1032-1034
- /
- 1992
There are increasing demands for the Space Power System to get more useful performance. These demands are high-efficiency, high-confidency, light-weight, small-size, and the systems's flexibility which can shorten the development time. These demands can be achieved when we make the use of $\mu$-Processor. This paper, therefore, shows an analysis and experimental results for the Space Power System with MC 68000, Motorola's 16 bits MPU, to find the system's characteristics.
PDF

검색결과 27건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)