Search | Korea Science

Design and Performance Evaluation of Expansion Buffer Cache (확장 버퍼 캐쉬의 설계 및 성능 평가)

Hong Won-Kee
- The KIPS Transactions:PartA
- /
- v.11A no.7 s.91
- /
- pp.489-498
- /
- 2004
VLIW processor is considered to be an appropriate processor for the embedded system, provided with high performance and low power con-sumption due to its simple hardware structure. Unfortunately, the VLIW processor often suffers from high memory access latency due to the variable length of I-packets, which consist of independent instructions to be issued in parallel. It is because of the variable I-packet length that some I-packets must be placed over two cache blocks, which are called straddle I-packets, so that two cache accesses are required to fetch such I-packets. In this paper, an expansion buffer cache is proposed to improve not only the instruction fetch bandwidth, but also the power consumption of the I-cache with moderate hardware cost. The expansion buffer cache has a small expansion buffer containing a fraction of a straddle packet along with the main cache to reduce the additional cache accesses due to the straddle I-packets. With a great reduction in the cache accesses due to the straddle packets, the expansion buffer cache can achieve $5{\~}9{\%}$improvement over the conventional I-caches in the $Delay{\cdot}Power{\cdot}Area$ metric.
https://doi.org/10.3745/KIPSTA.2004.11A.7.489 인용 PDF KSCI

Further Specialization of Clustered VLIW Processors: A MAP Decoder for Software Defined Radio

Ituero, Pablo;Lopez-Vallejo, Marisa
- ETRI Journal
- /
- v.30 no.1
- /
- pp.113-128
- /
- 2008
Turbo codes are extensively used in current communications standards and have a promising outlook for future generations. The advantages of software defined radio, especially dynamic reconfiguration, make it very attractive in this multi-standard scenario. However, the complex and power consuming implementation of the maximum a posteriori (MAP) algorithm, employed by turbo decoders, sets hurdles to this goal. This work introduces an ASIP architecture for the MAP algorithm, based on a dual-clustered VLIW processor. It displays the good performance of application specific designs along with the versatility of processors, which makes it compliant with leading edge standards. The machine deals with multi-operand instructions in an innovative way, the fetching and assertion of data is serialized and the addressing is automatized and transparent for the programmer. The performance-area trade-off of the proposed architecture achieves a throughput of 8 cycles per symbol with very low power dissipation.
PDF

A VLIW Code Generation Technique Utilizing NOP Instruction Slot (NOP 명령어 슬롯을 활용하는 VLIW 코드 생성기법)

문현주;이승수;김석주;김석일
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.10c
- /
- pp.615-617
- /
- 2000
본 논문에서는 VLIW 목적코드에 존재하는 NOP 명령어 슬롯에 의미있는 명령어를 중복 삽입하도록 함으로써 원래의 방법에서 존재하였던 자료의존관계를 해소하여 실행시간의 지연을 방지하는 기법을 연구하였다. 이 경우에 하나의 긴 명령어에 동일한 명령어가 둘 이상 포함될 수 있으므로 연산 관계에 이은 쓰기 단계에서 여러개의 명령어가 동일한 레지스터 파일의 주소에 쓰기를 함에 따른 충돌을 피할 수 없다. 본 논문에서는 연산처리 별로 쓰기 단계에서 연산 결과를 레지스터 파일에 쓰도록 허용할 것인지에 대한 정보를 명령어에 포함하는 TiPS 구조와 TiPS 구조에 적합한 목적코드 생성 알고리즘을 제안하였다. 목적코드 생성 알고리즘은 연산처리기별로 연속적으로 실행되는 명령어간의 자료의존관계를 해소하기 위하여 NOP 대신에 다른 연산처리기에서 실행할 명령어를 수행하도록 동일한 명령어를 복사하여 할당할 수 있다. 실험 결과, 명령어 복사 기법은 기존의 기법에 비하여 전체 실행 사이클을 크게 단축시킬 수 있음을 보여주었다.
PDF

An Improved Implementation of Block Matching Algorithm on a VLIW-based DSP (VLIW 기반 DSP에서의 개선된 블록매칭 알고리즘 구현)

You, Hui-Jae;Chung, Sun-Tae;Jung, Sou-Hwan
- Proceedings of the IEEK Conference
- /
- 2007.07a
- /
- pp.225-226
- /
- 2007
In this paper, we present our study about the optimization of the block matching algorithm on a VLIW based DSP. The block matching algorithm is well known for its computational burden in motion picture encoding. As supposed to the previous researches where the optimization is achieved by optimizing SAD, the most heavy routine of the block matching, we optimize the block matching algorithm by applying software pipelining technique to the whole routine of the algorithm. Through experiments, the efficiency of the proposed optimization is verified.
PDF

VLIW ASIP Processor for Integer Transform and Quantization in H.264 (H.264 정수변환 및 양자화를 위한 VLIW ASIP 프로세서)

Yang, Seungjun;Park, Sanghyun;Heo, Ingoo;Kim, Yongjoo;Kim, Kyoungwon;Paek, Yunheung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2009.11a
- /
- pp.9-10
- /
- 2009
H.264 비디오 코덱은 비디오 어플리케이션 중에서 중요한 역할을 차지하고 있다. 본 논문에서는 H.264의 여러 과정 중에서 정수 변환 및 양자화 과정을 효과적으로 처리하기 위한 VLIW ASIP 아키텍처를 제안한다.
https://doi.org/10.3745/PKIPS.y2009m11a.9 인용 PDF

3-Way 32 bit VLIW Multimedia Signal Processor

Park, Jaebok;Jaehee You
- Proceedings of the IEEK Conference
- /
- 2001.06b
- /
- pp.97-100
- /
- 2001
A 3-way VLIW multimedia signal processor capable of efficient repeated operations as well as both load/store and type transformations for various data types is presented. It is composed of a 32-bit execution unit that can execute two instructions in parallel, an independent load/store unit and a control unit. The processor is implemented with 0.6${\mu}{\textrm}{m}$ gate array and the results are discussed.
PDF

VLIW DSP 프로세서 아키텍처의 설계 기술

이상정
- The Magazine of the IEIE
- /
- v.31 no.11
- /
- pp.59-71
- /
- 2004
PDF

Implementing Swing Modulo Scheduler for VLIW Processor (VLIW 프로세서를 위한 Swing Modulo Scheduler 구현)

Shin, Jangseop;Han, Sangjun;Jung, Hyungyun;Ahn, Minwook;Youn, Jonghee M.;Paek, Yunheung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2014.04a
- /
- pp.12-14
- /
- 2014
하드웨어가 해저드(hazard) 검출을 지원하지 않는 멀티이슈 VLIW 프로세서의 성능을 높이기 위해서는 컴파일러가 명령어 의존성과 하드웨어 자원의 제약을 지키는 범위 안에서 최대한 명령어수준 병렬성(ILP)을 활용하는 것이 중요하다. 기본 블록(basic block) 스케쥴링은 Branch 등 제어 흐름(control flow)의 경계를 넘어선 스케쥴링을 행하지 않아 그 효과가 제한적이다. 소프트웨어 파이프라이닝(software pipelining)은 루프(loop)의 경계를 허물어 여러반본(iteration)의 명령어가 동시에 수행되도록 하는 것으로 모듈로 스케쥴링(modulo scheduling)은 그 중에 한 범주의 스케쥴링 기법들을 일컫는다. 본 연구에서는 그 중 한가지인 스윙 모듈로 스케쥴러(swing modulo scheduler)[1]를 구현하여 그 효과를 알아보고자 한다.
https://doi.org/10.3745/PKIPS.y2014m04a.12 인용 PDF

Soft Error Detection & Correction for VLIW Architecture (VLIW 프로세서를 위한 소프트에러 검출 및 수정 기법)

Li, Yunrong;Lee, Jongwon;Heo, Ingoo;Kwon, Yongin;Lee, Kyoungwoo;Paek, Yunheung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.9-10
- /
- 2011
임베디드 시스템에서 저전력 공급, 칩사이즈 축소, 낮은 노이즈 마진 등 설계기법이 날로 향상됨에 따라 소프트에러가 기하급수적으로 늘어나고 있다. 본 논문에서는 VLIW 아키텍처에서 치명적인 오류를 일으키는 이런 소프트에러들을 검출하고 수정하는 기법을 제안하고자 한다.
https://doi.org/10.3745/PKIPS.y2011m11a.9 인용 PDF

VLIW 시스템을 위한 컴파일러 최적화 기법

박명순;이진호;양정일;박성순
- Communications of the Korean Institute of Information Scientists and Engineers
- /
- v.12 no.5
- /
- pp.67-82
- /
- 1994
PDF

Search Result 54, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)