Study of an In-order SMT Architecture and Grouping Schemes

Moon, Byung-In;Kim, Moon-Gyung;Hong, In-Pyo;Kim, Ki-Chang;Lee, Yong-Surk;

International Journal of Control, Automation, and Systems

Volume 1 Issue 3
/
Pages.339-350
/
2003
/
1598-6446(pISSN)
/
2005-4092(eISSN)

Institute of Control, Robotics and Systems (제어로봇시스템학회)

Study of an In-order SMT Architecture and Grouping Schemes

Moon, Byung-In (SP Division of System IC, Hynix Semiconductor Inc.,) ;
Kim, Moon-Gyung (Department of Electrical & Electronic Engineering, Yonsei University) ;
Hong, In-Pyo (Department of Electrical & Electronic Engineering, Yonsei University) ;
Kim, Ki-Chang (School of Information & Communication Engineering, Inha University) ;
Lee, Yong-Surk (Department of Electrical & Electronic Engineering, Yonsei University)

Published : 2003.09.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose a simultaneous multithreading (SMT) architecture that improves instruction throughput by exploiting instruction level parallelism (ILP) and thread level parallelism (TLP). The proposed architecture issues and completes instructions belonging to the same thread in exact program order. The issue and completion policy greatly reduces the design complexity and hardware cost of our architecture, compared with others that employ out-of-order issue and completion. On the other hand, when the instructions belong to different threads, the issue and completion orders for those instructions may not necessarily be identical to the fetch order. The processor issues instructions simultaneously from multiple threads to functional units by exploiting ILP and TLP, and by dynamic resource sharing. That parallel execution notably improves performance and resource utilization with minimal additional hardware cost over the conventional superscalar processors. This paper proposes an SMT architecture with grouping as well as one without grouping. Without grouping, all threads dynamically and flexibly share most resources. On the other hand, in the SMT architecture with grouping, in which resources and threads are divided into several groups for design simplification, resources are shared only among threads belonging to the same group as those resources. Simulation results show that our processors with four and eight threads improve performance by three or more times over the conventional superscalar processor with comparable execution resources and policies, and that reasonable grouping reduces the design complexity of SMT processors with little negative effect on performance.

Keywords

References

Microprocessor Report v.11 no.9 Multithreading comes of age P. Song
Proc. 5th International Parallel Processing Symposium Special features of a VLIW architecture A. Abnous;N. Bagherzadeh
Proc. 22nd International Symposium on Computer Architecture Simultaneous multithreading: maximizing on-chip parallelism D. M. Tullsen;S. J. Eggers;H. M. Levy
Microprocessor Report v.13 no.16 Compaq chooses SMT for Alpha K. Diefendorff
Proc. 23rd International Symposium on Computer Architecture Exploiting choice: instruction fetch and issue on an implementable simultaneous multithreading processor D. M. Tuilsen;S. J. Eggers;J. S. Emer;H.M. Levy;J. L. Lo;R. L. Stamm
Computer Architecture: A Quantitative Approach(Second Edition) D. A. Patterson;J. L. Hennesy
Superscalar Microprocessor Design M. Johnson
Proc. 19th International Symposium on Computer Architecture An elementary processor architecture with simultaneous instruction issuing from multiple threads H. Hirata;K. Kimura;S. Nagamine;Y Mochizuki;A. Nishimura;Y. Nakase;T. Nishizawa
IEEE Trans. Comput. v.42 no.1 High-bandwidth interleaved memories for vector processors-a simulation study G. S. Sohi;M. Flanklin
IEEE Computer v.33 no.7 SPEC CPU2000: measuring CPU performance in the New Millennium J. L. Henning
ARM Developer Suite: Compiler Linker and Utilities Guide
Proc. IFIP WG10.3 Working Conference on Parallel Architectures and Compilation Techniques Increasing superscalar performance through multistreaming W. Yamamoto;M. Nemirovsky
Proc. International Conference on Parallel Processing v.1 A benchmark evaluation of a multi-threaded RISC processor architecture R. Prasadh;C.-L. Wu

International Journal of Control, Automation, and Systems

Study of an In-order SMT Architecture and Grouping Schemes

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)