• Title/Summary/Keyword: scalable architecture

Search Result 210, Processing Time 0.026 seconds

Hierarchical Performance Modeling and Simulation of Scalable Computer System (확장성을 고려한 계층적 시스템 성능 모델 및 시뮬레이션)

  • 김흥준
    • Journal of the Korea Society for Simulation
    • /
    • v.4 no.2
    • /
    • pp.1-16
    • /
    • 1995
  • The performance of a computer system depends on the system architecture and workload, and the high performance required in many applications can be achieved by the scalability of the system architecture and workload. This paper presents scalable workload, a performance metric of scalable speedup and hierarchical modeling for the scalable computer system as well as the development of the object-oriented simulator spmplC++ Which is an advanced C++ version of the discrete event-driven simulation environment smplE. In addition, this paper presents two examples of applying scalable speedup, hierarchical modeling and simulator smplC++ to analyze the performance effect of the sclcbility in a multiprocessor system and a network-based client/server system.

  • PDF

Design of Scalable Intra-prediction Architecture for H.264 Decoders (H.264 복호기를 위한 스케일러블 인트라 예측기 구조 설계)

  • Lee, Chan-Ho
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.45 no.11
    • /
    • pp.77-82
    • /
    • 2008
  • H.264 is a video coding standard of ITU-T and ISO/IEC, and widely spreads its application due to its high compression ratio more than twice that of MPEG-2 and high image quality. It has different architecture depending on demands since it is a lied from small image of QVGA to large size of HD. In this paper, We propose a scalable architecture for intra-prediction of H.264 decoders. The proposed scheme has a scalable architecture that can accommodate up to 4 processing elements depending on performance demands and can reduce the number of access to memory using efficient memory management so as to be energy-efficient. We design the intra-prediction unit using Verilog-HDL and verily it by prototyping using an FPGA. The performance is analyzed using the results of design.

Efficient Motion Prediction Architecture and Design of DPB for Scalable Multi-view Video Coding (스케일러블 다시점 비디오 부호화를 위한 효율적인 움직임 예측구조와 DPB 설계)

  • Kim, Ji-Hoon;Jung, Tae-Jun;Lee, Hong-Rae;Seo, Kwang-Deok;Kim, Jin-Soo;Lee, Hahyun;Kang, Jung Won
    • Journal of Broadcast Engineering
    • /
    • v.17 no.6
    • /
    • pp.976-989
    • /
    • 2012
  • In this paper, we propose an efficient motion prediction architecture and DPB design mechanism for scalable multi-view video coding which is implemented by integrating SVC and MVC coding algorithms. In the proposed motion prediction architecture, we employ pictures associated with other views as a candidate for reference picture for improved motion prediction performance. By the proposed prediction architecture, we could enormously reduce the size of compressed video data. When performing scalable multi-view video coding, an integrated DPB design mechanism is also proposed. It is shown by various simulations that the proposed motion prediction architecture for scalable multi-view video coding can result in reduced data size in the compressed bitstream.

An Efficient Architecture of Inter Layer Up-sampling in Scalable Video Decoder (SVC 복호화기에서 Inter Layer 업-샘플링의 효과적인 구조)

  • Ki, Dae-Wook;Kim, Jae-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.3
    • /
    • pp.621-627
    • /
    • 2010
  • This paper proposes an efficient architecture of Inter layer up-sampling in decoder for SVC(scalable video coding). A register bank for horizontal and vertical up-sampling and interpolation units are designed, by introducing the proposed architecture, 41% memory bandwidth is reduced compared to JSVM. For real-time operation for HD 6 layer decoder having CIF, SD, HD resolution for CGS layer, the hardware is designed to operate at 127MHz. The gate count is about 3000.

Efficient VLSI architecture for one-dimensional discrete wavelet transform using a sealable data reorder unit

  • Park, Taegeun
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.353-356
    • /
    • 2002
  • In this paper, we design an efficient, scalable one-dimensional discrete wavelet transform (1DDWT) filter using data reorder unit (DRU). At each level, the required hardware is optimized by sharing multipliers and adders because the input rate is reduced by a factor of two at each level due to decimation. The proposed architecture shows 100% hardware utilization by balancing hardware with input rate. Furthermore, sharing the coefficients of the highpass and the lowpass filters using the mirror filter property reduces the number of multipliers and adders by half. We designed a scalable DRU that efficiently reorders and feeds inputs to highpass and lowpass filters. The proposed DRU-based architecture is so regular and scalable that it can be easily extended to an arbitrary 1D DWT structure with M taps and J levels. Compared to other architectures, the proposed DWT filter shows efficiency in performance with relatively less hardware.

  • PDF

A Scalable Montgomery Modular Multiplier (확장 가능형 몽고메리 모듈러 곱셈기)

  • Choi, Jun-Baek;Shin, Kyung-Wook
    • Journal of IKEEE
    • /
    • v.25 no.4
    • /
    • pp.625-633
    • /
    • 2021
  • This paper describes a scalable architecture for flexible hardware implementation of Montgomery modular multiplication. Our scalable modular multiplier architecture, which is based on a one-dimensional array of processing elements (PEs), performs word parallel operation and allows us to adjust computational performance and hardware complexity depending on the number of PEs used, NPE. Based on the proposed architecture, we designed a scalable Montgomery modular multiplier (sMM) core supporting eight field sizes defined in SEC2. Synthesized with 180-nm CMOS cell library, our sMM core was implemented with 38,317 gate equivalents (GEs) and 139,390 GEs for NPE=1 and NPE=8, respectively. When operating with a 100 MHz clock, it was evaluated that 256-bit modular multiplications of 0.57 million times/sec for NPE=1 and 3.5 million times/sec for NPE=8 can be computed. Our sMM core has the advantage of enabling an optimized implementation by determining the number of PEs to be used in consideration of computational performance and hardware resources required in application fields, and it can be used as an IP (intellectual property) in scalable hardware design of elliptic curve cryptography (ECC).

Scalable Network Architecture for Flow-Based Traffic Control

  • Song, Jong-Tae;Lee, Soon-Seok;Kang, Kug-Chang;Park, No-Ik;Park, Heuk;Yoon, Sung-Hyun;Chun, Kyung-Gyu;Chang, Mi-Young;Joung, Jin-Oo;Kim, Young-Sun
    • ETRI Journal
    • /
    • v.30 no.2
    • /
    • pp.205-215
    • /
    • 2008
  • Many control schemes have been proposed for flow-level traffic control. However, flow-level traffic control is implemented only in limited areas such as traffic monitoring and traffic control at edge nodes. No clear solution for end-to-end architecture has been proposed. Scalability and the lack of a business model are major problems for deploying end-to-end flow-level control architecture. This paper introduces an end-to-end transport architecture and a scalable control mechanism to support the various flow-level QoS requests from applications.

  • PDF

The Montgomery Multiplier Using Scalable Carry Save Adder (분할형 CSA를 이용한 Montgomery 곱셈기)

  • 하재철;문상재
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.10 no.3
    • /
    • pp.77-83
    • /
    • 2000
  • This paper presents a new modular multiplier for Montgomery multiplication using iterative small carry save adder. The proposed multiplier is more flexible and suitable for long bit multiplication due to its scalable property according to design area and required computing time. We describe the word-based Montgomery algorithm and design architecture of the multiplier. Our analysis and simulation show that the proposed multiplier provides area/time tradeoffs in limited design area such as IC cards.

An Architecture of Scalable ATM Switching System and Its Call Processing Capacity Estimation

  • Kim, Young-Boo;Lee, Soon-Seok;Oh, Chang-Hwan;Kim, Young-Sun;Han, Chi-Moon;Yim, Chu-Hwan
    • ETRI Journal
    • /
    • v.18 no.3
    • /
    • pp.107-125
    • /
    • 1996
  • In this paper, we define the general requirements of ATM switching systems such as scalability, distributed fashion, and modularity. Also we propose a practical implementation of a scalable ATM switching system whose capacity can be easily expanded. Firstly, the architecture of the system is discussed with an emphasis on system scalability, modularity of subsystems and the simple control network of the design requirements. Secondly, we suggest the three types of distributed call/connection control schemes that are suitable for our switching system. We also estimate their call processing capacity on the average and make a comparison of them under the various system architectures. Since our scalable switching system can be constructed to perform the call processing functions on the various levels of the system capacity, it has much adaptability at the various evolution phases or regions of the network environment.

  • PDF

Scalable Extension of HEVC for Flexible High-Quality Digital Video Content Services

  • Lee, Hahyun;Kang, Jung Won;Lee, Jinho;Choi, Jin Soo;Kim, Jinwoong;Sim, Donggyu
    • ETRI Journal
    • /
    • v.35 no.6
    • /
    • pp.990-1000
    • /
    • 2013
  • This paper describes the scalable extension of High Efficiency Video Coding (HEVC) to provide flexible high-quality digital video content services. The proposed scalable codec is designed on multi-loop decoding architecture to support inter-layer sample prediction and inter-layer motion parameter prediction. Inter-layer sample prediction is enabled by inserting the reconstructed picture of the reference layer (RL) into the decoded picture buffer of the enhancement layer (EL). To reduce the motion parameter redundancies between layers, the motion parameter of the RL is used as one of the candidates in merge mode and motion vector prediction in the EL. The proposed scalable extension can support scalabilities with minimum changes to the HEVC and provide average Bj${\o}$ntegaard delta bitrate gains of about 24% for spatial scalability and of about 21% for SNR scalability compared to simulcast coding with HEVC.