Search | Korea Science

Partioning for hardwae-software codesign (하드웨어-소프트웨어 통합 설계를 위한 분할)

윤경로;박동하;신현철
- Journal of the Korean Institute of Telematics and Electronics A
- /
- v.33A no.7
- /
- pp.261-268
- /
- 1996
Hardware-software codesign becomes improtant to effectively sagisfy perfomrance goals, because designers can trade-off in the way hardware and software components work teogether to exhibit a specified behavior. In this paper, a hardware-software pratitioning algorithm is presetned, in which the system behavioral description containing a mixture of hardware and software components is partitioned into hardware part and software part. The partitioning algorithm tries to minimize the given cost function under constraints on hardware resources or latency. Recursive moving of operations between the hardware and software parts is used to find a near optimum partition and the list scheduling approach is used to estimate the hardware area and latency. Since memory may take substantial protion of the hardware part, memory cost is included in sthe hardware cost. Experimental resutls show that our algorithm is effective.
PDF

High Performance Integer Multiplier on FPGA with Radix-4 Number Theoretic Transform

Chang, Boon-Chiao;Lee, Wai-Kong;Goi, Bok-Min;Hwang, Seong Oun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.8
- /
- pp.2816-2830
- /
- 2022
Number Theoretic Transform (NTT) is a method to design efficient multiplier for large integer multiplication, which is widely used in cryptography and scientific computation. On top of that, it has also received wide attention from the research community to design efficient hardware architecture for large size RSA, fully homomorphic encryption, and lattice-based cryptography. Existing NTT hardware architecture reported in the literature are mainly designed based on radix-2 NTT, due to its small area consumption. However, NTT with larger radix (e.g., radix-4) may achieve faster speed performance in the expense of larger hardware resources. In this paper, we present the performance evaluation on NTT architecture in terms of hardware resource consumption and the latency, based on the proposed radix-2 and radix-4 technique. Our experimental results show that the 16-point radix-4 architecture is 2× faster than radix-2 architecture in expense of approximately 4× additional hardware. The proposed architecture can be extended to support the large integer multiplication in cryptography applications (e.g., RSA). The experimental results show that the proposed 3072-bit multiplier outperformed the best 3k-multiplier from Chen et al. [16] by 3.06%, but it also costs about 40% more LUTs and 77.8% more DSPs resources.
https://doi.org/10.3837/tiis.2022.08.020 인용 PDF KSCI HTML

The Relationship of Information System Resources distribution Between the System Plan.Control and System Development and System Operation (시스템 계획 및 통제, 개발, 운영 차원에서의 정보시스템 자원 분산화에 관한 연구)

Jung Lee-Sang;Han Jung-Hee
- Management & Information Systems Review
- /
- v.2
- /
- pp.133-167
- /
- 1998
This article discusses the findings of an empirical study conducted on 62 large organizations. The major purpose of the study was to analyze the relationship of Information System Resources distribution between the system plan control and system development and system operation. In this study information system resource is broadly Identified by computer hardware, software, data, procedure, operator. Because of the real centralization/decentralization issue facing organizations is much broader then the choice between alternative computer hardware configurations. And there are three separate resources of the information system that can be decentralized system plan and control, system development, system operations. The decision regarding how to organize each of these three separate resources is based on a different set of criteria. Furthermore, each decision can be made relatively independently of the others. In this article the results of a study are indicated below. In the degree of decentralization of information system resources between system plan control and system development and system operations were found the positive relationship. Therefore, the more information system resources are decentralized in the one dimension, the more information system resources are decentralized in the other dimensions, and the more information system resources are centralized in the one dimension, the more information system resources are centralized in the other dimensions.
PDF

High Throughput Radix-4 SISO Decoding Architecture with Reduced Memory Requirement

Byun, Wooseok;Kim, Hyeji;Kim, Ji-Hoon
- JSTS:Journal of Semiconductor Technology and Science
- /
- v.14 no.4
- /
- pp.407-418
- /
- 2014
As the high-throughput requirement in the next generation communication system increases, it becomes essential to implement high-throughput SISO (Soft-Input Soft-Output) decoder with minimal hardware resources. In this paper, we present the comparison results between cascaded radix-4 ACS (Add-Compare-Select) and LUT (Look-Up Table)-based radix-4 ACS in terms of delay, area, and power consumption. The hardware overhead incurred from the retiming technique used for high speed radix-4 ACS operation is also analyzed. According to the various analysis results, high-throughput radix-4 SISO decoding architecture based on simple path metric recovery circuit is proposed to minimize the hardware resources. The proposed architecture is implemented in 65 nm CMOS process and memory requirement and power consumption can be reduced up to 78% and 32%, respectively, while achieving high-throughput requirement.
https://doi.org/10.5573/JSTS.2014.14.4.407 인용 PDF KSCI

Color Correction with Optimized Hardware Implementation of CIE1931 Color Coordinate System Transformation (CIE1931 색좌표계 변환의 최적화된 하드웨어 구현을 통한 색상 보정)

Kim, Dae-Woon;Kang, Bong-Soon
- Journal of IKEEE
- /
- v.25 no.1
- /
- pp.10-14
- /
- 2021
This paper presents a hardware that improves the complexity of the CIE1931 color coordinate algorithm operation. The conventional algorithm has disadvantage of growing hardware due to 4-Split Multiply operations used to calculate large bits in the computation process. But the proposed algorithm pre-calculates the defined R2X, X2R Matrix operations of the conventional algorithm and makes them a matrix. By applying the matrix to the images and improving the color, it is possible to reduce the amount of computation and hardware size. By comparing the results of Xilinx synthesis of hardware designed with Verilog, we can check the performance for real-time processing in 4K environments with reduced hardware resources. Furthermore, this paper validates the hardware mount behavior by presenting the execution results of the FPGA board.
https://doi.org/10.7471/ikeee.2021.25.1.10 인용 PDF KSCI

VLSI Architecture of Digital Image Scaler Combining Linear Interpolation and Cubic Convolution Interpolation (선형 보간법과 3차회선 보간법을 결합한 디지털 영상 스케일러의 VLSI 구조)

Moon, Hae Min;Pan, Sung Bum
- Journal of the Institute of Electronics and Information Engineers
- /
- v.51 no.3
- /
- pp.112-118
- /
- 2014
As higher quality of image is required for digital image scaling, longer processing time is required. Therefore the technology that can make higher quality image quickly is needed. We propose the double linear-cubic convolution interpolation which creates the high quality image with low complexity and hardware resources. The proposed interpolation methods which are made up of four one-dimensional linear interpolations and one one-dimensional cubic convolution perform linear-cubic convolution interpolation in horizontal and vertical direction. When compared in aspects of peak signal-to-noise ratio(PSNR), performance time and amount of hardware resources, the proposed interpolation provided better PSNR, low complexity and less hardware resources than bicubic convolution interpolation.
https://doi.org/10.5573/ieie.2014.51.3.112 인용 PDF KSCI

Selecting a Synthesizable RISC-V Processor Core for Low-cost Hardware Devices

Gookyi, Dennis Agyemanh Nana;Ryoo, Kwangki
- Journal of Information Processing Systems
- /
- v.15 no.6
- /
- pp.1406-1421
- /
- 2019
The Internet-of-Things (IoT) has been deployed in almost every facet of our day to day activities. This is made possible because sensing and data collection devices have been given computing and communication capabilities. The devices implement System-on-Chips (SoCs) that incorporate a lot of functionalities, yet they are severely constrained in terms of memory capacitance, hardware area, and power consumption. With the increase in the functionalities of sensing devices, there is a need for low-cost synthesizable processors to handle control, interfacing, and error processing. The first step in selecting a synthesizable processor core for low-cost devices is to examine the hardware resource utilization to make sure that it fulfills the requirements of the device. This paper gives an analysis of the hardware resource usage of ten synthesizable processors that implement the Reduced Instruction Set Computer Five (RISC-V) Instruction Set Architecture (ISA). All the ten processors are synthesized using Vivado v2018.02. The maximum frequency, area, and power reports are extracted and a comparison is made to determine which processor is ideal for low-cost hardware devices.
https://doi.org/10.3745/JIPS.03.0129 인용 PDF KSCI

Implementation of Genetic Algorithm Processor based on Hardware Optimization for Evolvable Hardware (진화형 하드웨어를 위한 하드웨어 최적화된 유전자 알고리즘 프로세서의 구현)

Kim, Jin-Jeong;Jeong, Deok-Jin
- The Transactions of the Korean Institute of Electrical Engineers D
- /
- v.49 no.3
- /
- pp.133-144
- /
- 2000
Genetic Algorithm(GA) has been known as a method of solving large-scaled optimization problems with complex constraints in various applications. Since a major drawback of the GA is that it needs a long computation time, the hardware implementations of Genetic Algorithm Processors(GAP) are focused on in recent studies. In this paper, a hardware-oriented GA was proposed in order to save the hardware resources and to reduce the execution time of GAP. Based on steady-state model among continuos generation model, the proposed GA used modified tournament selection, as well as special survival condition, with replaced whenever the offspring's fitness is better than worse-fit parent's. The proposed algorithm shows more than 30% in convergence speed over the conventional algorithm in simulation. Finally, by employing the efficient pipeline parallelization and handshaking protocol in proposed GAP, above 30% of the computation speed-up can be achieved over survival-based GA which runs one million crossovers per second (1㎒), when device speed and size of application are taken into account on prototype. It would be used for high speed processing such of central processor of evolvable hardware, robot control and many optimization problems.
PDF

Closed-loop controller design, stability analysis and hardware implementation for fractional neutron point kinetics model

Vyawahare, Vishwesh A.;Datkhile, G.;Kadam, P.;Espinosa-Paredes, G.
- Nuclear Engineering and Technology
- /
- v.53 no.2
- /
- pp.688-694
- /
- 2021
The aim of this work is the analysis, design and hardware implementation of the fractional-order point kinetics (FNPK) model along with its closed-loop controller. The stability and closed-loop control of FNPK models are critical issues. The closed-loop stability of the controller-plant structure is established. Further, the designed PI/PD controllers are implemented in real-time on a DSP processor. The simulation and real-time hardware studies confirm that the designed PI/PD controllers result in a damped stable closed-loop response.
https://doi.org/10.1016/j.net.2020.07.026 인용 PDF KSCI

Design of a Hardware Resource Sharable Camera Control Processor for Low-Cost and Low-Power Camera Cell Phones (저비용, 저전력 카메라 폰 구현을 위한 하드웨어 자원 공유가 가능한 카메라 제어 프로세서의 설계)

Lim, Kyu-Sam;Baek, Kwang-Hyun;Kim, Su-Ki
- Journal of the Institute of Electronics Engineers of Korea SD
- /
- v.47 no.3
- /
- pp.35-40
- /
- 2010
In this paper, we propose a hardware resource sharable camera control processor (CCP) for low-cost and low-power camera cell phones. The main idea behind the proposed architecture is that adds direct access paths in the CCP to share its hardware resources so that the baseband processor expands its capabilities and boosts its performance by utilizing CCF's hardware resources. In addition, we applied a module grain dock-gating method to reduce power dissipation. Hence, the CCP can realize low-power and low-cost camera cell phones with greater hardware efficiency. This chip was fabricated in a 0.18um CMOS process with an active area of $3.8mm\;{\times}\;3.8mm$.
PDF KSCI

Search Result 442, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)