통합 검색 | Korea Science

AE32000B: a Fully Synthesizable 32-Bit Embedded Microprocessor Core

Kim, Hyun-Gyu;Jung, Dae-Young;Jung, Hyun-Sup;Choi, Young-Min;Han, Jung-Su;Min, Byung-Gueon;Oh, Hyeong-Cheol
- ETRI Journal
- /
- 제25권5호
- /
- pp.337-344
- /
- 2003
In this paper, we introduce a fully synthesizable 32-bit embedded microprocessor core called the AE32000B. The AE32000B core is based on the extendable instruction set computer architecture, so it has high code density and a low memory access rate. In order to improve the performance of the core, we developed and adopted various design options, including the load extension register instruction (LERI) folding unit, a high performance multiply and accumulate (MAC) unit, various DSP units, and an efficient coprocessor interface. The instructions per cycle count of the Dhrystone 2.1 benchmark for the designed core is about 0.86. We verified the synthesizability and the area and time performances of our design using two CMOS standard cell libraries: a 0.35-${\mu}m$ library and a 0.18-${\mu}m$ library. With the 0.35-${\mu}m$ library, the core can be synthesized with about 47,000 gates and operate at 70 MHz or higher, while it can be synthesized with about 53,000 gates and operate at 120 MHz or higher with the 0.18-${\mu}m$ library.
PDF

CATV 망용 고속 비대칭 기저대역 모뎀 ASIC 칩 설계 (Design of a High Speed Asymmetric Baseband MODEM ASIC Chip for CATV Network)

박기혁
- 한국통신학회논문지
- /
- 제25권9A호
- /
- pp.1332-1339
- /
- 2000
본 논문에서는 MCNS(Multimedia Cable N$\xi$twork System)의 DOCSIS(Data Over Cable Service Interface S Specification) 표준안의 물리계층을 지원하는 비대칭형 기저대역 모댐 ASIC 칩의 아키텍쳐와 설계에 대해 기술한다. 구현한 모뎀 칩은 크게 QPSK/16-QAM 방식의 상향 스트림용 송신부와 64/256-QAM 방식의 하향 스트림용 수신부로 구성되어 있으며, 심볼 타이밍 복구회로, 반송파 복구회로. MMA(Multi Modulus Algorithm)와 LMS(Least Mean Square) 알고리즘을 적용한 결정 궤환 구조의 블라인드 등화기를 포함한다. 구현한 모뎀 칩은 64/256-QAM 변복조 방식에서 각각 48Mbps, 64Mbps의 데이터 전송률을 지원하고, 심볼 전송률은 기존의 QAM 수신기들보다 빠른 8MBaud를 갖는다. 구현한 칩은 $0.35\mu\textrm{m}$ 표준 셀(Standard Cell) 라이브러리를 사용하여 논리합성을 수행하였으며, 총 게이트 수는 약 29만 게이트이며, 현재 ASIC 칩으후 제작중이다.
PDF

SHA-1과 HAS-160과 의사 난수 발생기를 구현한 해쉬 프로세서 설계 (Design of Hash Processor for SHA-1, HAS-160, and Pseudo-Random Number Generator)

전신우;김남영;정용진
- 한국통신학회논문지
- /
- 제27권1C호
- /
- pp.112-121
- /
- 2002
본 논문에서는 미국과 한국의 해쉬 함수 표준인 SHA-1과 HAS-160 해쉬 알고리즘, 그리고 SHA-1을 이용한 의사 난수 발생기를 구현한 프로세서를 설계하였다. SHA-1과 HAS-160이 동일한 단계 연산을 가지므로, 한 단계 연산만을 구현하여 공유함으로써 하드웨어 리소스를 감소시켰다. 그리고 메시지 변수의 사전 계산과 단계 연산을 두 단계의 파이프라인 구조로 구현함으로써 한 개의 클럭으로 한 단계 연산을 수행하는 방식보다 최장지연경로는 1/2로 줄고, 총 단계 연산에 필요한 클럭 수는 하나만 증가하므로 성능은 약 2배 향상되었다. 그 결과, 설계한 해쉬 프로세서는 삼성 0.5 um CMOS 스탠다드 셀 라이브러리를 근거로 산출할 때, 100 MHz의 동작 주파수에서 약 624 Mbps의 성능을 얻을 수 있다. 그리고 의사 난수 발생기로 사용될 때는 약 195 Mbps의 난수 발생 성능을 가진다. 이러한 성능은 지금까지 상용화된 국내외의 어느 해쉬 프로세서보다 빠른 처리 시간을 가지는 것으로 판단된다.
PDF KSCI

Generation of Embryonic Stem Cell-derived Transgenic Mice by using Tetraploid Complementation

Park, Sun-Mi;Song, Sang-Jin;Choi, Ho-Jun;Uhm, Sang-Jun;Cho, Ssang-Goo;Lee, Hoon-Taek
- 한국발생생물학회:학술대회논문집
- /
- 한국발생생물학회 2003년도 제3회 국제심포지움 및 학술대회
- /
- pp.121-121
- /
- 2003
The standard protocol for the production of transgenic mouse from ES-injected embryo has to process via chimera producing and several times breeding steps, In contrast, tetraploid-ES cell complementation method allows the immediate generation of targeted murine mutants from genetically modified ES cell clones. The advantage of this advanced technique is a simple and efficient without chimeric intermediates. Recently, this method has been significantly improved through the discovery that ES cells derived from hybrid strains support the development of viable ES mice more efficiently than inbred ES cells do. Therefore, the objective of this study was to generate transgenic mice overexpressing human resistin gene by using tetrapioid-ES cell complementation method. Human resistin gene was amplified from human fetal liver cDNA library by PCR and cloned into pCR 2.1 TOPO T-vector and constructed in pCMV-Tag4C vector. Human resistin mammalian expression plasmid was transfected into D3-GL ES cells by lipofectamine 2000, and then after 8~10 days of transfection, the human resistin-expressing cells were selected with G418. In order to produce tetraploid embryos, blastomeres of diploid embryos at the two-cell stage were fused with two times of electric pulse using 60 V 30 $\mu$sec. (fusion rate : 93.5%) and cultured upto the blastocyst stage (development rate : 94.6%). The 15~20 previously G418-selected ES cells were injected into tetraploid blastocysts, and then transferred into the uterus of E2.5d pseudopregnant recipient mice. To investigate the gestation progress, two El9.5d fetus were recovered by Casarean section and one fetus was confirmed to contain human resistin gene by genomic DNA-PCR. Therefore, this finding demonstrates that tetraploid-ES mouse technology can be considered as a useful tool to produce transgenic mouse for the rapid analysis of gene function in vivo.
PDF

MPEG-4 AVC를 위한 고속 다해상도 움직임 추정기의 하드웨어 구현 (Hardware Implementation of Past Multi-resolution Motion Estimator for MPEG-4 AVC)

임영훈;정용진
- 한국통신학회논문지
- /
- 제29권11C호
- /
- pp.1541-1550
- /
- 2004
본 논문에서는 다해상도 움직임 추정 알고리즘을 이용하여 모션 리터를 검색하는 고속 다해상도 움직임 추정기에 대한 하드웨어 구조를 제안한다. 동영상 압축기술인 MPEG-4 AVC 전체 구성 중에서 핵심 부분인 움직임 추정 모듈을 하드웨어로 설계하기 위하여 기본적인 구조를 구성하고 높은 화질로 실시간 부호화를 할 수 있도록 고속 움직임 검색을 위해 특수하게 설계된 램 구주 메모리 공유, 4화소x4화소 Motion Vector 추출 등과 같은 기술들을 사용하여 전체 움직임 검색기를 구현하였다. 구현된 전체 모듈은 Altera(사)의 Excalibur 디바이스를 이용한 FPGA 구성을 통해 검증하고 최종적으로 Samsung STD130 0.18um CMOS Cell Library를 이용하며 합성 및 검증을 하였다. 이렇게 검증된 구조의 성능은 ASIC으로 구현할 경우 최대 동작 주파수가 약 140MHz이며 QCIF(176화소x144화소) 사이즈 기준으로 초당 약 1100프레임, 4CIF(704화소x576화구 사이즈 기준으로 초당 약 70프레임의 움직임을 검색할 수 있다 본 성능은 하드웨어 기반의 MPEG-4 AVC 실시간 부호화기를 설계하기에 적합한 구조임을 보여준다.
PDF KSCI

AES Rijndael 블록 암호 알고리듬의 효율적인 하드웨어 구현 (An Efficient Hardware Implementation of AES Rijndael Block Cipher Algorithm)

안하기;신경욱
- 정보보호학회논문지
- /
- 제12권2호
- /
- pp.53-64
- /
- 2002
This paper describes a design of cryptographic processor that implements the AES (Advanced Encryption Standard) block cipher algorithm, "Rijndael". An iterative looping architecture using a single round block is adopted to minimize the hardware required. To achieve high throughput rate, a sub-pipeline stage is added by dividing the round function into two blocks, resulting that the second half of current round function and the first half of next round function are being simultaneously operated. The round block is implemented using 32-bit data path, so each sub-pipeline stage is executed for four clock cycles. The S-box, which is the dominant element of the round block in terms of required hardware resources, is designed using arithmetic circuit computing multiplicative inverse in GF($2^8$) rather than look-up table method, so that encryption and decryption can share the S-boxes. The round keys are generated by on-the-fly key scheduler. The crypto-processor designed in Verilog-HDL and synthesized using 0.25-$\mu\textrm{m}$ CMOS cell library consists of about 23,000 gates. Simulation results show that the critical path delay is about 8-ns and it can operate up to 120-MHz clock Sequency at 2.5-V supply. The designed core was verified using Xilinx FPGA board and test system.
https://doi.org/10.13089/JKIISC.2002.12.2.53 인용 PDF KSCI HTML

Design of Encoder and Decoder for LDPC Codes Using Hybrid H-Matrix

Lee, Chan-Ho
- ETRI Journal
- /
- 제27권5호
- /
- pp.557-562
- /
- 2005
Low-density parity-check (LDPC) codes have recently emerged due to their excellent performance. However, the parity check (H) matrices of the previous works are not adequate for hardware implementation of encoders or decoders. This paper proposes a hybrid parity check matrix which is efficient in hardware implementation of both decoders and encoders. The hybrid H-matrices are constructed so that both the semi-random technique and the partly parallel structure can be applied to design encoders and decoders. Using the proposed methods, the implementation of encoders can become practical while keeping the hardware complexity of the partly parallel decoder structures. An encoder and a decoder are designed using Verilog-HDL and are synthesized using a $0.35 {\mu}m$ CMOS standard cell library.
PDF

CMA 알고리즘을 이용한 고속 DFE 등화기의 ASIC 칩 설계 (Design of a High-speed Decision Feedback Equalizer ASIC chip using the Constant-Modulus Algorithm)

신대교;홍석희;선우명훈
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 하계종합학술대회 논문집(2)
- /
- pp.238-241
- /
- 2000
This paper describes an equalizer using the DFE (Decision Feedback Equalizer) structure, CMA. (Constant Modulus Algorithm) and LMS (Least Mean Square) algorithms. We employ high speed multipliers, square logics and many CSAs (Carry Save Adder) for high speed operations. We have developed floating-point models and fixed-point models using the COSSAP$\^$TM/ CAD tool and developed VHDL models. We have peformed logic synthesis using the SYNOPSYS$\^$TM/ CAD tool and the SAMSUNG 0.5 $\mu\textrm{m}$ standard cell library (STD80). The total number of gates is about 130,000.
PDF

Design of High-Performance Intra Prediction Circuit for H.264 Video Decoder

Yoo, Ji-Hye;Lee, Seon-Young;Cho, Kyeong-Soon
- JSTS:Journal of Semiconductor Technology and Science
- /
- 제9권4호
- /
- pp.187-191
- /
- 2009
This paper proposes a high-performance architecture of the H.264 intra prediction circuit. The proposed architecture uses the 4-input and 2-input common computation units and common registers for fast and efficient prediction operations. It avoids excessive power consumption by the efficient control of the external and internal memories. The implemented circuit based on the proposed architecture can process more than 60 HD ($1,920{\times}1,088$) image frames per second at the maximum operating frequency of 101 MHz by using 130 nm standard cell library.
https://doi.org/10.5573/JSTS.2009.9.4.187 인용 PDF KSCI

Design of High-Performance Unified Circuit for Linear and Non-Linear SVM Classifications

Kim, Soo-Jin;Lee, Seon-Young;Cho, Kyeong-Soon
- JSTS:Journal of Semiconductor Technology and Science
- /
- 제12권2호
- /
- pp.162-167
- /
- 2012
This paper describes the design of a high-performance unified SVM classifier circuit. The proposed circuit supports both linear and non-linear SVM classifications. In order to ensure efficient classification, a 48x96 or 64x64 sliding window with 20 window strides is used. We reduced the circuit size by sharing most of the resources required for both types of classification. We described the proposed unified SVM classifier circuit using the Verilog HDL and synthesized the gate-level circuit using 65nm standard cell library. The synthesized circuit consists of 661,261 gates, operates at the maximum operating frequency of 152 MHz and processes up to 33.8 640x480 image frames per second.
https://doi.org/10.5573/JSTS.2012.12.2.162 인용 PDF KSCI

검색결과 196건 처리시간 0.029초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)