Search | Korea Science

OpenGL ES 2.0 based Shader Compilation Method for the Instruction-Level Parallelism (OpenGL ES 2.0 기반 셰이더 명령어 병렬 처리를 위한 컴파일 기법)

Kim, Jong-Ho;Kim, Tae-Young
- Journal of Korea Game Society
- /
- v.8 no.2
- /
- pp.69-76
- /
- 2008
In this paper, we present the architecture of graphics processor and its instruction format for the mobile device. In addition, we introduce tile shader data structure for the on/off-line compilation based on the OpenGL ES 2.0 and a new optimization method based on the ILP(Instruction-Level Parallelism). This paper shows where a processor with the sane core clock is being used, the shader instruction resulted from the compile structure and method in this paper is approximately 1.5 to 2 times faster than a code based on the single instruction.
PDF

A Design of Symbol Timing Recovery for DVB-RCS (DVB-RCS에서 심볼 타이밍 복원에 관한 연구)

Mo, Kyoung-Ha;Song, Hyoung-Kyu
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.27 no.8A
- /
- pp.771-778
- /
- 2002
We investigate the design of an interpolation filter of a MF-TDMA demodulator which is applied to DVB-RCS. If sampling is not synchronized to the data symbols, timing adjustment in digital receiver must be performed by interpolation. It is impossible that conventional sinc interpolation filter coefficients are actually extended to infinity. We propose a Kaiser window interpolation filter and a sinc interpolation filter using th Kaiser window. Simulation results show that the performance improvement is realized by employing the proposed interpolation filter.
PDF KSCI

Code synchronization technique for spread spectrum transmission based on DVB-RCS +M standard (DVB-RCS +M 표준기반의 대역확산기술 부호동기기법)

Kim, Pan-Soo;Chang, Dae-Ig;Lee, Ho-Jin
- Journal of Satellite, Information and Communications
- /
- v.4 no.2
- /
- pp.39-45
- /
- 2009
This paper proposes the specific code synchronization technique for DS-SS(Direct Sequence-Spread Spectrum transmission in the DVB-RCS +M standard. DS-SS is better than multi-carrier transmission method under nonlinear channel but imposes a long acquisition time. To improve the synchronization aspect, the robust correlation structure is introduced for acquisition and the nonlinear delay lock loop is done for tracking. MAT(Mean Acquisition Time) performances is shown to validate its superiority. In addition, code tracking and jitter performances are done when code tracking algorithm based on 2 oversamples which is not influenced by sampling clock timing offset and carrier freq. offset is used.
PDF

VLSI Design for Folded Wavelet Transform Processor using Multiple Constant Multiplication (MCM과 폴딩 방식을 적용한 웨이블릿 변환 장치의 VLSI 설계)

Kim, Ji-Won;Son, Chang-Hoon;Kim, Song-Ju;Lee, Bae-Ho;Kim, Young-Min
- Journal of Korea Multimedia Society
- /
- v.15 no.1
- /
- pp.81-86
- /
- 2012
This paper presents a VLSI design for lifting-based discrete wavelet transform (DWT) 9/7 filter using multiplierless multiple constant multiplication (MCM) architecture. This proposed design is based on the lifting scheme using pattern search for folded architecture. Shift-add operation is adopted to optimize the multiplication process. The conventional serial operations of the lifting data flow can be optimized into parallel ones by employing paralleling and pipelining techniques. This optimized design has simple hardware architecture and requires less computation without performance degradation. Furthermore, hardware utilization reaches 100%, and the number of registers required is significantly reduced. To compare our work with previous methods, we implemented the architecture using Verilog HDL. We also executed simulation based on the logic synthesis using $0.18{\mu}m$ CMOS standard cells. The proposed architecture shows hardware reduction of up to 60.1% and 44.1% respectively at 200 MHz clock compared to previous works. This implementation results indicate that the proposed design performs efficiently in hardware cost, area, and power consumption.
https://doi.org/10.9717/kmms.2012.15.1.081 인용 PDF KSCI

Low Power Symbol Detector for MIMO Communication Systems (MIMO 통신 시스템을 위한 저전력 심볼 검출기 설계 연구)

Hwang, You-Sun;Jang, Soo-Hyun;Jung, Yun-Ho
- Journal of Advanced Navigation Technology
- /
- v.14 no.2
- /
- pp.220-226
- /
- 2010
In this paper, an low power symbol detector is proposed for MIMO communication system with two transmit and two receive antennas. The proposed symbol detector can support both the spatial multiplexing (SM) mode and spatial diversity (SD) mode for MIMO transmission technique, and shows the optimal maximum likelihood (ML) performance. Also, by sharing the hardware block and using the dedicated clock MIMO modes, the power of the proposed architecture is dramatically decreased. The proposed symbol detector was designed in hardware description language (HDL) and synthesized to logic gates using a $0.13-{\mu}m$ CMOS standard cell library. The power consumption was estimated by using Synopsys Power CompilerTM, which is reduced by maximum 85%, compared with the conventional architecture.
PDF KSCI

A realization of up/down-stream transmission on an optical subscriber network with the PON structure (PON구조의 광가입자망에서 상/하향전송 구현)

김효중;이찬구;강성수;이만섭
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.21 no.3
- /
- pp.795-806
- /
- 1996
A structure which transmits and receives 3 subscribers packet data through an optical splitter node in an optical subscriber network of PON wave suggested and realized. All the functions which are necessary to use PON structure such as 155.52Mb/s multiplexer/demultiplexer of down-stream signal, 25.92Mb/s multiplexer/demultiplexer of up-stream signal, frame synchronizer, and phase aligner for aligning phase of the clock and received data have been integrated within 2 ICs using CMOS technonology. TDMAtechnology was suggested for up-stream transmission. In the optical up-stream transmission of the urst signals transmissionquality degradataion by APC and AGC of optical tranceiver was observed. By compensating the degradation, more than 17dB of link budget which is enough to satisfy 11.2dBthat is required in SWAN of PON architecture has been obtained with optical transceiver whose power and sensitivity are -17dBm and -34dBm respectively.
PDF

Design of Low-Power and Low-Complexity MIMO-OFDM Baseband Processor for High Speed WLAN Systems (고속 무선 LAN 시스템을 위한 저전력/저면적 MIMO-OFDM 기저대역 프로세서 설계)

Im, Jun-Ha;Cho, Mi-Suk;Jung, Yun-Ho;Kim, Jae-Seok
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.33 no.11C
- /
- pp.940-948
- /
- 2008
This paper presents a low-power, low-complexity design and implementation results of a high speed multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) wireless LAN (WLAN) baseband processor. The proposed processor is composed of the physical layer convergence procedure (PLCP) processor and physical medium dependent (PMD) processor, which have been optimized to have low-power and reduced-complexity architecture. It was designed in a hardware description language (HDL) and synthesized to gate-level circuits using 0.18um CMOS standard cell library. As a result, the proposed TX-PLCP processor reduced the power consumption by as much as 81% over the bit-level operation architecture. Also, the proposed MIMO symbol detector reduced the hardware complexity by 18% over the conventional SQRD-based architecture with division circuits and square root operations.
PDF KSCI

A 125 MHz CMOS Phase-Locked Loop with 51-phase Output Clock (51-위상 출력 클럭을 가지는 125 MHz CMOS 위상 고정 루프)

Lee, Pil-Ho;Jang, Young-Chan
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2013.10a
- /
- pp.343-345
- /
- 2013
This paper describes a phase-locked loop (PLL) that generates a 51-phase clock with the operating frequency of 125MHz. To generate 51-phase clock with a frequency of 125 MHz, the proposed PLL uses three voltage controlled oscillators (VCOs) which are connected by resistors. Each VCO consists of 17 delay-cells. An resistor averaging scheme, which makes three VCOs to connect with each other, makes it possible to generates 51-phase clock of the same phase difference. The proposed PLL is designed by using 65 nm CMOS process with a 1.0 V supply. At the operating frequency of 125 MHz, the simulated DNL and peak-to-peak jitter are +0.0016/-0.0020 LSB and 1.07 ps, respectively. The area and power consumption of the implemented PLL are $290{\times}260{\mu}m^2$ and 2.5 mW, respectively.
PDF

An efficient interconnection network topology in dual-link CC-NUMA systems (이중 연결 구조 CC-NUMA 시스템의 효율적인 상호 연결망 구성 기법)

Suh, Hyo-Joong
- The KIPS Transactions:PartA
- /
- v.11A no.1
- /
- pp.49-56
- /
- 2004
The performance of the multiprocessor systems is limited by the several factors. The system performance is affected by the processor speed, memory delay, and interconnection network bandwidth/latency. By the evolution of semiconductor technology, off the shelf microprocessor speed breaks beyond GHz, and the processors can be scalable up to multiprocessor system by connecting through the interconnection networks. In this situation, the system performances are bound by the latencies and the bandwidth of the interconnection networks. SCI, Myrinet, and Gigabit Ethernet are widely adopted as a high-speed interconnection network links for the high performance cluster systems. Performance improvement of the interconnection network can be achieved by the bandwidth extension and the latency minimization. Speed up of the operation clock speed is a simple way to accomplish the bandwidth and latency betterment, while its physical distance makes the difficulties to attain the high frequency clock. Hence the system performance and scalability suffered from the interconnection network limitation. Duplicating the link of the interconnection network is one of the solutions to resolve the bottleneck of the scalable systems. Dual-ring SCI link structure is an example of the interconnection network improvement. In this paper, I propose a network topology and a transaction path algorism, which optimize the latency and the efficiency under the duplicated links. By the simulation results, the proposed structure shows 1.05 to 1.11 times better latency, and exhibits 1.42 to 2.1 times faster execution compared to the dual ring systems.
https://doi.org/10.3745/KIPSTA.2004.11A.1.049 인용 PDF KSCI

Design and Implementation of Co-Verification Environments based-on SystemVerilog & SystemC (SystemVerilog와 SystemC 기반의 통합검증환경 설계 및 구현)

You, Myoung-Keun;Song, Gi-Yong
- Journal of the Institute of Convergence Signal Processing
- /
- v.10 no.4
- /
- pp.274-279
- /
- 2009
The flow of a universal system-level design methodology consists of system specification, system-level hardware/software partitioning, co-design, co-verification using virtual or physical prototype, and system integration. In this paper, verification environments based-on SystemVerilog and SystemC, one is native-code co-verification environment which makes prompt functional verification possible and another is SystemVerilog layered testbench which makes clock-level verification possible, are implemented. In native-code co-verification, HW and SW parts of SoC are respectively designed with SystemVerilog and SystemC after HW/SW partitioning using SystemC, then the functional interaction between HW and SW parts is carried out as one simulation process. SystemVerilog layered testbench is a verification environment including corner case test of DUT through the randomly generated test-vector. We adopt SystemC to design a component of verification environment which has multiple inheritance, and we combine SystemC design unit with the SystemVerilog layered testbench using SystemVerilog DPI and ModelSim macro. As multiple inheritance is useful for creating class types that combine the properties of two or more class types, the design of verification environment adopting SystemC in this paper can increase the code reusability.
PDF

Search Result 53, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)