• Title/Summary/Keyword: Vector Architecture

Search Result 263, Processing Time 0.03 seconds

Trends in AI Computing Processor Semiconductors Including ETRI's Autonomous Driving AI Processor (인공지능 컴퓨팅 프로세서 반도체 동향과 ETRI의 자율주행 인공지능 프로세서)

  • Yang, J.M.;Kwon, Y.S.;Kang, S.W.
    • Electronics and Telecommunications Trends
    • /
    • v.32 no.6
    • /
    • pp.57-65
    • /
    • 2017
  • Neural network based AI computing is a promising technology that reflects the recognition and decision operation of human beings. Early AI computing processors were composed of GPUs and CPUs; however, the dramatic increment of a floating point operation requires an energy efficient AI processor with a highly parallelized architecture. In this paper, we analyze the trends in processor architectures for AI computing. Some architectures are still composed using GPUs. However, they reduce the size of each processing unit by allowing a half precision operation, and raise the processing unit density. Other architectures concentrate on matrix multiplication, and require the construction of dedicated hardware for a fast vector operation. Finally, we propose our own inAB processor architecture and introduce domestic cutting-edge processor design capabilities.

Computation of Flow around a Container Ship with Twin-Skegs using the CFD (CFD를 이용한 쌍축 컨테이너선 주위의 유동계산)

  • Kim, Hee-Taek;Kim, Hyoung-Tae
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.44 no.4
    • /
    • pp.370-378
    • /
    • 2007
  • In this study. a numerical analysis has been performed for the turbulent flow around a 15,000TEU twin-skeg container ship using a commercial CFD code. FLUENT. The computed results have been compared with the model test data from MOERI. We investigated viscous resistance coefficient. wake distribution and characteristics of the shear flow according to the grid numbers. Although the free surface is approximated by the plane of symmetry in this work. the calculated axial velocity and transverse vector show a good agreement with the MOERI experimental data except for the region of 0.9 level of axial velocity at the propeller plane. The numerical analysis show that commercial CFD code is useful tool for the evaluation of complex hull form with twin-skegs.

Adaptive Neural PLL for Grid-connected DFIG Synchronization

  • Bechouche, Ali;Abdeslam, Djaffar Ould;Otmane-Cherif, Tahar;Seddiki, Hamid
    • Journal of Power Electronics
    • /
    • v.14 no.3
    • /
    • pp.608-620
    • /
    • 2014
  • In this paper, an adaptive neural phase-locked loop (AN-PLL) based on adaptive linear neuron is proposed for grid-connected doubly fed induction generator (DFIG) synchronization. The proposed AN-PLL architecture comprises three stages, namely, the frequency of polluted and distorted grid voltages is tracked online; the grid voltages are filtered, and the voltage vector amplitude is detected; the phase angle is estimated. First, the AN-PLL architecture is implemented and applied to a real three-phase power supply. Thereafter, the performances and robustness of the new AN-PLL under voltage sag and two-phase faults are compared with those of conventional PLL. Finally, an application of the suggested AN-PLL in the grid-connected DFIG-decoupled control strategy is conducted. Experimental results prove the good performances of the new AN-PLL in grid-connected DFIG synchronization.

Design and Implementation of V-BLAST for MIMO-OFDM Systems (MIMO-OFDM 시스템을 위한 V-BLAST의 설계 및 구현)

  • Choi Yong-Woo;Park In-Cheol
    • Proceedings of the IEEK Conference
    • /
    • 2004.06b
    • /
    • pp.415-418
    • /
    • 2004
  • This paper describes a VLSI implementation of BLAST detection for MIMO-OFDM systems. To achieve high speed requirement, we propose the fully pipeline architecture for BLAST structure. This design is implemented using $0.18{\mu}m$ CMOS technology. For a 4-transmit and 4-receive antennas system, it takes $7.5{\mu}s$ to calculate nulling vector and detection order from 48 channel matrixes.

  • PDF

A VLSI Implementation of Real-time 8$\times$8 2-D DCT Processor for the Subprimary Rate Video Codec (저 전송률 비디오 코덱용 실시간 8$\times$8 이차원 DCT 처리기의 VLSI 구현)

  • 권용무;김형곤
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.15 no.1
    • /
    • pp.58-70
    • /
    • 1990
  • This paper describes a VLSI implementation of real-time two dimensional DCT processor for the subprimary rate video codec system. The proposed architecture exploits the parallelism and concurrency of the distributes architecture for vector inner product operation of DCT and meets the CCITT performance requirements of video codec for full CSIF 30 frames/sec. It is also shown that this architecture satisfies all the CCITT IDCT accuracy specification by simulating the suggested architecture in bit level. The efficient VLSI disign methodology to design suggested architecture is considered and the module generator oriented design environments are constructed based on SUN 3/150C workstation. Using the constructed design environments. the suggensted architecture have been designed by double metal 2micron CMOS technology. The chip area fo designed 8x8 2-D DA-DCT (Distributed Arithmetic DCT) processor is about 3.9mmx4.8mm.

  • PDF

Design and Implementation of Object Reusing Methods for Mobile Vector Map Services (모바일 벡터 지도 서비스를 위한 객체 재사용 기법의 설계 및 구현)

  • Kim, Jin-Deog;Choi, Jin-Oh
    • The KIPS Transactions:PartD
    • /
    • v.10D no.3
    • /
    • pp.359-366
    • /
    • 2003
  • Although the reuse of the cached data for scrolling the map reduces the amount of passed data between client and server, it needs the conversions of data coordinates, selective deletion of objects, cache compaction and object structuring step in the clients. The conversion is a time- intensive operation due to limited resources of mobile phones such as low computing power, small memory. Therefore, in order to control the map efficiently in the vector map service based mobile phones, it is necessary to study the methods which reuse cached objects for reducing wireless network bandwidth and overwhelming the limited resources of mobile phones as well. This paper proposes the methods of reusing pre-received spatial objects for map control in the mobile vector map service system based on client-server architecture. The experiments conducted on the Web GIS systems with real data show that the proposed method is appropriate to map services for mobile phone. We also analyze the advantages and drawbacks between the reuse of cached data and transmission of raw data respectively.

Implementation of Efficient Power Method on CUDA GPU (CUDA 기반 GPU에서 효율적인 Power Method의 구현)

  • Kim, Jung-Hwan;Kim, Jin-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.2
    • /
    • pp.9-16
    • /
    • 2011
  • GPU computing is emerging in high performance application area since it can easily exploit massive parallelism in a way of cost-effective computing. The power method which finds the eigen vector of a given matrix is widely used in various applications such as PageRank for calculating importance of web pages. In this research we made the power method efficiently parallelized on GPU and also suggested how it can be improved to enhance its performance. The power method mainly consists of matrix-vector product and it can be easily parallelized. However, it should decide the convergence of the eigen vector and need scaling of the vector subsequently. Such operations incur several calls to GPU kernels and data movement between host and GPU memories. We improved the performance of the power method by means of reduced calls to GPU kernels, optimized thread allocation and enhanced decision operation for the convergence.

An Effective Multiple Transition Pattern Generation Method for Signal Integrity Test on Interconnections (Signal Integrity 연결선 테스트용 다중천이 패턴 생성방안)

  • Kim, Yong-Joon;Kang, Sung-Ho
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.45 no.10
    • /
    • pp.39-44
    • /
    • 2008
  • Scan architecture is very effective design-for-testability technique that is widely used for high testability, however, it requires so much test time due to test vector shifting time. In this paper, an efficient scan test method is presented that is based on the Illinois scan architecture. The proposed method maximizes the common input effect via a scan chain selection scheme. Experimental results show the proposed method requires very short test time and small data volume by increasing the efficiency of common input effect.

Calculation of ice clearing resistance using normal vector of hull form and direct calculation of buoyancy force under the hull

  • Park, Kyung-Duk;Kim, Moon-Chan;Kim, Hyun-Soo
    • International Journal of Naval Architecture and Ocean Engineering
    • /
    • v.7 no.4
    • /
    • pp.699-707
    • /
    • 2015
  • The ice-resistance estimation technique for icebreaking ships had been studied intensively over recent years to meet the needs of designing Arctic vessels. Before testing in the ice model basin, the estimation of a ship's ice resistance with high reliability is very important to decide the delivered power necessary for level ice operation. The main idea of previous studies came from several empirical formulas, such as Poznyak and Ionov (1981), Enkvist (1972) and Shimansky (1938) methods, in which ice resistance components such as icebreaking, buoyancy and clearing resistances were represented by the integral equations along the Design Load Water Line (DLWL). The current study proposes a few modified methods not only considering the DLWL shape, but also the hull shape under the DLWL. In the proposed methodology, the DLWL shape for icebreaking resistance and the hull shape under the DLWL for buoyancy and clearing resistances can be directly considered in the calculation. Especially, when calculating clearing resistance, the flow pattern of ice particles under the DLWL of ship is assumed to be in accordance with the ice flow observed during ice model testing. This paper also deals with application examples for a few ship designs and its ice model testing programs at the AARC ice model basin. From the comparison of results of the model test and the estimation, the reliability of this estimation technique has been discussed.