• Title/Summary/Keyword: General purpose computing

Search Result 161, Processing Time 0.021 seconds

Essential Computational Tools for High-Fidelity Aerodynamic Simulation and Design (고 정밀 항공우주 유동해석 및 설계를 위한 공력계산 툴)

  • Kim, Chong-Am
    • 유체기계공업학회:학술대회논문집
    • /
    • 2006.08a
    • /
    • pp.33-36
    • /
    • 2006
  • As the computing environment is rapidly improved, the interests of CFD are gradually focused on large-scale computation over complex geometry. Keeping pace with the trend, essential computational tools to obtain solutions of complex aerospace flow analysis and design problems are examined. An accurate and efficient flow analysis and design codes for large-scale aerospace problem are presented in this work. With regard to original numerical schemes for flow analysis, high-fidelity flux schemes such as RoeM, AUSMPW+ and higher order interpolation schemes such as MLP (Multi-dimensional Limiting Process) are presented. Concerning the grid representation method, a general-purpose basis code which can handle multi-block system and overset grid system simultaneously is constructed. In respect to design optimization, the importance of turbulent sensitivity is investigated. And design tools to predict highly turbulent flows and its sensitivity accurately by fully differentiating turbulent transport equations are presented. Especially, a new sensitivity analysis treatment and geometric representation method to resolve the basic flow characteristics are presented. Exploiting these tools, the capability of the proposed approach to handle complex aerospace simulation and design problems is tested by computing several flow analysis and design problems.

  • PDF

Building a Dynamic Analyzer for CUDA based System.

  • SALAH T. ALSHAMMARI
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.8
    • /
    • pp.77-84
    • /
    • 2023
  • The utilization of GPUs on general-purpose computers is currently on the rise due to the increase in its programmability and performance requirements. The utility of tools like NVIDIA's CUDA have been designed to allow programmers to code algorithms by using C-like language for the execution process on the graphics processing units GPU. Unfortunately, many of the performance and correctness bugs will happen on parallel programs. The CUDA tool support for the parallel programs has not yet been actualized. The use of a dynamic analyzer to find performance and correctness bugs in CUDA programs facilitates the execution of sophisticated processes, especially in modern computing requirements. Any race conditions bug it will impact of program correctness and the share memory bank conflicts to improve the overall performance. The technique instruments the programs in a way that promotes accessibility of the memory locations accessed by different threads well as to check for any bugs in the code of a program. The instrumented source code will be used initiated directly in the device emulation code of CUDA to send report for the user about all errors. The current degree of automation helps programmers solve subtle bugs in highly complex programs or programs that cannot be analyzed manually.

A Fully Programmable Shader Processor for Low Power Mobile Devices (저전력 모바일 장치를 위한 완전 프로그램 가능형 쉐이더 프로세서)

  • Jeong, Hyung-Ki;Lee, Joo-Sock;Park, Tae-Ryong;Lee, Kwang-Yeob
    • Journal of IKEEE
    • /
    • v.13 no.2
    • /
    • pp.253-259
    • /
    • 2009
  • In this paper, we propose a novel architecture of a general graphics shader processor without a dedicated hardware. Recently, mobile devices require the high performance graphics processor as well as the small size, low power. The proposed shader processor is a GP-GPU(General-Purpose computing on Graphics Processing Units) to execute the whole OpenGL ES 2.0 graphics pipeline by using shader instructions. It does not require the separate dedicate H/W such as rasterization on this fully programmable capability. The fully programmable 3D graphics shader processor can reduce much of the graphics hardware. The chip size of the designed shader processor is reduced 60% less than the sizes of previous processors.

  • PDF

Parallel Computation of FDTD algorithm using CUDA (CUDA를 이용한 FDTD 알고리즘의 병렬처리)

  • Lee, Ho-Young;Park, Jong-Hyun;Kim, Jun-Seong
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.4
    • /
    • pp.82-87
    • /
    • 2010
  • Modern GPUs(Graphic Processing Units) provide computing capability higher than that of the general CPUs(Central Processor Units). With supports of programmability of graphics pipeline GP-GPU(General Purpose computation on GPU) has gained much attention expanding its application area. This paper compares sequential and massively parallel implementations of FDTD(Finite Difference Time Domain) algorithm using CUDA(Compute Unified Device Architecture). Experimental results show upto 45X speedup over conventional CPU execution.

Analysis of Impact of Correlation Between Hardware Configuration and Branch Handling Methods Executing General Purpose Applications (범용 응용프로그램 실행 시 하드웨어 구성과 분기 처리 기법에 따른 GPU 성능 분석)

  • Choi, Hong Jun;Kim, Cheol Hong
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.3
    • /
    • pp.9-21
    • /
    • 2013
  • Due to increased computing power and flexibility of GPU, recent GPUs execute general purpose parallel applications as well as graphics applications. Programmers can use GPGPU by using the APIs from GPU vendors. Unfortunately, computational resources of GPU are not fully utilized when executing general purpose applications because of frequent branch instructions. To handle the branch problem, several warp formations have been proposed. Intuitively, we expect that the warp formations providing higher computational resource utilization show higher performance. Contrary to our expectations, according to simulation results, the performance of the warp formation providing better utilization is lower than that of the warp formation providing worse utilization. This is because warp formation providing high utilization causes serious memory bottleneck due to increased memory request. Therefore, warp formation providing high computation utilization cannot guarantee high performance without proper hardware resources. For this reason, we will analyze the correlation between hardware configuration and warp formation. Our simulation results present the guideline to solve the underutilization problem due to branch instructions when designing recent GPU.

Economic Impact of HEMOS-Cloud Services for M&S Support (M&S 지원을 위한 HEMOS-Cloud 서비스의 경제적 효과)

  • Jung, Dae Yong;Seo, Dong Woo;Hwang, Jae Soon;Park, Sung Uk;Kim, Myung Il
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.10 no.10
    • /
    • pp.261-268
    • /
    • 2021
  • Cloud computing is a computing paradigm in which users can utilize computing resources in a pay-as-you-go manner. In a cloud system, resources can be dynamically scaled up and down to the user's on-demand so that the total cost of ownership can be reduced. The Modeling and Simulation (M&S) technology is a renowned simulation-based method to obtain engineering analysis and results through CAE software without actual experimental action. In general, M&S technology is utilized in Finite Element Analysis (FEA), Computational Fluid Dynamics (CFD), Multibody dynamics (MBD), and optimization fields. The work procedure through M&S is divided into pre-processing, analysis, and post-processing steps. The pre/post-processing are GPU-intensive job that consists of 3D modeling jobs via CAE software, whereas analysis is CPU or GPU intensive. Because a general-purpose desktop needs plenty of time to analyze complicated 3D models, CAE software requires a high-end CPU and GPU-based workstation that can work fluently. In other words, for executing M&S, it is absolutely required to utilize high-performance computing resources. To mitigate the cost issue from equipping such tremendous computing resources, we propose HEMOS-Cloud service, an integrated cloud and cluster computing environment. The HEMOS-Cloud service provides CAE software and computing resources to users who want to experience M&S in business sectors or academics. In this paper, the economic ripple effect of HEMOS-Cloud service was analyzed by using industry-related analysis. The estimated results of using the experts-guided coefficients are the production inducement effect of KRW 7.4 billion, the value-added effect of KRW 4.1 billion, and the employment-inducing effect of 50 persons per KRW 1 billion.

A Study on the Quality Determinant Factors of User-Support Service under Web-based Information System (웹정보시스템(WIS) 사용지원 서비스의 품질결정요인에 관한 연구)

  • 정상철;임형수
    • Journal of Information Technology Application
    • /
    • v.2 no.1
    • /
    • pp.25-53
    • /
    • 2000
  • As Information Technology has developed, The structure of information system used in organizations is changing from centralized computing structure to distributed computing structure. The roles of information department have expanded, which is not only develope and maintain system but also provide usage-support service to end-user. If organization support end-user properly, they get many benefits. if they don't do it properly, they waste many resource. Therefore, the purpose of this study is the search for quality determinant factors and type of information that provide user effective service under web-based information system. The result of this study is as follow First, when SERVQUAL is used, the quality determinant factors of usage-support service are categorized three, which is responsibility and assurance, empathy and tangable, and reliability. When SERVPERF is used, the determinant factors consist of five, but the use of assurance as a factor is cautious. Second, reliability and tangability among determinant factors affect general service quality, and tangibility is most important factors. third, When the locations of user are distributed the degree of general service quality is not different whether formal or informal information center provide usage-support service. This study may suggest practical implication as follows; First, as user are good at to use information system the degree of importance for tangibility are decreased. when user are individualized and improve their ability to use information system, empathy will not be important factors any more, therfore it assume that reliability will be most important factors. Second, if organizations promote not only formal informaton center but also informal inforamtion center they may support end-user more effectively. However, this study has the following limitations: First, it is difficult to generalize the result of this study Second, service quality determinant factors used in this study don't explain the influence to general service quality fully. Third, this study analyze a simple relation between service quality determinant factor and general service quality. Finally, this study don't distinguish between information system service and information support service.

  • PDF

Enhancing GPU Performance by Efficient Hardware-Based and Hybrid L1 Data Cache Bypassing

  • Huangfu, Yijie;Zhang, Wei
    • Journal of Computing Science and Engineering
    • /
    • v.11 no.2
    • /
    • pp.69-77
    • /
    • 2017
  • Recent GPUs have adopted cache memory to benefit general-purpose GPU (GPGPU) programs. However, unlike CPU programs, GPGPU programs typically have considerably less temporal/spatial locality. Moreover, the L1 data cache is used by many threads that access a data size typically considerably larger than the L1 cache, making it critical to bypass L1 data cache intelligently to enhance GPU cache performance. In this paper, we examine GPU cache access behavior and propose a simple hardware-based GPU cache bypassing method that can be applied to GPU applications without recompiling programs. Moreover, we introduce a hybrid method that integrates static profiling information and hardware-based bypassing to further enhance performance. Our experimental results reveal that hardware-based cache bypassing can boost performance for most benchmarks, and the hybrid method can achieve performance comparable to state-of-the-art compiler-based bypassing with considerably less profiling cost.

On the Opimal Decision Making using the Eigenvector Methods (고유벡터 법을 이용한 최적 의사결정에 관한 연구)

  • Chung Soon-Suk
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2006.04a
    • /
    • pp.123-131
    • /
    • 2006
  • Multi-criteria decision making is deducing the relative importance in the criterion of decision making and each alternative which is able to making a variety of choices measures the preferred degree in the series of low-raking criterions. Moreover, this is possible by synthesizing them systematically. In general, a fundamental problem decision maker solve for multi-criteria decision making is evaluating a set of activities which are considered as the target logically, and this kind of work is evaluated and synthesized by various criterions of the value which a chain of activities usually hold in common. In this paper, we are the eigenvector methods in weights calculating. For the purpose of making optimal decision, the data of five different car models are used. For computing, we used Visual Numerica Version 1.0 software package.

  • PDF

Real-time Vehicle License Plate Recognition Method using Vehicle-loaded Camera (차량 탑재용 카메라를 이용한 실시간 차량 번호판 인식 기법)

  • Chang, Jae-Khun
    • Journal of Internet Computing and Services
    • /
    • v.6 no.3
    • /
    • pp.147-158
    • /
    • 2005
  • Day after day the information of vehicle under the complex traffic environments is greatly required not only for traffic flow but also for vehicle disclosure of traffic violation, Vehicle information can be obtained from a recognition of vehicle license plate, This paper proposes a new vehicle plate recognition mechanism that uses moving style vehicle-loaded camera, The method is a real-time processing system using multi-step image processing and recognition process that recognizes general vehicles and special purpose vehicles, The experimental results of real environmental image and recognition using the proposed method are shown.

  • PDF