• Title/Summary/Keyword: Code Optimization

Search Result 589, Processing Time 0.034 seconds

A Study on Scalability of Profiling Method Based on Hardware Performance Counter for Optimal Execution of Supercomputer (슈퍼컴퓨터 최적 실행 지원을 위한 하드웨어 성능 카운터 기반 프로파일링 기법의 확장성 연구)

  • Choi, Jieun;Park, Guenchul;Rho, Seungwoo;Park, Chan-Yeol
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.9 no.10
    • /
    • pp.221-230
    • /
    • 2020
  • Supercomputer that shares limited resources to multiple users needs a way to optimize the execution of application. For this, it is useful for system administrators to get prior information and hint about the applications to be executed. In most high-performance computing system operations, system administrators strive to increase system productivity by receiving information about execution duration and resource requirements from users when executing tasks. They are also using profiling techniques that generates the necessary information using statistics such as system usage to increase system utilization. In a previous study, we have proposed a scheduling optimization technique by developing a hardware performance counter-based profiling technique that enables characterization of applications without further understanding of the source code. In this paper, we constructed a profiling testbed cluster to support optimal execution of the supercomputer and experimented with the scalability of the profiling method to analyze application characteristics in the built cluster environment. Also, we experimented that the profiling method can be utilized in actual scheduling optimization with scalability even if the application class is reduced or the number of nodes for profiling is minimized. Even though the number of nodes used for profiling was reduced to 1/4, the execution time of the application increased by 1.08% compared to profiling using all nodes, and the scheduling optimization performance improved by up to 37% compared to sequential execution. In addition, profiling by reducing the size of the problem resulted in a quarter of the cost of collecting profiling data and a performance improvement of up to 35%.

Sustainable Design Method of Reinforced Concrete Beam Using Embodied Energy Optimization Technique (내재에너지 최적화를 통한 철근 콘크리트 보의 지속가능 설계법)

  • Yoon, Young-Cheol;Kim, Kyeong-Hwan;Yeo, DongHun;Lee, Sang-Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.34 no.4
    • /
    • pp.1053-1063
    • /
    • 2014
  • This study presents a sustainable design method that optimizes the embodied energy of concrete beam based on the concept of sustainable development that effectively utilizes natural resource and energy within the range that our succeeding generation can afford to utilize. In order to get the flexural strength carrying the ultimate load, concrete beam sections are designed by optimization that consists of the embodied energy as a objective function and the requirements of design code as constrained conditions. The sustainable design can be used to minimize the embodied energy consumed in material production, construction, operation, demolition of the infrastructure. As a result of comparison of the cost and the embodied energy optimizations based on practical beam sections, it is shown that 20% embodied energy saving and 35% $CO_2$ emission saving are achieved by sacrificing 10% cost increase. The sustainable design method provides a new effective methodology that manages the strength design concept based on cost minimization together with economic feasibility and sustainability. In addition, the method is expected to be applied to more various structural design practices.

Shape Optimum Design of Pultruded FRP Bridge Decks (인발성형된 FRP 바닥판의 형상 최적설계)

  • 조효남;최영민;김희성;김형열;이종순
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.17 no.3
    • /
    • pp.319-332
    • /
    • 2004
  • Due to their high strength to weight ratios and excellent durability, fiber reinforced polymer(FRP) is widely used in construction industries. In this paper, a shape optimum design of FRP bridge decks haying pultruded cellular cross-section is presented. In the problem formulation, an objective function is selected to minimize the volumes. The cross-sectional dimensions and material properties of the deck of FRP bridges are used as the design variables. On the other hand, deflection limits in the design code, material failure criteria, buckling load, minimum height, and stress are selected as the design constraints to enhance the structural performance of FRP decks. In order to efficiently treat the optimization process, the cross-sectional shape of bridge decks is assumed to be a tube shape. The optimization process utilizes an improved Genetic Algorithms incorporating indexing technique. For the structural analysis using a three-dimensional finite element, a commercial package(ABAQUS) is used. Using a computer program coded for this study, an example problem is solved and the results are presented with sensitivity analysis. The bridge consists of a deck width of 12.14m and is supported by five 40m long steel girders spaced at 2.5m. The bridge is designed to carry a standard DB-24 truck loading according to the Standard Specifications for Highway Bridges in Korea. Based on the optimum design, viable cross-sectional dimensions for FRP decks, suitable for pultrusion process are proposed.

Development of Decision Support System for the Design of Steel Frame Structure (강 프레임 구조물 설계를 위한 의사 결정 지원 시스템의 개발)

  • Choi, Byoung Han
    • Journal of Korean Society of Steel Construction
    • /
    • v.19 no.1
    • /
    • pp.29-41
    • /
    • 2007
  • Structural design, like other complex decision problems, involves many trade-offs among competing criteria. Although mathematical programming models are becoming increasingly realistic, they often have design limitations, that is, there are often relevant issues that cannot be easily captured. From the understanding of these limitations, a decision-support system is developed that can generate some useful alternatives as well as a single optimum value in the optimization of steel frame structures. The alternatives produced using this system are "good" with respect to modeled objectives, and yet are "different," and are often better, with respect to interesting objectives not present in the model. In this study, we created a decision-support system for designing the most cost-effective moment-resisting steel frame structures for resisting lateral loads without compromising overall stability. The proposed approach considers the cost of steel products and the cost of connections within the design process. This system makes use of an optimization formulation, which was modified to generate alternatives of optimum value, which is the result of the trade-off between the number of moment connections and total cost. This trade-off was achieved by reducing the number of moment connections and rearranging them, using the combination of analysis based on the LRFD code and optimization scheme based on genetic algorithms. To evaluate the usefulness of this system, the alternatives were examined with respect to various design aspects.

Array Bounds Check Elimination using Ineguality Graph in Java Just-in-Time Compiler (대소관계 그래프를 이용한 Just-in-Time 컴파일 환경에서의 배열 경계 검사 제거)

  • Choi Sun-il;Moon Soo-mook
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.12
    • /
    • pp.1283-1291
    • /
    • 2005
  • One of the problems in boosting Java performance using a Just-in-Time (JIT) compiler is removing redundant array bound checks. In conventional static compilers, many powerful algorithms have been developed, yet they are not directly applicable to JIT compilation where the compilation time is part of the whole running time. In the current JIT compilers, we tan use either a naive algorithm that is not powerful enough or an aggressive algorithm which requires the transformation into a static single assignment (SSA) form of programs (and back to the original form after optimization), thus causing too much overhead not appropriate for JIT compilation This paper proposes a new algorithm based on an inequality graph which can eliminate array bounds check codes aggressively without resorting to the SSA form. When we actually perform this type of optimization, there are many constraints in code motion caused by the precise exception rule in Java specification, which would cause the algorithm to miss many opportunities for eliminating away bound checks. We also propose a new method to overcome these constraints.

Three-dimensional thermal-hydraulics/neutronics coupling analysis on the full-scale module of helium-cooled tritium-breeding blanket

  • Qiang Lian;Simiao Tang;Longxiang Zhu;Luteng Zhang;Wan Sun;Shanshan Bu;Liangming Pan;Wenxi Tian;Suizheng Qiu;G.H. Su;Xinghua Wu;Xiaoyu Wang
    • Nuclear Engineering and Technology
    • /
    • v.55 no.11
    • /
    • pp.4274-4281
    • /
    • 2023
  • Blanket is of vital importance for engineering application of the fusion reactor. Nuclear heat deposition in materials is the main heat source in blanket structure. In this paper, the three-dimensional method for thermal-hydraulics/neutronics coupling analysis is developed and applied for the full-scale module of the helium-cooled ceramic breeder tritium breeding blanket (HCCB TBB) designed for China Fusion Engineering Test Reactor (CFETR). The explicit coupling scheme is used to support data transfer for coupling analysis based on cell-to-cell mapping method. The coupling algorithm is realized by the user-defined function compiled in Fluent. The three-dimensional model is established, and then the coupling analysis is performed using the paralleled Coupling Analysis of Thermal-hydraulics and Neutronics Interface Code (CATNIC). The results reveal the relatively small influence of the coupling analysis compared to the traditional method using the radial fitting function of internal heat source. However, the coupling analysis method is quite important considering the nonuniform distribution of the neutron wall loading (NWL) along the poloidal direction. Finally, the structure optimization of the blanket is carried out using the coupling method to satisfy the thermal requirement of all materials. The nonlinear effect between thermal-hydraulics and neutronics is found during the blanket structure optimization, and the tritium production performance is slightly reduced after optimization. Such an adverse effect should be thoroughly evaluated in the future work.

Design of Heat and Fluid Flow in Cold Container Using CFD Simulation (CFD 시뮬레이션을 이용한 냉장컨테이너의 열유동 설계)

  • Yun, Hong-Sun;Kwon, Jin-Kyung;Jeong, Hoon;Lee, Hyun-Dong;Kim, Young-Keun
    • Journal of Biosystems Engineering
    • /
    • v.33 no.6
    • /
    • pp.396-403
    • /
    • 2008
  • Because thermal non-uniformity of transported agricultural products is mainly affected by cooling air flow pattern in the cold transport equipment, the analysis and control of flowfield is key to optimization of cold transport equipment. The objectives of this study were to estimate the effects of geometric and operating parameters of cold container on the air flow and heat transfer, and find the optimum design parameters for the low temperature level and its uniformity in given cold container with CFD simulations. Existences of ducts, gaps between pallets and geometries of exit as geometric parameters and fan blowing velocity as operating parameter were investigated. CFD simulations were carried out with the FLUENT 6.2 code. The result showed that optimum design condition was bulk loading with no duct, wall exit and 8.0 m/s of fan blowing velocity.

An Efficient Architecture Exploration Method for Optimal ASIP Design (Application에 최적의 ASIP 설계를 위한 효율적인 Architecture Exploration 방법)

  • Lee, Sung-Rae;Hwang, Sun-Young
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.9C
    • /
    • pp.913-921
    • /
    • 2007
  • Retargetable compiler which generates executable code for a target processor and performance profiler are required to design a processor optimized for a specific application. This paper presents an architecture exploration methodology based on ADL (Architecture Description Language). We synthesized instruction set and optimized processor structure using information extracted from application program. The information of operation sequences executed frequently and register usage are used for processor optimization. Architecture exploration has been performed for JPEG encoder to show the effectiveness of the system. The ASIP designed using the proposed method shows 1.97 times better performance.

A Study of Feedrate Optimization for Tolerance Error of NC Machining (NC가공에서 허용오차를 고려한 가공속도 최적화에 관한 연구)

  • Lee, Hee-Seung;Lee, Cheol-Soo;Kim, Jong-Min;Heo, Eun-Young
    • Journal of the Korean Society of Manufacturing Technology Engineers
    • /
    • v.22 no.5
    • /
    • pp.852-858
    • /
    • 2013
  • In numerical control (NC) machining, a machining error in equipment generally occurs for a variety of reasons. If there is a change in direction in the NC code, the characteristics of the automatic acceleration or deceleration function cause an overlap of each axis of the acceleration and deceleration zones, which in turn causes a shift in the actual processing path. Many studies have been conducted for error calibration of the edge as caused by automatic acceleration or deceleration in NC machining. This paper describes a geometric interpretation of the shape and processing characteristics of the operating NC device. The paper then describes a way to determine a feedrate that achieves the desired tolerance by using linear and parabolic profiles. Experiments were conducted by the validate equations using a three-axis NC machine. The results show that the machining errors were smaller than the machine resolution. The results also clearly demonstrate that the NC machine with the developed system can successfully predict machining errors induced with a change in direction.

Optimization of Blind Adaptive Decorrelating PIC Detector Performance in DS-CDMA System

  • Sirijiamrat, S.;Benjangkaprasert, C.;Sangaroon, O.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1962-1965
    • /
    • 2004
  • In this paper, the new algorithm for blind adaptive decorrelating parallel interference canceller detector in direct-sequence code division multiple access (DS-CDMA) synchronous communication systems is proposed. The goal of this paper is to improve the performance of the blind adaptive decorrelating parallel interference cancellation detector (BAD/PIC). The proposed blind adaptive decorrelating detector is using optimum step-size technique bootstrap algorithm as an initial stage of PIC, which does not require a training sequence. Therefore, this algorithm has a superior view of utilizing bandwidth and reduces the complexity of computation of inversion cross-correlation matrix. The computer simulation results show that the bit error rate performance of the proposed algorithm for the new structure of detector is better than that of the other detectors such as matched filters, the conventional PIC, and the blind adaptive decorrelating PIC detector.

  • PDF