• Title/Summary/Keyword: SUN Tachyon

Search Result 5, Processing Time 0.024 seconds

Improving the Job Success Rate through Analysis of User Logs in HPC (HPC 환경에서 사용자 로그 분석을 통한 작업 성공률 개선)

  • Yoon, JunWeon;Hong, TaeYoung;Kong, Ki-Sik;Park, ChanYeol
    • Journal of Digital Contents Society
    • /
    • v.16 no.5
    • /
    • pp.691-697
    • /
    • 2015
  • Supercomputers are used for many different areas including new product design of industries as well as state-of-the-art science and technology for large amount of computational needs. Tachyon is a 4th supercomputer built at KISTI that is a high-performance parallel computing system with 3,200 computing nodes and infrastructures. This system is currently about 10,000 users and over 170 organizations are used, the number of jobs they are performing work in batch type form through a scheduler. Also, this system logs lots of job scripts, execution environment, library, job status from the job submit to end. In this paper, we analyzed batch jobs information from Sun Grid Engine, that use as a scheduler in Tachyon system, and job executed information in Tachyon System. In particular, we distinguished the fail jobs from the all tasks that users perform and we analyzed the cause of failure. Among them, we can extracted some of jobs that can be regarded as normal jobs through the improvement in those works logged as all of fail jobs.

Fault Management System for Interconnection Network in HPC Environment (HPC 환경에서 인터커넥션 네트워크 장애관리 시스템 구축)

  • Hong, TaeYeong;Yoon, JunWeon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.04a
    • /
    • pp.68-70
    • /
    • 2017
  • KISTI 슈퍼컴퓨터 4호기 Tachyon2는 SUN Blade 6275 시스템을 기반으로 구성된 초병렬 컴퓨팅 시스템으로 이론최고성능(Rpeak) 300TFlops를 보이고 있으며 3,200대의 컴퓨팅 노드와 인프라 노드로 구분된다. Tachyon2 시스템은 국내 산학연 연구자들을 위한 공공 목적의 시스템으로 만여 명의 사용자와 200여개의 기관이 사용 중에 있다. 이런 슈퍼컴퓨터와 같은 대형 HPC 환경에서는 대규모의 사용자 작업을 원활하게 수행하기 위해서는 IB의 안정성이 우선적으로 보장되어야 한다. 본 논문에서는 Tachyon2 시스템에서 발생하는 IB 상태를 파악하고 관리하기 위한 자동화 도구를 개발하였다. 이로써 인터커넥션의 상태를 주기적으로 모니터링 할 수 있고, 장애내역 또한 신속하게 파악할 수 있다.

Shielding Design of Electron Beam Accelerators Using Supercomputer (슈퍼컴을 이용한 전자빔가속기의 차폐설계)

  • Kang, Won Gu;Kim, In Soo;Kuk, Sung Han;Kim, Jin Kyu;Han, Bum Soo;Jeong, Kwang Young;Kang, Chang Mu
    • Journal of Radiation Industry
    • /
    • v.4 no.1
    • /
    • pp.33-38
    • /
    • 2010
  • The MCNP5 neutron, electron, photon Monte Carlo transport program was installed on the KISTI's SUN Tachyon computer using the parallel programming. Electron beam accelerators were modeled and shielding calculations were performed in order to investigate the reduction of computation time in the supercomputer environment. It was observed that a speedup of 40 to 80 of computation time can be obtained using 64 CPUs compared to an IBM PC.

Design and Implementation of an Index Manager for a Main Memory DBMS (주기억장치 DBMS를 위한 인덱스 관리자의 설계 및 구현)

  • Kim, Sang-Wook;Yeom, Sang-Min;Kim, Yun-Ho;Lee, Seung-Sun;Choi, Wan
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.4B
    • /
    • pp.661-674
    • /
    • 2000
  • The main memory DBMS(MMDBMS) efficiently supports various database applications that require high performance since it employs main memory rather than disk as a primary storage. In this paper, we discuss theexperiences obtained in developing the index manager of the Tachyon, a next-generation MMDBMS. The indexmanager is an essential sub-component of the DBMS used to speed up the retrieval of objects from a largevolume of a database in response to a certain search condition. Previous research efforts on indexing proposed various index structures. However, they hardly dealt with the practical issues occured in implementating an index manager on a target DBMS. In this paper, we touch these issues and present our experiences in developing the index manager on the Tachyon as solutions. The main issues touched are (1) compact representation of an indexentry, (2) support of variable-length keys, (3) support of multiple-attribute keys, (4) support of duplicated keys,(5) definition of external APls, (6) concurrency control, and (7) backup and recovery. We believe that ourcontribution would help MMDBMS developers highly reduce their trial-and-errors.

  • PDF

Benchmarking techniques to evaluate single computing node of HPC (슈퍼컴퓨터 단일 컴퓨팅 노드 성능 측정을 위한 벤치마크 기법)

  • Kwon, Min-Woo;Yoon, JunWeon;Hong, TaeYoung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.04a
    • /
    • pp.571-572
    • /
    • 2017
  • 한국과학기술정보연구원에서 운영 중인 슈퍼컴퓨터 4호기인 Tachyon 2차 시스템은 이론최고성능 300TFlops인 SUN Blade 6275 시스템을 기반으로 구성되어있다. 로그인 노드 4대와 컴퓨팅 노드 3200대로 구성되어 있으며 컴퓨팅 노드 중에 24대는 디버깅 노드로 사용되고 있다. 3200대의 컴퓨팅 노드가 동일한 하드웨어로 구성이 되어 있으므로 Tachyon 2차 시스템의 전체 계산 성능을 결정하는 가장 중요한 요소가 단일 컴퓨팅 노드의 성능이 되겠다. 본 논문에서는 다양한 벤치마크 기법을 통해 단일 노드의 성능을 측정하여 분석하였다.