• Title/Summary/Keyword: WLCG

Search Result 4, Processing Time 0.019 seconds

Design and Construction of WLCG/EGEE Tier-2 (WLCG/EGEE 티어-2의 설계 및 구축)

  • Kwak, Jae-Hyuck;Lee, Pillwoo
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2007.11a
    • /
    • pp.255-258
    • /
    • 2007
  • WLCG/EGEE project has a main goal of providing application scientists with access to a geographically distributed computing Grid infrastructure. Currently, WLCG/EGEE infrastructure is the world's largest Grid infrastructure, providing over 20,000 CPUs and about 3 PB disk at 200 international sites and used by over 9 application domains including LHC experiments from high-energy physics. This paper addresses the design and construction of WLCG/EGEE tier-2 propelled by international collaboration with EGEE-II project.

  • PDF

An Empirical Analysis of Worldwide Cyberinfrastructure

  • Cho, Manhyung
    • Asian Journal of Innovation and Policy
    • /
    • v.4 no.3
    • /
    • pp.381-396
    • /
    • 2015
  • Cyberinfrastructure is a research infrastructure that provides an environment in which research communities can get access to distributed resources and collaborate at unprecedented levels of computation, storage, and network capacity. The Worldwide LHC Computing Grid (WLCG) is a global collaborative project of computing or data centers that enables access to scientific data generated by the Large Hadron Collider (LHC) experiments at CERN. This case study analyzes the WLCG as a model of cyberinfrastructure in research collaboration. WLCG provides a useful case of how cyberinfrastructure can work in providing an infrastructure for collaborative researches under data-intensive paradigm. Cyberinfrastructure plays the critical role of facilitating collaboration of diverse and widely separated communities of researchers. Data-intensive science requires new strategies for research support and significant development of cyberinfrastructure. The sustainability of WLCG depends on the resources of partner organizations and virtual organizations at international levels, essential for research collaboration.

Prediction Model of CMS Job Failures using C4.5 (C4.5를 이용한 CMS 잡 오류 예측 모델)

  • Xu, Zhenshun;Zuo, Shangsu;Choi, Heesu;Park, Daihee;Chung, Yongwha;Cho, Choong-ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.04a
    • /
    • pp.773-775
    • /
    • 2015
  • 복잡한 그리드 컴퓨팅 환경에서 수행한 잡의 성패율을 낮추는 것은 그리드 환경의 효율성과 선순환을 위한 필수적인 요건이다. 본 논문에서는 데이터마이닝의 대표적인 방법인 결정트리의 C4.5 알고리즘을 이용하여 WLCG에서 수행한 CMS 잡 모니터링 결과에 대한 오류를 예측하는 모델을 설계하고 구현하였다. 제안한 예측 모델은, 1) CMS 대시보드에서 모니터링 결과 데이터를 추출하여 오라클 테이블에 로딩한다. 2) 결정트리인 C4.5 알고리즘을 기반으로 Oracle Data Miner에서 예측 모델링을 수행한다. 3) C4.5의 파라미터를 조절하여 적절한 예측결과 값을 도용한다.

A Disk-based Archival Storage System Using the EOS Erasure Coding Implementation for the ALICE Experiment at the CERN LHC

  • Ahn, Sang Un;Betev, Latchezar;Bonfillou, Eric;Han, Heejune;Kim, Jeongheon;Lee, Seung Hee;Panzer-Steindel, Bernd;Peters, Andreas-Joachim;Yoon, Heejun
    • Journal of Information Science Theory and Practice
    • /
    • v.10 no.spc
    • /
    • pp.56-65
    • /
    • 2022
  • Korea Institute of Science and Technology Information (KISTI) is a Worldwide LHC Computing Grid (WLCG) Tier-1 center mandated to preserve raw data produced from A Large Ion Collider Experiment (ALICE) experiment using the world's largest particle accelerator, the Large Hadron Collider (LHC) at European Organization for Nuclear Research (CERN). Physical medium used widely for long-term data preservation is tape, thanks to its reliability and least price per capacity compared to other media such as optical disk, hard disk, and solid-state disk. However, decreasing numbers of manufacturers for both tape drives and cartridges, and patent disputes among them escalated risk of market. As alternative to tape-based data preservation strategy, we proposed disk-only erasure-coded archival storage system, Custodial Disk Storage (CDS), powered by Exascale Open Storage (EOS), an open-source storage management software developed by CERN. CDS system consists of 18 high density Just-Bunch-Of-Disks (JBOD) enclosures attached to 9 servers through 12 Gbps Serial Attached SCSI (SAS) Host Bus Adapter (HBA) interfaces via multiple paths for redundancy and multiplexing. For data protection, we introduced Reed-Solomon (RS) (16, 4) Erasure Coding (EC) layout, where the number of data and parity blocks are 12 and 4 respectively, which gives the annual data loss probability equivalent to 5×10-14. In this paper, we discuss CDS system design based on JBOD products, performance limitations, and data protection strategy accommodating EOS EC implementation. We present CDS operations for ALICE experiment and long-term power consumption measurement.