Search | Korea Science

High-performance computing for SARS-CoV-2 RNAs clustering: a data science-based genomics approach

Oujja, Anas;Abid, Mohamed Riduan;Boumhidi, Jaouad;Bourhnane, Safae;Mourhir, Asmaa;Merchant, Fatima;Benhaddou, Driss
- Genomics & Informatics
- /
- v.19 no.4
- /
- pp.49.1-49.11
- /
- 2021
Nowadays, Genomic data constitutes one of the fastest growing datasets in the world. As of 2025, it is supposed to become the fourth largest source of Big Data, and thus mandating adequate high-performance computing (HPC) platform for processing. With the latest unprecedented and unpredictable mutations in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the research community is in crucial need for ICT tools to process SARS-CoV-2 RNA data, e.g., by classifying it (i.e., clustering) and thus assisting in tracking virus mutations and predict future ones. In this paper, we are presenting an HPC-based SARS-CoV-2 RNAs clustering tool. We are adopting a data science approach, from data collection, through analysis, to visualization. In the analysis step, we present how our clustering approach leverages on HPC and the longest common subsequence (LCS) algorithm. The approach uses the Hadoop MapReduce programming paradigm and adapts the LCS algorithm in order to efficiently compute the length of the LCS for each pair of SARS-CoV-2 RNA sequences. The latter are extracted from the U.S. National Center for Biotechnology Information (NCBI) Virus repository. The computed LCS lengths are used to measure the dissimilarities between RNA sequences in order to work out existing clusters. In addition to that, we present a comparative study of the LCS algorithm performance based on variable workloads and different numbers of Hadoop worker nodes.
https://doi.org/10.5808/gi.21056 인용 PDF KSCI

The Technology Trend of Interconnection Network for High Performance Computing (고성능 컴퓨팅을 위한 인터커넥션 네트워크 기술 동향)

Cho, Hyeyoung;Jun, Tae Joon;Han, Jiyong
- Journal of the Korea Convergence Society
- /
- v.8 no.8
- /
- pp.9-15
- /
- 2017
With the development of semiconductor integration technology, central processing units and storage devices have been miniaturized and performance has been rapidly developed, interconnection network technology is becoming a more important factor in terms of the performance of high performance computing system. In this paper, we analyze the trend of interconnection network technology used in high performance computing. Interconnect technology, which is the most widely used in the Supercomputer Top 500(2017. 06.), is an Infiniband. Recently, Ethernet is the second highest share after InfiniBand due to the emergence of 40/100Gbps Gigabit Ethernet technology. Gigabit Ethernet, where latency performance is lower than InfiniBand, is preferred in cost-effective medium-sized data centers. In addition, top-end HPC systems that demand high performance are devoting themselves from Ethernet and InfiniBand technologies and are attempting to maximize system performance by introducing their own interconnect networks. In the future, high-performance interconnects are expected to utilize silicon-based optical communication technology to exchange data with light.
https://doi.org/10.15207/JKCS.2017.8.8.009 인용 PDF KSCI

An Analysis of the 2016 Cyber Attack Detection Data in NISN HPC Service Environment (NISN HPC 서비스 환경에서 2016년 사이버 공격 탐지 데이터 분석)

Lee, Jae-Kook;Kim, Sung-Jun;Hong, Teayoung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.04a
- /
- pp.190-191
- /
- 2017
정보통신기술(ICT)의 발전으로 원격지에서 고속으로 HPC(High Performance Computing) 서비스를 이용할 수 있게 되었지만, HPC 서비스 환경을 대상으로 하는 사이버 공격도 끊이지 않고 발생하고 있다. 본 논문에서는 슈퍼컴퓨터 4호기 서비스 환경에서 탐지/차단된 사이버 공격 증가 추이를 살펴보고, 2016년 사이버 공격 탐지 데이터와 슈퍼컴퓨팅서비스 네트워크 내부로 유입된 트래픽 데이터를 분석하여 급격히 증가한 공격지 IP 주소의 분포 및 특징을 확인한다.
https://doi.org/10.3745/PKIPS.y2017m04a.190 인용 PDF

Scallop-free TSV, Copper Pillar and Hybrid Bonding for 3D Packaging (3D 패키징을 위한 Scallop-free TSV와 Cu Pillar 및 하이브리드 본딩)

Jang, Ye Jin;Jung, Jae Pil
- Journal of the Microelectronics and Packaging Society
- /
- v.29 no.4
- /
- pp.1-8
- /
- 2022
High-density packaging technologies, including Through-Si-Via (TSV) technologies, are considered important in many fields such as IoT (internet of things), 6G/5G (generation) communication, and high-performance computing (HPC). Achieving high integration in two dimensional packaging has confronted with physical limitations, and hence various studies have been performed for the three-dimensional (3D) packaging technologies. In this review, we described about the causes and effects of scallop formation in TSV, the scallop-free etching technique for creating smooth sidewalls, Cu pillar and Cu-SiO₂ hybrid bonding in TSV. These technologies are expected to have effects on the formation of high-quality TSVs and the development of 3D packaging technologies.
https://doi.org/10.6117/kmeps.2022.29.4.001 인용 PDF KSCI

Design and Implementation of HPC Job Management Framework for Computational Scientific Simulation (계산과학 시뮬레이션을 위한 HPC 작업 관리 프레임워크의 설계 및 구현)

Yu, Jung-Lok;Kim, Han-Gi;Byun, Hee-Jung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2016.05a
- /
- pp.554-557
- /
- 2016
Recently, supercomputer has been increasingly adopted as a computing environment for scientific simulation as well as education, healthcare and national defence. Especially, supercomputing system with heterogeneous computing resources is gaining resurgence of interest as a next-generation problem solving environment, allowing theoretical and/or experimental research in various fields to be free of time and spatial limits. However, traditional supercomputing services have only been handled through a simple form of command-line based console, which leads to the critical limit of accessibility and usability of heterogeneous computing resources. To address this problem, in this paper, we provide the design and implementation of web-based HPC (High Performance Computing) job management framework for computational scientific simulation. The proposed framework has highly extensible design principles, providing the abstraction interfaces of job scheduler (as well as bundle scheduler plug-ins for LoadLeveler, Sun Grid Engine, OpenPBS scheduler) in order to easily incorporate the broad spectrum of heterogeneous computing resources such as cluster, computing cloud and grid. We also present the detailed specification of HTTP standard based RESTful endpoints, which manage simulation job's life-cycles such as job creation, submission, control and status monitoring, etc., enabling various 3rd-party applications to be newly created on top of the proposed framework.
PDF

The Design of A HPC based System For Responding Complex Disaster (복합재난 대응을 위한 HPC 기반 시스템 설계)

Kang, Kyung-woo;Kang, Yun-hee
- Journal of Platform Technology
- /
- v.6 no.4
- /
- pp.49-58
- /
- 2018
Complex disasters make greater damage and higher losses unexpected than the past. To prevent these disasters, it needs to prepare a plan for handling unexpected results. Especially an accident at a facility like an atomic power plant makes a big problem cause of climate change. A simulation needs to do preliminary researches based on diverse situations. In this research we define the basic component techniques to design and implement the disaster management system. Basically a hierarchical system design method is to build on the resources provided from high performance computing(HPC) and large-scale storage systems. To develop the system, it is considered middleware as well as application studies, data studies and decision making services in convergence areas.

Performance Optimization of Numerical Ocean Modeling on Cloud Systems (클라우드 시스템에서 해양수치모델 성능 최적화)

JUNG, KWANGWOOG;CHO, YANG-KI;TAK, YONG-JIN
- The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
- /
- v.27 no.3
- /
- pp.127-143
- /
- 2022
Recently, many attempts to run numerical ocean models in cloud computing environments have been tried actively. A cloud computing environment can be an effective means to implement numerical ocean models requiring a large-scale resource or quickly preparing modeling environment for global or large-scale grids. Many commercial and private cloud computing systems provide technologies such as virtualization, high-performance CPUs and instances, ether-net based high-performance-networking, and remote direct memory access for High Performance Computing (HPC). These new features facilitate ocean modeling experimentation on commercial cloud computing systems. Many scientists and engineers expect cloud computing to become mainstream in the near future. Analysis of the performance and features of commercial cloud services for numerical modeling is essential in order to select appropriate systems as this can help to minimize execution time and the amount of resources utilized. The effect of cache memory is large in the processing structure of the ocean numerical model, which processes input/output of data in a multidimensional array structure, and the speed of the network is important due to the communication characteristics through which a large amount of data moves. In this study, the performance of the Regional Ocean Modeling System (ROMS), the High Performance Linpack (HPL) benchmarking software package, and STREAM, the memory benchmark were evaluated and compared on commercial cloud systems to provide information for the transition of other ocean models into cloud computing. Through analysis of actual performance data and configuration settings obtained from virtualization-based commercial clouds, we evaluated the efficiency of the computer resources for the various model grid sizes in the virtualization-based cloud systems. We found that cache hierarchy and capacity are crucial in the performance of ROMS using huge memory. The memory latency time is also important in the performance. Increasing the number of cores to reduce the running time for numerical modeling is more effective with large grid sizes than with small grid sizes. Our analysis results will be helpful as a reference for constructing the best computing system in the cloud to minimize time and cost for numerical ocean modeling.
https://doi.org/10.7850/jkso.2022.27.3.127 인용 PDF KSCI

A Study on the Improvement of High Performance Computing Education in Computational Science (계산과학분야의 고성능컴퓨팅 교육 개선을 위한 탐색적 연구)

Yoon, Heejun;Ahn, Seongjin
- Journal of Digital Convergence
- /
- v.16 no.12
- /
- pp.21-31
- /
- 2018
In order to utilize HPC in Computational science, It is necessary to learn the knowledge and skills of computer science such as programming, algorithms and data structure. In this paper, we investigate IT education status in Computational science and propose policy directions to improve the HPC education through user survey. To do this, we surveyed the current state of IT subjects among major subjects in physics, chemistry, life sciences, and earth science in domestic universities and surveyed the users' Recognition of HPC education. As a result, the ratio of IT subjects in Computational science was very lower than the ratio of major domain subjects. Despite the high educational needs of universities, the educational level of universities was the lowest. Most users have learned the necessary knowledge and skills through self-study. We recognized the role of the university is the most urgent and important, and the role of professional institutions and online education is also important.
https://doi.org/10.14400/JDC.2018.16.12.021 인용 PDF KSCI HTML

Simulation of Wood Crib Burning Behaviors by Using FDS (FDS를 이용한 소화모형 화재거동의 시뮬레이션)

Kwon, Seong-Pil;Yoon, Hun-Ju;Kim, Hyeong-Gweon;Ra, Yong-Woon;SaKong, Seong-Ho;Shin, Dong-Il
- Proceedings of the Korea Institute of Fire Science and Engineering Conference
- /
- 2008.11a
- /
- pp.76-79
- /
- 2008
In this work wood crib burning behaviors have been simulated by using the FDS(Fire Dynamic Simulator) program. Wood cribs are regularly stacked arrays of wood sticks, and available for the performance rating of fire-extinguishers. On the basis of an angle iron supporter 26 layers of wood sticks have been stacked up. Each layer consists of 5 or 6 wood sticks which are placed in parallel, with a constant distance, and in alternating rows. They are laid between the horizontally adjacent sticks at the before last layer. The wood crib is ignited instantaneously by an amount of burning gasoline below. A comprehensive simulation of such a practical sophisticated combustion is still too difficult to realize with any currently available computer, although the performance of modern processors is getting better everyday. We could carry it out here through parallel computing on the HPC(High Performance Computing) cluster as the feasible alternative. At last the validation has been executed by means of temperature distribution data measured by the thermal video camera.
PDF

Performance Analysis and Characterization of Multi-Core Servers (멀티-코어 서버의 성능 분석 및 특성화)

Lee, Myung-Ho;Kang, Jun-Suk
- The KIPS Transactions:PartA
- /
- v.15A no.5
- /
- pp.259-268
- /
- 2008
Multi-Core processors have become main-stream microprocessors in recent years. Servers based on these multi-core processors are widely adopted in High Performance Computing (HPC) and commercial business applications as well. These servers provide increased level of parallelism, thus can potentially boost the performance for applications. However, the shared resources among multiple cores on the same chip can become hot spots and act as performance bottlenecks. Therefore it is essential to optimize the use of shared resources for high performance and scalability for the multi-core servers. In this paper, we conduct experimental studies to analyze the positive and negative effects of the resource sharing on the performance of HPC applications. Through the analyses we also characterize the performance of multi-core servers.
https://doi.org/10.3745/KIPSTA.2008.15-A.5.259 인용 PDF KSCI

Search Result 65, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)