Big Data Astronomy : Let's "PySpark" the Universe

빅데이터 천문학 : PySpark를 이용한 천문자료 분석

  • Published : 2018.05.08

Abstract

The modern large-scale surveys and state-of-the-art cosmological simulations produce various kinds of big data composed of millions and billions of galaxies. Inevitably, we need to adopt modern Big Data platforms to properly handle such large-scale data sets. In my talk, I will briefly introduce the de facto standard of modern Big Data platform, Apache Spark, and present some examples to demonstrate how Apache Spark can be utilized for solving data-driven astronomical problems.

Keywords