Browse > Article
http://dx.doi.org/10.9708/jksci.2021.26.10.117

CANVAS: A Cloud-based Research Data Analytics Environment and System  

Kim, Seongchan (Dept. of Machine Learning Data Research, KISTI, Dept. of Data & High Performance Computing Science, UST-KISTI)
Song, Sa-kwang (Research Data Commons Team, KISTI, Dept. of Data & High Performance Computing Science, UST-KISTI)
Abstract
In this paper, we propose CANVAS (Creative ANalytics enVironment And System), an analytics system of the National Research Data Platform (DataON). CANVAS is a personalized analytics cloud service for researchers who need computing resources and tools for research data analysis. CANVAS is designed in consideration of scalability based on micro-services architecture and was built on top of open-source software such as eGovernment Standard framework (Spring framework), Kubernetes, and JupyterLab. The built system provides personalized analytics environments to multiple users, enabling high-speed and large-capacity analysis by utilizing high-performance cloud infrastructure (CPU/GPU). More specifically, modeling and processing data is possible in JupyterLab or GUI workflow environment. Since CANVAS shares data with DataON, the research data registered by users or downloaded data can be directly processed in the CANVAS. As a result, CANVAS enhances the convenience of data analysis for users in DataON and contributes to the sharing and utilization of research data.
Keywords
CANVAS; Cloud Analysis Environment; Korea Research Data Platform; DataON; Workflow; JupyterLab;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. Cho, H. Yim, S. Kim, J. Kim, S.-K. Song, "National Research Data Platform Service Strategy for Pandemic Response," Proceedings of the Korea Computer Congress. pp. 1034-1035, Dec. 2020.
2 National Science and Technology Council, "Strategies for sharing and utilizing research data to promote innovative growth," Jan. 2018.
3 National Science and Technology Research Society (NST), "Science and technology contribution (research) development plan," Jan. 2018.
4 M. Choi, and S. Lee, "Research Data Management Status of Science and Technology Research Institutes in Korea," Data Science Journal, Vol. 19, No. 1, pp. 1-11, Aug. 2020. DOI: 10.5334/dsj-2020-029   DOI
5 M. Cho, S. Kim, J. Kim, S.-K. Song, "Development of Pilot Service for Sharing Research Data," Proceedings of the Korea Computer Congress, pp. 1132-1134. Korea, Dec. 2019.
6 National Information Socient Agency (NIA), AI Hub, https://aihub.or.kr/
7 J.-H. Kim, "Development of Clinical Analytics Platform for the Sharing and Analysis of Clinical Cases," The Journal of Next-Generation Convergence Technology Association, Vol. 4, No. 1, pp. 5-14, Feb. 2020. DOI: 10.33097/jncta.2020.04.01.5   DOI
8 National Information Socient Agency (NIA), Public Data Portal, https://www.data.go.kr/
9 KOREA Data Agency, Data Store, https://www.datastore.or.kr/
10 KDX. Korea Data Exchange, https://kdx.kr/main
11 Seoul Metropolitan Government, Seoul Open Data Plaza, https://data.seoul.go.kr/
12 J.-S. Lim, S. Kim, S. H. Bae, K.-H. Kim, D.-K. Won, "Research on regional spatial information analysis platform about NTIS raw data," Journal of Cadastre & Land InformatiX, Vol. 50, No. 2, pp. 21-35, Dec. 2020. DOI: 10.22640/LXSIRI.2020.50.2.21   DOI
13 European Open Science Cloud (EOSC), https://eosc-portal.eu/
14 Austrailian Research Data Commons (ARDC), https://ardc.edu.au/