• Title/Summary/Keyword: National Research Data Platform(DataON)

Search Result 326, Processing Time 0.025 seconds

An Implementation of Web-Enabled OLAP Server in Korean HealthCare BigData Platform (한국 보건의료 빅데이터 플랫폼에서 웹 기반 OLAP 서버 구현)

  • Ly, Pichponreay;Kim, jin-hyuk;Jung, seung-hyun;Lee, kyung-hee Lee;Cho, wan-sup
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2017.05a
    • /
    • pp.33-34
    • /
    • 2017
  • In 2015, Ministry of Health and Welfare of Korea announced a research and development plan of using Korean healthcare data to support decision making, reduce cost and enhance a better treatment. This project relies on the adoption of BigData technology such as Apache Hadoop, Apache Spark to store and process HealthCare Data from various institution. Here we present an approach a design and implementation of OLAP server in Korean HealthCare BigData platform. This approach is used to establish a basis for promoting personalized healthcare research for decision making, forecasting disease and developing customized diagnosis and treatment.

  • PDF

Semantic-based Mashup Platform for Contents Convergence

  • Yongju Lee;Hongzhou Duan;Yuxiang Sun
    • International journal of advanced smart convergence
    • /
    • v.12 no.2
    • /
    • pp.34-46
    • /
    • 2023
  • A growing number of large scale knowledge graphs raises several issues how knowledge graph data can be organized, discovered, and integrated efficiently. We present a novel semantic-based mashup platform for contents convergence which consists of acquisition, RDF storage, ontology learning, and mashup subsystems. This platform servers a basis for developing other more sophisticated applications required in the area of knowledge big data. Moreover, this paper proposes an entity matching method using graph convolutional network techniques as a preliminary work for automatic classification and discovery on knowledge big data. Using real DBP15K and SRPRS datasets, the performance of our method is compared with some existing entity matching methods. The experimental results show that the proposed method outperforms existing methods due to its ability to increase accuracy and reduce training time.

Development of Big-data Management Platform Considering Docker Based Real Time Data Connecting and Processing Environments (도커 기반의 실시간 데이터 연계 및 처리 환경을 고려한 빅데이터 관리 플랫폼 개발)

  • Kim, Dong Gil;Park, Yong-Soon;Chung, Tae-Yun
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.16 no.4
    • /
    • pp.153-161
    • /
    • 2021
  • Real-time access is required to handle continuous and unstructured data and should be flexible in management under dynamic state. Platform can be built to allow data collection, storage, and processing from local-server or multi-server. Although the former centralize method is easy to control, it creates an overload problem because it proceeds all the processing in one unit, and the latter distributed method performs parallel processing, so it is fast to respond and can easily scale system capacity, but the design is complex. This paper provides data collection and processing on one platform to derive significant insights from various data held by an enterprise or agency in the latter manner, which is intuitively available on dashboards and utilizes Spark to improve distributed processing performance. All service utilize dockers to distribute and management. The data used in this study was 100% collected from Kafka, showing that when the file size is 4.4 gigabytes, the data processing speed in spark cluster mode is 2 minute 15 seconds, about 3 minutes 19 seconds faster than the local mode.

Integrated Platform on the Basis of Heterogeneous Data to Support the Establishment of an Innovative Ecosystem for National High-Performance Computing: Focusing on Life Science & Public Health Area (국가 초고성능컴퓨팅 혁신 생태계 구축 지원을 위한 이종데이터 기반 통합 플랫폼: 생명·보건분야를 중심으로)

  • Do-Yeon Lee;Myoung-Ju Koh;Jae-Gyoon Hahm;Keun-Hwan Kim
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.26 no.1
    • /
    • pp.1-14
    • /
    • 2023
  • To secure national future competitiveness, the Korean government announced the 『National Ultra-High Performance Computing (HPC) Innovation Strategy (2021.5.28.)』 and set three innovation strategy goals throughout establishing an innovation ecosystem. This study presented a heterogenous data-based strategic support framework that allowed to understand both the current status of domestic & foreign R&D areas and domestic industrial economy areas in terms of strategic fields related to ultra-high performance computing, and the empirical research was conducted in the life science and public health area. The HPC innovation ecosystem platform based on the connection of heterogeneous data (domestic R&D project-technology-industry-overseas R&D project) presented in this study provided useful and essential information that allowed establishing a specific action plan for the national HPC innovation strategy and contributing to vitalizing the innovation ecosystem. Since the evidence-based policy assumes that a more reasonable consensus is reached through a non-biased decision- making process among stakeholders, the proposed platform may contribute to enhancing policy momentum by increasing legitimacy and trust of planning of the national HPC strategy.

Design and Implementation of IoT-Based Intelligent Platform for Water Level Monitoring (IoT 기반 지능형 수위 모니터링 플랫폼 설계 및 구현)

  • Park, Jihoon;Kang, Moon Seong;Song, Jung-Hun;Jun, Sang Min
    • Journal of Korean Society of Rural Planning
    • /
    • v.21 no.4
    • /
    • pp.177-186
    • /
    • 2015
  • The main objective of this study was to assess the applicability of IoT (Internet of Things)-based flood management under climate change by developing intelligent water level monitoring platform based on IoT. In this study, Arduino Uno was selected as the development board, which is an open-source electronic platform. Arduino Uno was designed to connect the ultrasonic sensor, temperature sensor, and data logger shield for implementing IoT. Arduino IDE (Integrated Development Environment) was selected as the Arduino software and used to develop the intelligent algorithm to measure and calibrate the real-time water level automatically. The intelligent water level monitoring platform consists of water level measurement, temperature calibration, data calibration, stage-discharge relationship, and data logger algorithms. Water level measurement and temperature calibration algorithm corrected the bias inherent in the ultrasonic sensor. Data calibration algorithm analyzed and corrected the outliers during the measurement process. The verification of the intelligent water level measurement algorithm was performed by comparing water levels using the tape and ultrasonic sensor, which was generated by measuring water levels at regular intervals up to the maximum level. The statistics of the slope of the regression line and $R^2$ were 1.00 and 0.99, respectively which were considered acceptable. The error was 0.0575 cm. The verification of data calibration algorithm was performed by analyzing water levels containing all error codes in a time series graph. The intelligent platform developed in this study may contribute to the public IoT service, which is applicable to intelligent flood management under climate change.

CANVAS: A Cloud-based Research Data Analytics Environment and System

  • Kim, Seongchan;Song, Sa-kwang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.10
    • /
    • pp.117-124
    • /
    • 2021
  • In this paper, we propose CANVAS (Creative ANalytics enVironment And System), an analytics system of the National Research Data Platform (DataON). CANVAS is a personalized analytics cloud service for researchers who need computing resources and tools for research data analysis. CANVAS is designed in consideration of scalability based on micro-services architecture and was built on top of open-source software such as eGovernment Standard framework (Spring framework), Kubernetes, and JupyterLab. The built system provides personalized analytics environments to multiple users, enabling high-speed and large-capacity analysis by utilizing high-performance cloud infrastructure (CPU/GPU). More specifically, modeling and processing data is possible in JupyterLab or GUI workflow environment. Since CANVAS shares data with DataON, the research data registered by users or downloaded data can be directly processed in the CANVAS. As a result, CANVAS enhances the convenience of data analysis for users in DataON and contributes to the sharing and utilization of research data.

Review of Artificial Intelligence Platform Policies and Strategies in South Korea, United States, China and the European Union Using National Innovation Capacity

  • Park, Mun-Su;Chang, Soonwoo Daniel
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.12 no.3
    • /
    • pp.79-99
    • /
    • 2022
  • South Korea is at an important juncture in its history to decide whether to continue its investment to become a first-mover of artificial intelligence (A.I.) platform technology or stay as a fast follower. This paper compares South Korea's A.I. platform capacity to that of the United States, China and the European Union by reviewing publicly opened documents and reports on AI platform strategies and policies using the three elements of the national innovation capacity: common innovation infrastructure, cluster-specific conditions, and quality of linkages. This paper found three major areas the South Korean government can focus on in the A.I. platform industry. First, South Korea needs to increase its investment in the A.I. field and expand its public-private collaboration activities. Second, unlike the U.S. and the U.K., South Korea lacks data protection policies. Third, South Korea needs to build a high-performance system and environment to experiment with artificial intelligence technology and big data.

A Study on the Improvement of 3D Building Data Format for Spatial Information Open Platform (공간정보 오픈플랫폼 3차원 건물데이터 포맷 개선방안 연구)

  • Kim, Hyeon Deok;Kang, Ji Hun;Kim, Hak Joon
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.25 no.1
    • /
    • pp.63-70
    • /
    • 2017
  • On the spatial information open platform, the national spatial data are released to provide services that the people can use freely. Recently, the demand for high quality 3D geospatial information and indoor spatial information is increasing. However, open platform is not able to provide seamless service because spatial data of indoor and outdoor are composed of different formats and storage structures. In addition, the 3D data format used in the current service does not reflect the recent changes in service environment and new technology. Therefore, in this study, we proposed new format of 3D data used in service to improve interoperability and service of open platform 3D data. The proposed format is lighter than the existing format and the rendering speed is improved.

Analysis of Computational Science and Engineering SW Data Format for Multi-physics and Visualization

  • Ryu, Gimyeong;Kim, Jaesung;Lee, Jongsuk Ruth
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.2
    • /
    • pp.889-906
    • /
    • 2020
  • Analysis of multi-physics systems and the visualization of simulation data are crucial and difficult in computational science and engineering. In Korea, Korea Institute of Science and Technology Information KISTI developed EDISON, a web-based computational science simulation platform, and it is now the ninth year since the service started. Hitherto, the EDISON platform has focused on providing a robust simulation environment and various computational science analysis tools. However, owing to the increasing issues in collaborative research, data format standardization has become more important. In addition, as the visualization of simulation data becomes more important for users to understand, the necessity of analyzing input / output data information for each software is increased. Therefore, it is necessary to organize the data format and metadata for the representative software provided by EDISON. In this paper, we analyzed computational fluid dynamics (CFD) and computational structural dynamics (CSD) simulation software in the field of mechanical engineering where several physical phenomena (fluids, solids, etc.) are complex. Additionally, in order to visualize various simulation result data, we used existing web visualization tools developed by third parties. In conclusion, based on the analysis of these data formats, it is possible to provide a foundation of multi-physics and a web-based visualization environment, which will enable users to focus on simulation more conveniently.

Development of Platform for Connection of Electronic Power Backbone based on D-TRS (D-TRS 기반 전력기간망 접속을 위한 게이트웨이 플랫폼 개발)

  • Song, Byeong-Kwon;Lee, Sang-Hun;Jeong, Tae-Eui;Kim, Gun-Woong;Kim, Jin-Chul;Kim, Young-Eok
    • Proceedings of the KIEE Conference
    • /
    • 2008.11a
    • /
    • pp.382-384
    • /
    • 2008
  • D-TRS is a method of wireless communication. This method will be able to use several frequency for multiple user used chanel together. TETRA of D-TRS technology is not rented network. Using TETRA network has the strong point which cost better than CDMA network of rental network. Master server of SCADA(Supervisory Control And Data Acquisition) system is realtime supervise control and a data acquire the control system or the RTU(Remote Terminal Unit). The present paper is developed and proposal the gateway platform for electronic power backbone network based on D-TRS. This gateway platform is converted DNP3.0 messages with TETRA PDU and converted TETRA PDU with DNP3.0 messages. Master server and FRTU will be able to send and receive DNP3.0 message via TETRA network using this gateway platform.

  • PDF