• Title/Summary/Keyword: Data-driven Modeling

Search Result 166, Processing Time 0.026 seconds

A review on urban inundation modeling research in South Korea: 2001-2022 (도시침수 모의 기술 국내 연구동향 리뷰: 2001-2022)

  • Lee, Seungsoo;Kim, Bomi;Choi, Hyeonjin;Noh, Seong Jin
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.10
    • /
    • pp.707-721
    • /
    • 2022
  • In this study, a state-of-the-art review on urban inundation simulation technology was presented summarizing major achievements and limitations, and future research recommendations and challenges. More than 160 papers published in major domestic academic journals since the 2000s were analyzed. After analyzing the core themes and contents of the papers, the status of technological development was reviewed according to simulation methodologies such as physically-based and data-driven approaches. In addition, research trends for application purposes and advances in overseas and related fields were analyzed. Since more than 60% of urban inundation research used Storm Water Management Model (SWMM), developing new modeling techniques for detailed physical processes of dual drainage was encouraged. Data-based approaches have become a new status quo in urban inundation modeling. However, given that hydrological extreme data is rare, balanced research development of data and physically-based approaches was recommended. Urban inundation analysis technology, actively combined with new technologies in other fields such as artificial intelligence, IoT, and metaverse, would require continuous support from society and holistic approaches to solve challenges from climate risk and reduce disaster damage.

Analysis on Strategies for Modeling the Wave Equation with Physics-Informed Neural Networks (물리정보신경망을 이용한 파동방정식 모델링 전략 분석)

  • Sangin Cho;Woochang Choi;Jun Ji;Sukjoon Pyun
    • Geophysics and Geophysical Exploration
    • /
    • v.26 no.3
    • /
    • pp.114-125
    • /
    • 2023
  • The physics-informed neural network (PINN) has been proposed to overcome the limitations of various numerical methods used to solve partial differential equations (PDEs) and the drawbacks of purely data-driven machine learning. The PINN directly applies PDEs to the construction of the loss function, introducing physical constraints to machine learning training. This technique can also be applied to wave equation modeling. However, to solve the wave equation using the PINN, second-order differentiations with respect to input data must be performed during neural network training, and the resulting wavefields contain complex dynamical phenomena, requiring careful strategies. This tutorial elucidates the fundamental concepts of the PINN and discusses considerations for wave equation modeling using the PINN approach. These considerations include spatial coordinate normalization, the selection of activation functions, and strategies for incorporating physics loss. Our experimental results demonstrated that normalizing the spatial coordinates of the training data leads to a more accurate reflection of initial conditions in neural network training for wave equation modeling. Furthermore, the characteristics of various functions were compared to select an appropriate activation function for wavefield prediction using neural networks. These comparisons focused on their differentiation with respect to input data and their convergence properties. Finally, the results of two scenarios for incorporating physics loss into the loss function during neural network training were compared. Through numerical experiments, a curriculum-based learning strategy, applying physics loss after the initial training steps, was more effective than utilizing physics loss from the early training steps. In addition, the effectiveness of the PINN technique was confirmed by comparing these results with those of training without any use of physics loss.

Implementation of the Ensemble Kalman Filter to a Double Gyre Ocean and Sensitivity Test using Twin Experiments (Double Gyre 모형 해양에서 앙상블 칼만필터를 이용한 자료동화와 쌍둥이 실험들을 통한 민감도 시험)

  • Kim, Young-Ho;Lyu, Sang-Jin;Choi, Byoung-Ju;Cho, Yang-Ki;Kim, Young-Gyu
    • Ocean and Polar Research
    • /
    • v.30 no.2
    • /
    • pp.129-140
    • /
    • 2008
  • As a preliminary effort to establish a data assimilative ocean forecasting system, we reviewed the theory of the Ensemble Kamlan Filter (EnKF) and developed practical techniques to apply the EnKF algorithm in a real ocean circulation modeling system. To verify the performance of the developed EnKF algorithm, a wind-driven double gyre was established in a rectangular ocean using the Regional Ocean Modeling System (ROMS) and the EnKF algorithm was implemented. In the ideal ocean, sea surface temperature and sea surface height were assimilated. The results showed that the multivariate background error covariance is useful in the EnKF system. We also tested the sensitivity of the EnKF algorithm to the localization and inflation of the background error covariance and the number of ensemble members. In the sensitivity tests, the ensemble spread as well as the root-mean square (RMS) error of the ensemble mean was assessed. The EnKF produces the optimal solution as the ensemble spread approaches the RMS error of the ensemble mean because the ensembles are well distributed so that they may include the true state. The localization and inflation of the background error covariance increased the ensemble spread while building up well-distributed ensembles. Without the localization of the background error covariance, the ensemble spread tended to decrease continuously over time. In addition, the ensemble spread is proportional to the number of ensemble members. However, it is difficult to increase the ensemble members because of the computational cost.

Changes in Teaching Practices of Elementary School Teachers in Scientific Modeling Classes: Focused on Modeling Pedagogical Content Knowledge (PCK) (과학 모델링 수업에서 나타난 초등 교사의 수업 실행 변화 -모델링 PCK를 중심으로-)

  • Uhm, Janghee;Kim, Heui-Baik
    • Journal of The Korean Association For Science Education
    • /
    • v.40 no.5
    • /
    • pp.543-563
    • /
    • 2020
  • This study explores how the teaching practices of two teachers changed during scientific modeling classes. It also aims to understand these changes in terms of the teachers' modeling pedagogical content knowledge (PCK) development. The study participants were two elementary school teachers and their fifth-grade students. The teachers taught eight lessons of scientific modeling classes about the human body. The data analysis was conducted for lessons 1-2 and 7-8, which best showed the change in teaching practice. The two teachers' teaching practices were analyzed in terms of feedback frequency, feedback content, and the time allocated for each stage of model generation, evaluation, and modification. Teacher A led the evaluation and modification stages in a teacher-driven way throughout the classes. In terms of feedback, teacher A mainly used answer evaluation feedback in lesson 1-2; however, in lesson 7-8, the feedback content changed to thought-provoking feedback. Meanwhile, teacher B mostly led a teacher-driven model evaluation and modification in lesson 1-2; however, in lesson 7-8, she let her students lead the model evaluation and modification stages and helped them develop models through various feedbacks. The analysis shows that these teaching changes were related to the development of modeling PCK components. Furthermore, the two teachers' modeling PCK differed in teaching orientation, in understanding the modeling stages, and in recognizing the value of modeling, suggesting the importance of these in modeling teaching practice. This study can help improve the understanding of modeling classes by revealing the relationship between teaching practices and modeling PCK.

A Partition Technique of UML-based Software Models for Multi-Processor Embedded Systems (멀티프로세서용 임베디드 시스템을 위한 UML 기반 소프트웨어 모델의 분할 기법)

  • Kim, Jong-Phil;Hong, Jang-Eui
    • The KIPS Transactions:PartD
    • /
    • v.15D no.1
    • /
    • pp.87-98
    • /
    • 2008
  • In company with the demand of powerful processing units for embedded systems, the method to develop embedded software is also required to support the demand in new approach. In order to improve the resource utilization and system performance, software modeling techniques have to consider the features of hardware architecture. This paper proposes a partitioning technique of UML-based software models, which focus the generation of the allocatable software components into multiprocessor architecture. Our partitioning technique, at first, transforms UML models to CBCFGs(Constraint-Based Control Flow Graphs), and then slices the CBCFGs with consideration of parallelism and data dependency. We believe that our proposition gives practical applicability in the areas of platform specific modeling and performance estimation in model-driven embedded software development.

Comparison of event tree/fault tree and convolution approaches in calculating station blackout risk in a nuclear power plant

  • Man Cheol Kim
    • Nuclear Engineering and Technology
    • /
    • v.56 no.1
    • /
    • pp.141-146
    • /
    • 2024
  • Station blackout (SBO) risk is one of the most significant contributors to nuclear power plant risk. In this paper, the sequence probability formulas derived by the convolution approach are compared with those derived by the conventional event tree/fault tree (ET/FT) approach for the SBO situation in which emergency diesel generators fail to start. The comparison identifies what makes the ET/FT approach more conservative and raises the issue regarding the mission time of a turbine-driven auxiliary feedwater pump (TDP), which suggests a possible modeling improvement in the ET/FT approach. Monte Carlo simulations with up-to-date component reliability data validate the convolution approach. The sequence probability of an alternative alternating current diesel generator (AAC DG) failing to start and the TDP failing to operate owing to battery depletion contributes most to the SBO risk. The probability overestimation of the scenario in which the AAC DG fails to run and the TDP fails to operate owing to battery depletion contributes most to the SBO risk overestimation determined by the ET/FT approach. The modification of the TDP mission time renders the sequence probabilities determined by the ET/FT approach more consistent with those determined by the convolution approach.

Development of an integrated Web-based system with a pile load test database and pre-analyzed data

  • Chen, Yit-Jin;Liao, Ming-Ru;Lin, Shiu-Shin;Huang, Jen-Kai;Marcos, Maria Cecilia M.
    • Geomechanics and Engineering
    • /
    • v.7 no.1
    • /
    • pp.37-53
    • /
    • 2014
  • A Web-based pile load test (WBPLT) system was developed and implemented in this study. Object-oriented and concept-based software design techniques were adopted to integrate the pile load test database into the system. A total of 673 case histories of pile load test were included in the database. The data consisted of drilled shaft and driven precast concrete pile axial load tests in drained, undrained, and gravel loading conditions as well as pre-analyzed data and back-calculated design parameters. Unified modeling language, a standard software design tool, was utilized to design the WBPLT system architecture with five major concept-based components. These components provide the static structure and dynamic behavior of system message flows in a visualized manner. The open-source Apache Web server is the building block of the WBPLT system, and PHP Web programming language implements the operation of the WBPLT components, particularly the automatic translation of user query into structured query language. A simple search and inexpensive query can be implemented through the Internet browser. The pile load test database is helpful, and data can be easily retrieved and utilized worldwide for research and advanced applications.

Development of Dam Inflow Simulation Method Based on Bayesian Autoregressive Exogenous Stochastic Volatility (ARXSV) model

  • Fabian, Pamela Sofia;Kim, Ho-Jun;Kim, Ki-Chul;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.437-437
    • /
    • 2022
  • The prediction of dam inflow rate is crucial for the management of the largest multi-purpose dam in South Korea, the Soyang Dam. The main issue associated with the management of water resources is the stochastic nature of the reservoir inflow leading to an increase in uncertainty associated with the inflow prediction. The Autoregressive (AR) model is commonly used to provide the simulation and forecast of hydrometeorological data. However, because its estimation is based solely on the time-series data, it has the disadvantage of being unable to account for external variables such as climate information. This study proposes the use of the Autoregressive Exogenous Stochastic Volatility (ARXSV) model within a Bayesian modeling framework for increased predictability of the monthly dam inflow by addressing the exogenous and stochastic factors. This study analyzes 45 years of hydrological input data of the Soyang Dam from the year 1974 to 2019. The result of this study will be beneficial to strengthen the potential use of data-driven models for accurate inflow predictions and better reservoir management.

  • PDF

Discovery to Human Disease Research: Proteo-Metabolomics Analysis

  • Minjoong Joo;Jeong-Hun Mok;Van-An Duong;Jong-Moon Park;Hookeun Lee
    • Mass Spectrometry Letters
    • /
    • v.15 no.2
    • /
    • pp.69 -78
    • /
    • 2024
  • The advancement of high-throughput omics technologies and systems biology is essential for understanding complex biological mechanisms and diseases. The integration of proteomics and metabolomics provides comprehensive insights into cellular functions and disease pathology, driven by developments in mass spectrometry (MS) technologies, including electrospray ionization (ESI). These advancements are crucial for interpreting biological systems effectively. However, integrating these technologies poses challenges. Compared to genomic, proteomics and metabolomics have limitations in throughput, and data integration. This review examines developments in MS equipped electrospray ionization (ESI), and their importance in the effective interpretation of biological mechanisms. The review also discusses developments in sample preparation, such as Simultaneous Metabolite, Protein, Lipid Extraction (SIMPLEX), analytical techniques, and data analysis, highlighting the application of these technologies in the study of cancer or Huntington's disease, underscoring the potential for personalized medicine and diagnostic accuracy. Efforts by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and integrative data analysis methods such as O2PLS and OnPLS extract statistical similarities between metabolomic and proteomic data. System modeling techniques that mathematically explain and predict system responses are also covered. This practical application also shows significant improvements in cancer research, diagnostic accuracy and therapeutic targeting for diseases like pancreatic ductal adenocarcinoma, non-small cell lung cancer, and Huntington's disease. These approaches enable researchers to develop standardized protocols, and interoperable software and databases, expanding multi-omics research application in clinical practice.

Uncertainty Sequence Modeling Approach for Safe and Effective Autonomous Driving (안전하고 효과적인 자율주행을 위한 불확실성 순차 모델링)

  • Yoon, Jae Ung;Lee, Ju Hong
    • Smart Media Journal
    • /
    • v.11 no.9
    • /
    • pp.9-20
    • /
    • 2022
  • Deep reinforcement learning(RL) is an end-to-end data-driven control method that is widely used in the autonomous driving domain. However, conventional RL approaches have difficulties in applying it to autonomous driving tasks due to problems such as inefficiency, instability, and uncertainty. These issues play an important role in the autonomous driving domain. Although recent studies have attempted to solve these problems, they are computationally expensive and rely on special assumptions. In this paper, we propose a new algorithm MCDT that considers inefficiency, instability, and uncertainty by introducing a method called uncertainty sequence modeling to autonomous driving domain. The sequence modeling method, which views reinforcement learning as a decision making generation problem to obtain high rewards, avoids the disadvantages of exiting studies and guarantees efficiency, stability and also considers safety by integrating uncertainty estimation techniques. The proposed method was tested in the OpenAI Gym CarRacing environment, and the experimental results show that the MCDT algorithm provides efficient, stable and safe performance compared to the existing reinforcement learning method.