• Title/Summary/Keyword: Data-driven Modeling

Search Result 162, Processing Time 0.041 seconds

A Systems Engineering Approach to Predict the Success Window of FLEX Strategy under Extended SBO Using Artificial Intelligence

  • Alketbi, Salama Obaid;Diab, Aya
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.16 no.2
    • /
    • pp.97-109
    • /
    • 2020
  • On March 11, 2011, an earthquake followed by a tsunami caused an extended station blackout (SBO) at the Fukushima Dai-ichi NPP Units. The accident was initiated by a total loss of both onsite and offsite electrical power resulting in the loss of the ultimate heat sink for several days, and a consequent core melt in some units where proper mitigation strategies could not be implemented in a timely fashion. To enhance the plant's coping capability, the Diverse and Flexible Strategies (FLEX) were proposed to append the Emergency Operation Procedures (EOPs) by relying on portable equipment as an additional line of defense. To assess the success window of FLEX strategies, all sources of uncertainties need to be considered, using a physics-based model or system code. This necessitates conducting a large number of simulations to reflect all potential variations in initial, boundary, and design conditions as well as thermophysical properties, empirical models, and scenario uncertainties. Alternatively, data-driven models may provide a fast tool to predict the success window of FLEX strategies given the underlying uncertainties. This paper explores the applicability of Artificial Intelligence (AI) to identify the success window of FLEX strategy for extended SBO. The developed model can be trained and validated using data produced by the lumped parameter thermal-hydraulic code, MARS-KS, as best estimate system code loosely coupled with Dakota for uncertainty quantification. A Systems Engineering (SE) approach is used to plan and manage the process of using AI to predict the success window of FLEX strategies under extended SBO conditions.

Application of POD reduced-order algorithm on data-driven modeling of rod bundle

  • Kang, Huilun;Tian, Zhaofei;Chen, Guangliang;Li, Lei;Wang, Tianyu
    • Nuclear Engineering and Technology
    • /
    • v.54 no.1
    • /
    • pp.36-48
    • /
    • 2022
  • As a valid numerical method to obtain a high-resolution result of a flow field, computational fluid dynamics (CFD) have been widely used to study coolant flow and heat transfer characteristics in fuel rod bundles. However, the time-consuming, iterative calculation of Navier-Stokes equations makes CFD unsuitable for the scenarios that require efficient simulation such as sensitivity analysis and uncertainty quantification. To solve this problem, a reduced-order model (ROM) based on proper orthogonal decomposition (POD) and machine learning (ML) is proposed to simulate the flow field efficiently. Firstly, a validated CFD model to output the flow field data set of the rod bundle is established. Secondly, based on the POD method, the modes and corresponding coefficients of the flow field were extracted. Then, an deep feed-forward neural network, due to its efficiency in approximating arbitrary functions and its ability to handle high-dimensional and strong nonlinear problems, is selected to build a model that maps the non-linear relationship between the mode coefficients and the boundary conditions. A trained surrogate model for modes coefficients prediction is obtained after a certain number of training iterations. Finally, the flow field is reconstructed by combining the product of the POD basis and coefficients. Based on the test dataset, an evaluation of the ROM is carried out. The evaluation results show that the proposed POD-ROM accurately describe the flow status of the fluid field in rod bundles with high resolution in only a few milliseconds.

Long-term ecological monitoring in South Korea: progress and perspectives

  • Jeong Soo Park;Seung Jin Joo;Jaseok Lee;Dongmin Seo;Hyun Seok Kim;Jihyeon Jeon;Chung Weon Yun;Jeong Eun Lee;Sei-Woong Choi;Jae-Young Lee
    • Journal of Ecology and Environment
    • /
    • v.47 no.4
    • /
    • pp.264-271
    • /
    • 2023
  • Environmental crises caused by climate change and human-induced disturbances have become urgent challenges to the sustainability of human beings. These issues can be addressed based on a data-driven understanding and forecasting of ecosystem responses to environmental changes. In this study, we introduce a long-term ecological monitoring system in Korean Long-Term Ecological Research (KLTER), and a plan for the Korean Ecological Observatory Network (KEON). KLTER has been conducted since 2004 and has yielded valuable scientific results. However, the KLTER approach has limitations in data integration and coordinated observations. To overcome these limitations, we developed a KEON plan focused on multidisciplinary monitoring of the physiochemical, meteorological, and biological components of ecosystems to deepen process-based understanding of ecosystem functions and detect changes. KEON aims to answer nationwide and long-term ecological questions by using a standardized monitoring approach. We are preparing three types of observatories: two supersites depending on the climate-vegetation zones, three local sites depending on the ecosystem types, and two mobile deployment platforms to act on urgent ecological issues. The main observation topics were species diversity, population dynamics, biogeochemistry (carbon, methane, and water cycles), phenology, and remote sensing. We believe that KEON can address environmental challenges and play an important role in ecological observations through partnerships with international observatories.

A Study on the Research Trends for Smart City using Topic Modeling (토픽 모델링을 활용한 스마트시티 연구동향 분석)

  • Park, Keon Chul;Lee, Chi Hyung
    • Journal of Internet Computing and Services
    • /
    • v.20 no.3
    • /
    • pp.119-128
    • /
    • 2019
  • This study aims to analyze the research trends on Smart City and to present implications to policy maker, industry professional, and researcher. Cities around globe have undergone the rapid progress in urbanization and the consequent dramatic increase in urban dwellings over the past few decades, and faced many urban problems in such areas as transportation, environment and housing. Cities around the globe are in a hurry to introduce Smart City to pursue a common goal of solving these urban problems and improving the quality of their lives. However, various conceptual approaches to smart city are causing uncertainty in setting policy goals and establishing direction for implementation. The study collected 11,527 papers titled "Smart City(cities)" from the Scopus DB and Springer DB, and then analyze research status, topic, trends based on abstracts and publication date(year) information using the LDA based Topic Modeling approaches. Research topics are classified into three categories(Services, Technologies, and User Perspective) and eight regarding topics. Out of eight topics, citizen-driven innovation is the most frequently referred. Additional topic network analysis reveals that data and privacy/security are the most prevailing topics affecting others. This study is expected to helps understand the trends of Smart City researches and predict the future researches.

Analysis of Research Trends in Korean English Education Journals Using Topic Modeling (토픽 모델링을 활용한 한국 영어교육 학술지에 나타난 연구동향 분석)

  • Won, Yongkook;Kim, Youngwoo
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.4
    • /
    • pp.50-59
    • /
    • 2021
  • To understand the research trends of English education in Korea for the last 20 years from 2000 to 2019, 12 major academic journals in Korea in the field of English education were selected, and bibliographic information of 7,329 articles published in these journals were collected and analyzed. The total number of articles increased from the 2000s to the first half of the 2010s, but decreased somewhat in the late 2010s and the number of publications by journal has become similar. These results show that the overall influence of English education journals has decreased and then leveled in terms of quantity. Next, 34 topics were extracted by applying latent Dirichlet allocation (LDA) topic modeling using the English abstract of the articles. Teacher, word, culture/media, and grammar appeared as topics that were highly studied. Topics such as word, vocabulary, and testing and evaluation appeared through unique keywords, and various topics related to learner factors emerged, becoming topics of interest in English education research. Then, topics were analyzed to determine which ones were rising or falling in frequency. As a result of this analysis, qualitative research, vocabulary, learner factor, and testing were found to be rising topics, while falling topics included CALL, language, teaching, and grammar. This change in research topics shows that research interests in the field of English education are shifting from static research topics to data-driven and dynamic research topics.

Prediction of Correct Answer Rate and Identification of Significant Factors for CSAT English Test Based on Data Mining Techniques (데이터마이닝 기법을 활용한 대학수학능력시험 영어영역 정답률 예측 및 주요 요인 분석)

  • Park, Hee Jin;Jang, Kyoung Ye;Lee, Youn Ho;Kim, Woo Je;Kang, Pil Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.11
    • /
    • pp.509-520
    • /
    • 2015
  • College Scholastic Ability Test(CSAT) is a primary test to evaluate the study achievement of high-school students and used by most universities for admission decision in South Korea. Because its level of difficulty is a significant issue to both students and universities, the government makes a huge effort to have a consistent difficulty level every year. However, the actual levels of difficulty have significantly fluctuated, which causes many problems with university admission. In this paper, we build two types of data-driven prediction models to predict correct answer rate and to identify significant factors for CSAT English test through accumulated test data of CSAT, unlike traditional methods depending on experts' judgments. Initially, we derive candidate question-specific factors that can influence the correct answer rate, such as the position, EBS-relation, readability, from the annual CSAT practices and CSAT for 10 years. In addition, we drive context-specific factors by employing topic modeling which identify the underlying topics over the text. Then, the correct answer rate is predicted by multiple linear regression and level of difficulty is predicted by classification tree. The experimental results show that 90% of accuracy can be achieved by the level of difficulty (difficult/easy) classification model, whereas the error rate for correct answer rate is below 16%. Points and problem category are found to be critical to predict the correct answer rate. In addition, the correct answer rate is also influenced by some of the topics discovered by topic modeling. Based on our study, it will be possible to predict the range of expected correct answer rate for both question-level and entire test-level, which will help CSAT examiners to control the level of difficulties.

Twitter Issue Tracking System by Topic Modeling Techniques (토픽 모델링을 이용한 트위터 이슈 트래킹 시스템)

  • Bae, Jung-Hwan;Han, Nam-Gi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.109-122
    • /
    • 2014
  • People are nowadays creating a tremendous amount of data on Social Network Service (SNS). In particular, the incorporation of SNS into mobile devices has resulted in massive amounts of data generation, thereby greatly influencing society. This is an unmatched phenomenon in history, and now we live in the Age of Big Data. SNS Data is defined as a condition of Big Data where the amount of data (volume), data input and output speeds (velocity), and the variety of data types (variety) are satisfied. If someone intends to discover the trend of an issue in SNS Big Data, this information can be used as a new important source for the creation of new values because this information covers the whole of society. In this study, a Twitter Issue Tracking System (TITS) is designed and established to meet the needs of analyzing SNS Big Data. TITS extracts issues from Twitter texts and visualizes them on the web. The proposed system provides the following four functions: (1) Provide the topic keyword set that corresponds to daily ranking; (2) Visualize the daily time series graph of a topic for the duration of a month; (3) Provide the importance of a topic through a treemap based on the score system and frequency; (4) Visualize the daily time-series graph of keywords by searching the keyword; The present study analyzes the Big Data generated by SNS in real time. SNS Big Data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. In addition, such analysis requires the latest big data technology to process rapidly a large amount of real-time data, such as the Hadoop distributed system or NoSQL, which is an alternative to relational database. We built TITS based on Hadoop to optimize the processing of big data because Hadoop is designed to scale up from single node computing to thousands of machines. Furthermore, we use MongoDB, which is classified as a NoSQL database. In addition, MongoDB is an open source platform, document-oriented database that provides high performance, high availability, and automatic scaling. Unlike existing relational database, there are no schema or tables with MongoDB, and its most important goal is that of data accessibility and data processing performance. In the Age of Big Data, the visualization of Big Data is more attractive to the Big Data community because it helps analysts to examine such data easily and clearly. Therefore, TITS uses the d3.js library as a visualization tool. This library is designed for the purpose of creating Data Driven Documents that bind document object model (DOM) and any data; the interaction between data is easy and useful for managing real-time data stream with smooth animation. In addition, TITS uses a bootstrap made of pre-configured plug-in style sheets and JavaScript libraries to build a web system. The TITS Graphical User Interface (GUI) is designed using these libraries, and it is capable of detecting issues on Twitter in an easy and intuitive manner. The proposed work demonstrates the superiority of our issue detection techniques by matching detected issues with corresponding online news articles. The contributions of the present study are threefold. First, we suggest an alternative approach to real-time big data analysis, which has become an extremely important issue. Second, we apply a topic modeling technique that is used in various research areas, including Library and Information Science (LIS). Based on this, we can confirm the utility of storytelling and time series analysis. Third, we develop a web-based system, and make the system available for the real-time discovery of topics. The present study conducted experiments with nearly 150 million tweets in Korea during March 2013.

Priority for the Investment of Artificial Rainfall Fusion Technology (인공강우 융합기술 개발을 위한 R&D 투자 우선순위 도출)

  • Lim, Jong Yeon;Kim, KwangHoon;Won, DongKyu;Yeo, Woon-Dong
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.3
    • /
    • pp.261-274
    • /
    • 2019
  • This paper aims to develop an appropriate methodology for establishing an investment strategy for 'demonstration of artificial rainfall technology using UAV' and that include establishment of a technology classification, set of indicators for technology evaluation, suggestion of final key technology as a whole study area. It is designed to complement the latest research trend analysis results and expert committee opinions using quantitative analysis. The key indicators for technology evaluation consisted of three major items (activity, technology, marketability) and 10 detailed indicators. The AHP questionnaire was conducted to analyze the importance of indicators. As a result, it was analyzed that the attribute of the technology itself is most important, and the order of closeness to the implementation of the core function (centrality), feasibility (feasibility). Among the 16 technology groups, top investment priority groups were analyzed as ground seeding, artificial rainfall verification, spreading and diffusion of seeding material, artificial rainfall numerical modeling, and UAV sensor technology.

A Study on Data-driven Modeling Employing Stratification-related Physical Variables for Reservoir Water Quality Prediction (취수원 수질예측을 위한 성층 물리변수 활용 데이터 기반 모델링 연구)

  • Hyeon June Jang;Ji Young Jung;Kyung Won Joo;Choong Sung Yi;Sung Hoon Kim
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.143-143
    • /
    • 2023
  • 최근 대청댐('17), 평림댐('19) 등 광역 취수원에서 망간의 먹는 물 수질기준(0.05mg/L 이하) 초과 사례가 발생되어, 다수의 민원이 제기되는 등 취수원의 망간 관리 중요성이 부각되고 있다. 특히, 동절기 전도(Turn-over)시기에 고농도 망간이 발생되는 경우가 많은데, 현재 정수장에서는 망간을 처리하기 위해 유입구간에 필터를 설치하고 주기적으로 교체하는 방식으로 처리하고 있다. 그러나 단기간에 고농도 망간 다량 유입 시 처리용량의 한계 등 정수장에서의 공정관리가 어려워지므로 사전 예측에 의한 대응 체계 고도화가 필요한 실정이다. 본 연구는 광역취수원인 주암댐을 대상으로 망간 예측의 정확도 향상 및 예측기간 확대를 위해 다양한 머신러닝 기법들을 적용하여 비교 분석하였으며, 독립변수 및 초매개변수 최적화를 진행하여 모형의 정확도를 개선하였다. 머신러닝 모형은 수심별 탁도, 저수위, pH, 수온, 전기전도도, DO, 클로로필-a, 기상, 수문 자료 등의 독립변수와 화순정수장에 유입된 망간 농도를 종속변수로 각 변수에 해당하는 실측치를 학습데이터로 사용하였다. 그리고 데이터기반 모형의 정확도를 개선하기 위해서 성층의 수준을 판별하는 지표로서 PEA(Potential Energy Anomaly)를 도입하여 데이터 분석에 활용하고자 하였다. 분석 결과, 망간 유입률은 계절 주기에 따라 농도가 달라지는 것을 확인하였고 동절기 전도시점과 하절기 장마기간 난류생성 시기에 저층의 고농도 망간이 유입이 되는 것을 분석하였다. 또한, 두 시기의 망간 농도의 변화 패턴이 상이하므로 예측 모델은 각 계절별로 구축해 학습을 진행함으로써 예측의 정확도를 향상할 수 있었다. 다양한 머신러닝 모델을 구축하여 성능 비교를 진행한 결과, 동절기에는 Gradient Boosting Machine, 하절기에는 eXtreme Gradient Boosting의 기법이 우수하여 추론 모델로 활용하고자 하였다. 선정 모델을 통한 단기 수질예측 결과, 전도현상 발생 시기에 대한 추종 및 예측력이 기존의 데이터 모형만 적용했을 경우대비 약 15% 이상 예측 효율이 향상된 것으로 나타났다. 본 연구는 머신러닝 모델을 활용한 망간 농도 예측으로 정수장의 신속한 대응 체계 마련을 지원하고, 수처리 공정의 효율성을 높이는 데 기여할 것으로 기대되며, 후속 연구로 과거 시계열 자료 활용 및 물리모형과의 연결 등을 통해 모델의 신뢰성을 제고 할 계획이다.

  • PDF

A Review of Urban Flooding: Causes, Impacts, and Mitigation Strategies (도시 홍수: 원인, 영향 및 저감 전략 고찰)

  • Jin-Yong Lee
    • The Journal of Engineering Geology
    • /
    • v.33 no.3
    • /
    • pp.489-502
    • /
    • 2023
  • Urban floods pose significant challenges to cities worldwide, driven by the interplay between urbanization and climate change. This review examines recent studies of urban floods to understand their causes, impacts, and potential mitigation strategies. Urbanization, with its increase in impermeable surfaces and altered drainage patterns, disrupts natural water flow, exacerbating surface runoff during intense rainfall events. The impacts of urban floods are far-reaching, affecting lives, infrastructure, the economy, and the environment. Loss of life, property damage, disruptions to critical services, and environmental consequences underscore the urgency of effective urban flood management. To mitigate urban floods, integrated flood management strategies are crucial. Sustainable urban planning, green infrastructure, and improved drainage systems play pivotal roles in reducing flood vulnerabilities. Early warning systems, emergency response planning, and community engagement are essential components of flood preparedness and resilience. Looking to the future, climate change projections indicate increased flood risks, necessitating resilience and adaptation measures. Advances in research, data collection, and modeling techniques will enable more accurate flood predictions, thus guiding decision-making. In conclusion, urban flooding demands urgent attention and comprehensive strategies to protect lives, infrastructure, and the economy.