• Title/Summary/Keyword: 기계학습식 모형

Search Result 10, Processing Time 0.036 seconds

Application of AI technology for various disaster analysis (다양한 재해분석을 위한 AI 기술적용 사례 소개)

  • Giha Lee;Xuan-Hien Le;Van-Giang Nguyen;Van-Linh Ngyen;Sungho Jung
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.97-97
    • /
    • 2023
  • 최근 재해분야에서 인공신경망(ANN), 기계학습(ML), 딥러닝(DL) 등 AI 기술이 활용성이 점차 증가하고 있으며, 센싱정보와 연계한 시설물 안전관리, 원격탐사와 연계한 재해감시(녹조, 산사태, 산불 등), 수문시계열(수위, 유량 등) 예측, 레이더·위성강수 자료의 보정과 예측, 상하수도 관망누수예측 등 다양한 분야에서 AI 기술이 적용되고 그 활용성이 검증된 바 있다. 본 연구에서는 ML, DL, 물리기반신경망(Pysics-informed Neural Networks, PINNs)을 이용한 다양한 재해분석 사례를 소개하고, 그 활용성과 한계에 대해서 논의하고자 한다. 주요사례로는 (1) SAR영상과 기계학습을 이용한 재해피해지역(울진 산불) 감지, (2) 국가 디지털 정보를 이용한 산사태 위험지역 판별(인제 산사태) (3) 기계학습 및 딥러닝 기법을 이용한 위성강수 자료의 보정·예측 및 유출해석, (4) 수리해석을 위한 수치해석분야에서의 PINNs의 적용성(1차원 Saint-Venant 식 해석) 평가 연구결과를 공유한다. 특히, 자료의 입·출력 자료만으로 학습된 인공신경망 모형 대신 지배방정식(물리방정식)을 만족하도록 강제한 PINNs의 경우, 인공신경망 모형보다 우수한 모의능력을 보여주었으며, 향후 복잡한 수리모델링 등 수치해석분야에서 그 활용가능성이 매우 높을 것으로 판단된다.

  • PDF

A Study on the Predictions of Wave Breaker Index in a Gravel Beach Using Linear Machine Learning Model (선형기계학습모델을 이용한 자갈해빈상에서의 쇄파지표 예측)

  • Eul-Hyuk Ahn;Young-Chan Lee;Do-Sam Kim;Kwang-Ho Lee
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.36 no.2
    • /
    • pp.37-49
    • /
    • 2024
  • To date, numerous empirical formulas have been proposed through hydraulic model experiments to predict the wave breaker index, including wave height and depth of wave breaking, due to the inherent complexity of generation mechanisms. Unfortunately, research on the characteristics of wave breaking and the prediction of the wave breaker index for gravel beaches has been limited. This study aims to forecast the wave breaker index for gravel beaches using representative linear-based machine learning techniques known for their high predictive performance in regression or classification problems across various research fields. Initially, the applicability of existing empirical formulas for wave breaker indices to gravel seabeds was assessed. Various linear-based machine learning algorithms were then employed to build prediction models, aiming to overcome the limitations of existing empirical formulas in predicting wave breaker indices for gravel seabeds. Among the developed machine learning models, a new calculation formula for easily computable wave breaker indices based on the model was proposed, demonstrating high predictive performance for wave height and depth of wave breaking on gravel beaches. The study validated the predictive capabilities of the proposed wave breaker indices through hydraulic model experiments and compared them with existing empirical formulas. Despite its simplicity as a polynomial, the newly proposed empirical formula for wave breaking indices in this study exhibited exceptional predictive performance for gravel beaches.

A Comparison of Machine Learning Species Distribution Methods for Habitat Analysis of the Korea Water Deer (Hydropotes inermis argyropus) (고라니 서식지 분석을 위한 기계학습식 종분포모형 비교)

  • Song, Won-Kyong;Kim, Eun-Young
    • Korean Journal of Remote Sensing
    • /
    • v.28 no.1
    • /
    • pp.171-180
    • /
    • 2012
  • The field of wildlife habitat conservation research has attracted attention as integrated biodiversity management strategies. Considering the status of the species surveying data and the environmental variables in Korea, the GARP and Maxent models optimized for presence-only data could be one of the most suitable models in habitat modeling. For make sure applicability in the domestic environment we applied the machine learning species distribution model for analyzing habitats of the Korea water deer($Hydropotes$ $inermis$ $argyropus$) in the $Sapgyocheon$ watershed, $Chungcheong$ province. We used the $3^{rd}$ National Natural Environment Survey data and 10 environment variables by literature review for the modelling. Analysis results showed that habitats for the Korea water deer were predicted 16.3%(Maxent) and 27.1%(GARP), respectively. In terms of accuracy(training/test) the Maxent(0.85/0.69) was higher than the GARP(0.65/0.61), and the Spearman's rank correlation coefficient result of the Maxent(${\rho}$=0.71, p<0.01) was higher than the result of GARP(${\rho}$=0.55, p<0.05). However results could be depended on sites and target species, therefore selection of the appropriate model considering on the situation will be important to analyzing habitats.

Development of Suspended Sediment Concentration Measurement Technique Based on Hyperspectral Imagery with Optical Variability (분광 다양성을 고려한 초분광 영상 기반 부유사 농도 계측 기법 개발)

  • Kwon, Siyoon;Seo, Il Won
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.116-116
    • /
    • 2021
  • 자연 하천에서의 부유사 농도 계측은 주로 재래식 채집방식을 활용한 직접계측 방식에 의존하여 비용과 시간이 많이 소요되며 점 계측 방식으로 고해상도의 시공간 자료를 측정하기엔 한계가 존재한다. 이러한 한계점을 극복하기 위해 최근 위성영상과 드론을 활용하여 촬영된 다분광 혹은 초분광 영상을 통해 고해상도의 부유사 농도 시공간분포를 측정하는 기법에 대한 연구가 활발히 진행되고 있다. 하지만, 다른 하천 물리량 계측에 비해 부유사 계측 연구는 하천에 따라 부유사가 비균질적으로 분포하여 원격탐사를 통해 정확하고 전역적인 농도 분포를 재현하기는 어려운 실정이다. 이러한 부유사의 비균질성은 부유사의 입도분포, 광물특성, 침강성 등이 하천에서 다양하게 분포하기 때문이며 이로 인해 부유사는 지역별로 다양한 분광특성을 가지게 된다. 따라서, 본 연구에서는 이러한 영향을 고려한 전역적인 부유사 농도 예측 모형을 개발하기 위해 실내 실험을 통해 부유사 특성별 고유 분광 라이브러리를 구축하고 실규모 수로에서 다양한 부유사 조건에 대한 초분광 스펙트럼과 부유사 농도를 측정하는 실험을 수행하였다. 실제 부유사 농도는 광학 기반 센서인 LISST-200X와 샘플링을 통한 실험실 분석을 통해 계측되었으며, 초분광 스펙트럼 자료는 초분광 카메라를 통해 촬영한 영상에서 부유사 계측 지점에 대한 픽셀의 스펙트럼을 추출하여 구축하였다. 이렇게 생성된 자료들의 분광 다양성을 주성분 분석(Principle Component Analysis; PCA)를 통해 분석하였으며, 부유사의 입도 분포, 부유사 종류, 수온 등과의 상관관계를 통해 분광 특성과 가장 상관관계가 높은 물리적 인자를 규명하였다. 더불어 구축된 자료를 바탕으로 기계학습 기반 주요 특징 선택 알고리즘인 재귀적 특징 제거법 (Recursive Feature Elimination)과 기계학습기반 회귀 모형인 Support Vector Regression을 결합하여 초분광 영상 기반 부유사 농도 예측 모형을 개발하였으며, 이 결과를 원격탐사 계측 연구에서 일반적으로 사용되어 오던 최적 밴드비 분석 (Optimal Band Ratio Analysis; OBRA) 방법으로 도출된 회귀식과 비교하였다. 그 결과, 기존의 OBRA 기반 방법은 비선형성을 증가시켜도 좁은 영역의 파장대만을 고려하는 한계점으로 인해 부유사의 다양한 분광 특성을 반영하지 못하였으며, 본 연구에서 제시한 기계학습 기반 예측 모형은 420 nm~1000 nm에 걸쳐 폭 넓은 파장대를 고려함과 동시에 높은 정확도를 산출하였다. 최종적으로 개발된 모형을 적용해 다양한 유사 조건에 대한 부유사 시공간 분포를 매핑한 결과, 시공간적으로 고해상도의 부유사 농도 분포를 산출하는 것으로 밝혀졌다.

  • PDF

Development of suspended solid concentration measurement technique based on multi-spectral satellite imagery in Nakdong River using machine learning model (기계학습모형을 이용한 다분광 위성 영상 기반 낙동강 부유 물질 농도 계측 기법 개발)

  • Kwon, Siyoon;Seo, Il Won;Beak, Donghae
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.2
    • /
    • pp.121-133
    • /
    • 2021
  • Suspended Solids (SS) generated in rivers are mainly introduced from non-point pollutants or appear naturally in the water body, and are an important water quality factor that may cause long-term water pollution by being deposited. However, the conventional method of measuring the concentration of suspended solids is labor-intensive, and it is difficult to obtain a vast amount of data via point measurement. Therefore, in this study, a model for measuring the concentration of suspended solids based on remote sensing in the Nakdong River was developed using Sentinel-2 data that provides high-resolution multi-spectral satellite images. The proposed model considers the spectral bands and band ratios of various wavelength bands using a machine learning model, Support Vector Regression (SVR), to overcome the limitation of the existing remote sensing-based regression equations. The optimal combination of variables was derived using the Recursive Feature Elimination (RFE) and weight coefficients for each variable of SVR. The results show that the 705nm band belonging to the red-edge wavelength band was estimated as the most important spectral band, and the proposed SVR model produced the most accurate measurement compared with the previous regression equations. By using the RFE, the SVR model developed in this study reduces the variable dependence compared to the existing regression equations based on the single spectral band or band ratio and provides more accurate prediction of spatial distribution of suspended solids concentration.

Clustering of sediment characteristics in South Korean rivers and its expanded application strategy to H-ADCP based suspended sediment concentration monitoring technique (한국 하천의 지역별 유사특성의 군집화와 H-ADCP 기반 부유사 농도 관측 기법에의 활용 방안)

  • Noh, Hyoseob;Son, GeunSoo;Kim, Dongsu;Park, Yong Sung
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.1
    • /
    • pp.43-57
    • /
    • 2022
  • Advances in measurement techniques have reduced measurement costs and enhanced safety resulting in less uncertainty. For example, an acoustic doppler current profiler (ADCP) based suspended sediment concentration (SSC) measurement technique is being accepted as an alternative to the conventional data collection method. In Korean rivers, horizontal ADCPs (H-ADCPs) are mounted on the automatic discharge monitoring stations, where SSC can be measured using the backscatter of ADCPs. However, automatic discharge monitoring stations and sediment monitoring stations do not always coincide which hinders the application of the new techniques that are not feasible to some stations. This work presents and analyzes H-ADCP-SSC models for 9 discharge monitoring stations in Korean rivers. In application of the Gaussian mixture model (GMM) to sediment-related variables (catchment area, particle size distributions of suspended sediment and bed material, water discharge-sediment discharge curves) from 44 sediment monitoring stations, it is revealed that those characteristics can distinguish sediment monitoring stations regionally. Linking the two results, we propose a protocol determining the H-ADCP-SSC model where no H-ADCP-SSC model is available.

Literature Review of AI Hallucination Research Since the Advent of ChatGPT: Focusing on Papers from arXiv (챗GPT 등장 이후 인공지능 환각 연구의 문헌 검토: 아카이브(arXiv)의 논문을 중심으로)

  • Park, Dae-Min;Lee, Han-Jong
    • Informatization Policy
    • /
    • v.31 no.2
    • /
    • pp.3-38
    • /
    • 2024
  • Hallucination is a significant barrier to the utilization of large-scale language models or multimodal models. In this study, we collected 654 computer science papers with "hallucination" in the abstract from arXiv from December 2022 to January 2024 following the advent of Chat GPT and conducted frequency analysis, knowledge network analysis, and literature review to explore the latest trends in hallucination research. The results showed that research in the fields of "Computation and Language," "Artificial Intelligence," "Computer Vision and Pattern Recognition," and "Machine Learning" were active. We then analyzed the research trends in the four major fields by focusing on the main authors and dividing them into data, hallucination detection, and hallucination mitigation. The main research trends included hallucination mitigation through supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF), inference enhancement via "chain of thought" (CoT), and growing interest in hallucination mitigation within the domain of multimodal AI. This study provides insights into the latest developments in hallucination research through a technology-oriented literature review. This study is expected to help subsequent research in both engineering and humanities and social sciences fields by understanding the latest trends in hallucination research.

A Study on the Prediction Model of Stock Price Index Trend based on GA-MSVM that Simultaneously Optimizes Feature and Instance Selection (입력변수 및 학습사례 선정을 동시에 최적화하는 GA-MSVM 기반 주가지수 추세 예측 모형에 관한 연구)

  • Lee, Jong-sik;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.147-168
    • /
    • 2017
  • There have been many studies on accurate stock market forecasting in academia for a long time, and now there are also various forecasting models using various techniques. Recently, many attempts have been made to predict the stock index using various machine learning methods including Deep Learning. Although the fundamental analysis and the technical analysis method are used for the analysis of the traditional stock investment transaction, the technical analysis method is more useful for the application of the short-term transaction prediction or statistical and mathematical techniques. Most of the studies that have been conducted using these technical indicators have studied the model of predicting stock prices by binary classification - rising or falling - of stock market fluctuations in the future market (usually next trading day). However, it is also true that this binary classification has many unfavorable aspects in predicting trends, identifying trading signals, or signaling portfolio rebalancing. In this study, we try to predict the stock index by expanding the stock index trend (upward trend, boxed, downward trend) to the multiple classification system in the existing binary index method. In order to solve this multi-classification problem, a technique such as Multinomial Logistic Regression Analysis (MLOGIT), Multiple Discriminant Analysis (MDA) or Artificial Neural Networks (ANN) we propose an optimization model using Genetic Algorithm as a wrapper for improving the performance of this model using Multi-classification Support Vector Machines (MSVM), which has proved to be superior in prediction performance. In particular, the proposed model named GA-MSVM is designed to maximize model performance by optimizing not only the kernel function parameters of MSVM, but also the optimal selection of input variables (feature selection) as well as instance selection. In order to verify the performance of the proposed model, we applied the proposed method to the real data. The results show that the proposed method is more effective than the conventional multivariate SVM, which has been known to show the best prediction performance up to now, as well as existing artificial intelligence / data mining techniques such as MDA, MLOGIT, CBR, and it is confirmed that the prediction performance is better than this. Especially, it has been confirmed that the 'instance selection' plays a very important role in predicting the stock index trend, and it is confirmed that the improvement effect of the model is more important than other factors. To verify the usefulness of GA-MSVM, we applied it to Korea's real KOSPI200 stock index trend forecast. Our research is primarily aimed at predicting trend segments to capture signal acquisition or short-term trend transition points. The experimental data set includes technical indicators such as the price and volatility index (2004 ~ 2017) and macroeconomic data (interest rate, exchange rate, S&P 500, etc.) of KOSPI200 stock index in Korea. Using a variety of statistical methods including one-way ANOVA and stepwise MDA, 15 indicators were selected as candidate independent variables. The dependent variable, trend classification, was classified into three states: 1 (upward trend), 0 (boxed), and -1 (downward trend). 70% of the total data for each class was used for training and the remaining 30% was used for verifying. To verify the performance of the proposed model, several comparative model experiments such as MDA, MLOGIT, CBR, ANN and MSVM were conducted. MSVM has adopted the One-Against-One (OAO) approach, which is known as the most accurate approach among the various MSVM approaches. Although there are some limitations, the final experimental results demonstrate that the proposed model, GA-MSVM, performs at a significantly higher level than all comparative models.

A Study on Spatial Downscaling of Satellite-based Soil Moisture Data (토양수분 위성자료의 공간상세화에 관한 연구)

  • Shin, Dae Yun;Lee, Yang Won;Park, Mun Sung
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.414-414
    • /
    • 2017
  • 토양수분은 지면환경에서 일어나는 수문 및 에너지 순환을 이해하는 데 있어 중요한 기상인자이다. 토양수분 현장관측은 땅속에 매설된 센서에 의해 상당히 정확하게 이루어지만, 관측점 수가 충분치 않아 공간적 연속성을 확보하지 못하는 어려움이 존재한다. 이에 광역적 및 연속적 관측이 가능한 마이크로파 위성센서가 토양수분 정보 획득을 위한 보조수단으로서 그 중요성이 부각되고 있다. 마이크로파 위성센서는 구름 등 기상조건의 제약을 받지 않으며, 1978년 이래 현재까지 여러 위성에 의해 25 km 및 10 km 해상도의 전지구 토양수분자료가 생산되어 왔다. 마이크로파 센서를 이용한 토양수분자료는 동일지점에 대하여 하루 2회 정도 산출되므로 적절한 시간분해능을 가지지만, 공간해상도가 최고 10 km로서 지역규모의 수문분석에 적용하기에는 충분치 않다. 이러한 토양수분자료의 공간해상도 문제 해결을 위하여 다양한 지면환경요소를 활용한 통계적 다운스케일링이 대안으로 제시되었다. 최근의 선행연구들은 대부분 방정식을 이용한 결합모형을 통해 통계적 다운스케일링을 수행하였는데, 회귀식과 같은 선형결합뿐 아니라 신경망이나 기계학습 등의 비선형결합에서도, 불가피하게 발생할 수밖에 없는 잔차(residual)로 인하여 다운스케일링 전후의 공간분포 패턴이 달라져버리는 문제를 안고 있었다. 회귀분석에 잔차의 공간내삽을 결합시킨 회귀크리깅(regression kriging)은 잔차보정을 통해 이러한 문제를 해결함으로써 다운스케일링 전후의 공간분포 일관성을 보장하는 기법이다. 이 연구에서는 회귀크리깅을 이용하여 일자별 AMSR2(Advanced Microwave Scanning Radiometer 2) 토양수분 자료를 10 km에서 1 km 해상도로 다운스케일링하고, 다운스케일링 전후의 자료패턴 일관성을 평가한다. 지면온도(LST), 지면온도상승률(RR), 식생온도건조지수(TVDI)는 일자별로 DB를 구축하였고, 식생지수(NDVI), 수분지수(NDWI), 지면알베도(SA)는 8일 간격으로 DB를 구축하였다. 이러한 8일 간격의 자료를 일자별로 변환하기 위하여 큐빅스플라인(cubic spline)을 이용하여 시계열내삽을 수행하였다. 또한 상이한 공간해상도의 자료는 최근린법을 이용하여 다운스케일링 목표해상도인 1 km에 맞도록 변환하였다. 우선 저해상도 스케일에서 추정치를 산출하기 위해서는 저해상도 픽셀별로 이에 해당하는 복수의 고해상도 픽셀을 평균화하여 대응시켜야 하며, 이를 통해 6개의 설명변수(LST, RR, TVDI, NDVI, NDWI, SA)와 AMSR2 토양수분을 반응변수로 하는 다중회귀식을 도출하였다. 이식을 고해상도 스케일의 설명변수들에 적용하면 고해상도 토양수분 추정치가 산출되는데, 이때 추정치와 원자료의 차이에 해당하는 잔차에 대한 보정이 필요하다. 저해상도 스케일로 존재하는 잔차를 크리깅 공간내삽을 통해 고해상도로 변환한 후 이를 고해상도 추정치에 부가해주는 방식으로 잔차보정이 이루어짐으로써, 다운스케일링 전후의 자료패턴 일관성이 유지되는(r>0.95) 공간상세화된 토양수분 자료를 생산할 수 있다.

  • PDF

Modeling the Spatial Distribution of Roe Deer (Capreolus pygargus) in Jeju Island (제주 노루(Capreolus pygargus)의 서식지 선호도 분석)

  • KIM, A-Reum;LEE, Jae-Min;JANG, Gab-Sue
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.4
    • /
    • pp.139-151
    • /
    • 2017
  • The habitat preference of roe deers(Capreolus pygargus) in Jeju island, South Korea was analyzed by using their occurrence probability in MaxEnt model in this study. Totally 490 surveying data were gathered and 15 environmental variables were chosen for the model in which 6 variables out of 15 ones were filtered and finally removed because of there being higher correlation(over 0.7 in correlation coefficient). According to the modeling, roe deers were known to prefer the area ranging from 200 to 700 meter and over 1,500 meter in sea level, where there were not many dominant tree and/or dominant vegetation with low density so that understory vegetation can grow well with plentiful sunlight and can be used as a food of herbivore like roe deers. Otherwise, the region ranging from 700 to 1,500 meter was mostly covered with high density vegetation which cut off sunlight trying to penetrate through the dominant vegetation. It can cause a lower density of vegetation on surface, which can not attract to roe deers.