• Title/Summary/Keyword: Spatial random forest

Search Result 98, Processing Time 0.023 seconds

Generation of Daily High-resolution Sea Surface Temperature for the Seas around the Korean Peninsula Using Multi-satellite Data and Artificial Intelligence (다종 위성자료와 인공지능 기법을 이용한 한반도 주변 해역의 고해상도 해수면온도 자료 생산)

  • Jung, Sihun;Choo, Minki;Im, Jungho;Cho, Dongjin
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_2
    • /
    • pp.707-723
    • /
    • 2022
  • Although satellite-based sea surface temperature (SST) is advantageous for monitoring large areas, spatiotemporal data gaps frequently occur due to various environmental or mechanical causes. Thus, it is crucial to fill in the gaps to maximize its usability. In this study, daily SST composite fields with a resolution of 4 km were produced through a two-step machine learning approach using polar-orbiting and geostationary satellite SST data. The first step was SST reconstruction based on Data Interpolate Convolutional AutoEncoder (DINCAE) using multi-satellite-derived SST data. The second step improved the reconstructed SST targeting in situ measurements based on light gradient boosting machine (LGBM) to finally produce daily SST composite fields. The DINCAE model was validated using random masks for 50 days, whereas the LGBM model was evaluated using leave-one-year-out cross-validation (LOYOCV). The SST reconstruction accuracy was high, resulting in R2 of 0.98, and a root-mean-square-error (RMSE) of 0.97℃. The accuracy increase by the second step was also high when compared to in situ measurements, resulting in an RMSE decrease of 0.21-0.29℃ and an MAE decrease of 0.17-0.24℃. The SST composite fields generated using all in situ data in this study were comparable with the existing data assimilated SST composite fields. In addition, the LGBM model in the second step greatly reduced the overfitting, which was reported as a limitation in the previous study that used random forest. The spatial distribution of the corrected SST was similar to those of existing high resolution SST composite fields, revealing that spatial details of oceanic phenomena such as fronts, eddies and SST gradients were well simulated. This research demonstrated the potential to produce high resolution seamless SST composite fields using multi-satellite data and artificial intelligence.

Predicting the Effects of Rooftop Greening and Evaluating CO2 Sequestration in Urban Heat Island Areas Using Satellite Imagery and Machine Learning (위성영상과 머신러닝 활용 도시열섬 지역 옥상녹화 효과 예측과 이산화탄소 흡수량 평가)

  • Minju Kim;Jeong U Park;Juhyeon Park;Jisoo Park;Chang-Uk Hyun
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_1
    • /
    • pp.481-493
    • /
    • 2023
  • In high-density urban areas, the urban heat island effect increases urban temperatures, leading to negative impacts such as worsened air pollution, increased cooling energy consumption, and increased greenhouse gas emissions. In urban environments where it is difficult to secure additional green spaces, rooftop greening is an efficient greenhouse gas reduction strategy. In this study, we not only analyzed the current status of the urban heat island effect but also utilized high-resolution satellite data and spatial information to estimate the available rooftop greening area within the study area. We evaluated the mitigation effect of the urban heat island phenomenon and carbon sequestration capacity through temperature predictions resulting from rooftop greening. To achieve this, we utilized WorldView-2 satellite data to classify land cover in the urban heat island areas of Busan city. We developed a prediction model for temperature changes before and after rooftop greening using machine learning techniques. To assess the degree of urban heat island mitigation due to changes in rooftop greening areas, we constructed a temperature change prediction model with temperature as the dependent variable using the random forest technique. In this process, we built a multiple regression model to derive high-resolution land surface temperatures for training data using Google Earth Engine, combining Landsat-8 and Sentinel-2 satellite data. Additionally, we evaluated carbon sequestration based on rooftop greening areas using a carbon absorption capacity per plant. The results of this study suggest that the developed satellite-based urban heat island assessment and temperature change prediction technology using Random Forest models can be applied to urban heat island-vulnerable areas with potential for expansion.

Spatial Distribution of Epilithic Diatom Communities in the Estuary of Korean Peninsula (한반도 하구역 부착돌말류의 공간적 분포)

  • Kim, Ha-Kyung;Cho, In-Hwan;Kim, Young-Hyo;Lee, Min-Hyuk;Kim, Yong-Jae;Won, Du-Hee;Hwang, Su-Ok;Byun, Jung-Hwan;Hwang, Soon-Jin;Kim, Baik-Ho
    • Korean Journal of Ecology and Environment
    • /
    • v.51 no.1
    • /
    • pp.1-15
    • /
    • 2018
  • With land-use (cover) and water quality, the distributional characteristics of epilithic diatom communities were studied with 193 samples from estuaries of Korean peninsula between 2015 and 2016. Of total 394 taxa classified, Nitzschia perminuta (19.6%) and N. inconspicua (14.0%) were the 1st and 2nd dominant species. Using a cluster analysis, the epilithic diatom communities of Korean estuaries were divided into four groups (G1-G4). Ecological characteristics of each group were followed: G1 was located in estuaries of the East Sea, and characterized by high forest land-use and high DO and low nutrients; G2 was the eastern part of the South Sea, and characterized by low turbidity and nutrients; G3 was the western part of the South Sea, and characterized by high agriculture, low electric conductivity and low salinity; G4 was the Yellow Sea, and characterized by high nutrients. The environmental factors having significant correlation with diatom distributions were as follows: TN to G1, turbidity to G2, agriculture to G3, and TP to G4. Moreover, the important factors affecting the occurrence of indicator species were forest land-use for Fragilaria construens var. venter in G1, turbidity for Rhoicosphenia abbreviata in G2, urban land- use and total phosphorus (TP) for Bacillaria paradoxa and Hantzschia amphioxys of G3, and TP and turbidity for N. ovalis and Stephanodiscus invistatus of G4. These results collectively indicate that the distribution of epilithic diatom communities in Korean peninsula was largely effected by water quality and land cover/use.

Development of a Classification Method for Forest Vegetation on the Stand Level, Using KOMPSAT-3A Imagery and Land Coverage Map (KOMPSAT-3A 위성영상과 토지피복도를 활용한 산림식생의 임상 분류법 개발)

  • Song, Ji-Yong;Jeong, Jong-Chul;Lee, Peter Sang-Hoon
    • Korean Journal of Environment and Ecology
    • /
    • v.32 no.6
    • /
    • pp.686-697
    • /
    • 2018
  • Due to the advance in remote sensing technology, it has become easier to more frequently obtain high resolution imagery to detect delicate changes in an extensive area, particularly including forest which is not readily sub-classified. Time-series analysis on high resolution images requires to collect extensive amount of ground truth data. In this study, the potential of land coverage mapas ground truth data was tested in classifying high-resolution imagery. The study site was Wonju-si at Gangwon-do, South Korea, having a mix of urban and natural areas. KOMPSAT-3A imagery taken on March 2015 and land coverage map published in 2017 were used as source data. Two pixel-based classification algorithms, Support Vector Machine (SVM) and Random Forest (RF), were selected for the analysis. Forest only classification was compared with that of the whole study area except wetland. Confusion matrixes from the classification presented that overall accuracies for both the targets were higher in RF algorithm than in SVM. While the overall accuracy in the forest only analysis by RF algorithm was higher by 18.3% than SVM, in the case of the whole region analysis, the difference was relatively smaller by 5.5%. For the SVM algorithm, adding the Majority analysis process indicated a marginal improvement of about 1% than the normal SVM analysis. It was found that the RF algorithm was more effective to identify the broad-leaved forest within the forest, but for the other classes the SVM algorithm was more effective. As the two pixel-based classification algorithms were tested here, it is expected that future classification will improve the overall accuracy and the reliability by introducing a time-series analysis and an object-based algorithm. It is considered that this approach will contribute to improving a large-scale land planning by providing an effective land classification method on higher spatial and temporal scales.

Discriminant analysis of grain flours for rice paper using fluorescence hyperspectral imaging system and chemometric methods

  • Seo, Youngwook;Lee, Ahyeong;Kim, Bal-Geum;Lim, Jongguk
    • Korean Journal of Agricultural Science
    • /
    • v.47 no.3
    • /
    • pp.633-644
    • /
    • 2020
  • Rice paper is an element of Vietnamese cuisine that can be used to wrap vegetables and meat. Rice and starch are the main ingredients of rice paper and their mixing ratio is important for quality control. In a commercial factory, assessment of food safety and quantitative supply is a challenging issue. A rapid and non-destructive monitoring system is therefore necessary in commercial production systems to ensure the food safety of rice and starch flour for the rice paper wrap. In this study, fluorescence hyperspectral imaging technology was applied to classify grain flours. Using the 3D hyper cube of fluorescence hyperspectral imaging (fHSI, 420 - 730 nm), spectral and spatial data and chemometric methods were applied to detect and classify flours. Eight flours (rice: 4, starch: 4) were prepared and hyperspectral images were acquired in a 5 (L) × 5 (W) × 1.5 (H) cm container. Linear discriminant analysis (LDA), partial least square discriminant analysis (PLSDA), support vector machine (SVM), classification and regression tree (CART), and random forest (RF) with a few preprocessing methods (multivariate scatter correction [MSC], 1st and 2nd derivative and moving average) were applied to classify grain flours and the accuracy was compared using a confusion matrix (accuracy and kappa coefficient). LDA with moving average showed the highest accuracy at A = 0.9362 (K = 0.9270). 1D convolutional neural network (CNN) demonstrated a classification result of A = 0.94 and showed improved classification results between mimyeon flour (MF)1 and MF2 of 0.72 and 0.87, respectively. In this study, the potential of non-destructive detection and classification of grain flours using fHSI technology and machine learning methods was demonstrated.

Effects of a Newly Designed Pelvic Belt Orthosis on Functional Mobility of Adults with Post-Stroke Hemiparesis

  • Cho, Byeong-Mo;Zarayeneh, Neda;Suh, Sang C.
    • Journal of The Korean Society of Integrative Medicine
    • /
    • v.8 no.4
    • /
    • pp.125-131
    • /
    • 2020
  • Purpose : Lower extremity orthoses have been used as conservative methods to recover gait of the stroke patients. The purpose of this study is to examine how newly designed pelvic belt orthosis can improve gait ability and dynamic balance of adults with Hemiparesis after stroke. Methods : 22 patients who had hemiparesis after stroke participated in this study. Two groups were randomly created by assigning 10 subjects to the experimental group and the rest of the 12 subjects to the control group. The control group was treated by conventional physical therapy and occupational therapy. Identical therapy protocols were used to treat the experimental group who were assigned to wear the pelvic belt orthosis during post measurement. This study has a group of independent variables including group, gender, age, height, MAS, lesion side, cause and a group of dependent variables including gait speed, cadence, step length, stride length, and dynamic balance. The GAITRite system was used to measure spatial-temporal gain parameters and the balance system SD to measure dynamic balance. The data was analyzed using R version 3.3.1. Random forest, boosting algorithm, and MANOVA test were conducted to determine the effects of independent variables on dependent variables. Results : This study has a group of independent variables including group, gender, age, height, MAS, lesion side, cause and a group of dependent variables including gait speed, cadence, step length, stride length, and dynamic balance. The independent variable "group" has the most important value, which is approximately 25.42 (%IncMSE) representing a value three times greater than the second important predictor "height." Conclusion : As a result of this research, the hypothesis is validated with conclusion that Pelvic Belt orthosis could be effectively used for improving gait ability and balance of the patients with post-stroke hemiparesis.

Estimation of Chlorophyll-a via harmonized landsat sentinel-2 (HLS) datasets (Harmonized Landsat Sentinel-2 (HLS) 위성자료를 활용한 클로로필-a 추정)

  • Jongmin Park
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.400-400
    • /
    • 2023
  • 급격한 기후변화로 인해 일사량, 지표면 온도 및 이산화탄소 농도가 꾸준히 상승함에 따라 수문 순환의 불균형을 초래함과 하천 및 호소 내 수질 또한 악화되고 있는 추세이다. 특히, 국내의 경우, 기후변화 및 인위적 요인에 의해 하천 및 호소에서의 수위 감소 및 수온 증가로 인해 부영양화가 증가되고 있고, 이로 인한 유해 녹조의 발생빈도를 높이는 결과를 초래한다. 현재 국내에서는 유인 수질 관측 및 자동 수질관측 시스템을 통해 주요 수질인자를 모니터링 하고 있으나 시·공간적인 변동성을 파악하는데 제한점이 있다. 이러한 한계점을 극복하기 위해 국·내외에서 광학위성을 이용한 수질인자 추정 알고리즘 개발과 관련된 연구들이 진행되고 있다. 이에 따라, 본 연구에서는 NASA에서 제공하는 Landsat-8 위성과 ESA에서 제공하는 Sentinel-2자료가 동화된 Harmonized Landsat Sentinel-2 위성자료를 활용한 클로로필-a (Chl-a)를 추정하고자 한다. 이를 위해, 본 연구에서는 1) 단순 회귀 분석, 2) Akaike information criteria (AIC) 기반 최적화 회귀 분석 및 3) Random forest (RF)를 활용하였다. 또한, HLS 위성 자료의 적용성을 평가하기 위해 미국 오하이오 주에 위치하고 있는 130여개의 중규모 및 대규모 호소에서 2000년부터 2021년까지 수집된 클로로필-a 관측치를 활용하였다. 두 가지 수질 추정 모형에 대한 정확도 검증에 앞서 오하이오 주 내에서의 클로로필-a의 시계열적 변동성에 대하여 분석하였다. 전반적으로, 2000년부터 2016년까지는 Chl-a가 꾸준히 증가하는 경향성을 나타내었으나, 그 이후로는 감소하는 추세를 나타내었다. 이를 기반으로, 각 방법론을 통해서 나온 Chl-a 추정치에 대해서 통계적 검증을 수행하였다. 결과, 단순 회귀 분석을 통해 추청된 Chl-a값의 결정계수는 0.34였지만, AIC 기반 모델과 RF모형을 사용한 결과 결정계수가 각각 0.82와 0.92로 향상된 것을 확인할 수 있었다. 이와 더불어, spatial 및 temporal window와 더불어 호소의 크기에 따른 정확도 분석 또한 수행하였다. 그 결과, temporal window 가 정확도에 가장 큰 영향을 미치는 것으로 나타났으며, 호소의 크기가 작을수록 정확도가 낮아지는 것을 확인 할 수 있었다. 본 연구의 결과를 토대로 추후 국내 호소에 대해 상기 모형들의 적용성 평가를 수행하여 효율적인 수질 모니터링 시스템 구축으로 이어질 수 있을 것으로 기대된다.

  • PDF

Comparison of Machine Learning Techniques in Urban Weather Prediction using Air Quality Sensor Data (실외공기측정기 자료를 이용한 도심 기상 예측 기계학습 모형 비교)

  • Jong-Chan Park;Heon Jin Park
    • The Journal of Bigdata
    • /
    • v.6 no.2
    • /
    • pp.39-49
    • /
    • 2021
  • Recently, large and diverse weather data are being collected by sensors from various sources. Efforts to predict the concentration of fine dust through machine learning are being made everywhere, and this study intends to compare PM10 and PM2.5 prediction models using data from 840 outdoor air meters installed throughout the city. Information can be provided in real time by predicting the concentration of fine dust after 5 minutes, and can be the basis for model development after 10 minutes, 30 minutes, and 1 hour. Data preprocessing was performed, such as noise removal and missing value replacement, and a derived variable that considers temporal and spatial variables was created. The parameters of the model were selected through the response surface method. XGBoost, Random Forest, and Deep Learning (Multilayer Perceptron) are used as predictive models to check the difference between fine dust concentration and predicted values, and to compare the performance between models.

Spatial Downscaling of Ocean Colour-Climate Change Initiative (OC-CCI) Forel-Ule Index Using GOCI Satellite Image and Machine Learning Technique (GOCI 위성영상과 기계학습 기법을 이용한 Ocean Colour-Climate Change Initiative (OC-CCI) Forel-Ule Index의 공간 상세화)

  • Sung, Taejun;Kim, Young Jun;Choi, Hyunyoung;Im, Jungho
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.5_1
    • /
    • pp.959-974
    • /
    • 2021
  • Forel-Ule Index (FUI) is an index which classifies the colors of inland and seawater exist in nature into 21 gradesranging from indigo blue to cola brown. FUI has been analyzed in connection with the eutrophication, water quality, and light characteristics of water systems in many studies, and the possibility as a new water quality index which simultaneously contains optical information of water quality parameters has been suggested. In thisstudy, Ocean Colour-Climate Change Initiative (OC-CCI) based 4 km FUI was spatially downscaled to the resolution of 500 m using the Geostationary Ocean Color Imager (GOCI) data and Random Forest (RF) machine learning. Then, the RF-derived FUI was examined in terms of its correlation with various water quality parameters measured in coastal areas and its spatial distribution and seasonal characteristics. The results showed that the RF-derived FUI resulted in higher accuracy (Coefficient of Determination (R2)=0.81, Root Mean Square Error (RMSE)=0.7784) than GOCI-derived FUI estimated by Pitarch's OC-CCI FUI algorithm (R2=0.72, RMSE=0.9708). RF-derived FUI showed a high correlation with five water quality parameters including Total Nitrogen, Total Phosphorus, Chlorophyll-a, Total Suspended Solids, Transparency with the correlation coefficients of 0.87, 0.88, 0.97, 0.65, and -0.98, respectively. The temporal pattern of the RF-derived FUI well reflected the physical relationship with various water quality parameters with a strong seasonality. The research findingssuggested the potential of the high resolution FUI in coastal water quality management in the Korean Peninsula.

Machine Learning Based MMS Point Cloud Semantic Segmentation (머신러닝 기반 MMS Point Cloud 의미론적 분할)

  • Bae, Jaegu;Seo, Dongju;Kim, Jinsoo
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_3
    • /
    • pp.939-951
    • /
    • 2022
  • The most important factor in designing autonomous driving systems is to recognize the exact location of the vehicle within the surrounding environment. To date, various sensors and navigation systems have been used for autonomous driving systems; however, all have limitations. Therefore, the need for high-definition (HD) maps that provide high-precision infrastructure information for safe and convenient autonomous driving is increasing. HD maps are drawn using three-dimensional point cloud data acquired through a mobile mapping system (MMS). However, this process requires manual work due to the large numbers of points and drawing layers, increasing the cost and effort associated with HD mapping. The objective of this study was to improve the efficiency of HD mapping by segmenting semantic information in an MMS point cloud into six classes: roads, curbs, sidewalks, medians, lanes, and other elements. Segmentation was performed using various machine learning techniques including random forest (RF), support vector machine (SVM), k-nearest neighbor (KNN), and gradient-boosting machine (GBM), and 11 variables including geometry, color, intensity, and other road design features. MMS point cloud data for a 130-m section of a five-lane road near Minam Station in Busan, were used to evaluate the segmentation models; the average F1 scores of the models were 95.43% for RF, 92.1% for SVM, 91.05% for GBM, and 82.63% for KNN. The RF model showed the best segmentation performance, with F1 scores of 99.3%, 95.5%, 94.5%, 93.5%, and 90.1% for roads, sidewalks, curbs, medians, and lanes, respectively. The variable importance results of the RF model showed high mean decrease accuracy and mean decrease gini for XY dist. and Z dist. variables related to road design, respectively. Thus, variables related to road design contributed significantly to the segmentation of semantic information. The results of this study demonstrate the applicability of segmentation of MMS point cloud data based on machine learning, and will help to reduce the cost and effort associated with HD mapping.