• Title/Summary/Keyword: 평균 모델

Search Result 3,604, Processing Time 0.035 seconds

Estimating Fine Particulate Matter Concentration using GLDAS Hydrometeorological Data (GLDAS 수문기상인자를 이용한 초미세먼지 농도 추정)

  • Lee, Seulchan;Jeong, Jaehwan;Park, Jongmin;Jeon, Hyunho;Choi, Minha
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.6_1
    • /
    • pp.919-932
    • /
    • 2019
  • Fine particulate matter (PM2.5) is not only affected by anthropogenic emissions, but also intensifies, migrates, decreases by hydrometeorological factors. Therefore, it is essential to understand relationships between the hydrometeorological factors and PM2.5 concentration. In Korea, PM2.5 concentration is measured at the ground observatories and estimated data are given to locations where observatories are not present. In this way, the data is not suitable to represent an area, hence it is impossible to know accurate concentration at such locations. In addition, it is hard to trace migration, intensification, reduction of PM2.5. In this study, we analyzed the relationships between hydrometeorological factors, acquired from Global Land Data Assimilation System (GLDAS), and PM2.5 by means of Bayesian Model Averaging (BMA). By BMA, we also selected factors that have meaningful relationship with the variation of PM2.5 concentration. 4 PM2.5 concentration models for different seasons were developed using those selected factors, with Aerosol Optical Depth (AOD) from MODerate resolution Imaging Spectroradiometer (MODIS). Finally, we mapped the result of the model, to show spatial distribution of PM2.5. The model correlated well with the observed PM2.5 concentration (R ~0.7; IOA ~0.78; RMSE ~7.66 ㎍/㎥). When the models were compared with the observed PM2.5 concentrations at different locations, the correlation coefficients differed (R: 0.32-0.82), although there were similarities in data distribution. The developed concentration map using the models showed its capability in representing temporal, spatial variation of PM2.5 concentration. The result of this study is expected to be able to facilitate researches that aim to analyze sources and movements of PM2.5, if the study area is extended to East Asia.

A Simulation-Based Investigation of an Advanced Traveler Information System with V2V in Urban Network (시뮬레이션기법을 통한 차량 간 통신을 이용한 첨단교통정보시스템의 효과 분석 (도시 도로망을 중심으로))

  • Kim, Hoe-Kyoung
    • Journal of Korean Society of Transportation
    • /
    • v.29 no.5
    • /
    • pp.121-138
    • /
    • 2011
  • More affordable and available cutting-edge technologies (e.g., wireless vehicle communication) are regarded as a possible alternative to the fixed infrastructure-based traffic information system requiring the expensive infrastructure investments and mostly implemented in the uninterrupted freeway network with limited spatial system expansion. This paper develops an advanced decentralized traveler information System (ATIS) using vehicle-to-vehicle (V2V) communication system whose performance (drivers' travel time savings) are enhanced by three complementary functions (autonomous automatic incident detection algorithm, reliable sample size function, and driver behavior model) and evaluates it in the typical $6{\times}6$ urban grid network with non-recurrent traffic state (traffic incident) with the varying key parameters (traffic flow, communication radio range, and penetration ratio), employing the off-the-shelf microscopic simulation model (VISSIM) under the ideal vehicle communication environment. Simulation outputs indicate that as the three key parameters are increased more participating vehicles are involved for traffic data propagation in the less communication groups at the faster data dissemination speed. Also, participating vehicles saved their travel time by dynamically updating the up-to-date traffic states and searching for the new route. Focusing on the travel time difference of (instant) re-routing vehicles, lower traffic flow cases saved more time than higher traffic flow ones. This is because a relatively small number of vehicles in 300vph case re-route during the most system-efficient time period (the early time of the traffic incident) but more vehicles in 514vph case re-route during less system-efficient time period, even after the incident is resolved. Also, normally re-routings on the network-entering links saved more travel time than any other places inside the network except the case where the direct effect of traffic incident triggers vehicle re-routings during the effective incident time period and the location and direction of the incident link determines the spatial distribution of re-routing vehicles.

A Study on the Effect of Using Sentiment Lexicon in Opinion Classification (오피니언 분류의 감성사전 활용효과에 대한 연구)

  • Kim, Seungwoo;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.133-148
    • /
    • 2014
  • Recently, with the advent of various information channels, the number of has continued to grow. The main cause of this phenomenon can be found in the significant increase of unstructured data, as the use of smart devices enables users to create data in the form of text, audio, images, and video. In various types of unstructured data, the user's opinion and a variety of information is clearly expressed in text data such as news, reports, papers, and various articles. Thus, active attempts have been made to create new value by analyzing these texts. The representative techniques used in text analysis are text mining and opinion mining. These share certain important characteristics; for example, they not only use text documents as input data, but also use many natural language processing techniques such as filtering and parsing. Therefore, opinion mining is usually recognized as a sub-concept of text mining, or, in many cases, the two terms are used interchangeably in the literature. Suppose that the purpose of a certain classification analysis is to predict a positive or negative opinion contained in some documents. If we focus on the classification process, the analysis can be regarded as a traditional text mining case. However, if we observe that the target of the analysis is a positive or negative opinion, the analysis can be regarded as a typical example of opinion mining. In other words, two methods (i.e., text mining and opinion mining) are available for opinion classification. Thus, in order to distinguish between the two, a precise definition of each method is needed. In this paper, we found that it is very difficult to distinguish between the two methods clearly with respect to the purpose of analysis and the type of results. We conclude that the most definitive criterion to distinguish text mining from opinion mining is whether an analysis utilizes any kind of sentiment lexicon. We first established two prediction models, one based on opinion mining and the other on text mining. Next, we compared the main processes used by the two prediction models. Finally, we compared their prediction accuracy. We then analyzed 2,000 movie reviews. The results revealed that the prediction model based on opinion mining showed higher average prediction accuracy compared to the text mining model. Moreover, in the lift chart generated by the opinion mining based model, the prediction accuracy for the documents with strong certainty was higher than that for the documents with weak certainty. Most of all, opinion mining has a meaningful advantage in that it can reduce learning time dramatically, because a sentiment lexicon generated once can be reused in a similar application domain. Additionally, the classification results can be clearly explained by using a sentiment lexicon. This study has two limitations. First, the results of the experiments cannot be generalized, mainly because the experiment is limited to a small number of movie reviews. Additionally, various parameters in the parsing and filtering steps of the text mining may have affected the accuracy of the prediction models. However, this research contributes a performance and comparison of text mining analysis and opinion mining analysis for opinion classification. In future research, a more precise evaluation of the two methods should be made through intensive experiments.

Modeling of Vegetation Phenology Using MODIS and ASOS Data (MODIS와 ASOS 자료를 이용한 식물계절 모델링)

  • Kim, Geunah;Youn, Youjeong;Kang, Jonggu;Choi, Soyeon;Park, Ganghyun;Chun, Junghwa;Jang, Keunchang;Won, Myoungsoo;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_1
    • /
    • pp.627-646
    • /
    • 2022
  • Recently, the seriousness of climate change-related problems caused by global warming is growing, and the average temperature is also rising. As a result, it is affecting the environment in which various temperature-sensitive creatures and creatures live, and changes in the ecosystem are also being detected. Seasons are one of the important factors influencing the types, distribution, and growth characteristics of creatures living in the area. Among the most popular and easily recognized plant seasonal phenomena among the indicators of the climate change impact evaluation, the blooming day of flower and the peak day of autumn leaves were modeled. The types of plants used in the modeling were forsythia and cherry trees, which can be seen as representative plants of spring, and maple and ginkgo, which can be seen as representative plants of autumn. Weather data used to perform modeling were temperature, precipitation, and solar radiation observed through the ASOS Observatory of the Korea Meteorological Administration. As satellite data, MODIS NDVI was used for modeling, and it has a correlation coefficient of about -0.2 for the flowering date and 0.3 for the autumn leaves peak date. As the model used, the model was established using multiple regression models, which are linear models, and Random Forest, which are nonlinear models. In addition, the predicted values estimated by each model were expressed as isopleth maps using spatial interpolation techniques to express the trend of plant seasonal changes from 2003 to 2020. It is believed that using NDVI with high spatio-temporal resolution in the future will increase the accuracy of plant phenology modeling.

Machine Learning Based MMS Point Cloud Semantic Segmentation (머신러닝 기반 MMS Point Cloud 의미론적 분할)

  • Bae, Jaegu;Seo, Dongju;Kim, Jinsoo
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_3
    • /
    • pp.939-951
    • /
    • 2022
  • The most important factor in designing autonomous driving systems is to recognize the exact location of the vehicle within the surrounding environment. To date, various sensors and navigation systems have been used for autonomous driving systems; however, all have limitations. Therefore, the need for high-definition (HD) maps that provide high-precision infrastructure information for safe and convenient autonomous driving is increasing. HD maps are drawn using three-dimensional point cloud data acquired through a mobile mapping system (MMS). However, this process requires manual work due to the large numbers of points and drawing layers, increasing the cost and effort associated with HD mapping. The objective of this study was to improve the efficiency of HD mapping by segmenting semantic information in an MMS point cloud into six classes: roads, curbs, sidewalks, medians, lanes, and other elements. Segmentation was performed using various machine learning techniques including random forest (RF), support vector machine (SVM), k-nearest neighbor (KNN), and gradient-boosting machine (GBM), and 11 variables including geometry, color, intensity, and other road design features. MMS point cloud data for a 130-m section of a five-lane road near Minam Station in Busan, were used to evaluate the segmentation models; the average F1 scores of the models were 95.43% for RF, 92.1% for SVM, 91.05% for GBM, and 82.63% for KNN. The RF model showed the best segmentation performance, with F1 scores of 99.3%, 95.5%, 94.5%, 93.5%, and 90.1% for roads, sidewalks, curbs, medians, and lanes, respectively. The variable importance results of the RF model showed high mean decrease accuracy and mean decrease gini for XY dist. and Z dist. variables related to road design, respectively. Thus, variables related to road design contributed significantly to the segmentation of semantic information. The results of this study demonstrate the applicability of segmentation of MMS point cloud data based on machine learning, and will help to reduce the cost and effort associated with HD mapping.

Evaluation of Temperature and Precipitation over CORDEX-EA Phase 2 Domain using Regional Climate Model HadGEM3-RA (HadGEM3-RA 지역기후모델을 이용한 CORDEX 동아시아 2단계 지역의 기온과 강수 모의 평가)

  • Byon, Jae-Young;Kim, Tae-Jun;Kim, Jin-Uk;Kim, Do-Hyun
    • Journal of the Korean earth science society
    • /
    • v.43 no.3
    • /
    • pp.367-385
    • /
    • 2022
  • This study evaluates the temperature and precipitation results in East Asia simulated from the Hadley Centre Global Environmental Model version 3 regional climate model (HadGEM3-RA) developed by the UK Met Office. The HadGEM3-RA is conducted in the Coordinated Regional climate Downscaling Experiment-East Asia (CORDEX-EA) Phase II domain for 15 year (2000-2014). The spatial distribution of rainbands produced from the HadGEM3-RA by the summer monsoon is in good agreement with the Asian Precipitation Highly Resolved Observational Data Integration Towards Evaluation of water resources (APRODITE) data over the East Asia. But, precipitation amount is overestimated in Southeast Asia and underestimated over the Korean Peninsula. In particular, the simulated summer rainfall and APRODITE data show the least correlation coefficient and the maximum value of root mean square error in South Korea. Prediction of temperature in Southeast Asia shows underestimation with a maximum error during winter season, while it appears the largest underestimation in South Korea during spring season. In order to evaluate local predictability, the time series of temperature and precipitation compared to the ASOS data of the Seoul Meteorological Station is similar to the spatial average verification results in which the summer precipitation and winter temperature underestimate. Especially, the underestimation of the rainfall increases when the amounts of precipitation increase in summer. The winter temperature tends to underestimate at low temperature, while it overestimates at high temperature. The results of the extreme climate index comparison show that heat wave is overestimated and heavy rainfall is underestimated. The HadGEM3-RA simulated with a horizontal resolution of 25 km shows limitations in the prediction of mesoscale convective system and topographic precipitation. This study indicates that improvement of initial data, horizontal resolution, and physical process are necessary to improve predictability of regional climate model.

Estimation of Structural Deterioration of Sewer using Markov Chain Model (마르코프 연쇄 모델을 이용한 하수관로의 구조적 노후도 추정)

  • Kang, Byong Jun;Yoo, Soon Yu;Zhang, Chuanli;Park, Kyoo Hong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.43 no.4
    • /
    • pp.421-431
    • /
    • 2023
  • Sewer deterioration models can offer important information on prediction of future condition of the asset to decision makers in their implementing sewer pipe networks management program. In this study, Markov chain model was used to estimate sewer deterioration trend based on the historical structural condition assessment data obtained by CCTV inspection. The data used in this study were limited to Hume pipe with diameter of 450 mm and 600 mm in three sub-catchment areas in city A, which were collected by CCTV inspection projects performed in 1998-1999 and 2010-2011. As a result, it was found that sewers in sub-catchment area EM have deteriorated faster than those in other two sub-catchments. Various main defects were to generate in 29% of 450 mm sewers and 38% of 600 mm in 35 years after the installation, while serious failure in 62% of 450 mm sewers and 74% of 600 mm in 100 years after the installation in sub-catchment area EM. In sub-catchment area SN, main defects were to generate in 26% of 450 mm sewers and 35% of 600 mm in 35 years after the installation, while in sub-catchment area HK main defects were to generate in 27% of 450 mm sewers and 37% of 600 mm in 35 years after the installation. Larger sewer pipes of 600 mm were found to deteriorate faster than smaller sewer pipes of 450 mm by about 12 years. Assuming that the percentage of main defects generation could be set as 40% to estimate the life expectancy of the sewers, it was estimated as 60 years in sub-catchment area SN, 42 years in sub-catchment area EM, 59 years in sub-catchment area HK for 450 mm sewer pipes, respectively. For 600 mm sewer pipes, on the other hand, it was estimated as 43 years, 34 years, 39 years in sub-catchment areas SN, EM, and HK, respectively.

Retrieval of Hourly Aerosol Optical Depth Using Top-of-Atmosphere Reflectance from GOCI-II and Machine Learning over South Korea (GOCI-II 대기상한 반사도와 기계학습을 이용한 남한 지역 시간별 에어로졸 광학 두께 산출)

  • Seyoung Yang;Hyunyoung Choi;Jungho Im
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_3
    • /
    • pp.933-948
    • /
    • 2023
  • Atmospheric aerosols not only have adverse effects on human health but also exert direct and indirect impacts on the climate system. Consequently, it is imperative to comprehend the characteristics and spatiotemporal distribution of aerosols. Numerous research endeavors have been undertaken to monitor aerosols, predominantly through the retrieval of aerosol optical depth (AOD) via satellite-based observations. Nonetheless, this approach primarily relies on a look-up table-based inversion algorithm, characterized by computationally intensive operations and associated uncertainties. In this study, a novel high-resolution AOD direct retrieval algorithm, leveraging machine learning, was developed using top-of-atmosphere reflectance data derived from the Geostationary Ocean Color Imager-II (GOCI-II), in conjunction with their differences from the past 30-day minimum reflectance, and meteorological variables from numerical models. The Light Gradient Boosting Machine (LGBM) technique was harnessed, and the resultant estimates underwent rigorous validation encompassing random, temporal, and spatial N-fold cross-validation (CV) using ground-based observation data from Aerosol Robotic Network (AERONET) AOD. The three CV results consistently demonstrated robust performance, yielding R2=0.70-0.80, RMSE=0.08-0.09, and within the expected error (EE) of 75.2-85.1%. The Shapley Additive exPlanations(SHAP) analysis confirmed the substantial influence of reflectance-related variables on AOD estimation. A comprehensive examination of the spatiotemporal distribution of AOD in Seoul and Ulsan revealed that the developed LGBM model yielded results that are in close concordance with AERONET AOD over time, thereby confirming its suitability for AOD retrieval at high spatiotemporal resolution (i.e., hourly, 250 m). Furthermore, upon comparing data coverage, it was ascertained that the LGBM model enhanced data retrieval frequency by approximately 8.8% in comparison to the GOCI-II L2 AOD products, ameliorating issues associated with excessive masking over very illuminated surfaces that are often encountered in physics-based AOD retrieval processes.

A Study on Medical Waste Generation Analysis during Outbreak of Massive Infectious Diseases (대규모 감염병 발병에 따른 의료폐기물 발생량 예측에 관한 연구)

  • Sang-Min Kim;Jin-Kyu Park;In-Beom Ko;Byung-Sun Lee;Sang-Ryong Shin;Nam-Hoon Lee
    • Journal of the Korea Organic Resources Recycling Association
    • /
    • v.31 no.4
    • /
    • pp.29-39
    • /
    • 2023
  • In this study, an analysis of medical waste generation characteristics was conducted, differentiating between ordinary situation and the outbreaks of massive infectious diseases. During ordinary situation, prediction models for medical waste quantities by type, general medical waste(G-MW), hazardous medical waste(H-MW), infectious medical waste(I-MW), were established through regression analysis, with all significance values (p) being <0.0001, indicating statistical significance. The determination coefficient(R2) values for prediction models of each category were analyzed as follows : I-MW(R2=0.9943) > G-MW(R2=0.9817) > H-MW(R2=0.9310). Additionally, factors such as GDP(G-MW), the number of medical institutions (H-MW), and the elderly population ratio(I-MW), utilized as influencing factors and consistent with previous literature, showed high correlations. The total MW generation, evaluated by combining each model, had an MAE of 2,615 and RMSE of 3,353. This indicated accuracy levels similar to the medical waste models of H-MW(2,491, 2,890) and I-MW(2,291, 3,267). Due to limitations in accurately estimating the quantity of medical waste during the rapid and outbreaks of massive infectious diseases, the generation unit of I-MW was derived to analyze its characteristics. During the early unstable stage of infectious disease outbreaks, the generation unit was 8.74 kg/capita·day, 2.69 kg/capita·day during the stable stage, and an average of 0.08 kg/capita·day during the reduction stage. Correlation analysis between generation unit of I-MW and lethality rates showed +0.99 in the unstable stage, +0.52 in the stable stage, and +0.96 in the reduction period, demonstrating a very high positive correlation of +0.95 or higher throughout the entire outbreaks of massive infectious diseases. The results derived from this study are expected to play a useful role in establishing an effective medical waste management system in the field of health care.

The effect of climate change on hydroelectric power generation of multipurpose dams according to SSP scenarios (SSP 시나리오에 따른 기후변화가 다목적댐 수력발전량에 미치는 영향 분석)

  • Wang, Sizhe;Kim, Jiyoung;Kim, Yongchan;Kim, Dongkyun;Kim, Tae-Woong
    • Journal of Korea Water Resources Association
    • /
    • v.57 no.7
    • /
    • pp.481-491
    • /
    • 2024
  • Recent droughts make hydroelectric power generation (HPG) decreasing. Due to climate change in the future, the frequency and intensity of drought are expected to increase, which will increase uncertainty of HPG in multi-purpose dams. Therefore, it is necessary to estimate the amount of HPG according to climate change scenarios and analyze the effect of drought on the amount of HPG. This study analyzed the future HPG of the Soyanggang Dam and Chungju Dam according to the SSP2-4.5 and SSP5-8.5 scenarios. Regression equations for HPG were developed based on the observed data of power generation discharge and HPG in the past provided by My Water, and future HPGs were estimated according to the SSP scenarios. The effect of drought on the amount of HPG was investigated based on the drought severity calculated using the standardized precipitation index (SPI). In this study, the future SPIs were calculated using precipitation data based on four GCM models (CanESM5, ACCESS-ESM1-5, INM-CM4-8, IPSL-CM6A) provided through the environmental big data platform. Overall results show that climate change had significant effects on the amount of HPG. In the case of Soyanggang Dam, the amount of HPG decreased in the SSP2-4.5 and SSP5-8.5 scenarios. Under the SSP2-4.5 scenario the CanESM model showed a 65% reduction in 2031, and under the SSP5-8.5 scenario the ACCESS-ESM1-5 model showed a 54% reduction in 2029. In the case of Chungju Dam, under the SSP2-4.5 and SSP5-8.5 scenarios the average monthly HPG compared to the reference period showed a decreasing trend except for INM-CM4 model.