• Title/Summary/Keyword: Error Estimation

Search Result 4,264, Processing Time 0.029 seconds

Developing a Traffic Accident Prediction Model for Freeways (고속도로 본선에서의 교통사고 예측모형 개발)

  • Mun, Sung-Ra;Lee, Young-Ihn;Lee, Soo-Beom
    • Journal of Korean Society of Transportation
    • /
    • v.30 no.2
    • /
    • pp.101-116
    • /
    • 2012
  • Accident prediction models have been utilized to predict accident possibilities in existing or projected freeways and to evaluate programs or policies for improving safety. In this study, a traffic accident prediction model for freeways was developed for the above purposes. When selecting variables for the model, the highest priority was on the ease of both collecting data and applying them into the model. The dependent variable was set as the number of total accidents and the number of accidents including casualties in the unit of IC(or JCT). As a result, two models were developed; the overall accident model and the casualty-related accident model. The error structure adjusted to each model was the negative binomial distribution and the Poisson distribution, respectively. Among the two models, a more appropriate model was selected by statistical estimation. Major nine national freeways were selected and five-year dada of 2003~2007 were utilized. Explanatory variables should take on either a predictable value such as traffic volumes or a fixed value with respect to geometric conditions. As a result of the Maximum Likelihood estimation, significant variables of the overall accident model were found to be the link length between ICs(or JCTs), the daily volumes(AADT), and the ratio of bus volume to the number of curved segments between ICs(or JCTs). For the casualty-related accident model, the link length between ICs(or JCTs), the daily volumes(AADT), and the ratio of bus volumes had a significant impact on the accident. The likelihood ratio test was conducted to verify the spatial and temporal transferability for estimated parameters of each model. It was found that the overall accident model could be transferred only to the road with four or more than six lanes. On the other hand, the casualty-related accident model was transferrable to every road and every time period. In conclusion, the model developed in this study was able to be extended to various applications to establish future plans and evaluate policies.

Estimation of Reference Crop Evapotranspiration Using Backpropagation Neural Network Model (역전파 신경망 모델을 이용한 기준 작물 증발산량 산정)

  • Kim, Minyoung;Choi, Yonghun;O'Shaughnessy, Susan;Colaizzi, Paul;Kim, Youngjin;Jeon, Jonggil;Lee, Sangbong
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.61 no.6
    • /
    • pp.111-121
    • /
    • 2019
  • Evapotranspiration (ET) of vegetation is one of the major components of the hydrologic cycle, and its accurate estimation is important for hydrologic water balance, irrigation management, crop yield simulation, and water resources planning and management. For agricultural crops, ET is often calculated in terms of a short or tall crop reference, such as well-watered, clipped grass (reference crop evapotranspiration, $ET_o$). The Penman-Monteith equation recommended by FAO (FAO 56-PM) has been accepted by researchers and practitioners, as the sole $ET_o$ method. However, its accuracy is contingent on high quality measurements of four meteorological variables, and its use has been limited by incomplete and/or inaccurate input data. Therefore, this study evaluated the applicability of Backpropagation Neural Network (BPNN) model for estimating $ET_o$ from less meteorological data than required by the FAO 56-PM. A total of six meteorological inputs, minimum temperature, average temperature, maximum temperature, relative humidity, wind speed and solar radiation, were divided into a series of input groups (a combination of one, two, three, four, five and six variables) and each combination of different meteorological dataset was evaluated for its level of accuracy in estimating $ET_o$. The overall findings of this study indicated that $ET_o$ could be reasonably estimated using less than all six meteorological data using BPNN. In addition, it was shown that the proper choice of neural network architecture could not only minimize the computational error, but also maximize the relationship between dependent and independent variables. The findings of this study would be of use in instances where data availability and/or accuracy are limited.

The GOCI-II Early Mission Ocean Color Products in Comparison with the GOCI Toward the Continuity of Chollian Multi-satellite Ocean Color Data (천리안해양위성 연속자료 구축을 위한 GOCI-II 임무 초기 주요 해색산출물의 GOCI 자료와 비교 분석)

  • Park, Myung-Sook;Jung, Hahn Chul;Lee, Seonju;Ahn, Jae-Hyun;Bae, Sujung;Choi, Jong-Kuk
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.5_2
    • /
    • pp.1281-1293
    • /
    • 2021
  • The recent launch of the GOCI-II enables South Korea to have the world's first capability in deriving the ocean color data at geostationary satellite orbit for about 20 years. It is necessary to develop a consistent long-term ocean color time-series spanning GOCI to GOCI-II mission and improve the accuracy through validation using in situ data. To assess the GOCI-II's early mission performance, the objective of this study is to compare the GOCI-II Chlorophyll-a concentration (Chl-a), Colored Dissolved Organic Matter (CDOM), and remote sensing reflectances (Rrs) through comparison with the GOCI data. Overall, the distribution of GOCI-II Chl-a corresponds with that of the GOCI over the Yellow Sea, Korea Strait, and the Ulleung Basin. In particular, a smaller RMSE value (0.07) between GOCI and GOCI-II over the summer Ulleung Basin confirms the GOCI-II data's reliability. However, despite the excellent correlation, the GOCI-II tends to overestimate Chl-a than the GOCI over the Yellow Sea and Korea Strait. The similar over-estimation bias of the GOCI-II is also notable in CDOM. Whereas no significant bias or error is found for Rrs at 490 nm and 550 nm (RMSE~0), the underestimation of Rrs at 443 nm contributes to the overestimation of GOCI-II Chl-a and CDOM over the Yellow Sea and the Korea Strait. Also, we show over-estimation of GOCI-II Rrs at 660 nm relative to GOCI to cause a possible bias in Total suspended sediment. In conclusion, this study confirms the initial reliability of the GOCI-II ocean color products, and upcoming update of GOCI-II radiometric calibration will lessen the inconsistency between GOCI and GOCI-II ocean color products.

Study on the Concentration Estimation Equation of Nitrogen Dioxide using Hyperspectral Sensor (초분광센서를 활용한 이산화질소 농도 추정식에 관한 연구)

  • Jeon, Eui-Ik;Park, Jin-Woo;Lim, Seong-Ha;Kim, Dong-Woo;Yu, Jae-Jin;Son, Seung-Woo;Jeon, Hyung-Jin;Yoon, Jeong-Ho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.6
    • /
    • pp.19-25
    • /
    • 2019
  • The CleanSYS(Clean SYStem) is operated to monitor air pollutants emitted from specific industrial complexes in Korea. So the industrial complexes without the system are directly monitored by the control officers. For efficient monitoring, studies using various sensors have been conducted to monitor air pollutants emitted from industrial complex. In this study, hyperspectral sensors were used to model and verify the equations for estimating the concentration of $NO_2$(nitrogen dioxide) in air pollutants emitted. For development of the equations, spectral radiance were observed for $NO_2$ at various concentrations with different SZA(Solar Zenith Angle), VZA(Viewing Zenith Angle), and RAA(Relative Azimuth Angle). From the observed spectral radiance, the calculated value of the difference between the values of the specific wavelengths was taken as an absorption depth, and the equations were developed using the relationship between the depth and the $NO_2$ concentration. The spectral radiance mixed gas of $NO_2$ and $SO_2$(sulfur dioxide) was used to verify the equations. As a result, the $R^2$(coefficient of determination) and RMSE(Root Mean Square Error) were different from 0.71~0.88 and 72~23 ppm according to the form of the equation, and $R^2$ of the exponential form was the highest among the equations. Depending on the type of the equations, the accuracy of the estimated concentration with varying concentrations is not constant. However, if the equations are advanced in the future, hyperspectral sensors can be used to monitor the $NO_2$ emitted from the industrial complex.

Estimation of Soil Moisture Using Sentinel-1 SAR Images and Multiple Linear Regression Model Considering Antecedent Precipitations (선행 강우를 고려한 Sentinel-1 SAR 위성영상과 다중선형회귀모형을 활용한 토양수분 산정)

  • Chung, Jeehun;Son, Moobeen;Lee, Yonggwan;Kim, Seongjoon
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.3
    • /
    • pp.515-530
    • /
    • 2021
  • This study is to estimate soil moisture (SM) using Sentinel-1A/B C-band SAR (synthetic aperture radar) images and Multiple Linear Regression Model(MLRM) in the Yongdam-Dam watershed of South Korea. Both the Sentinel-1A and -1B images (6 days interval and 10 m resolution) were collected for 5 years from 2015 to 2019. The geometric, radiometric, and noise corrections were performed using the SNAP (SentiNel Application Platform) software and converted to backscattering coefficient of VV and VH polarization. The in-situ SM data measured at 6 locations using TDR were used to validate the estimated SM results. The 5 days antecedent precipitation data were also collected to overcome the estimation difficulty for the vegetated area not reaching the ground. The MLRM modeling was performed using yearly data and seasonal data set, and correlation analysis was performed according to the number of the independent variable. The estimated SM was verified with observed SM using the coefficient of determination (R2) and the root mean square error (RMSE). As a result of SM modeling using only BSC in the grass area, R2 was 0.13 and RMSE was 4.83%. When 5 days of antecedent precipitation data was used, R2 was 0.37 and RMSE was 4.11%. With the use of dry days and seasonal regression equation to reflect the decrease pattern and seasonal variability of SM, the correlation increased significantly with R2 of 0.69 and RMSE of 2.88%.

Estimation of the Lodging Area in Rice Using Deep Learning (딥러닝을 이용한 벼 도복 면적 추정)

  • Ban, Ho-Young;Baek, Jae-Kyeong;Sang, Wan-Gyu;Kim, Jun-Hwan;Seo, Myung-Chul
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.66 no.2
    • /
    • pp.105-111
    • /
    • 2021
  • Rice lodging is an annual occurrence caused by typhoons accompanied by strong winds and strong rainfall, resulting in damage relating to pre-harvest sprouting during the ripening period. Thus, rapid estimations of the area of lodged rice are necessary to enable timely responses to damage. To this end, we obtained images related to rice lodging using a drone in Gimje, Buan, and Gunsan, which were converted to 128 × 128 pixels images. A convolutional neural network (CNN) model, a deep learning model based on these images, was used to predict rice lodging, which was classified into two types (lodging and non-lodging), and the images were divided in a 8:2 ratio into a training set and a validation set. The CNN model was layered and trained using three optimizers (Adam, Rmsprop, and SGD). The area of rice lodging was evaluated for the three fields using the obtained data, with the exception of the training set and validation set. The images were combined to give composites images of the entire fields using Metashape, and these images were divided into 128 × 128 pixels. Lodging in the divided images was predicted using the trained CNN model, and the extent of lodging was calculated by multiplying the ratio of the total number of field images by the number of lodging images by the area of the entire field. The results for the training and validation sets showed that accuracy increased with a progression in learning and eventually reached a level greater than 0.919. The results obtained for each of the three fields showed high accuracy with respect to all optimizers, among which, Adam showed the highest accuracy (normalized root mean square error: 2.73%). On the basis of the findings of this study, it is anticipated that the area of lodged rice can be rapidly predicted using deep learning.

Estimation of Stem Taper Equations and Stem Volume Table for Phyllostachys pubescens Mazel in South Korea (맹종죽의 수간곡선식 및 수간재적표 추정)

  • Eun-Ji, Bae;Yeong-Mo, Son;Jin-Taek, Kang
    • Journal of Korean Society of Forest Science
    • /
    • v.111 no.4
    • /
    • pp.622-629
    • /
    • 2022
  • The study aim was to derive a stem taper equation for Phyllostachys pubescens, a type of bamboo in South Korea, and to develop a stem volume table. To derive the stem taper equation, three stem taper models (Max & Burkhart, Kozak, and Lee) were used. Since bamboo stalks are hollow because of its woody characteristics, the outer and inner diameters of the tree were calculated, and connecting them enabled estimating the tree curves. The results of the three equations for estimating the outer and inner diameters led to selection of the Kozak model for determining the optimal stem taper because it had the highest fitness index and lowest error and bias. We used the Kozak model to estimate the diameter of Phyllostachys pubescens by stem height, which proved optimal, and drew the stem curve. After checking the residual degree in the stem taper equation, all residuals were distributed around "0", which proved the suitability of the equation. To calculate the stem volume of Phyllostachys pubescens, a rotating cube was created by rotating the stem curve with the outer diameter at 360°, and the volume was calculated by applying Smalian's method. The volume of Phyllostachys pubescens was calculated by deducting the inner diameter calculated volume from the outer diameter calculated volume. The volume of Phyllostachys pubescens was only 20~30% of the volume of Larix kaempferi, which is a general species. However, considering the current trees/ha of Phyllostachys pubescens and the amount of bamboo shoots generated every year, the individual tree volume was predicted to be small, but the volume/ha was not very different or perhaps more. The significance of this study is the stem taper equation and stem volume table for Phyllostachys pubescens developed for the first time in South Korea. The results are expected to be used as basic data for bamboo trading that is in increasing public and industrial demand and carbon absorption estimation.

Rice Yield Estimation Using Sentinel-2 Satellite Imagery, Rainfall and Soil Data (Sentinel-2 위성영상과 강우 및 토양자료를 활용한 벼 수량 추정)

  • KIM, Kyoung-Seop;CHOUNG, Yun-Jae;JUN, Byong-Woon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.25 no.1
    • /
    • pp.133-149
    • /
    • 2022
  • Existing domestic studies on estimating rice yield were mainly implemented at the level of cities and counties in the entire nation using MODIS satellite images with low spatial resolution. Unlike previous studies, this study tried to estimate rice yield at the level of eup-myon-dong in Gimje-si, Jeollabuk-do using Sentinel-2 satellite images with medium spatial resolution, rainfall and soil data, and then to evaluate its accuracy. Five vegetation indices such as NDVI, LAI, EVI2, MCARI1 and MCARI2 derived from Sentinel-2 images of August 1, 2018 for Gimje-si, Jeollabuk-do, rainfall and paddy soil-type data were aggregated by the level of eup-myon-dong and then rice yield was estimated with gamma generalized linear model, an expanded variant of multi-variate regression analysis to solve the non-normality problem of dependent variable. In the rice yield model finally developed, EVI2, rainfall days in September, and saline soils ratio were used as significant independent variables. The coefficient of determination representing the model fit was 0.68 and the RMSE for showing the model accuracy was 62.29kg/10a. This model estimated the total rice production in Gimje-si in 2018 to be 96,914.6M/T, which was very close to 94,470.3M/T the actual amount specified in the Statistical Yearbook with an error of 0.46%. Also, the rice production per unit area of Gimje-si was amounted to 552kg/10a, which was almost consistent with 550kg/10a of the statistical data. This result is similar to that of the previous studies and it demonstrated that the rice yield can be estimated using Sentinel-2 satellite images at the level of cities and counties or smaller districts in Korea.

Validation of Sea Surface Wind Speeds from Satellite Altimeters and Relation to Sea State Bias - Focus on Wind Measurements at Ieodo, Marado, Oeyeondo Stations (인공위성 고도계 해상풍 검증과 해상상태편차와의 관련성 - 이어도, 마라도, 외연도 해상풍 관측치를 중심으로 -)

  • Choi, Do-Young;Woo, Hye-Jin;Park, Kyung-Ae;Byun, Do-Seong;Lee, Eunil
    • Journal of the Korean earth science society
    • /
    • v.39 no.2
    • /
    • pp.139-153
    • /
    • 2018
  • The sea surface wind field has long been obtained from satellite scatterometers or passive microwave radiometers. However, the importance of satellite altimeter-derived wind speed has seldom been addressed because of the outstanding capability of the scatterometers. Satellite altimeter requires the accurate wind speed data, measured simultaneously with sea surface height observations, to enhance the accuracy of sea surface height through the correction of sea state bias. This study validates the wind speeds from the satellite altimeters (GFO, Jason-1, Envisat, Jason-2, Cryosat-2, SARAL) and analyzes characteristics of errors. In total, 1504 matchup points were produced using the wind speed data of Ieodo Ocean Research Station (IORS) and of Korea Meteorological Administration (KMA) buoys at Marado and Oeyeondo stations for 10 years from December 2007 to May 2016. The altimeter wind speed showed a root mean square error (RMSE) of about $1.59m\;s^{-1}$ and a negative bias of $-0.35m\;s^{-1}$ with respect to the in-situ wind speed. Altimeter wind speeds showed characteristic biases that they were higher (lower) than in-situ wind speeds at low (high) wind speed ranges. Some tendency was found that the difference between the maximum and minimum value gradually increased with distance from the buoy stations. For the improvement of the accuracy of altimeter wind speed, an equation for correction was derived based on the characteristics of errors. In addition, the significance of altimeter wind speed on the estimation of sea surface height was addressed by presenting the effect of the corrected wind speeds on the sea state bias values of Jason-1.

A Study on Sample Allocation for Stratified Sampling (층화표본에서의 표본 배분에 대한 연구)

  • Lee, Ingue;Park, Mingue
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1047-1061
    • /
    • 2015
  • Stratified random sampling is a powerful sampling strategy to reduce variance of the estimators by incorporating useful auxiliary information to stratify the population. Sample allocation is the one of the important decisions in selecting a stratified random sample. There are two common methods, the proportional allocation and Neyman allocation if we could assume data collection cost for different observation units equal. Theoretically, Neyman allocation considering the size and standard deviation of each stratum, is known to be more effective than proportional allocation which incorporates only stratum size information. However, if the information on the standard deviation is inaccurate, the performance of Neyman allocation is in doubt. It has been pointed out that Neyman allocation is not suitable for multi-purpose sample survey that requires the estimation of several characteristics. In addition to sampling error, non-response error is another factor to evaluate sampling strategy that affects the statistical precision of the estimator. We propose new sample allocation methods using the available information about stratum response rates at the designing stage to improve stratified random sampling. The proposed methods are efficient when response rates differ considerably among strata. In particular, the method using population sizes and response rates improves the Neyman allocation in multi-purpose sample survey.