• Title/Summary/Keyword: Estimation error

Search Result 4,252, Processing Time 0.03 seconds

Simultaneous estimation of fatty acids contents from soybean seeds using fourier transform infrared spectroscopy and gas chromatography by multivariate analysis (적외선 분광스펙트럼 및 기체크로마토그라피 분석 데이터의 다변량 통계분석을 이용한 대두 종자 지방산 함량예측)

  • Ahn, Myung Suk;Ji, Eun Yee;Song, Seung Yeob;Ahn, Joon Woo;Jeong, Won Joong;Min, Sung Ran;Kim, Suk Weon
    • Journal of Plant Biotechnology
    • /
    • v.42 no.1
    • /
    • pp.60-70
    • /
    • 2015
  • The aim of this study was to investigate whether fourier transform infrared (FT-IR) spectroscopy can be applied to simultaneous determination of fatty acids contents in different soybean cultivars. Total 153 lines of soybean (Glycine max Merrill) were examined by FT-IR spectroscopy. Quantification of fatty acids from the soybean lines was confirmed by quantitative gas chromatography (GC) analysis. The quantitative spectral variation among different soybean lines was observed in the amide bond region ($1,700{\sim}1,500cm^{-1}$), phosphodiester groups ($1,500{\sim}1,300cm^{-1}$) and sugar region ($1,200{\sim}1,000cm^{-1}$) of FT-IR spectra. The quantitative prediction modeling of 5 individual fatty acids contents (palmitic acid, stearic acid, oleic acid, linoleic acid, linolenic acid) from soybean lines were established using partial least square regression algorithm from FT-IR spectra. In cross validation, there were high correlations ($R^2{\geq}0.97$) between predicted content of 5 individual fatty acids by PLS regression modeling from FT-IR spectra and measured content by GC. In external validation, palmitic acid ($R^2=0.8002$), oleic acid ($R^2=0.8909$) and linoleic acid ($R^2=0.815$) were predicted with good accuracy, while prediction for stearic acid ($R^2=0.4598$), linolenic acid ($R^2=0.6868$) had relatively lower accuracy. These results clearly show that FT-IR spectra combined with multivariate analysis can be used to accurately predict fatty acids contents in soybean lines. Therefore, we suggest that the PLS prediction system for fatty acid contents using FT-IR analysis could be applied as a rapid and high throughput screening tool for the breeding for modified Fatty acid composition in soybean and contribute to accelerating the conventional breeding.

The PRISM-based Rainfall Mapping at an Enhanced Grid Cell Resolution in Complex Terrain (복잡지형 고해상도 격자망에서의 PRISM 기반 강수추정법)

  • Chung, U-Ran;Yun, Kyung-Dahm;Cho, Kyung-Sook;Yi, Jae-Hyun;Yun, Jin-I.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.11 no.2
    • /
    • pp.72-78
    • /
    • 2009
  • The demand for rainfall data in gridded digital formats has increased in recent years due to the close linkage between hydrological models and decision support systems using the geographic information system. One of the most widely used tools for digital rainfall mapping is the PRISM (parameter-elevation regressions on independent slopes model) which uses point data (rain gauge stations), a digital elevation model (DEM), and other spatial datasets to generate repeatable estimates of monthly and annual precipitation. In the PRISM, rain gauge stations are assigned with weights that account for other climatically important factors besides elevation, and aspects and the topographic exposure are simulated by dividing the terrain into topographic facets. The size of facet or grid cell resolution is determined by the density of rain gauge stations and a $5{\times}5km$ grid cell is considered as the lowest limit under the situation in Korea. The PRISM algorithms using a 270m DEM for South Korea were implemented in a script language environment (Python) and relevant weights for each 270m grid cell were derived from the monthly data from 432 official rain gauge stations. Weighted monthly precipitation data from at least 5 nearby stations for each grid cell were regressed to the elevation and the selected linear regression equations with the 270m DEM were used to generate a digital precipitation map of South Korea at 270m resolution. Among 1.25 million grid cells, precipitation estimates at 166 cells, where the measurements were made by the Korea Water Corporation rain gauge network, were extracted and the monthly estimation errors were evaluated. An average of 10% reduction in the root mean square error (RMSE) was found for any months with more than 100mm monthly precipitation compared to the RMSE associated with the original 5km PRISM estimates. This modified PRISM may be used for rainfall mapping in rainy season (May to September) at much higher spatial resolution than the original PRISM without losing the data accuracy.

Parameterization and Application of a Forest Landscape Model by Using National Forest Inventory and Long Term Ecological Research Data (국가산림자원조사와 장기생태연구 자료를 활용한 산림경관모형의 모수화 및 적용성 평가)

  • Cho, Wonhee;Lim, Wontaek;Kim, Eun-Sook;Lim, Jong-Hwan;Ko, Dongwook W.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.22 no.3
    • /
    • pp.215-231
    • /
    • 2020
  • Forest landscape models (FLMs) can be used to investigate the complex interactions of various ecological processes and patterns, which makes them useful tools to evaluate how environmental and anthropogenic variables can influence forest ecosystems. However, due to the large spatio-temporal scales in FLMs studies, parameterization and validation can be extremely challenging when applying to new study areas. To address this issue, we focused on the parameterization and application of a spatially explicit forest landscape model, LANDIS-II, to Mt. Gyebang, South Korea, with the use of the National Forest Inventory (NFI) and long-term ecological research (LTER) site data. In this study, we present the followings for the biomass succession extension of LANDIS-II: 1) species-specific and spatial parameters estimation for the biomass succession extension of LANDIS-II, 2) calibration, and 3) application and validation for Mt. Gyebang. For the biomass succession extension, we selected 14 tree species, and parameterized ecoregion map, initial community map, species growth characteristics. We produced ecoregion map using elevation, aspect, and topographic wetness index based on digital elevation model. Initial community map was produced based on NFI and sub-alpine survey data. Tree species growth parameters, such as aboveground net primary production and maximum aboveground biomass, were estimated from PnET-II model based on species physiological factors and environmental variables. Literature data were used to estimate species physiological factors, such as FolN, SLWmax, HalfSat, growing temperature, and shade tolerance. For calibration and validation purposes, we compared species-specific aboveground biomass of model outputs and NFI and sub-alpine survey data and calculated coefficient of determination (R2) and root mean square error (RMSE). The final model performed very well, with 0. 98 R2 and 8. 9 RMSE. This study can serve as a foundation for the use of FLMs to other applications such as comparing alternative forest management scenarios and natural disturbance effects.

Agroclimatology of North Korea for Paddy Rice Cultivation: Preliminary Results from a Simulation Experiment (생육모의에 의한 북한지방 시ㆍ군별 벼 재배기후 예비분석)

  • Yun Jin-Il;Lee Kwang-Hoe
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.2 no.2
    • /
    • pp.47-61
    • /
    • 2000
  • Agroclimatic zoning was done for paddy rice culture in North Korea based on a simulation experiment. Daily weather data for the experiment were generated by 3 steps consisting of spatial interpolation based on topoclimatological relationships, zonal summarization of grid cell values, and conversion of monthly climate data to daily weather data. Regression models for monthly climatological temperature estimation were derived from a statistical procedure using monthly averages of 51 standard weather stations in South and North Korea (1981-1994) and their spatial variables such as latitude, altitude, distance from the coast, sloping angle, and aspect-dependent field of view (openness). Selected models (0.4 to 1.6$^{\circ}C$ RMSE) were applied to the generation of monthly temperature surface over the entire North Korean territory on 1 km$\times$l km grid spacing. Monthly precipitation data were prepared by a procedure described in Yun (2000). Solar radiation data for 27 North Korean stations were reproduced by applying a relationship found in South Korea ([Solar Radiation, MJ m$^{-2}$ day$^{-1}$ ] =0.344 + 0.4756 [Extraterrestrial Solar Irradiance) + 0.0299 [Openness toward south, 0 - 255) - 1.307 [Cloud amount, 0 - 10) - 0.01 [Relative humidity, %), $r^2$=0.92, RMSE = 0.95 ). Monthly solar irradiance data of 27 points calculated from the reproduced data set were converted to 1 km$\times$1 km grid data by inverse distance weighted interpolation. The grid cell values of monthly temperature, solar radiation, and precipitation were summed up to represent corresponding county, which will serve as a land unit for the growth simulation. Finally, we randomly generated daily maximum and minimum temperature, solar irradiance and precipitation data for 30 years from the monthly climatic data for each county based on a statistical method suggested by Pickering et a1. (1994). CERES-rice, a rice growth simulation model, was tuned to accommodate agronomic characteristics of major North Korean cultivars based on observed phenological and yield data at two sites in South Korea during 1995~1998. Daily weather data were fed into the model to simulate the crop status at 183 counties in North Korea for 30 years. Results were analyzed with respect to spatial and temporal variation in yield and maturity, and used to score the suitability of the county for paddy rice culture.

  • PDF

A Study on the Stock Assessment and Management Implications of the Korean Aucha perch (Coreoperca herzi) in Freshwater: (1) Estimation of Population Ecological Characteristics of Coreoperca herzi in the Mid-Upper System of the Seomjin River (담수산 어류 꺽지 (Coreoperca herzi)의 자원 평가 및 관리 방안 연구: 섬진강 중.상류 수계에서 꺽지의 개체군 생태학적 특성치 추정 (1))

  • Jang, Sung-Hyun;Ryu, Hui-Seong;Lee, Jung-Ho
    • Korean Journal of Ecology and Environment
    • /
    • v.43 no.1
    • /
    • pp.82-90
    • /
    • 2010
  • The ecological characteristics of the Korean Aucha perch, Coreoperca herzi, were determined in order to estimate stock of the mid-upper system of the Seomjin River. The age was determined by counting the otolith annuli. The oldest fish observed in this study was 5 years old. Relationships between body length (BL) and body weight (BW) were $BW=0.0195BL^{3.08}$ ($R^2=0.966$) (p<0.01). Relationships between the otolith radius (R) and body length (BL) were BL=3.882R+1.66 ($R^2=0.944$). The von Bertalanffy growth parameters estimated from a non-linear regression method were $L_{\infty}=19.68\;cm$, $W_{\infty}=188.64\;g$, $K=0.17\;year^{-1}$ and $t_0=-1.46$ year. Therefore, growth in length of the fish was expressed by the von Bertalanffy's growth equation as $L_t=19.68$ ($1-e^{-0.17(t+1.46)}$) ($R^2=0.997$). The annual survival rate (S) was estimated to be $0.666\;year^{-1}$. The instantaneous coefficient of natural mortality (M) of estimated from the Zhang and Megrey method was $0.346\;year^{-1}$, and instantaneous coefficient of fishing mortality (F) was calculated $0.061\;year^{-1}$. From the estimates of survival rate (S), the instantaneous coefficient of total mortality(Z) was estimated to be $0.407\;year^{-1}$.

Methodological Comparison of the Quantification of Total Carbon and Organic Carbon in Marine Sediment (해양 퇴적물내 총탄소 및 유기탄소의 분석기법 고찰)

  • Kim, Kyeong-Hong;Son, Seung-Kyu;Son, Ju-Won;Ju, Se-Jong
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.9 no.4
    • /
    • pp.235-242
    • /
    • 2006
  • The precise estimation of total and organic carbon contents in sediments is fundamental to understand the benthic environment. To test the precision and accuracy of CHN analyzer and the procedure to quantify total and organic carbon contents(using in-situ acidification with sulfurous acid($H_2SO_3$)) in the sediment, the reference material s such as Acetanilide($C_8H_9NO$), Sulfanilammide($C_6H_8N_2O_2S$), and BCSS-1(standard estuary sediment) were used. The results indicate that CHN analyzer to quantify carbon and nitrogen content has high precision(percent error=3.29%) and accuracy(relative standard deviation=1.26%). Additionally, we conducted the instrumental comparison of carbon values analyzed using CHN analyzer and Coulometeric Carbon Analyzer. Total carbon contents measured from two different instruments were highly correlated($R^2=0.9993$, n=84, p<0.0001) with a linear relationship and show no significant differences(paired t-test, p=0.0003). The organic carbon contents from two instruments also showed the similar results with a significant linear relationship($R^2=0.8867$, n=84, p<0.0001) and no significant differences(paired t-test, p<0.0001). Although it is possible to overestimate organic carbon contents for some sediment types having high inorganic carbon contents(such as calcareous ooze) due to procedural and analytical errors, analysis of organic carbon contents in sediments using CHN Analyzer and current procedures seems to provide the best estimates. Therefore, we recommend that this method can be applied to measure the carbon content in normal any sediment samples and are considered to be one of the best procedure far routine analysis of total and organic carbon.

  • PDF

Analysis and Prediction of Sewage Components of Urban Wastewater Treatment Plant Using Neural Network (대도시 하수종말처리장 유입 하수의 성상 평가와 인공신경망을 이용한 구성성분 농도 예측)

  • Jeong, Hyeong-Seok;Lee, Sang-Hyung;Shin, Hang-Sik;Song, Eui-Yeol
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.28 no.3
    • /
    • pp.308-315
    • /
    • 2006
  • Since sewage characteristics are the most important factors that can affect the biological reactions in wastewater treatment plants, a detailed understanding on the characteristics and on-line measurement techniques of the influent sewage would play an important role in determining the appropriate control strategies. In this study, samples were taken at two hour intervals during 51 days from $1^{st}$ October to $21^{st}$ November 2005 from the influent gate of sewage treatment plant. Then the characteristics of sewage were investigated. It was found that the daily values of flow rate and concentrations of sewage components showed a defined profile. The highest and lowest peak values were observed during $11:00{\sim}13:00$ hours and $05:00{\sim}07:00$ hours, respectively. Also, it was shown that the concentrations of sewage components were strongly correlated with the absorbance measured at 300 nm of UV. Therefore, the objective of the paper is to develop on-line estimation technique of the concentration of each component in the sewage using accumulated profiles of sewage, absorbance, and flow rate which can be measured in real time. As a first step, regression analysis was performed using the absorbance and component concentration data. Then a neural network trained with the input of influent flow rate, absorbance, and inflow duration was used. Both methods showed remarkable accuracy in predicting the resulting concentrations of the individual components of the sewage. In case of using the neural network, the predicted value md of the measurement were 19.3 and 14.4 for TSS, 26.7 and 25.1 for TCOD, 5.4 and 4.1 for TN, and for TP, 0.45 to 0.39, respectively.

Evaluation of the Satellite-based Air Temperature for All Sky Conditions Using the Automated Mountain Meteorology Station (AMOS) Records: Gangwon Province Case Study (산악기상관측정보를 이용한 위성정보 기반의 전천후 기온 자료의 평가 - 강원권역을 중심으로)

  • Jang, Keunchang;Won, Myoungsoo;Yoon, Sukhee
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.19 no.1
    • /
    • pp.19-26
    • /
    • 2017
  • Surface air temperature ($T_{air}$) is a key variable for the meteorology and climatology, and is a fundamental factor of the terrestrial ecosystem functions. Satellite remote sensing from the Moderate Resolution Imaging Spectroradiometer (MODIS) provides an opportunity to monitor the $T_{air}$. However, the several problems such as frequent cloud cover and mountainous region can result in substantial retrieval error and signal loss in MODIS $T_{air}$. In this study, satellite-based $T_{air}$ was estimated under both clear and cloudy sky conditions in Gangwon Province using Aqua MODIS07 temperature profile product (MYD07_L2) and GCOM-W1 Advanced Microwave Scanning Radiometer 2 (AMSR2) brightness temperature ($T_b$) at 37 GHz frequency, and was compared with the measurements from the Automated Mountain Meteorology Stations (AMOS). The application of ambient temperature lapse rate was performed to improve the retrieval accuracy in mountainous region, which showed the improvement of estimation accuracy approximately 4% of RMSE. A simple pixel-wise regression method combining synergetic information from MYD07_L2 $T_{air}$ and AMSR2 $T_b$ was applied to estimate surface $T_{air}$ for all sky conditions. The $T_{air}$ retrievals showed favorable agreement in comparison with AMOS data (r=0.80, RMSE=7.9K), though the underestimation was appeared in winter season. Substantial $T_{air}$ retrievals were estimated 61.4% (n=2,657) for cloudy sky conditions. The results presented in this study indicate that the satellite remote sensing can produce the surface $T_{air}$ at the complex mountainous region for all sky conditions.

Downscaling of Sunshine Duration for a Complex Terrain Based on the Shaded Relief Image and the Sky Condition (하늘상태와 음영기복도에 근거한 복잡지형의 일조시간 분포 상세화)

  • Kim, Seung-Ho;Yun, Jin I.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.233-241
    • /
    • 2016
  • Experiments were carried out to quantify the topographic effects on attenuation of sunshine in complex terrain and the results are expected to help convert the coarse resolution sunshine duration information provided by the Korea Meteorological Administration (KMA) into a detailed map reflecting the terrain characteristics of mountainous watershed. Hourly shaded relief images for one year, each pixel consisting of 0 to 255 brightness value, were constructed by applying techniques of shadow modeling and skyline analysis to the 3m resolution digital elevation model for an experimental watershed on the southern slope of Mt. Jiri in Korea. By using a bimetal sunshine recorder, sunshine duration was measured at three points with different terrain conditions in the watershed from May 15, 2015 to May 14, 2016. The brightness values of the 3 corresponding pixel points on the shaded relief map were extracted and regressed to the measured sunshine duration, resulting in a brightness-sunshine duration response curve for a clear day. We devised a method to calibrate this curve equation according to sky condition categorized by cloud amount and used it to derive an empirical model for estimating sunshine duration over a complex terrain. When the performance of this model was compared with a conventional scheme for estimating sunshine duration over a horizontal plane, the estimation bias was improved remarkably and the root mean square error for daily sunshine hour was 1.7hr, which is a reduction by 37% from the conventional method. In order to apply this model to a given area, the clear-sky sunshine duration of each pixel should be produced on hourly intervals first, by driving the curve equation with the hourly shaded relief image of the area. Next, the cloud effect is corrected by 3-hourly 'sky condition' of the KMA digital forecast products. Finally, daily sunshine hour can be obtained by accumulating the hourly sunshine duration. A detailed sunshine duration distribution of 3m horizontal resolution was obtained by applying this procedure to the experimental watershed.

A Case Study on Forecasting Inbound Calls of Motor Insurance Company Using Interactive Data Mining Technique (대화식 데이터 마이닝 기법을 활용한 자동차 보험사의 인입 콜량 예측 사례)

  • Baek, Woong;Kim, Nam-Gyu
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.3
    • /
    • pp.99-120
    • /
    • 2010
  • Due to the wide spread of customers' frequent access of non face-to-face services, there have been many attempts to improve customer satisfaction using huge amounts of data accumulated throughnon face-to-face channels. Usually, a call center is regarded to be one of the most representative non-faced channels. Therefore, it is important that a call center has enough agents to offer high level customer satisfaction. However, managing too many agents would increase the operational costs of a call center by increasing labor costs. Therefore, predicting and calculating the appropriate size of human resources of a call center is one of the most critical success factors of call center management. For this reason, most call centers are currently establishing a department of WFM(Work Force Management) to estimate the appropriate number of agents and to direct much effort to predict the volume of inbound calls. In real world applications, inbound call prediction is usually performed based on the intuition and experience of a domain expert. In other words, a domain expert usually predicts the volume of calls by calculating the average call of some periods and adjusting the average according tohis/her subjective estimation. However, this kind of approach has radical limitations in that the result of prediction might be strongly affected by the expert's personal experience and competence. It is often the case that a domain expert may predict inbound calls quite differently from anotherif the two experts have mutually different opinions on selecting influential variables and priorities among the variables. Moreover, it is almost impossible to logically clarify the process of expert's subjective prediction. Currently, to overcome the limitations of subjective call prediction, most call centers are adopting a WFMS(Workforce Management System) package in which expert's best practices are systemized. With WFMS, a user can predict the volume of calls by calculating the average call of each day of the week, excluding some eventful days. However, WFMS costs too much capital during the early stage of system establishment. Moreover, it is hard to reflect new information ontothe system when some factors affecting the amount of calls have been changed. In this paper, we attempt to devise a new model for predicting inbound calls that is not only based on theoretical background but also easily applicable to real world applications. Our model was mainly developed by the interactive decision tree technique, one of the most popular techniques in data mining. Therefore, we expect that our model can predict inbound calls automatically based on historical data, and it can utilize expert's domain knowledge during the process of tree construction. To analyze the accuracy of our model, we performed intensive experiments on a real case of one of the largest car insurance companies in Korea. In the case study, the prediction accuracy of the devised two models and traditional WFMS are analyzed with respect to the various error rates allowable. The experiments reveal that our data mining-based two models outperform WFMS in terms of predicting the amount of accident calls and fault calls in most experimental situations examined.