• Title/Summary/Keyword: Prediction accuracy

Search Result 3,758, Processing Time 0.027 seconds

Analysis and Prediction of Sewage Components of Urban Wastewater Treatment Plant Using Neural Network (대도시 하수종말처리장 유입 하수의 성상 평가와 인공신경망을 이용한 구성성분 농도 예측)

  • Jeong, Hyeong-Seok;Lee, Sang-Hyung;Shin, Hang-Sik;Song, Eui-Yeol
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.28 no.3
    • /
    • pp.308-315
    • /
    • 2006
  • Since sewage characteristics are the most important factors that can affect the biological reactions in wastewater treatment plants, a detailed understanding on the characteristics and on-line measurement techniques of the influent sewage would play an important role in determining the appropriate control strategies. In this study, samples were taken at two hour intervals during 51 days from $1^{st}$ October to $21^{st}$ November 2005 from the influent gate of sewage treatment plant. Then the characteristics of sewage were investigated. It was found that the daily values of flow rate and concentrations of sewage components showed a defined profile. The highest and lowest peak values were observed during $11:00{\sim}13:00$ hours and $05:00{\sim}07:00$ hours, respectively. Also, it was shown that the concentrations of sewage components were strongly correlated with the absorbance measured at 300 nm of UV. Therefore, the objective of the paper is to develop on-line estimation technique of the concentration of each component in the sewage using accumulated profiles of sewage, absorbance, and flow rate which can be measured in real time. As a first step, regression analysis was performed using the absorbance and component concentration data. Then a neural network trained with the input of influent flow rate, absorbance, and inflow duration was used. Both methods showed remarkable accuracy in predicting the resulting concentrations of the individual components of the sewage. In case of using the neural network, the predicted value md of the measurement were 19.3 and 14.4 for TSS, 26.7 and 25.1 for TCOD, 5.4 and 4.1 for TN, and for TP, 0.45 to 0.39, respectively.

An Empirical Model for Forecasting Alternaria Leaf Spot in Apple (사과 점무늬낙엽병(斑點落葉病)예찰을 위한 한 경험적 모델)

  • Kim, Choong-Hoe;Cho, Won-Dae;Kim, Seung-Chul
    • Korean journal of applied entomology
    • /
    • v.25 no.4 s.69
    • /
    • pp.221-228
    • /
    • 1986
  • An empirical model to predict initial disease occurrence and subsequent progress of Alternaria leaf spot was constructed based on the modified degree day temperature and frequency of rainfall in three years field experiments. Climatic factors were analized 10-day bases, beginning April 20 to the end of August, and were used as variables for model construction. Cumulative degree portion (CDP) that is over $10^{\circ}C$ in the daily average temperature was used as a parameter to determine the relationship between temperature and initial disease occurrence. Around one hundred and sixty of CDP was needed to initiate disease incidence. This value was considered as temperature threshhold. After reaching 160 CDP, time of initial occurrence was determined by frequency of rainfall. At least four times of rainfall were necessary to be accumulated for initial occurrence of the disease after passing temperature threshhold. Disease progress after initial incidence generally followed the pattern of frequency of rainfall accumulated in those periods. Apparent infection rate (r) in the general differential equation dx/dt=xr(1-x) for individual epidemics when x is disease proportion and t is time, was a linear function of accumulation rate of rainfall frequency (Rc) and was able to be directly estimated based on the equation r=1.06Rc-0.11($R^2=0.993$). Disease severity (x) after t time could be predicted using exponential equation $[x/(1-x)]=[x_0/(1-x)]e^{(b_0+b_1R_c)t}$ derived from the differential equation, when $x_0$ is initial disease, $b_0\;and\;b_1$ are constants. There was a significant linear relationship between disease progress and cumulative number of air-borne conidia of Alternaria mali. When the cumulative number of air-borne conidia was used as an independent variable to predict disease severity, accuracy of prediction was poor with $R^2=0.3328$.

  • PDF

Personalized Exhibition Booth Recommendation Methodology Using Sequential Association Rule (순차 연관 규칙을 이용한 개인화된 전시 부스 추천 방법)

  • Moon, Hyun-Sil;Jung, Min-Kyu;Kim, Jae-Kyeong;Kim, Hyea-Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.195-211
    • /
    • 2010
  • An exhibition is defined as market events for specific duration to present exhibitors' main product range to either business or private visitors, and it also plays a key role as effective marketing channels. Especially, as the effect of the opinions of the visitors after the exhibition impacts directly on sales or the image of companies, exhibition organizers must consider various needs of visitors. To meet needs of visitors, ubiquitous technologies have been applied in some exhibitions. However, despite of the development of the ubiquitous technologies, their services cannot always reflect visitors' preferences as they only generate information when visitors request. As a result, they have reached their limit to meet needs of visitors, which consequently might lead them to loss of marketing opportunity. Recommendation systems can be the right type to overcome these limitations. They can recommend the booths to coincide with visitors' preferences, so that they help visitors who are in difficulty for choices in exhibition environment. One of the most successful and widely used technologies for building recommender systems is called Collaborative Filtering. Traditional recommender systems, however, only use neighbors' evaluations or behaviors for a personalized prediction. Therefore, they can not reflect visitors' dynamic preference, and also lack of accuracy in exhibition environment. Although there is much useful information to infer visitors' preference in ubiquitous environment (e.g., visitors' current location, booth visit path, and so on), they use only limited information for recommendation. In this study, we propose a booth recommendation methodology using Sequential Association Rule which considers the sequence of visiting. Recent studies of Sequential Association Rule use the constraints to improve the performance. However, since traditional Sequential Association Rule considers the whole rules to recommendation, they have a scalability problem when they are adapted to a large exhibition scale. To solve this problem, our methodology composes the confidence database before recommendation process. To compose the confidence database, we first search preceding rules which have the frequency above threshold. Next, we compute the confidences of each preceding rules to each booth which is not contained in preceding rules. Therefore, the confidence database has two kinds of information which are preceding rules and their confidence to each booth. In recommendation process, we just generate preceding rules of the target visitors based on the records of the visits, and recommend booths according to the confidence database. Throughout these steps, we expect reduction of time spent on recommendation process. To evaluate proposed methodology, we use real booth visit records which are collected by RFID technology in IT exhibition. Booth visit records also contain the visit sequence of each visitor. We compare the performance of proposed methodology with traditional Collaborative Filtering system. As a result, our proposed methodology generally shows higher performance than traditional Collaborative Filtering. We can also see some features of it in experimental results. First, it shows the highest performance at one booth recommendation. It detects preceding rules with some portions of visitors. Therefore, if there is a visitor who moved with very a different pattern compared to the whole visitors, it cannot give a correct recommendation for him/her even though we increase the number of recommendation. Trained by the whole visitors, it cannot correctly give recommendation to visitors who have a unique path. Second, the performance of general recommendation systems increase as time expands. However, our methodology shows higher performance with limited information like one or two time periods. Therefore, not only can it recommend even if there is not much information of the target visitors' booth visit records, but also it uses only small amount of information in recommendation process. We expect that it can give real?time recommendations in exhibition environment. Overall, our methodology shows higher performance ability than traditional Collaborative Filtering systems, we expect it could be applied in booth recommendation system to satisfy visitors in exhibition environment.

The NCAM Land-Atmosphere Modeling Package (LAMP) Version 1: Implementation and Evaluation (국가농림기상센터 지면대기모델링패키지(NCAM-LAMP) 버전 1: 구축 및 평가)

  • Lee, Seung-Jae;Song, Jiae;Kim, Yu-Jung
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.307-319
    • /
    • 2016
  • A Land-Atmosphere Modeling Package (LAMP) for supporting agricultural and forest management was developed at the National Center for AgroMeteorology (NCAM). The package is comprised of two components; one is the Weather Research and Forecasting modeling system (WRF) coupled with Noah-Multiparameterization options (Noah-MP) Land Surface Model (LSM) and the other is an offline one-dimensional LSM. The objective of this paper is to briefly describe the two components of the NCAM-LAMP and to evaluate their initial performance. The coupled WRF/Noah-MP system is configured with a parent domain over East Asia and three nested domains with a finest horizontal grid size of 810 m. The innermost domain covers two Gwangneung deciduous and coniferous KoFlux sites (GDK and GCK). The model is integrated for about 8 days with the initial and boundary conditions taken from the National Centers for Environmental Prediction (NCEP) Final Analysis (FNL) data. The verification variables are 2-m air temperature, 10-m wind, 2-m humidity, and surface precipitation for the WRF/Noah-MP coupled system. Skill scores are calculated for each domain and two dynamic vegetation options using the difference between the observed data from the Korea Meteorological Administration (KMA) and the simulated data from the WRF/Noah-MP coupled system. The accuracy of precipitation simulation is examined using a contingency table that is made up of the Probability of Detection (POD) and the Equitable Threat Score (ETS). The standalone LSM simulation is conducted for one year with the original settings and is compared with the KoFlux site observation for net radiation, sensible heat flux, latent heat flux, and soil moisture variables. According to results, the innermost domain (810 m resolution) among all domains showed the minimum root mean square error for 2-m air temperature, 10-m wind, and 2-m humidity. Turning on the dynamic vegetation had a tendency of reducing 10-m wind simulation errors in all domains. The first nested domain (7,290 m resolution) showed the highest precipitation score, but showed little advantage compared with using the dynamic vegetation. On the other hand, the offline one-dimensional Noah-MP LSM simulation captured the site observed pattern and magnitude of radiative fluxes and soil moisture, and it left room for further improvement through supplementing the model input of leaf area index and finding a proper combination of model physics.

Research about feature selection that use heuristic function (휴리스틱 함수를 이용한 feature selection에 관한 연구)

  • Hong, Seok-Mi;Jung, Kyung-Sook;Chung, Tae-Choong
    • The KIPS Transactions:PartB
    • /
    • v.10B no.3
    • /
    • pp.281-286
    • /
    • 2003
  • A large number of features are collected for problem solving in real life, but to utilize ail the features collected would be difficult. It is not so easy to collect of correct data about all features. In case it takes advantage of all collected data to learn, complicated learning model is created and good performance result can't get. Also exist interrelationships or hierarchical relations among the features. We can reduce feature's number analyzing relation among the features using heuristic knowledge or statistical method. Heuristic technique refers to learning through repetitive trial and errors and experience. Experts can approach to relevant problem domain through opinion collection process by experience. These properties can be utilized to reduce the number of feature used in learning. Experts generate a new feature (highly abstract) using raw data. This paper describes machine learning model that reduce the number of features used in learning using heuristic function and use abstracted feature by neural network's input value. We have applied this model to the win/lose prediction in pro-baseball games. The result shows the model mixing two techniques not only reduces the complexity of the neural network model but also significantly improves the classification accuracy than when neural network and heuristic model are used separately.

The study of quantitative analytical method for pH and moisture of Hanji record paper using non-destructive FT-NIR spectroscopy (비파괴 분석 방법인 푸리에 변환 근적외선 분광 분석을 이용한 한지 기록물의 산성도 및 함수율 정량 분석 연구)

  • Shin, Yong-Min;Park, Soung-Be;Lee, Chang-Yong;Kim, Chan-Bong;Lee, Seong-Uk;Cho, Won-Bo;Kim, Hyo-Jin
    • Analytical Science and Technology
    • /
    • v.25 no.2
    • /
    • pp.121-126
    • /
    • 2012
  • It is essential to evaluate the quality of Hanji record paper without damaging the record paper by previous destructive methods. The samples were Hanji record paper produced in the 1900s. Near-infrared (NIR) spectrometer was used as a non destructive method for evaluating the quality of record papers. Fourier transform (FT) spectrometer was used with 12,500 to 4,000 $cm^{-1}$ wavenumber range for quantitative analysis and it has high accuracy and good signal-to-noise ratio. The acidity and moisture content of Hanji record paper were measured by integrating sphere as diffuse reflectance type. The acidity (pH) of chemical factors as a quality evaluated factor of Hanji was correlated to NIR spectrum. The NIR spectrum was pretreated to obtain the coefficients of optimum correlation. Multiplicative scatter correction (MSC) and First derivative of Savitzky-Golay were used as pretreated methods. The coefficients of optimum correlation were calculated by PLSR (partial least square regression). The correlation coefficients ($R^2$) of acidity had 0.92 on NIR spectra without pretreatment. Also the standard error of prediction (SEP) of pH was 0.24. And then the NIR spectra with pretreatment would have better correlation coefficient ($R^2$ = 0.98) and 0.19 as SEP on pH. For moisture contents, the linearity correlation without pretreatment was higher than the case with pretreatment (MSC, $1^{st}$ derivative). As the best result, the $R^2$ was 0.99 and SEP was 0.45. This indicates that it is highly proper to evaluate the quality of Hanji record papers speedily with integrated sphere and FT NIR analyzer as a non-destructive method.

Estimation of Precipitable Water from the GMS-5 Split Window Data (GMS-5 Split Window 자료를 이용한 가강수량 산출)

  • 손승희;정효상;김금란;이정환
    • Korean Journal of Remote Sensing
    • /
    • v.14 no.1
    • /
    • pp.53-68
    • /
    • 1998
  • Observation of hydrometeors' behavior in the atmosphere is important to understand weather and climate. By conventional observations, we can get the distribution of water vapor at limited number of points on the earth. In this study, the precipitable water has been estimated from the split window channel data on GMS-5 based upon the technique developed by Chesters et al.(1983). To retrieve the precipitable water, water vapor absorption parameter depending on filter function of sensor has been derived using the regression analysis between the split window channel data and the radiosonde data observed at Osan, Pohang, Kwangiu and Cheju staions for 4 months. The air temperature of 700 hPa from the Global Spectral Model of Korea Meteorological Administration (GSM/KMA) has been used as mean air temperature for single layer radiation model. The retrieved precipitable water for the period from August 1996 through December 1996 are compared to radiosonde data. It is shown that the root mean square differences between radiosonde observations and the GMS-5 retrievals range from 0.65 g/$cm^2$ to 1.09 g/$cm^2$ with correlation coefficient of 0.46 on hourly basis. The monthly distribution of precipitable water from GMS-5 shows almost good representation in large scale. Precipitable water is produced 4 times a day at Korea Meteorological Administration in the form of grid point data with 0.5 degree lat./lon. resolution. The data can be used in the objective analysis for numerical weather prediction and to increase the accuracy of humidity analysis especially under clear sky condition. And also, the data is a useful complement to existing data set for climatological research. But it is necessary to get higher correlation between radiosonde observations and the GMS-5 retrievals for operational applications.

Validation of Satellite Scatterometer Sea-Surface Wind Vectors (MetOp-A/B ASCAT) in the Korean Coastal Region (한반도 연안해역에서 인공위성 산란계(MetOp-A/B ASCAT) 해상풍 검증)

  • Kwak, Byeong-Dae;Park, Kyung-Ae;Woo, Hye-Jin;Kim, Hee-Young;Hong, Sung-Eun;Sohn, Eun-Ha
    • Journal of the Korean earth science society
    • /
    • v.42 no.5
    • /
    • pp.536-555
    • /
    • 2021
  • Sea-surface wind is an important variable in ocean-atmosphere interactions, leading to the changes in ocean surface currents and circulation, mixed layers, and heat flux. With the development of satellite technology, sea-surface winds data retrieved from scatterometer observation data have been used for various purposes. In a complex marine environment such as the Korean Peninsula coast, scatterometer-observed sea-surface wind is an important factor for analyzing ocean and atmospheric phenomena. Therefore, the validation results of wind accuracy can be used for diverse applications. In this study, the sea-surface winds derived from ASCAT (Advanced SCATterometer) mounted on MetOp-A/B (METeorological Operational Satellite-A/B) were validated compared to in-situ wind measurements at 16 marine buoy stations around the Korean Peninsula from January to December 2020. The buoy winds measured at a height of 4-5 m from the sea surface were converted to 10-m neutral winds using the LKB (Liu-Katsaros-Businger) model. The matchup procedure produced 5,544 and 10,051 collocation points for MetOp-A and MetOp-B, respectively. The root mean square errors (RMSE) were 1.36 and 1.28 m s-1, and bias errors amounted to 0.44 and 0.65 m s-1 for MetOp-A and MetOp-B, respectively. The wind directions of both scatterometers exhibited negative biases of -8.03° and -6.97° and RMSE values of 32.46° and 36.06° for MetOp-A and MetOp-B, respectively. These errors were likely associated with the stratification and dynamics of the marine-atmospheric boundary layer. In the seas around the Korean Peninsula, the sea-surface winds of the ASCAT tended to be more overestimated than the in-situ wind speeds, particularly at weak wind speeds. In addition, the closer the distance from the coast, the more the amplification of error. The present results could contribute to the development of a prediction model as improved input data and the understanding of air-sea interaction and impact of typhoons in the coastal regions around the Korean Peninsula.

Comparison of Wind Vectors Derived from GK2A with Aeolus/ALADIN (위성기반 GK2A의 대기운동벡터와 Aeolus/ALADIN 바람 비교)

  • Shin, Hyemin;Ahn, Myoung-Hwan;KIM, Jisoo;Lee, Sihye;Lee, Byung-Il
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.6_1
    • /
    • pp.1631-1645
    • /
    • 2021
  • This research aims to provide the characteristics of the world's first active lidar sensor Atmospheric Laser Doppler Instrument (ALADIN) wind data and Geostationary Korea Multi Purpose Satellite 2A (GK2A) Atmospheric Motion Vector (AMV) data by comparing two wind data. As a result of comparing the data from September 2019 to August 1, 2020, The total number of collocated data for the AMV (using IR channel) and Mie channel ALADIN data is 177,681 which gives the Root Mean Square Error (RMSE) of 3.73 m/s and the correlation coefficient is 0.98. For a more detailed analysis, Comparison result considering altitude and latitude, the Normalized Root Mean Squared Error (NRMSE) is 0.2-0.3 at most latitude bands. However, the upper and middle layers in the lower latitudes and the lower layer in the southern hemispheric are larger than 0.4 at specific latitudes. These results are the same for the water vapor channel and the visible channel regardless of the season, and the channel-specific and seasonal characteristics do not appear prominently. Furthermore, as a result of analyzing the distribution of clouds in the latitude band with a large difference between the two wind data, Cirrus or cumulus clouds, which can lower the accuracy of height assignment of AMV, are distributed more than at other latitude bands. Accordingly, it is suggested that ALADIN wind data in the southern hemisphere and low latitude band, where the error of the AMV is large, can have a positive effect on the numerical forecast model.

Calculation Method of Oil Slick Area on Sea Surface Using High-resolution Satellite Imagery: M/V Symphony Oil Spill Accident (고해상도 광학위성을 이용한 해상 유출유 면적 산출: 심포니호 기름유출 사고 사례)

  • Kim, Tae-Ho;Shin, Hye-Kyeong;Jang, So Yeong;Ryu, Joung-Mi;Kim, Pyeongjoong;Yang, Chan-Su
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.6_1
    • /
    • pp.1773-1784
    • /
    • 2021
  • In order to minimize damage to oil spill accidents in the ocean, it is essential to collect a spilled area as soon as possible. Thus satellite-based remote sensing is a powerful source to detect oil spills in the ocean. With the recent rapid increase in the number of available satellites, it has become possible to generate a status report of marine oil spills soon after the accident. In this study, the oil spill area was calculated using various satellite images for the Symphony oil spill accident that occurred off the coast of Qingdao Port, China, on April 27, 2021. In particular, improving the accuracy of oil spill area determination was applied using high-resolution commercial satellite images with a spatial resolution of 2m. Sentinel-1, Sentinel-2, LANDSAT-8, GEO-KOMPSAT-2B (GOCI-II) and Skysat satellite images were collected from April 27 to May 13, but five images were available considering the weather conditions. The spilled oil had spread northeastward, bound for coastal region of China. This trend was confirmed in the Skysat image and also similar to the movement prediction of oil particles from the accident location. From this result, the look-alike patch observed in the north area from the Sentinel-1A (2021.05.01) image was discriminated as a false alarm. Through the survey period, the spilled oil area tends to increase linearly after the accident. This study showed that high-resolution optical satellites can be used to calculate more accurately the distribution area of spilled oil and contribute to establishing efficient response strategies for oil spill accidents.