Browse > Article

A Study on Statistical Forecasting Models of PM10 in Pohang Region by the Variable Transformation  

Lee, Yung-Seop (Department of statistics, Dongguk University)
Kim, Hyun-Goo (Korea Institute of Energy Research)
Park, Jong-Seok (Department of statistics, Dongguk University)
Kim, Hee-Kyung (Department of statistics, Dongguk University)
Publication Information
Journal of Korean Society for Atmospheric Environment / v.22, no.5, 2006 , pp. 614-626 More about this Journal
Abstract
Using the data of three environmental monitoring sites in Pohang area(KME112, KME113, and KME114), statistical forecasting models of the daily maximum and mean values of PM10 have been developed. Since the distributions of the daily maximum and mean PM10 values are skewed, which are similar to the Weibull distribution, these values were log-transformed to increase prediction accuracy by approximating the normal distribution. Three statistical forecasting models, which are regression, neural networks(NN) and support vector regression(SVR), were built using the log-transformed response variables, i.e., log(max(PM10)) or log(mean (PM10)). Also, the forecasting models were validated by the measure of RMSE, CORR, and IOA for the model comparison and accuracy. The improvement rate of IOA before and after the log-transformation in the daily maximum PM10 prediction was 12.7% for the regression and 22.5% for NN. In particular, 42.7% was improved for SVR method. In the case of the daily mean PM10 prediction, IOA value was improved by 5.1% for regression, 6.5% for NN, and 6.3% for SVR method. As a conclusion, SVR method was found to be performed better than the other methods in the point of the model accuracy and fitness views.
Keywords
PM10 forecast; Regression; Neural network; Support vector regression; Log-transformation;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 구윤서, 윤원정, 권희용, 양재문, 최종혁, 윤희영 (2005) 전일미세먼지 (PM10) 예보시스템 개발, 한국대기환경학회 춘계학술대회논문집, 403-404
2 김현구, 이영섭, 구자문, 고유나 (2006) 사회통계조사에 의한 대기환경 체감지수 개발에 관한 연구, 한국대기환경학회 춘계학술대회논문집, 142-143
3 이화운, 정우식, 김현구, 이순환 (2004) 대기오염 확산해석을 위한 포항지역 기상장 연구-바람장 수치해석, 한국대기환경학회지, 20(1), 1-15
4 포항산업과학연구원 (2002) 포항시 환경보전 종합계획2002-2011, 포항시청
5 Draper, N. and H. Smith (1998) Applied Regression Analysis (3rd ed), John Wiley & Sons, New York
6 김운수(2004) 서울시 미세먼지 배출량 조사.분석 및 관리방안 연구, 시정연 2004-R-22, 서울시정개발연구원
7 환경부 (2005a) 연도별 대기오염도 변화추이 (측정소별), 환경부 대기환경연보
8 Scholkopf, B. and A.J. Smola (2001) Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, MIT Press
9 환경부 (2005b) 미세먼지 농도별 행동요령, 환경부 2005. 1. 26 보도자료, http://dust.seoul.go.kr
10 구윤서, 권희용, 윤희영(2003) 통계모델을 이용한 실시간 오염도 예보 시스템 개발(PM-10), 한국대기환경학회 추계학술대회논문집, 445-446
11 Vapnik, V. (1998) Statistical Learning Theory, John Wiley & Sons, New York
12 Ripley, B.D. (1996) Pattern Recognition and Neural Networks, Cambridge University Press
13 정은희, 김현구, 전희동 (2005) 포항지역 대기오염물질 배출량 산정(2003년도), 한국대기환경학회 춘계학술대회논문집, 356-357
14 김현구 (2005) 기상조건별 비산먼지 관리체계 최적화 연구, 한국대기환경학회지, 21(6), 573-583   과학기술학회마을