Browse > Article
http://dx.doi.org/10.9713/kcer.2021.59.2.191

Prediction of Distillation Column Temperature Using Machine Learning and Data Preprocessing  

Lee, Yechan (Green Materials and Processes R&D Group, Korea Institute of Industrial Technology)
Choi, Yeongryeol (Green Materials and Processes R&D Group, Korea Institute of Industrial Technology)
Cho, Hyungtae (Green Materials and Processes R&D Group, Korea Institute of Industrial Technology)
Kim, Junghwan (Green Materials and Processes R&D Group, Korea Institute of Industrial Technology)
Publication Information
Korean Chemical Engineering Research / v.59, no.2, 2021 , pp. 191-199 More about this Journal
Abstract
A distillation column, which is a main facility of the chemical process, separates the desired product from a mixture by using the difference of boiling points. The distillation process requires the optimization and the prediction of operation because it consumes much energy. The target process of this study is difficult to operate efficiently because the composition of feed flow is not steady according to the supplier. To deal with this problem, we could develop a data-driven model to predict operating conditions. However, data preprocessing is essential to improve the predictive performance of the model because the raw data contains outlier and noise. In this study, after optimizing the predictive model based long-short term memory (LSTM) and Random forest (RF), we used a low-pass filter and one-class support vector machine for data preprocessing and compared predictive performance according to the method and range of the preprocessing. The performance of the predictive model and the effect of the preprocessing is compared by using R2 and RMSE. In the case of LSTM, R2 increased from 0.791 to 0.977 by 23.5%, and RMSE decreased from 0.132 to 0.029 by 78.0%. In the case of RF, R2 increased from 0.767 to 0.938 by 22.3%, and RMSE decreased from 0.140 to 0.050 by 64.3%.
Keywords
Data preprocessing; low-pass filter; one-class support vector machine; distillation column; machine learning; random forests; Long-short term memory;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Giap, V., Pineda, I. T., Lee, J. Y., Lee, D. K., Kim, Y. S., Ahn, K. Y. and Lee, Y. D., "Performance Prediction Model of Solid Oxide Fuel Cell Stack Using Deep Neural Network Technique, Trans," Korean Hydrog. Energy Soc., 31(5), 436-443(2020).   DOI
2 Vijaya Raghavan, S. R., Radhakrishnan, T. K. and Srinivasan, K., "Soft Sensor Based Composition Estimation and Controller Design for an Ideal Reactive Distillation Column," ISA Trans., 50(1), 61-70(2011).   DOI
3 Sharma, N. and Singh, K., "Neural Network and Support Vector Machine Predictive Control of Tert-amyl Methyl Ether Reactive Distillation Column," Syst. Sci. Control Eng., 2(1), 512-526(2014).   DOI
4 Lee, D. W. and Lee, S. W., "Hourly Prediction of Particulate Matter (PM2.5) Concentration Using Time Series Data and Random Forest," Trans. Softw. Data Eng., 9(4), 129-136(2020).
5 Kim, J., Park, N. S., Yun, S., Chae, S. H. and Yoon, S., "Application of Isolation Forest Technique for Outlier Detection in Water Quality Data," J. Korean Soc. Environ. Eng., 40(12), 473-480(2018).   DOI
6 Howsalya Devi, R. D., Bai, A. and Nagarajan, N., "A Novel Hybrid Approach for Diagnosing Diabetes Mellitus Using Farthest First and Support Vector Machine Algorithms," Obes. Med., 17, 100152(2020)   DOI
7 Erkus, E. C. and Purutcuoglu, V., "Outlier Detection and Quasiperiodicity Optimization Algorithm: Frequency Domain Based Outlier Detection (FOD)," Eur. J. Oper. Res, 291(2), 560-574(2020).   DOI
8 Zhang, R., Zhang, S. and Muthuraman, S, J. J. "One Class Support Vector Machine for Anomaly Detection in the Communication Network Performance Data," 5th WSEAS Int. Conf. Appl. Electromagn. Wirel. Opt. Commun. Tenerife, Spain, December 14-16 (2007).
9 Guinon, J. L., Ortega, E., Garcia-Anton, J. and Perez-herranz, V., "Moving Average and Savitzki-Golay Smoothing Filters Using Mathcad," International Conference on Engineering Education, July, Coimbra, 1-4(2007).
10 Guo, H., Yu, M., Liu, J. and Ning, J., "Butterworth Low-pass Filter for Processing Inertial Navigation System Raw Data," J. Surv. Eng., 130(4), 175-178(2004).   DOI
11 Panigrahi, S., Karali, Y. S. and Behera, H., "Time Series Forecasting Using Evolutionary Neural Network," Int. J. Comput. Appl., 75(10), 13-17(2013).   DOI
12 Zhang, Z., Wu, Z., Rincon, D. and Christofides, P. D., "Real-time Optimization and Control of Nonlinear Processes Using Machine Learning," Mathematics, 7(10), 1-25(2019).
13 Kim, T. J. and Hong, J. S., "Classification of Parent Company's Downward Business Clients Using Random Forest: Focused on Value Chain at the Industry of Automobile Parts," J. Soc. E-bus. Stud., 23(1), 1-22(2018).   DOI
14 Oh, K., Kwon, H., Roh, J., Choi, Y., Park, H., Cho, H. and Kim, J., "Development of Machine Learning-Based Platform for Distillation Column," Korean. Chem. Eng. Res., 23(4), 565-572(2020).
15 Kwon, H., Oh, K., Chung, Y. G., Cho, H. and Kim, J., "Development of Machine Learning Model for Prediction Distillation Column Temperature," Appl. Chem. Eng., 31(5), 520-525(2020).   DOI
16 Kartal, F. and Ozveren, U., "A Deep Learning Approach for Prediction of Syngas Lower Heating Value from CFB Gasifier in Aspen Plus®," Energy, 209, 118457(2020).   DOI
17 Sneesby, M. G., Tade, M. O., Datta, R. and Smith, T. N., "ETBE Synthesis via Reactive Distillation. 1. Steady-State Simulation and Design Aspects," Ind. Eng. Chem. Res., 36(5), 1855-1869 (1997).   DOI
18 Kwon, H., Oh, K., Choi, Y., CHung, Y. G, Kim, J., "Development and Application of Machine Learning-based Prediction Model for Distillation Column, Int. J. Intell. Syst., 36, 1970-1997(2021).   DOI
19 Kim, D. W., Lee, S. C., Kim, M. J., Lee, E. J. and Yoo, C. K., "Development of QSAR Model Based on the Key Molecular Descriptors Selection and Computational Toxicology for Prediction of Toxicity of PCBs," Korean Chem. Eng. Res., 54(5), 621-629(2016).   DOI