Browse > Article

A Method for Prediction of Quality Defects in Manufacturing Using Natural Language Processing and Machine Learning  

Roh, Jeong-Min (고려사이버대학교 융합정보대학원)
Kim, Yongsung (고려사이버대학교 창의공학부 소프트웨어공학과)
Publication Information
Journal of Platform Technology / v.9, no.3, 2021 , pp. 52-62 More about this Journal
Abstract
Quality control is critical at manufacturing sites and is key to predicting the risk of quality defect before manufacturing. However, the reliability of manual quality control methods is affected by human and physical limitations because manufacturing processes vary across industries. These limitations become particularly obvious in domain areas with numerous manufacturing processes, such as the manufacture of major nuclear equipment. This study proposed a novel method for predicting the risk of quality defects by using natural language processing and machine learning. In this study, production data collected over 6 years at a factory that manufactures main equipment that is installed in nuclear power plants were used. In the preprocessing stage of text data, a mapping method was applied to the word dictionary so that domain knowledge could be appropriately reflected, and a hybrid algorithm, which combined n-gram, Term Frequency-Inverse Document Frequency, and Singular Value Decomposition, was constructed for sentence vectorization. Next, in the experiment to classify the risky processes resulting in poor quality, k-fold cross-validation was applied to categorize cases from Unigram to cumulative Trigram. Furthermore, for achieving objective experimental results, Naive Bayes and Support Vector Machine were used as classification algorithms and the maximum accuracy and F1-score of 0.7685 and 0.8641, respectively, were achieved. Thus, the proposed method is effective. The performance of the proposed method were compared and with votes of field engineers, and the results revealed that the proposed method outperformed field engineers. Thus, the method can be implemented for quality control at manufacturing sites.
Keywords
Manufacturing; Natural Language Processing; Machine Learning; Prediction of Quality Defects; Sentence Vectorization;
Citations & Related Records
연도 인용수 순위
  • Reference
1 B. Park and W. Lee. "Use Cases of Machine Learning Techniques for Manufacturing Process Data: Comparative Analysis of CART, Random Forest, and TreeNet," Quality Management of Korea, 2021(0), pp. 40-40, 2021
2 J. Yoon, H. An and Y. Choi. "A Machine Learning Based Facility Error Pattern Extraction Framework for Smart Manufacturing," Society for E-Business Studies 23(2), pp. 97-110, 2018
3 K. Arif-Uz-Zaman, M. E. Cholette, L. Ma and A. Karim. "Extracting failure time data from industrial maintenance records using text mining." Advanced Engineering Informatics, 33, pp. 388-396, 2017   DOI
4 M. Lim and S. Kang. "Comparison Between Optimal Features of Korean and Chinese for Text Classification," International Journal of Fuzzy Logic and Intelligent Systems, 25(4), pp. 386-391, 2015
5 J. Camacho-Collados and M. T. Pilehvar. Cornell University (Ver. 3) https://arxiv.org/abs/1707.01780v3 (downloaded:2020. 9. 20)
6 G. Li, Y. Cao, S. Zhao, Y. Bao and C. Yu. "Research on Text Data Pool of Intelligent Manufacturing for Plate Parts." Journal of Physics: Conference Series, 1575(012199), 2020   DOI
7 S. An and S. Jo. "Stock Prediction Using News Text Mining and Time Series Analysis," Korea Computer Congress 2010, 37(1), pp. 364-369, 2010
8 Y. Kim and S. Lee. "Combinations of Text Preprocessing and Word Embedding Suitable for Neural Network Models for Document Classification," Journal of KIISE, 45(7), pp. 690-700, 2018   DOI
9 A. I. Kadhim, Y. Cheah and N. H. Ahamed. "Text Document preprocessing and Dimension Reduction Techniques for Text Document Clustering." 2014 4th International Conference on Artificial Intelligence with Applications in Engineering and Technology, pp. 69-73, 2014
10 C. S. Pavan Kumar and L. D. Dhinesh Babu. "Novel Text Preprocessing Framework for Sentiment Analysis. Smart Intelligent Computing and Applications. Smart Innovation," Systems and Technologies, pp. 105, 2019
11 J. Perkins. Python 3 Text Processing with NLTK 3 Cookbook, 2nd Ed., pp. 36. (Packt Publishing Ltd., Birmingham)
12 T. Wuest, D. Weimer, C. Irgens and K. D. Thoben. "Machine learning in manufacturing: advantages, challenges, and applications." Production & Manufacturing Research, 4(1), pp. 23-45, 2016   DOI
13 Jing Gao and Jun Zhang. "Clustered SVD strategies in latent semantic indexing," Information Processing & Management 41(5), pp. 1051-1063, 2005   DOI
14 B. Kang, S. Park. "Integrated machine learning approaches for complementing statistical process control procedures." Decision Support Systems, 29(1), pp. 59-72, 2000   DOI
15 R. Fazai, K. Abodayeh, M. Mansouri, M. Trabelsi, H. Nounou, M. Nounou and G.E. Georghiou. "Machine learning-based statistical testing hypothesis for fault detection in photovoltaic systems." Solar Energy, 190, pp. 405-413, 2019   DOI