DOI QR코드

DOI QR Code

Classification of Unstructured Customer Complaint Text Data for Potential Vehicle Defect Detection

잠재적 차량 결함 탐지를 위한 비정형 고객불만 텍스트 데이터 분류

  • Ju Hyun Jo (Department Of Industrial System Engineering, Ajou University) ;
  • Chang Su Ok (Department Of Industrial Data Engineering, Hongik University) ;
  • Jae Il Park (Department Of Industrial Engineering, Ajou University)
  • Received : 2023.03.30
  • Accepted : 2023.05.09
  • Published : 2023.06.30

Abstract

This research proposes a novel approach to tackle the challenge of categorizing unstructured customer complaints in the automotive industry. The goal is to identify potential vehicle defects based on the findings of our algorithm, which can assist automakers in mitigating significant losses and reputational damage caused by mass claims. To achieve this goal, our model uses the Word2Vec method to analyze large volumes of unstructured customer complaint data from the National Highway Traffic Safety Administration (NHTSA). By developing a score dictionary for eight pre-selected criteria, our algorithm can efficiently categorize complaints and detect potential vehicle defects. By calculating the score of each complaint, our algorithm can identify patterns and correlations that can indicate potential defects in the vehicle. One of the key benefits of this approach is its ability to handle a large volume of unstructured data, which can be challenging for traditional methods. By using machine learning techniques, we can extract meaningful insights from customer complaints, which can help automakers prioritize and address potential defects before they become widespread issues. In conclusion, this research provides a promising approach to categorize unstructured customer complaints in the automotive industry and identify potential vehicle defects. By leveraging the power of machine learning, we can help automakers improve the quality of their products and enhance customer satisfaction. Further studies can build upon this approach to explore other potential applications and expand its scope to other industries.

Keywords

Acknowledgement

This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT) (No. 2021R1F1A1062194).

References

  1. Acocella, I., The Focus Groups in Social Research: Advantages and Disadvantages, Quality & Quantity, 2012, Vol.46, pp. 1125-1136. https://doi.org/10.1007/s11135-011-9600-4
  2. Amherst, Identification and validation of themes from vehicle owner complaints and fatality reports using text analysis - Shashank Kumar Mehrotra Graduate Research Assistant Human Performance Laboratory University of Massachusett, 2019.
  3. Cornelia Kiefer, A Hybrid information extraction approach exploiting structured data within a text mining process, stuttgart, 2019.
  4. Das, T.K. and Kumar, P.M., Big Data Analytics: A Framework for Unstructured Data Analysis, International Journal of Engineering Technology, 2013, Vol. 5, No. 1, pp. 153-156.
  5. Eboli, M.G., Maberry, C.M., Gibbs, I.A., Detecting potential vehicle concerns using natural language processing applied to automotive big data, General Motors United States of America, 2019.
  6. Gantz, J. and Reinsel, D., The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the far east, IDC iView: IDC Anal, Future, 2012, Vol. 2007, pp. 1-16.
  7. Ghazizadeh, M., and Lee, J.D., Consumer Complaints and Traffic Fatalities: Insights from the NHTSA Vehicle Owner's Complaint Database, Department of Industrial and Systems Engineering University of Wisconsin-Madison, USA, 2012.
  8. Joachims, T., Learning to Classify Text Using Support Vector Machines, New York, NY: 2002.
  9. Kaveh Bastani, Latent Dirichlet Allocation(LDA) for topic modeling of the CFPB consumer complaints, cincinnati, 2019.
  10. Kim, S.J., A study on the unstructured big data morpheme classification model using customized dictionary techniques, 2021.
  11. King, T., 80 Percent of Your Data Will Be Unstructured in Five Years, Data Management Solutions Review, March 28, 2019. Accessed November 24, 2020.
  12. Ministry of Land, Infrastructure and Transport, Partial Amendment to the Enforcement Regulations of the Automobile Management Act, 2009.
  13. Morgan, D.L., Focus groups, Annual Review of Sociology, 1996, Vol.22, No.1, pp. 129-152. https://doi.org/10.1146/annurev.soc.22.1.129
  14. Netzer, O., Feldman, R., Goldenberg, J., and Fresko, M., Mine Your Own Business: Market-structure Surveillance Through Text Mining, Marketing Science, 2012, Vol.31, No.3, pp. 521-543. https://doi.org/10.1287/mksc.1120.0713
  15. Sobhan Sarkar, Vaibhav Lodhi, Text-clustering based deep neural network for prediction of occupational accident risk, IIT kharagpur, 2019.
  16. Subasish Das, vehicle consumer complaint reports lnvolving severe incidents - Mining Large Contingency Tables, Texas, 2018.
  17. Zhang, X., Qiao, Z., Tang, L., and Fan, W., Identifying Product Defects from User Complaints: A Probabilistic Defect Model, Virginia, 2016.