[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.23093/FSI.2021.54.3.160

AI-based system for automatically detecting food risk information from news data

Baek, Yujin (Graduate School of AI, KAIST)
Lee, Jihyeon (Graduate School of AI, KAIST)
Kim, Nam Hee (N Kim lab)
Lee, Hunjoo (Chem.I.Net Inc.)
Choo, Jaegul (Graduate School of AI, KAIST)

Publication Information

Food Science and Industry / v.54, no.3, 2021 , pp. 160-170 More about this Journal

Abstract

A recent advance in communication technologies accelerates the spread of food safety issues once presented by the news media. To respond to those safety issues and take steps in a timely manner, automatically detecting related information from the news data matters. This work presents an AI-based system that detects risk information within a food-related news article. Experts in food safety areas participated in labeling risk information from the food-related news articles; we acquired 43,527 articles in which food names and risk information are marked as labels. Based on the news document, our system automatically detects food names and risk information by analyzing similarities between words within a text by leveraging learned word embedding vectors. Our AI-based system shows higher detection accuracy scores over a non-AI rule-based system: achieving an absolute gain of +32.94% in F1 for the food name category and +41.53% for the risk information category.

Keywords

artificial intelligence; word embedding; risk information; food safety issues;

Citations & Related Records

Reference

1	Pennington J, Socher R, Manning C. Glove: global vectors for word representation. pp. 1532-1543. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (2014)
2	Van der Maaten L, Hinton G. Visualizing data using t-SNE. JMLR. 9: 2579-2605 (2008)
3	Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Polosukhin I. pp. 6000-6010. In: Proceedings of the 31th International Conference on Neural Information Processing Systems (2017)
4	Devlin J, Chang MW, Lee K. Bert: pre-training of deep bidirectional transformers for language understanding. Vol. 1, pp. 4171-4186. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2019)
5	Tagtog. 택톡 웹기반 텍스트 레이블링 툴. https://www.tagtog.net/
6	Joulin A, Grave E, Bojanowski P, Douze M, Jegou H, Mikolov T. Fasttext. zip: compressing text classification models. arXiv preprint arXiv:1612.03651. (2016)
7	Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J. Distributed representations of words and phrases and their compositionality. Vol. 2, pp. 3111-3119. In: Proceedings of the 26th International Conference on Neural Information Processing Systems (2013)

KSCI

AI-based system for automatically detecting food risk information from news data 뉴스 데이터로부터 식품위해정보 자동 추출을 위한 인공지능 기술

AI-based system for automatically detecting food risk information from news data