Browse > Article
http://dx.doi.org/10.3837/tiis.2022.06.004

MSFM: Multi-view Semantic Feature Fusion Model for Chinese Named Entity Recognition  

Liu, Jingxin (School of Computer Science and Technology, Hainan University)
Cheng, Jieren (School of Computer Science and Technology, Hainan University)
Peng, Xin (School of Cyberspace Security, Hainan University)
Zhao, Zeli (School of Cyberspace Security, Hainan University)
Tang, Xiangyan (School of Computer Science and Technology, Hainan University)
Sheng, Victor S. (Department of Computer Science Texas Tech University TX)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.16, no.6, 2022 , pp. 1833-1848 More about this Journal
Abstract
Named entity recognition (NER) is an important basic task in the field of Natural Language Processing (NLP). Recently deep learning approaches by extracting word segmentation or character features have been proved to be effective for Chinese Named Entity Recognition (CNER). However, since this method of extracting features only focuses on extracting some of the features, it lacks textual information mining from multiple perspectives and dimensions, resulting in the model not being able to fully capture semantic features. To tackle this problem, we propose a novel Multi-view Semantic Feature Fusion Model (MSFM). The proposed model mainly consists of two core components, that is, Multi-view Semantic Feature Fusion Embedding Module (MFEM) and Multi-head Self-Attention Mechanism Module (MSAM). Specifically, the MFEM extracts character features, word boundary features, radical features, and pinyin features of Chinese characters. The acquired font shape, font sound, and font meaning features are fused to enhance the semantic information of Chinese characters with different granularities. Moreover, the MSAM is used to capture the dependencies between characters in a multi-dimensional subspace to better understand the semantic features of the context. Extensive experimental results on four benchmark datasets show that our method improves the overall performance of the CNER model.
Keywords
Chinese Named Entity Recognition; Multi-head Self-Attention Mechanism; Multi-view Feature Fusion Embedding; Natural Language Processing; Semantic Feature Fusion;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Y. Jin, J. Xie, W. Guo, C. Luo, D. Wu, and R. Wang, "LSTM-CRF Neural Network with Gated Self Attention for Chinese NER," IEEE Access, vol. 7, pp. 136694-136703, Sep. 2019.   DOI
2 A. Al-Besher, K. Kumar, M. Sangeetha and T. Butsa, "Bert for conversational question answering systems using semantic similarity estimation," Computers, Materials & Continua, vol. 70, no.3, pp. 4763-4780, Oct. 2022.   DOI
3 J. Liu, L. Gao, S. Guo, et al, "A hybrid deep-learning approach for complex biochemical named entity recognition" Knowledge-Based Systems, vol. 221, Jun. 2021.
4 Q. He, L. Wu, Y. Yin, et al, "Knowledge-graph augmented word representations for named entity recognition," in Proc. of AAAI, vol. 34(5), pp. 7919-7926, Apr. 2020.   DOI
5 F. Lei, J. Cheng, Y. Yang, X. Tang, V. Sheng and C. Huang, "Improving Heterogeneous Network Knowledge Transfer Based on the Principle of Generative Adversarial," Electronics, vol. 10, no. 13, pp.1525, Jun. 2021.   DOI
6 G. Levow, "The third international Chinese language processing bakeoff: Word segmentation and named entity recognition," in Proc. of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 108-117, Jul. 2006.
7 X. Tang, W. Tu, K.Li and J. Cheng, "DFFNet: An IoT-Perceptive Dual Feature Fusion Network for General Real-time Semantic Segmentation," Information Sciences, vol. 565, pp. 326-343, Jul. 2021.   DOI
8 J. Cheng, X. Peng, X. Tang, W. Tu, W. Xu, "MIFNet: A lightweight multiscale information fusion network," Int J Intell Syst, pp. 1- 26. Dec. 2021.
9 T. Li, Y. Hu, A. Ju and Z. Hu, "Adversarial active learning for named entity recognition in cybersecurity," Computers, Materials & Continua, vol. 66, no.1, pp. 407-420, Oct. 2021.
10 Y. Nie, Y. Tian, X. Wan, et al, "Named entity recognition for social media texts with semantic augmentation," in Proc. of EMNLP, pp. 1383-1391, Oct. 2020.
11 T. Gui, Y. Zou, Q. Zhang, et al, "A lexicon-based graph neural network for chinese ner," in Proc. of EMNLP-IJCNLP, pp. 1040-1050, Nov. 2019.
12 W. Liu, T. Xu, Q. Xu, et al, "An Encoding Strategy Based Word-Character," in Proc. of NAACL, pp. 2379-2389, Jun. 2019.
13 R. Ma, M. Peng, Q. Zhang, et al, "Simplify the usage of lexicon in Chinese NER," in Proc. of ACL, pp. 5951-5960, Oct. 2019.
14 R. Yin, Q. Wang, R. Li, P. Li, B. Wang, "Multi-Granularity Chinese Word Embedding," in Proc. of EMNLP-IJCNLP, pp.981-986, Nov. 2016.
15 N. Ye, X. Qin, L. Dong, X. Zhang, and K. Sun, "Chinese Named Entity Recognition Based on Character-Word Vector Fusion," Wireless Communications and Mobile Computing, vol. 2020, no. 3, pp. 1-7, Jul. 2020.
16 T. Mikolov, K. Chen, G. Corrado, et al, "Efficient Estimation of Word Representations in Vector Space," in Proc.of NIPS, Sep. 2013.
17 J. Li, K. Meng, "MFE-NER: Multi-feature Fusion Embedding for Chinese Named Entity Recognition," in Proc. of ICLR, Sep. 2021.
18 S. Yoo and O. Jeong, "Ep-bot: empathetic chatbot using auto-growing knowledge graph," Computers, Materials & Continua, vol. 67, no.3, pp. 2807-2817, Mar. 2021.   DOI
19 T. Gui, R. Ma, Q. Zhang, L. Zhao, Y. Jiang, and X. Huang, "CNN-Based Chinese NER with Lexicon Rethinking," in Proc. of IJCAI, pp. 4982-4988, Aug. 2019.
20 J. Li, A. Sun, J. Han, et al, "A survey on deep learning for named entity recognition," IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 1, pp. 50-70, Mar. 2020.
21 Y. Chen, L. Xu, K. Liu, D. Zeng, and J. Zhao, "Event extraction via dynamic multi-pooling convolutional neural networks," in Proc. of ACL-IJCNLP, pp. 167-176, Jul. 2015.
22 P. Virga, S. Khudanpur, "Transliteration of proper names in cross-lingual information retrieval," in Proc. of ACL, pp. 57-64, Jul. 2003.
23 X. Yao, B. Van Durme, "Information extraction over structured data: Question answering with freebase," in Proc. of ACL, pp. 956-966, Jun.2014.
24 D. Molla, M. Van Zaanen, D. Smith, "Named entity recognition for question answering," in Proc. of ALTW, pp. 51-58, Spe. 2006.
25 S. Gajendran, D. Manjula, V. Sugumaran, "Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature," Journal of Biomedical Informatics, vol. 112, no. 445, pp. 103609, Dec. 2020.   DOI
26 S. Hochreiter, J. Schmidhuber, "Long Short-Term Memory," Neural Computation, vol. 9, no. 8 pp. 1735-1780, Nov. 1997.   DOI
27 A. Vaswani, N. Shazeer, N. Parmar, et al, "Attention Is All You Need," in Proc. of NIPS, Long Beach, CA, USA, Dec. 2017.
28 N. Peng, M. Dredze, "Named Entity Recognition for Chinese Social Media with Jointly Trained Embeddings," in Proc. of EMNLP, pp. 548-554, Sep. 2015.
29 S. Zhao, M. Hu, Z. Cai, Z., F. Liu, "Dynamic Modeling Cross-Modal Interactions in Two-Phase Prediction for Entity-Relation Extraction," IEEE Transactions on Neural Networks and Learning Systems, Aug. 2021.
30 J. Cheng, Y. Yang, X. Tang, N. Xiong, Y. Zhang and F. Lei, "Generative Adversarial Networks: A Literature Review," KSII Transactions on Internet and Information Systems, vol. 14, no. 12, pp. 4625-4647, Dec. 2020.   DOI
31 R. Bunescu, R. Mooney, "A shortest path dependency kernel for relation extraction," in Proc. of EMNLP, pp. 724-731, Oct. 2005.
32 J. Cheng, J. Liu, X. Xu, et al, "A review of Chinese named entity recognition," KSII Transactions on Internet and Information Systems (TIIS), vol. 15, no. 6, pp. 2012-2030, Jun. 2021.
33 B. Ji, R. Liu, S. Li, et al, "A hybrid approach for named entity recognition in Chinese electronic medical record," BMC medical informatics and decision making, vol. 19, no. 2, pp. 149-158, Apr. 2019.   DOI
34 Y. Zhang and J. Yang, "Chinese ner using lattice lstm," in Proc. of ACL, pp. 1554-1564, Jul. 2018.
35 Q. Zhao, D. Wang, J. Li, and F. Akhtar, "Exploiting the concept level feature for enhanced name entity recognition in Chinese EMRs," Supercomputing, vol. 76, no. 8, pp. 6399-6420, 2020.   DOI
36 S. Zhao, M. Hu, Z. Cai, Z. Zhang, T. Zhou, F. Liu. "Enhancing Chinese Character Representation With Lattice-Aligned Attention," IEEE Transactions on Neural Networks and Learning Systems, pp. 1-10, Oct. 2021.
37 N. Peng, M. Dredze, "Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning," in Proc. of ACL, pp. 149-155, Mar. 2016.
38 X. Li, H. Yan, X. Qiu, and X, Huang, "FLAT: Chinese NER Using Flat-Lattice Transformer," in Proc. of ACL, pp. 6838-6842, May. 2020.