Browse > Article
http://dx.doi.org/10.9708/jksci.2021.26.11.033

A study on the Extraction of Similar Information using Knowledge Base Embedding for Battlefield Awareness  

Kim, Sang-Min (Intelligent C4I Team, Hanwha Systems Co.)
Jin, So-Yeon (Intelligent C4I Team, Hanwha Systems Co.)
Lee, Woo-Sin (Intelligent C4I Team, Hanwha Systems Co.)
Abstract
Due to advanced complex strategies, the complexity of information that a commander must analyze is increasing. An intelligent service that can analyze battlefield is needed for the commander's timely judgment. This service consists of extracting knowledge from battlefield information, building a knowledge base, and analyzing the battlefield information from the knowledge base. This paper extract information similar to an input query by embedding the knowledge base built in the 2nd step. The transformation model is needed to generate the embedded knowledge base and uses the random-walk algorithm. The transformed information is embedding using Word2Vec, and Similar information is extracted through cosine similarity. In this paper, 980 sentences are generated from the open knowledge base and embedded as a 100-dimensional vector and it was confirmed that similar entities were extracted through cosine similarity.
Keywords
Knowledge Base; Embedding; Battlefield Awareness; Natural Language Processing; Artificial Intelligence;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Hyungsuc Kang, Janghoon Yang, "Performance Comparison of Word2vec and fastText Embedding Models", Journal of Digital Contents Society 21(7), 2020.7, 1335-1343 (9 pages)   DOI
2 Se-Hyeon Jo and Hack-Jun Kim, "A Study on Building Knowledge Base for Intelligent Battlefield Awareness Service," Journal of the Korea Society of Computer and Information 25(4), 2020.4, 11-17(7 pages)
3 M. Li, et al., "GAIA at SM-KBP 2019 - Amulti-media multi-lingual knowledge extractionand hypothesis generation system," TACSM-KMP, 2019
4 Andersen, Carl F. et al. "KB Construction and Hypothesis Generation Using SAMSON." TAC (2019).
5 So-yeon Jin and Woo-sin Lee. "A Study on Multiple Reasoning Technology for Intelligent Battlefield Situational Awareness" The Journal of Korean Institute of Communications and Information Sciences 45(6), 2020.6,1046-1055 (10 pages)   DOI
6 TAC(Text Analysis Conference) StreamingMultimedia Knowledge Base Population(SM-KBP) 2019 guidelines, Retrieved May 8,2020, from https://tac.nist.gov/tracks/SM-KBP/2019/ontologies/LDCOwlOntology.
7 Devlin, Jacob, et al. "Bert: Pre-training of deep bidirectional transformers for language understanding." arXiv preprint arXiv:1810.04805, 2018.
8 P. Bojanowski et al., "Enriching word vectors with subword information," Transactions of the Asociation for Computational Linguistics, No. 5, p. 135-146, 2017   DOI
9 J. Penington, R. Socher, & C. D. Maning, "Glove: Global vectors for word representation," In Procedings of the 2014 conference on empirical methods in naturalanguage procesing (EMNLP), p. 1532-1543, Oct. 2014.
10 Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 701-710. ACM, 2014.
11 Aditya Grover and Jure Leskovec. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2016.
12 @inproceedings{pyrdf2vec, author = {Gilles Vandewiele and Bram Steenwinckel and Terencio Agozzino and Michael Weyns and Pieter Bonte and Femke Ongenae and Filip De Turck}, title = {{pyRDF2Vec: Python Implementation and Extension of RDF2Vec}}, organization = {IDLab}, year = {2020}, url = {https://github.com/IBCNServices/pyRDF2Vec}}
13 K. Clark et al., "ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators." in Int. Conf. Learning Representations, Addis Ababa, Ethiopia, May 2020.
14 KoELECTRA, https://github.com/monologg/KoELECTRA
15 T. Mikolov, K. Chen, G. Corrado, and J. Dean. "Efficient estimation of word representations in vector space.", In Proceedings of ICLR Workshops Track, 2013. arxiv.org/abs/1301. 3781.
16 Zhiheng Huang, Wei Xu, Kai Yu, "Bidirectional LSTM-CRF Models for Sequence Tagging, " arXiv preprint arXiv:1508.01991, 2015.
17 Petar Ristoski, Jessica Rosati, Tommaso Di Noia, Renato De Leone, and Heiko Paulheim. 2018. RDF2Vec: RDF Graph Embeddings and Their Applications. Semantic Web, 10(4):721-752.   DOI
18 KoBERT, https://github.com/SKTBrain/KoBERT
19 Hasim Sak, Andrew Senior, Franscois Beaufays "Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling," In proc. Interspeech.