Browse > Article
http://dx.doi.org/10.3837/tiis.2015.01.025

Building a Business Knowledge Base by a Supervised Learning and Rule-Based Method  

Shin, Sungho (Department of Computer Intelligent Research, Korea Institute of Science and Technology Information)
Jung, Hanmin (Department of Computer Intelligent Research, Korea Institute of Science and Technology Information)
Yi, Mun Yong (Department of Knowledge Service Engineering, Korea Advanced Institute of Science and Technology)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.9, no.1, 2015 , pp. 407-420 More about this Journal
Abstract
Natural Language Question Answering (NLQA) and Prescriptive Analytics (PA) have been identified as innovative, emerging technologies in 2015 by the Gartner group. These technologies require knowledge bases that consist of data that has been extracted from unstructured texts. Every business requires a knowledge base for business analytics as it can enhance companies' competitiveness in their industry. Most intelligent or analytic services depend a lot upon on knowledge bases. However, building a qualified knowledge base is very time consuming and requires a considerable amount of effort, especially if it is to be manually created. Another problem that occurs when creating a knowledge base is that it will be outdated by the time it is completed and will require constant updating even when it is ready in use. For these reason, it is more advisable to create a computerized knowledge base. This research focuses on building a computerized knowledge base for business using a supervised learning and rule-based method. The method proposed in this paper is based on information extraction, but it has been specialized and modified to extract information related only to a business. The business knowledge base created by our system can also be used for advanced functions such as presenting the hierarchy of technologies and products, and the relations between technologies and products. Using our method, these relations can be expanded and customized according to business requirements.
Keywords
Information extraction; business knowledge base; structural support vector machine; named entity recognition; relation extraction;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 A. McCallum and W. Li, "Early Results for Named Entity Recognition with Conditional Random Fields, features Induction and Web-Enhanced Lexicons," in Proc. of Conference on Computational Natural Language Learning, May 31-June 1, 2003.
2 D. Nadeau and S. Sekine, "A Survey of Named Entity Recognition and Classification," Linguisticae Investigations, vol. 30, pp. 3-26. 2007.   DOI
3 N. Bach and S. Badaskar, "A survey on relation extraction," Language Technologies Institute, Carnegie Mellon University, 2007.
4 M. Mintz, S. Bills, R. Snow and D. Jurafsky, "Distant supervision for relation extraction without labeled data," in Proc. of the Association for Computational Linguistics, August 2-7, 2009.
5 C. Lee, P. M. Ryu and H. K. Kim, "Named Entity Recognition using a Modified Pegasos Algorithm," in Proc. of the 20th ACM International Conference on Information and Knowledge Management, October 24-28, 2011.
6 S. Shin, C. H. Jeong, D. Seo, S. P. Choi and H. Jung, "Improvement of the Performance in Rule-Based Knowledge Extraction by Modifying Rules," in Proc. of the 2nd International Workshop on Semantic Web-based Computer Intelligence with Big-data, November 9-11, 2013.
7 C. N. Seon, J. H. Yoo, H. Kim, J. H. Kim and J. Seo, "Lightweight Named Entity Extraction for Korean Short Message Service Text," KSII Transactions on Internet & Information Systems, vol. 5, no. 3, pp. 560-574, 2011.   DOI
8 M. Zhang, J. Zhang, J. Su and G. Zhou, "A composite kernel to extract relations between entities with both flat and structured features," in Proc. Of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, pp. 825-832, July 17-21, 2006.
9 Z. Guodong, S. Jian, Z. Jie and Z. Min, "Exploring various knowledge in relation extraction," in Proc. of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 427-434, June 25-30, 2005.
10 M. Mintz, S. Bills, R. Snow, D. Jurafsky, "Distant supervision for relation extraction without labeled data," in Proc. of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, August 2-7, 2009.
11 M. Collins, "Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms," in Proc. of Empirical Methods in Natural Language Processing, July 6-7, 2002.
12 S. Shin, Y. S. Choi, S. K. Song, S. P. Choi and H. Jung, "Construction of Test Collection for Automatically Extracting Technological Knowledge," Journal of Korea Content Society, vol.12, no.7, 2012 (in Korean).