DOI QR코드

DOI QR Code

Document Clustering Technique by Domain Ontology

도메인 온톨로지에 의한 문서 군집화 기법

  • Received : 2016.04.08
  • Accepted : 2016.06.24
  • Published : 2016.06.30

Abstract

We can organize, manage, search, and process the documents efficiently by a document clustering. In general, the documents are clustered in a high dimensional feature space because the documents consist of many terms. In this paper, we propose a new method to cluster the documents efficiently in a low dimensional feature space by finding the core concepts from a domain ontology corresponding to the particular area documents. The experiment shows that our clustering method has a good performance.

Keywords

References

  1. Bae, Y., Kim, J., Ok, D., and Choi, H. S., "Development of Concept and Instance Classification System for Automatic Construction of Ontology", Proceedings of The 34th KIISE Spring Conference, 2007.
  2. Choi, H., Lim, J., Bae, Y., Choi, S., and Ok, C. Y., "Ontology Construction Method and Example", Communications of the Korean Institute of Information Scientists and Engineers, 2006.
  3. Choi, H. S. and Ok, C. Y., "Information Retrieval and Ontology", Communications of the Korean Institute of Information Scientists and Engineers, Vol. 22, No. 4, 2004, pp. 62-71.
  4. Hotho, A., Maedche, A., and Staab, S., "Ontology-Based Text Clustering", Proceedings of the IJCAI-001 Workshop, "Text Learning : Beyond Supervision", 2011.
  5. Hu, X., Zhang, X., Lu, C., Park, E. K., and Zhou, X., "Exploiting Wikipedia as External Knowledge for Document Clustering", Proceeding of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, 2009.
  6. Hwang, C., Lee, M., and Jung, G., "Design of Merchandise Retrieval System based on Ontology on EC", 2015 KSII Spring Conference, 2005.
  7. Hwang, M., Jeong, D. H., Cho, M., Jung, H., Kim, P., Yoon, S., and Han, K., "On Construction of National History Ontology", On Construction of National History Ontology", 2012 KSII Fall Conference, 2012.
  8. Jo, D. and Kim, D., "Study on Legal Ontology Construction and RDF Inference Method", The 39th KIPS Fall Conference 2013, 2013.
  9. Jo, D. H. and Lee, K. S., "Query Expansion based on UMLS and Wikipedia Knowledge Information for Clinical Decision Support", Proceedings of The 42nd KIISE Spring Conference, 2015.
  10. Kaufman, L. and Rousseeuw, P. J., Finding Groups in Data : An Introduction to Cluster Analysis, Wiley, New York, 1990.
  11. Kim, J. and Choi, K. S., "Automatic Construction of Korean WordNet based on Core-Net and Dictionaries", Proceedings of The 41st KIISE Fall Conference, 2014.
  12. Kong, H., Hwang, M., Kim, W., and Kim, P., "The Study on the Autometic Ontology Building Methodology about the Specific Domain Knowledge", Proceedings of The 32nd KIISE Fall Conference, 2005.
  13. Min, Y. and Lee, B., "Predicate Ontology for Automatic Ontology Building", The 34th KIPS Spring Conference 2008, 2008.
  14. Mun, H. J. and Woo, Y. T., "Concept Extraction Technique from Documents Using Domain Ontology", The KIPS Transactions :Part D, Vol. 13-D, No. 3, 2006, pp. 309-316.
  15. Park, S., Kim, K. J., Kim, K. H., and Lee, S., "Enhancing Document Clustering Using Term Re-weighting Based on Semantic Features", Journal of KIICE, 2013.
  16. Park, S., Lee, Y., Jung, M. A., and Lee, S., "Enhancing Document Clustering using Important Term of Cluster and Wikipedia", Journal of IEIE, 2012.
  17. Ra, M., Yoo, D., No, S., Shin, J., and Han, C., "National Defense Domain Ontology Development Using Mixed Ontology Building Methodology", The 38th KIPS Spring Conference 2012, 2012.
  18. Smirnov, A., Pashkin, M., Chilov, N., Levashova, T., Krizhanovsky, A., and Kashevnik, A., Ontology-Based Users and Requests Clustering in Customer Service Management System, Springer-Verlag GmbH, Lecture Notes in Computer Science, Vol. 3505, 2005, pp. 231-246.
  19. Snasel, V., Moravec, P., and Pokorny, J., "WordNet Ontology based Model for Web Retrieval", International Workshop on Challenges in Web Information Retrieval and Integration, 2005.
  20. Son, J., Kim, D., and Jung, I., "Representation of drug information and their relations using ontology", Proceedings of The 37th KIISE Spring Conference, 2010.
  21. Wang, H., Azuaje, F., and Bodenreider, O., "An ontologydriven clustering method for supporting gene expression analysis", In Proc. of the 18th IEEE International Symposium on Computer-Based Medical Systems, in press, 2005.
  22. http://lyle.smu.edu/-tspell/jaws/.
  23. http://obi-ontology.org/page.
  24. http://protegewiki.stanford.edu/wiki/ProtegeOWL_API_Programmers_Guide.
  25. http://www.biomedcentral.com.
  26. https://lsg3.nlm.nih.gov/LexSysGroup/Summary/lexicon.html.
  27. https://lucene.apache.org/core.