DOI QR코드

DOI QR Code

Intention Classification for Retrieval of Health Questions

  • Liu, Rey-Long (Department of Medical Informatics, Tzu Chi University Hualien)
  • Received : 2016.10.29
  • Accepted : 2017.02.26
  • Published : 2017.03.31

Abstract

Healthcare professionals have edited many health questions (HQs) and their answers for healthcare consumers on the Internet. The HQs provide both readable and reliable health information, and hence retrieval of those HQs that are relevant to a given question is essential for health education and promotion through the Internet. However, retrieval of relevant HQs needs to be based on the recognition of the intention of each HQ, which is difficult to be done by predefining syntactic and semantic rules. We thus model the intention recognition problem as a text classification problem, and develop two techniques to improve a learning-based text classifier for the problem. The two techniques improve the classifier by location-based and area-based feature weightings, respectively. Experimental results show that, the two techniques can work together to significantly improve a Support Vector Machine classifier in both the recognition of HQ intentions and the retrieval of relevant HQs.

Keywords

References

  1. Abbas, J., Schwartz, D. G., & Krause, R. (2010). Emergency medical residents' use of Google(R) for answering clinical questions in the emergency room. In Proc. of ASIST 2010.
  2. Cartright, M.-A., White, R. W., & Horvitz, E. (2011). Intentions and attention in exploratory health search. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, 65-74.
  3. Casellas, N., Casanovas, P., Vallbe, J.-J., Poblet, M., Blazquez, M., Contreras, J., Lopez-Cobo, J.-M., & Richard, V. (2007). Semantic enhancement for legal information retrieval: IURISERVICE performance. In Proceedings of ICAIL, Palo Alto, CA USA.
  4. Cohen, W. W., & Singer, Y. (1996). Context-sensitive mining methods for text categorization. In Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval, 307-315, Zurich, Switzerland.
  5. Eysenbach, G., & Kohler, C. (2002). How do consumers search for and appraise health informationa on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews, British Medical Journal, 324, 573-577. https://doi.org/10.1136/bmj.324.7337.573
  6. Huang, Z., Thint, M., & Qin, Z. (2008). Question classification using headwords and their hypernyms. Proc. of the 2008 Conference on Empirical Methods in Natural Language Processing, 927-936.
  7. Joachims, T. (1999). Making large-scale SVM learning practical. In advances in kernel methods - support vector learning, B. Scholkopf, C. Burges and A. Smola (ed.), MIT-Press.
  8. Krishnan, V., Das, S., & Chakrabarti, S. (2005). Enhanced answer type inference from questions using sequential models. Proc. of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, 315-322.
  9. Lee, C.-W, Day, M.-Y., Sung, C.-L., Lee, Y.-H., Jiang, T.-J., Wu, C.-W., Shih, C.-W., Chen, Y.-R., & Hsu, W.-L. (2008). Boosting chinese question answering with two lightweight methods: ABSPs and SCO-QAT. ACM Trans. Asian Lang. Inform. Process, Article 12.
  10. Lin, J. & Demner-Fushman, D. (2006). The role of knowledge in conceptual retrieval: A study in the domain of clinical medicine. In Proceedings of SIGIR' 06, Seattle, Washington, USA.
  11. Lin, X.-D., Peng, H., & Liu, B. (2006). Support Vector Machines for Text Categorization in Chinese Question Classification. Proc. of the 2006 IEEE/WIC/ACM International Conferenceon Web Intelligence.
  12. Liu, R.-L. (2010). Context-based term frequency assessment for text classification. Journal of the American Society for Information Science and Technology, 61(2), 300-309. https://doi.org/10.1002/asi.21260
  13. Liu, R.-L., & Lin, S.-L. (2012). A Conceptual model for retrieval of chinese frequently asked questions in healthcare, Proc. of the 8th Asia Information Retrieval Societies Conference, LNCS, Springer-Verlag Berlin Heidelberg, Tianjin, Mainland China.
  14. Liszka, H. A., Steyer, T. E., Hueston, W. J. (2006). Virtual medical care: How are our patients using online health information? Journal of Community Health, 31(5), 368-78. https://doi.org/10.1007/s10900-006-9019-3
  15. Mishra, M., Mishra, V. K., & Sharma, H. R. (2013). Question classification using Semantic, syntactic and lexical features. International Journal of Web & Semantic Technology, 4(3).
  16. Moschitti, A., Chu-Carroll, J., Patwardhan, S., Fan, J., & Riccardi, G. (2011). Using syntactic and semantic structural kernels for classifying definition questions in Jeopardy! Proc. of the 2011 Conference on Empirical Methods in Natural Language Processing, 712-724.
  17. Pan, Y., Tang, Y., Lin, L., & Luo, Y. (2008). Question classification with semantic tree kernel. Proc. of SIGIR'08, Singapore.
  18. Palotti, J. R. M., Stefanov, V., & Hanbury, A. (2014). User intent behind medical queries: An evaluation of entity mapping approaches with Metamap and Freebase, in Proceedings of the 5th Information Interaction in Context Symposium, 283-286.
  19. Peng, F. & Schuurmans, D. (2003). Combining naive bayes and n-Gram language models for text classification. In Proceedings of 25th European Conference on Information Retrieval Research (ECIR), 335-350. Pisa, Italy.
  20. Raghavi, K. C., Chinnakotla, M. K., & Shrivastava, M. (2015). "Answer ka type kya he?": learning to classify questions in code-mixed language. in Proceedings of the 24th International Conference on World Wide Web, 853-858.
  21. Rekha, V. S., Divya, N., & Bagavathi, P. S. (2014). A hybrid auto-tagging dystem for stackoverflow forum questions. in Proceedings of the 2014 International Conference on Interdisciplinary Advances in Applied Computing, Article No.56.
  22. Shuyler, K. S., & Knight, K. M. (2003). What are patients seeking when they turn to the Internet? Qualitative content analysis of questions asked by visitors to an orthopaedics web site, Journal of Medical Internet Research, 5(4), 24. https://doi.org/10.2196/jmir.5.4.e24
  23. Thomson, M. D., & Hoffman-Goetz, L. (2007). Readability and cultural sensitivity of web-based patient decision aids for cancer screening and treatment: A systematic review, Medical Informatics and the Internet in Medicine 32(4), 263-286. https://doi.org/10.1080/14639230701780408
  24. Wang, K., Ming, Z., & Chua, T.-S. (2009). A syntactic tree matching approach to finding similar questions in community-based QA services. In Proceedings of SIGIR, Boston, Massachusetts, USA.
  25. Wu, C.-H., Yeh, J.-F., & Lai, Y.-S. (2006). Semantic segment extraction and matching for Internet FAQ Retrieval. IEEE Transactions on Knowledge and Data Engineering, 18(7).
  26. Wu, C.-H., Yeh, J.-F., & Chen, M.-J. (2005). Domain-specific FAQ retrieval using independent aspects. ACM Transactions on Asian Language Information Processing, 4(1), 1-17. https://doi.org/10.1145/1066078.1066079
  27. Zeng, Q. T., Kogan, S., Plovnick, R. M., Crowell, J., Lacroix, E.-M., & Greenes, R. A. (2004). Positive attitudes and failed queries: an exploration of the conundrums of consumer health information retrieval. International Journal of Medical Informatics, 73, 45-55. https://doi.org/10.1016/j.ijmedinf.2003.12.015
  28. Zhang, C., Fan, W., Du, N., & Yu, P. S. (2016). Mining user intentions from medical queries: A neural network based heterogeneous jointly modeling approach, in Proceedings of the 25th International Conference on World Wide Web (WWW'16), 1373-1384.
  29. Zhang, D., & Lee, W. S. (2003). Question classification using support vector machines. Proc. of SIGIR'03, Toronto, Canada.