DOI QR코드

DOI QR Code

A Comparative Study between LSI and LDA in Constructing Traceability between Functional and Non-Functional Requirements

  • Byun, Sung-Hoon (Dept. of Computer engineering, Ajou University) ;
  • Lee, Seok-Won (Dept. of Software and Computer engineering, Ajou University)
  • Received : 2019.04.15
  • Accepted : 2019.06.06
  • Published : 2019.07.31

Abstract

Requirements traceability is regarded as one of the important quality attributes in software requirements engineering field. If requirements traceability is guaranteed then we can trace the requirements' life throughout all the phases, from the customers' needs in the early stage of the project to requirements specification, deployment, and maintenance phase. This includes not only tracking the development artifacts that accompany the requirements, but also tracking backwards from the development artifacts to the initial customer requirements associated with them. In this paper, especially, we dealt with the traceability between functional requirements and non-functional requirements. Among many Information Retrieval (IR) techniques, we decided to utilize Latent Semantic Indexing (LSI) and Latent Dirichlet Allocation (LDA) in our research. Ultimately, we conducted an experiment on constructing traceability by using two techniques and analyzed the experiment results. And then we provided a comparative study between two IR techniques in constructing traceability between functional requirements and non-functional requirements.

Keywords

CPTSCQ_2019_v24n7_19_f0001.png 이미지

Fig. 1. Singular Value Decomposition of t × d metrix

CPTSCQ_2019_v24n7_19_f0002.png 이미지

Fig. 2. Schematic model of LDA

CPTSCQ_2019_v24n7_19_f0003.png 이미지

Fig. 3. 897 × 97 Word-Document Matrix

CPTSCQ_2019_v24n7_19_f0004.png 이미지

Fig. 4. Process of LSI

CPTSCQ_2019_v24n7_19_f0005.png 이미지

Fig. 5. Result of LSI Similar calculation of functionalrequirement of ‘Accuracy’

CPTSCQ_2019_v24n7_19_f0006.png 이미지

Fig. 6. Word-by-topic distribution depending on K value

Table 1. Requirements Traceability Matrix

CPTSCQ_2019_v24n7_19_t0001.png 이미지

Table 2. Three Steps Level Similarity of LSI

CPTSCQ_2019_v24n7_19_t0002.png 이미지

Table 3. Accuracy of LSI

CPTSCQ_2019_v24n7_19_t0003.png 이미지

Table 4. Three Steps Level Similarity of LDA

CPTSCQ_2019_v24n7_19_t0004.png 이미지

Table 5. Accuracy of LDA

CPTSCQ_2019_v24n7_19_t0005.png 이미지

References

  1. Orelena C. Z. Gotel, Anthony C. W. Finkelstein, "An analysis of the requirements traceability problem", Proceedings of the First International Conference on Requirements Engineering, Colorado Springs, CO, USA, 1994.
  2. Vikash Katta, Christian Raspotnig, Peter Karpati, Tor Stalhane, "Requirements management in a combined process for safety and security assessments", Eighth International Conference on Availability, Reliability and Security (ARES), p.780-786, Regensburg, Germany September 2013.
  3. Vikash Katta, Tor Stalhane, "A conceptual model of traceability for safety systems", Electronic Proc. 2nd Complex Systems Design & Management Conference (CSD&M 2011), Poster-Session Paper, Paris, France, 7-9 December 2011.
  4. Kannenberg, Andrew, and Hossein Saiedian. "Why software requirements traceability remains a challenge." CrossTalk. The Journal of Defense Software Engineering 22.5 : 14-19, 2009
  5. Knethen AV, Paech B, "A survey on tracing approaches in practice and research", Research Report, ESE-Report, 095.01/E, Fraunhofer IESE, Kaiserslautern, 2002.
  6. Jane Cleland-Huang, Orlena C. Z. Gotel, Jane Huffman Hayes, et al, "Software traceability: trends and future directions" Proceedings of the on Future of Software Engineering, p. 55-69, Hyderabad, India , 2014.
  7. Asuncion, Hazeline U., Arthur U. Asuncion, and Richard N. Taylor. "Software traceability with topic modeling." Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering-Volume 1. p.95, Cape Town, South Africa, 2010.
  8. Thefor P.Williams, John F.Betak. "A Comparison of LSA and LDA for the Analysis of Railroad Accident text." Procedia Computer science, Vol 130, p. 98-102, 2018 https://doi.org/10.1016/j.procs.2018.04.017
  9. Dan Port, Jane Huffman Hayes, LiGuo Huang. "Text mining support for software requirements: Traceability assurance." System Sciences (HICSS), 2011 44th Hawaii International Conference on. IEEE, p.1-11, Kauai, HI, USA, 2011.
  10. Cysneiros, Luiz Marcio, and Julio Cesar Sampaio do Prado Leite. "Nonfunctional requirements: From elicitation to conceptual models." IEEE Transactions on Software Engineering Vol 30, Issue5, p.328-350, 04 May 2004 https://doi.org/10.1109/TSE.2004.10
  11. Gross, Daniel, and Eric Yu. "From non-functional requirements to design through patterns.", Requirements Engineering, Vol 6, Issue 1, p.18-36, 2001 https://doi.org/10.1007/s007660170013
  12. Jane Cleland-Huang, et al. "Goal-centric traceability for managing non-functional requirements." Proceedings of the 27th international conference on Software engineering. Saint Louis, MO, USA, 2005.
  13. Jane Cleland-Huang, "Toward improved traceability of non-functional requirements." TEFSE '05 Proceedings of the 3rd international workshop on Traceability in emerging forms of software engineering. p.14-19, Long Beach, California, 2005.
  14. Stefan Winkler, Jens von Pilgrim, "A survey of traceability in requirements engineering and model-driven development", Journal Software and Systems Modeling (SoSyM) archive, Volume 9, Issue 4, Pages 529-565, September 2010 https://doi.org/10.1007/s10270-009-0145-0
  15. Ismenia Galvao, Arda Goknil, "Survey of traceability approaches in model-driven engineering", 11th IEEE International Enterprise Distributed Object Computing Conference, Annapolis, MD, USA, 2007
  16. Stale Walderhaug, Ulrik Johansen, Erlend Stav, Jan Aagedal, "Towards a generic solution for traceability in MDD", ECMDA Traceability Workshop (ECMDA-TW), p. 41-50, 2006.
  17. Pedro Sanchez, Diego Alonso, Francisca Rosique, Barbara Alvarez, Juan A. Pastor, "Introducing safety requirements traceability support in model-driven development of robotic applications", IEEE Transactions on Computers, Vol 60, Issue 8, Aug. 2010.
  18. Kassab, Mohamad, Olga Ormandjieva, and Maya Daneva. "A traceability metamodel for change management of non-functional requirements." Software Engineering Research, Management and Applications, Prague, Czech Republic, 2008
  19. Hayes, Jane Huffman, Alex Dekhtyar, and James Osborne. "Improving requirements tracing via information retrieval." Requirements Engineering Conference, Monterey Bay, CA, USA, 2003
  20. Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. "Indexing by latent semantic analysis", journal of the American society for information science, Vol 41, Issue 6, 391, September 1990 https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
  21. Landauer, Thomas K., Peter W. Foltz, and Darrell Laham. "An introduction to latent semantic analysis.", Discourse processes, Vol 25, Issue 2-3 p. 259-284, 1998 https://doi.org/10.1080/01638539809545028
  22. Lormans, Marco, and Arie Van Deursen. "Can LSI help reconstructing requirements traceability in design and test?" Software Maintenance and Reengineering, Bari, Italy, 2006
  23. Bradford, Roger B. "An empirical study of required dimensionality for large-scale latent semantic indexing applications." Proceedings of the 17th ACM conference on Information and knowledge management. p. 153-162, Napa Valley, California, USA, 2008
  24. Yee Whye Teh, Michael I. Jordan, Matthew J. Beal and David M. Blei, "Hierarchical Dirichlet Processes", Journal of the American Statistical Association, Vol 101, No.476, p.1566-1581, 2006 https://doi.org/10.1198/016214506000000302
  25. Blei, David M., Andrew Y. Ng, and Michael I. Jordan. "Latent Dirichlet Allocation." Journal of Machine Learning Research 3, p.993-1022, Jan 2003
  26. Mei, Qiaozhu, Xuehua Shen, and ChengXiang Zhai. "Automatic labeling of multinomial topic models." Proceedings of the 13th ACM SIGKDD international conference onn Kowledge discovery and data mining, p.490-499, San Jose, California, USA, 2007
  27. De Lucia, Antoniol. "Information retrieval models for recovering traceability links between code and documentation." Proceedings 2000 International Conference on Software Maintenance, San Jose, CA, USA, 2000
  28. Marcus, Andrian, and Jonathan I. Maletic. "Recovering documentation-to-source-code traceability links using latent semantic indexing." Proceedings of the 25th International Conference on Software Engineering, Portland, Oregon, 2003
  29. Hayes, Jane Huffman, Alex Dekhtyar, and James Osborne. "Improving requirements tracing via information retrieval.", Proceedings. 11th International Requirements Engineering Conference, p.138-147, Monterey Bay, CA, USA, 2003.
  30. Giovanni Capobianco, Andrea De Lucia, et al. "On the role of the nouns in IR-based traceability recovery.", 2009 IEEE 17th International Conference on Program Comprehension, p. 148-157, Vancouver, BC, Canada, 2009.
  31. Oliveto, Rocco, et al. "On the equivalence of information retrieval methods for automated traceability link recovery.", 2010 IEEE 18th International Conference on Program Comprehension, p. 68-71, Braga, Minho, Portugal, 2010.
  32. Gethers, Malcom, et al. "On integrating orthogonal information retrieval methods to improve traceability recovery.", 2011 27th IEEE International Conference on Software Maintenance(ICSM), p. 133-142, , Williamsburg, VI, USA, 2011.
  33. Rehurek, Radim, and Petr Sojka. "Software framework for topic modelling with large corpora." In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, At Malta, May 2010.
  34. Stopword Lists, www.ranks.nl/stopwords