Logo image
Mining software repositories for traceability links
Conference proceeding

Mining software repositories for traceability links

H Kagdi, J.I Maletic and B Sharif
15th IEEE International Conference on Program Comprehension (ICPC '07), pp.145-154
06-2007

Abstract

Data mining Documentation Frequency History Itemsets Open source software Software maintenance Software systems Software tools
An approach to recover/discover traceability links between software artifacts via the examination of a software system's version history is presented. A heuristic-based approach that uses sequential-pattern mining is applied to the commits in software repositories for uncovering highly frequent co-changing sets of artifacts (e.g., source code and documentation). If different types of files are committed together with high frequency then there is a high probability that they have a traceability link between them. The approach is evaluated on a number of versions of the open source system KDE. As a validation step, the discovered links are used to predict similar changes in the newer versions of the same system. The results show highly precision predictions of certain types of traceability links.

Metrics

18 Record Views

Details

Logo image