Logo image
Mining sequences of changed-files from version histories
Conference proceeding

Mining sequences of changed-files from version histories

Huzefa Kagdi, Shehnaaz Yusuf and Jonathan Maletic
Proceedings of the 2006 international workshop on mining software repositories, pp.47-53
MSR '06
05-22-2006

Abstract

change sequences heuristics mining software repositories
Modern source-control systems, such as Subversion, preserve change-sets of files as atomic commits. However, the specific ordering information in which files were changed is typically not found in these source-code repositories. In this paper, a set of heuristics for grouping change-sets (i.e., log-entries) found in source-code repositories is presented. Given such groups of change-sets, sequences of files that frequently change together are uncovered. This approach not only gives the (unordered) sets of files but supplements them with (partial temporal) ordering information. The technique is demonstrated on a subset of KDE source-code repository. The results show that the approach is able to find sequences of changed-files.

Metrics

8 Record Views

Details

Logo image