Logo image
What's a Typical Commit? A Characterization of Open Source Software Repositories
Conference proceeding

What's a Typical Commit? A Characterization of Open Source Software Repositories

A Alali, H Kagdi and J.I Maletic
2008 16th IEEE International Conference on Program Comprehension, pp.182-191
International Conference on Program Comprehension
06-2008

Abstract

commit analysis Computer science Control systems History mining software repositories Open source software Software systems Solids Vocabulary
The research examines the version histories of nine open source software systems to uncover trends and characteristics of how developers commit source code to version control systems (e.g., subversion). The goal is to characterize what a typical or normal commit looks like with respect to the number of files, number of lines, and number of hunks committed together. The results of these three characteristics are presented and the commits are categorized from extra small to extra large. The findings show that approximately 75% of commits are quite small for the systems examined along all three characteristics. Additionally, the commit messages are examined along with the characteristics. The most common words are extracted from the commit messages and correlated with the size categories of the commits. It is observed that sized categories can be indicative of the types of maintenance activities being performed.

Metrics

Details

Logo image