Conference Proceeding

Comparison between SLOCs and number of files as size metrics for software evolution analysis

Grupo de Sistemas y Comunicaciones, Univ. Rey Juan Carlos
04/2006; DOI:10.1109/CSMR.2006.17 ISBN: 0-7695-2536-9 pp.8 pp. - 213 In proceeding of: Software Maintenance and Reengineering, 2006. CSMR 2006. Proceedings of the 10th European Conference on
Source: DBLP

ABSTRACT There are some concerns in the research community about the convenience of using low-level metrics (such as SLOC, source lines of code) for characterizing the evolution of software, instead of the more traditional higher lever metrics (such as the number of modules or files). This issue has been raised in particular after some studies that suggest that libre (free, open source) software evolves differently than 'traditional' software, and therefore it does not conform to Lehman's laws of software evolution. Since those studies on libre software evolution use SLOCs as the base metric, while Lehman's and other traditional studies use modules or files, it is difficult to compare both cases. To overcome this difficulty, and to explore the differences between SLOC and files/modules counts in libre software projects, we have selected a large sample of programs and have calculated both size metrics over time. Our study shows that in those cases the evolution patterns in both cases (counting SLOCs or files) is the same, and that some patterns not conforming to Lehman's laws are indeed apparent

0 0
 · 
0 Bookmarks
 · 
26 Views
  • Source
    Conference Proceeding: Towards a Theoretical Model for Software Growth
    [show abstract] [hide abstract]
    ABSTRACT: Software growth (and more broadly, software evolution) is usually considered in terms of size or complexity of source code. However in different studies, usually different metrics are used, which make it difficult to compare approaches and results. In addition, not all metrics are equally easy to calculate for a given source code, which leads to the question of which one is the easiest to calculate without losing too much information. To address both issues, in this paper present a comprehensive study, based on the analysis of about 700,000 C source code files, calculating several size and complexity metrics for all of them. For this sample, we have found double Pareto statistical distributions for all metrics considered, and a high correlation between any two of them. This would imply that any model addressing software growth should produce this Pareto distributions, and that analysis based on any of the considered metrics should show a similar pattern, provided the sample of files considered is large enough.
    Mining Software Repositories, 2007. ICSE Workshops MSR '07. Fourth International Workshop on; 06/2007
  • Source
    Article: Software evolution in open source projects - a large-scale investigation.
    Journal of Software Maintenance. 01/2007; 19:361-382.
  • Source
    Conference Proceeding: On the nature of commits.
    [show abstract] [hide abstract]
    ABSTRACT: Information contained in versioning system commits has been frequently used to support software evolution research. Concomitantly, some researchers have tried to relate commits to certain activities, e.g., large commits are more likely to be originated from code management activities, while small ones are related to development activities. However, these characterizations are vague, because there is no consistent definition of what is a small or a large commit. In this paper, we study the nature of commits in two dimensions. First, we define the size of commits in terms of number of files, and then we classify commits based on the content of their comments. To perform this study, we use the history log of nine large open source projects.
    23rd IEEE/ACM International Conference on Automated Software Engineering - Workshop Proceedings (ASE Workshops 2008), 15-16 September 2008, L'Aquila, Italy; 01/2008

Full-text (2 Sources)

View
3 Downloads
Available from
30 Oct 2012

Keywords

'traditional' software
 
base metric
 
cases
 
concerns
 
convenience
 
files/modules counts
 
free
 
large sample
 
Lehman's
 
Lehman's laws
 
libre software evolution use SLOCs
 
libre software projects
 
low-level metrics
 
open source
 
size metrics
 
SLOCs
 
software evolution
 
source lines
 
traditional higher lever metrics
 
traditional studies use modules