Conference Proceeding
Comparison between SLOCs and number of files as size metrics for software evolution analysis
Grupo de Sistemas y Comunicaciones, Univ. Rey Juan Carlos
04/2006;
DOI:10.1109/CSMR.2006.17
ISBN: 0-7695-2536-9 pp.8 pp. - 213 In proceeding of: Software Maintenance and Reengineering, 2006. CSMR 2006. Proceedings of the 10th European Conference on
Source: DBLP
-
Citations (0)
- Cited In (3)
-
Conference Proceeding: Towards a Theoretical Model for Software Growth
[show abstract] [hide abstract]
ABSTRACT: Software growth (and more broadly, software evolution) is usually considered in terms of size or complexity of source code. However in different studies, usually different metrics are used, which make it difficult to compare approaches and results. In addition, not all metrics are equally easy to calculate for a given source code, which leads to the question of which one is the easiest to calculate without losing too much information. To address both issues, in this paper present a comprehensive study, based on the analysis of about 700,000 C source code files, calculating several size and complexity metrics for all of them. For this sample, we have found double Pareto statistical distributions for all metrics considered, and a high correlation between any two of them. This would imply that any model addressing software growth should produce this Pareto distributions, and that analysis based on any of the considered metrics should show a similar pattern, provided the sample of files considered is large enough.Mining Software Repositories, 2007. ICSE Workshops MSR '07. Fourth International Workshop on; 06/2007 -
Article: Software evolution in open source projects - a large-scale investigation.
Journal of Software Maintenance. 01/2007; 19:361-382. -
Conference Proceeding: On the nature of commits.
[show abstract] [hide abstract]
ABSTRACT: Information contained in versioning system commits has been frequently used to support software evolution research. Concomitantly, some researchers have tried to relate commits to certain activities, e.g., large commits are more likely to be originated from code management activities, while small ones are related to development activities. However, these characterizations are vague, because there is no consistent definition of what is a small or a large commit. In this paper, we study the nature of commits in two dimensions. First, we define the size of commits in terms of number of files, and then we classify commits based on the content of their comments. To perform this study, we use the history log of nine large open source projects.23rd IEEE/ACM International Conference on Automated Software Engineering - Workshop Proceedings (ASE Workshops 2008), 15-16 September 2008, L'Aquila, Italy; 01/2008
Data provided are for informational purposes only. Although carefully collected, accuracy cannot be guaranteed.
The impact factor represents a rough estimation of the journal's impact factor and does not reflect the actual
current impact factor.
Publisher conditions are provided by RoMEO. Differing provisions from the publisher's actual policy or licence
agreement may be applicable.
Keywords
'traditional' software
base metric
cases
concerns
convenience
files/modules counts
free
large sample
Lehman's
Lehman's laws
libre software evolution use SLOCs
libre software projects
low-level metrics
open source
size metrics
SLOCs
software evolution
source lines
traditional higher lever metrics
traditional studies use modules