A Novel Approach for Estimating Truck Factors

Guilherme Avelino; Leonardo Passos; Andre Hora; Marco Tulio Valente

arXiv:1604.06766·cs.SE·August 7, 2018

A Novel Approach for Estimating Truck Factors

Guilherme Avelino, Leonardo Passos, Andre Hora, Marco Tulio Valente

PDF

2 Repos

TL;DR

This paper introduces an automated method to estimate the Truck Factor in software projects, analyzing 133 GitHub projects and validating results through developer surveys to assess project knowledge concentration.

Contribution

It presents a novel, automated approach for calculating Truck Factors and provides empirical evidence from real-world projects and developer feedback.

Findings

01

65% of projects have TF <= 2

02

84% developer agreement on main authorship

03

53% positive feedback on estimated TFs

Abstract

Truck Factor (TF) is a metric proposed by the agile community as a tool to identify concentration of knowledge in software development environments. It states the minimal number of developers that have to be hit by a truck (or quit) before a project is incapacitated. In other words, TF helps to measure how prepared is a project to deal with developer turnover. Despite its clear relevance, few studies explore this metric. Altogether there is no consensus about how to calculate it, and no supporting evidence backing estimates for systems in the wild. To mitigate both issues, we propose a novel (and automated) approach for estimating TF-values, which we execute against a corpus of 133 popular project in GitHub. We later survey developers as a means to assess the reliability of our results. Among others, we find that the majority of our target systems (65%) have TF <= 2. Surveying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.