论文标题
软件开发团队之间的知识分布平衡 - 开源和封闭式软件开发的观察
Balanced Knowledge Distribution among Software Development Teams -- Observations from Open-Source and Closed-Source Software Development
论文作者
论文摘要
在软件开发团队中,开发人员的离职是项目失败的主要原因之一,因为它导致了新移民的知识和压力。不幸的是,没有建立的方法来衡量如何在开发团队之间分布知识。了解这些知识如何发展和由项目中的主要开发人员拥有,有助于管理人员降低流动率引起的风险。为此,本文介绍了域知识分布的新颖,现实的表示:Conceptrealm。为了构建Conceptrealm,我们采用了潜在的Dirichlet分配模型来表示从300k期间获得的文本功能和518个开源项目中的130万个评论。我们分析了新出现的问题和开发人员是否共享相似的概念,或者随着时间的流逝,开发人员的概念与团队的概念如何保持一致。我们还研究了离开成员对概念频率的影响。最后,我们评估了封闭源软件方法的合理性,从而可以从实际角度验证结果。我们发现,Conceptrealm可以代表团队内的高级领域知识,并可以用来预测开发人员与问题的一致性。我们还观察到,项目表现出许多独立于项目成熟的守护者,并且突然让守护者损害了团队的概念熟悉。
In software development teams, developer turnover is among the primary reasons for project failures as it leads to a great void of knowledge and strain for the newcomers. Unfortunately, no established methods exist to measure how knowledge is distributed among development teams. Knowing how this knowledge evolves and is owned by key developers in a project helps managers reduce risks caused by turnover. To this end, this paper introduces a novel, realistic representation of domain knowledge distribution: the ConceptRealm. To construct the ConceptRealm, we employ a latent Dirichlet allocation model to represent textual features obtained from 300k issues and 1.3M comments from 518 open-source projects. We analyze whether the newly emerged issues and developers share similar concepts or how aligned the developers' concepts are with the team over time. We also investigate the impact of leaving members on the frequency of concepts. Finally, we evaluate the soundness of our approach to closed-source software, thus allowing the validation of the results from a practical standpoint. We find out that the ConceptRealm can represent the high-level domain knowledge within a team and can be utilized to predict the alignment of developers with issues. We also observe that projects exhibit many keepers independent of project maturity and that abruptly leaving keepers harm the team's concept familiarity.