Dealing with Data Challenges when Delivering Data-Intensive Software Solutions
Ulrike M. Graetsch, Hourieh Khalajzadeh, Mojtaba Shahin, Rashina Hoda, and John Grundy

TL;DR
This study explores the challenges faced by multi-disciplinary data-intensive software teams, identifying key issues with data access, quality, and understanding, and proposing strategies like governance and team development to address them.
Contribution
It presents a grounded theory of data challenges in MDSTs, detailing causes, consequences, and strategies, based on interviews with practitioners across roles and organizational levels.
Findings
Data access and quality are primary challenges.
Strategies include data governance and advanced tools.
Team dynamics and communication are crucial for success.
Abstract
The predicted increase in demand for data-intensive solution development is driving the need for software, data, and domain experts to effectively collaborate in multi-disciplinary data-intensive software teams (MDSTs). We conducted a socio-technical grounded theory study through interviews with 24 practitioners in MDSTs to better understand the challenges these teams face when delivering data-intensive software solutions. The interviews provided perspectives across different types of roles including domain, data and software experts, and covered different organisational levels from team members, team managers to executive leaders. We found that the key concern for these teams is dealing with data-related challenges. In this paper, we present the theory of dealing with data challenges that explains the challenges faced by MDSTs including gaining access to data, aligning data,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Scientific Computing and Data Management · Software Engineering Research
