Define-ML: An Approach to Ideate Machine Learning-Enabled Systems
Silvio Alonso, Antonio Pedro Santos Alves, Lucas Romao, H\'elio Lopes, Marcos Kalinowski

TL;DR
Define-ML is a structured framework extending Lean Inception to better support early-stage ideation of machine learning-enabled systems by integrating data and technical constraints, improving alignment and feasibility.
Contribution
This paper introduces Define-ML, a novel extension of Lean Inception that systematically incorporates ML-specific considerations into product ideation.
Findings
Participants found Define-ML effective for clarifying data concerns.
The approach reduced ideation ambiguity and improved alignment.
Participants expressed strong intent to adopt Define-ML.
Abstract
[Context] The increasing adoption of machine learning (ML) in software systems demands specialized ideation approaches that address ML-specific challenges, including data dependencies, technical feasibility, and alignment between business objectives and probabilistic system behavior. Traditional ideation methods like Lean Inception lack structured support for these ML considerations, which can result in misaligned product visions and unrealistic expectations. [Goal] This paper presents Define-ML, a framework that extends Lean Inception with tailored activities - Data Source Mapping, Feature-to-Data Source Mapping, and ML Mapping - to systematically integrate data and technical constraints into early-stage ML product ideation. [Method] We developed and validated Define-ML following the Technology Transfer Model, conducting both static validation (with a toy problem) and dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
MethodsADaptive gradient method with the OPTimal convergence rate
