Capabilities for Better ML Engineering

Chenyang Yang; Rachel Brower-Sinning; Grace A. Lewis; Christian; K\"astner; Tongshuang Wu

arXiv:2211.06409·cs.AI·February 14, 2023·5 cites

Capabilities for Better ML Engineering

Chenyang Yang, Rachel Brower-Sinning, Grace A. Lewis, Christian, K\"astner, Tongshuang Wu

PDF

Open Access

TL;DR

This paper proposes a capability-based framework to unify and improve ML engineering practices by specifying model behaviors, with preliminary evidence showing its potential to enhance model generalizability and safety.

Contribution

It introduces a novel capability-based framework for ML engineering that unites various efforts and improves model safety, generalizability, and trustworthiness.

Findings

01

Capabilities reflect model generalizability

02

Framework guides ML engineering processes

03

Potential to build safer, more trustworthy models

Abstract

In spite of machine learning's rapid growth, its engineering support is scattered in many forms, and tends to favor certain engineering stages, stakeholders, and evaluation preferences. We envision a capability-based framework, which uses fine-grained specifications for ML model behaviors to unite existing efforts towards better ML engineering. We use concrete scenarios (model design, debugging, and maintenance) to articulate capabilities' broad applications across various different dimensions, and their impact on building safer, more generalizable and more trustworthy models that reflect human needs. Through preliminary experiments, we show capabilities' potential for reflecting model generalizability, which can provide guidance for ML engineering process. We discuss challenges and opportunities for capabilities' integration into ML engineering.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Software Engineering Research · Data Quality and Management