Final Report on MITRE Evaluations for the DARPA Big Mechanism Program
Matthew Peterson, Tonia Korves, Christopher Garay, Robyn Kozierok and, Lynette Hirschman

TL;DR
This report details the evaluation methodology for the DARPA Big Mechanism program, assessing systems' ability to extract, assemble, and explain cancer mechanisms from biomedical literature across three iterative phases.
Contribution
It introduces a comprehensive evaluation framework with innovative metrics and reference sets for assessing AI systems in biomedical knowledge extraction and modeling.
Findings
Systems improved in extracting mechanistic information across phases
Automated models increasingly able to explain experimental data
Evaluation approach guides future development of biomedical AI systems
Abstract
This report presents the evaluation approach developed for the DARPA Big Mechanism program, which aimed at developing computer systems that will read research papers, integrate the information into a computer model of cancer mechanisms, and frame new hypotheses. We employed an iterative, incremental approach to the evaluation of the three phases of the program. In Phase I, we evaluated the ability of system and human teams ability to read-with-a-model to capture mechanistic information from the biomedical literature, integrated with information from expert curated biological databases. In Phase II we evaluated the ability of systems to assemble fragments of information into a mechanistic model. The Phase III evaluation focused on the ability of systems to provide explanations of experimental observations based on models assembled (largely automatically) by the Big Mechanism process. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Scientific Computing and Data Management · Genetics, Bioinformatics, and Biomedical Research
