An Annotated Corpus for Machine Reading of Instructions in Wet Lab   Protocols

Chaitanya Kulkarni; Wei Xu; Alan Ritter; Raghu Machiraju

arXiv:1805.00195·cs.CL·May 2, 2018

An Annotated Corpus for Machine Reading of Instructions in Wet Lab Protocols

Chaitanya Kulkarni, Wei Xu, Alan Ritter, Raghu Machiraju

PDF

TL;DR

This paper introduces an annotated corpus of wet lab protocols to aid in converting natural language instructions into machine-readable formats, supporting automated biological research workflows.

Contribution

The paper presents a new annotated corpus of 622 wet lab protocols and demonstrates its usefulness for machine learning-based semantic parsing of instructions.

Findings

01

Corpus improves semantic parsing accuracy

02

Annotated data facilitates machine learning models

03

Corpus publicly available for research use

Abstract

We describe an effort to annotate a corpus of natural language instructions consisting of 622 wet lab protocols to facilitate automatic or semi-automatic conversion of protocols into a machine-readable format and benefit biological research. Experimental results demonstrate the utility of our corpus for developing machine learning approaches to shallow semantic parsing of instructional texts. We make our annotated Wet Lab Protocol Corpus available to the research community.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.