# OpBerg: Discovering causal sentences using optimal alignments

**Authors:** Justin Wood, Nicholas J. Matiasz, Alcino J. Silva, William Hsu, Alexej, Abyzov, Wei Wang

arXiv: 1904.02032 · 2019-04-04

## TL;DR

This paper introduces OpBerg, a novel method that uses optimal sequence alignments to accurately extract causal sentences from biological literature, overcoming limitations of existing machine learning and database-based approaches.

## Contribution

The paper presents a new alignment-based approach for causal sentence extraction that outperforms existing methods in accuracy and efficiency.

## Key findings

- Improved accuracy over existing methods
- Efficient computational performance
- Effective in extracting causal relations from biological texts

## Abstract

The biological literature is rich with sentences that describe causal relations. Methods that automatically extract such sentences can help biologists to synthesize the literature and even discover latent relations that had not been articulated explicitly. Current methods for extracting causal sentences are based on either machine learning or a predefined database of causal terms. Machine learning approaches require a large set of labeled training data and can be susceptible to noise. Methods based on predefined databases are limited by the quality of their curation and are unable to capture new concepts or mistakes in the input. We address these challenges by adapting and improving a method designed for a seemingly unrelated problem: finding alignments between genomic sequences. This paper presents a novel and outperforming method for extracting causal relations from text by aligning the part-of-speech representations of an input set with that of known causal sentences. Our experiments show that when applied to the task of finding causal sentences in biological literature, our method improves on the accuracy of other methods in a computationally efficient manner.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.02032/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1904.02032/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/1904.02032/full.md

---
Source: https://tomesphere.com/paper/1904.02032