Predicting and Understanding Law-Making with Word Vectors and an   Ensemble Model

John J. Nay

arXiv:1607.02109·cs.CL·July 5, 2017

Predicting and Understanding Law-Making with Word Vectors and an Ensemble Model

John J. Nay

PDF

TL;DR

This paper presents a machine learning approach using word vectors and ensemble models to predict the likelihood of bills becoming law, analyzing the influence of text and context over time.

Contribution

It introduces a novel ensemble model combining text and contextual data to forecast legislative outcomes with temporal analysis.

Findings

01

Text-only models outperform context-only at bill introduction

02

Context-only models outperform text-only with updated data

03

Combined models yield the highest prediction accuracy

Abstract

Out of nearly 70,000 bills introduced in the U.S. Congress from 2001 to 2015, only 2,513 were enacted. We developed a machine learning approach to forecasting the probability that any bill will become law. Starting in 2001 with the 107th Congress, we trained models on data from previous Congresses, predicted all bills in the current Congress, and repeated until the 113th Congress served as the test. For prediction we scored each sentence of a bill with a language model that embeds legislative vocabulary into a high-dimensional, semantic-laden vector space. This language representation enables our investigation into which words increase the probability of enactment for any topic. To test the relative importance of text and context, we compared the text model to a context-only model that uses variables such as whether the bill's sponsor is in the majority party. To test the effect of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.