# Policy agendas of the American state legislatures

**Authors:** Ethan Dee, Alex Garlick

PMC · DOI: 10.1038/s41597-025-05621-5 · 2025-07-22

## TL;DR

This paper uses machine learning to categorize millions of state bills into policy areas, helping researchers study U.S. state legislatures.

## Contribution

A transformer-based model is introduced to classify state bills into policy areas with high coverage and accuracy.

## Key findings

- The model successfully coded 1.36 million bills into 28 policy areas since 2009.
- The method outperforms traditional dictionary-based approaches in coverage and accuracy.

## Abstract

State legislatures in the United States handle a number of important policy issues, but pose a challenge for researchers to observe because they are not organized by any central agency. We use a machine learning model based on the “transformer” architecture and contextual word-piece embeddings to code the universe of bills introduced in the states since 2009 (about 1.36 million bills) into 28 policy areas. Validation exercises show our method compares favorably with hand-coded estimates of bill policy areas while offering far greater coverage than legacy human-supervised “dictionary” methods. We explain how researchers can use these estimates to investigate sub-national governance in the United States.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12283944/full.md

---
Source: https://tomesphere.com/paper/PMC12283944