Quantifying A Firm's AI Engagement: Constructing Objective, Data-Driven, AI Stock Indices Using 10-K Filings
Lennart Ante, Aman Saggu

TL;DR
This paper introduces an objective, data-driven NLP method to classify AI-related stocks using 10-K filings, enabling the construction of AI indices that outperform existing ETFs and market benchmarks.
Contribution
It presents a novel NLP-based approach for quantifying AI engagement in firms and constructing transparent, market-responsive AI stock indices from corporate filings.
Findings
AI indices outperform existing ETFs and Nasdaq in risk-adjusted returns.
Higher AI engagement correlates with greater positive abnormal returns post-ChatGPT.
The methodology offers a cost-effective, transparent alternative for AI-themed investing.
Abstract
Following an analysis of existing AI-related exchange-traded funds (ETFs), we reveal the selection criteria for determining which stocks qualify as AI-related are often opaque and rely on vague phrases and subjective judgments. This paper proposes a new, objective, data-driven approach using natural language processing (NLP) techniques to classify AI stocks by analyzing annual 10-K filings from 3,395 NASDAQ-listed firms between 2011 and 2023. This analysis quantifies each company's engagement with AI through binary indicators and weighted AI scores based on the frequency and context of AI-related terms. Using these metrics, we construct four AI stock indices-the Equally Weighted AI Index (AII), the Size-Weighted AI Index (SAII), and two Time-Discounted AI Indices (TAII05 and TAII5X)-offering different perspectives on AI investment. We validate our methodology through an event study on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
