Discovering material information using hierarchical Reformer model on financial regulatory filings
Francois Mercier, Makesh Narsimhan

TL;DR
This paper introduces a hierarchical Reformer model that processes large financial regulatory documents to predict trade volume changes, revealing that regulatory filings contain material information relevant for market understanding.
Contribution
The paper presents a novel hierarchical Reformer model tailored for large document processing in finance, demonstrating its ability to predict market movements from regulatory filings.
Findings
Model successfully predicts trade volume changes from filings.
Attention patterns reveal detection of material information.
Processing regulatory filings benefits market understanding.
Abstract
Most applications of machine learning for finance are related to forecasting tasks for investment decisions. Instead, we aim to promote a better understanding of financial markets with machine learning techniques. Leveraging the tremendous progress in deep learning models for natural language processing, we construct a hierarchical Reformer ([15]) model capable of processing a large document level dataset, SEDAR, from canadian financial regulatory filings. Using this model, we show that it is possible to predict trade volume changes using regulatory filings. We adapt the pretraining task of HiBERT ([36]) to obtain good sentence level representations using a large unlabelled document dataset. Finetuning the model to successfully predict trade volume changes indicates that the model captures a view from financial markets and processing regulatory filings is beneficial. Analyzing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Banking stability, regulation, efficiency
MethodsAttention Is All You Need · Linear Layer · 1x1 Convolution · Byte Pair Encoding · Dense Connections · Multi-Head Attention · SentencePiece · Layer Normalization · Convolution · Adafactor
