TL;DR
This paper presents a fast neural network model with extensive feature engineering for predicting Twitter user engagement, achieving high accuracy within strict latency constraints in a large-scale challenge.
Contribution
The paper introduces a novel combination of feature engineering techniques and a simple residual neural network to meet real-time prediction constraints in a large-scale social media challenge.
Findings
Achieved second place in the RecSys 2021 Challenge.
Developed a model with inference time limited to 6ms per tweet.
Utilized advanced feature encoding methods like EMDE and Fourier features.
Abstract
In this paper we present our 2nd place solution to ACM RecSys 2021 Challenge organized by Twitter. The challenge aims to predict user engagement for a set of tweets, offering an exceptionally large data set of 1 billion data points sampled from over four weeks of real Twitter interactions. Each data point contains multiple sources of information, such as tweet text along with engagement features, user features, and tweet features. The challenge brings the problem close to a real production environment by introducing strict latency constraints in the model evaluation phase: the average inference time for single tweet engagement prediction is limited to 6ms on a single CPU core with 64GB memory. Our proposed model relies on extensive feature engineering performed with methods such as the Efficient Manifold Density Estimator (EMDE) - our previously introduced algorithm based on Locality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
