S&P 500 Trend Prediction
Shasha Yu, Qinchen Zhang, Yuwei Zhao

TL;DR
This study compares various machine learning models to predict S&P 500 market trends, finding KNN effective for short-term and XGBoost for long-term forecasts, with insights into feature importance and model limitations.
Contribution
It introduces a comprehensive approach using multiple models and feature engineering based on the '101 Formulaic Alphas' to predict market trends.
Findings
KNN performs best for short-term trend prediction.
XGBoost achieves highest accuracy for long-term forecasts.
Momentum and volume indicators are key features.
Abstract
This project aims to predict short-term and long-term upward trends in the S&P 500 index using machine learning models and feature engineering based on the "101 Formulaic Alphas" methodology. The study employed multiple models, including Logistic Regression, Decision Trees, Random Forests, Neural Networks, K-Nearest Neighbors (KNN), and XGBoost, to identify market trends from historical stock data collected from Yahoo! Finance. Data preprocessing involved handling missing values, standardization, and iterative feature selection to ensure relevance and variability. For short-term predictions, KNN emerged as the most effective model, delivering robust performance with high recall for upward trends, while for long-term forecasts, XGBoost demonstrated the highest accuracy and AUC scores after hyperparameter tuning and class imbalance adjustments using SMOTE. Feature importance analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Forecasting Techniques and Applications · Financial Distress and Bankruptcy Prediction
MethodsLogistic Regression · Feature Selection · Synthetic Minority Over-sampling Technique.
