PHBench: A Benchmark for Predicting Startup Series A Funding from Product Hunt Launch Signals

Yagiz Ihlamur; Ben Griffin; Rick Chen

arXiv:2605.02974·q-fin.PR·May 6, 2026

PHBench: A Benchmark for Predicting Startup Series A Funding from Product Hunt Launch Signals

Yagiz Ihlamur, Ben Griffin, Rick Chen

PDF

1 Repo

TL;DR

PHBench is a comprehensive benchmark dataset and framework for predicting Series A funding outcomes from Product Hunt launch signals, demonstrating statistically significant predictive power and providing reproducible tools.

Contribution

This work introduces PHBench, a new benchmark with a large dataset, engineered features, and evaluation tools for startup funding prediction from Product Hunt data.

Findings

01

Best ensemble model achieves F0.5 = 0.097 and AP = 0.037, outperforming random chance.

02

Statistical analysis confirms a credible advantage over baseline models.

03

Large dataset captures genuine market trends, not noise.

Abstract

Structured launch signals on Product Hunt contain statistically significant predictive information for Series A funding outcomes. We construct PHBench from 67,292 featured Product Hunt posts spanning 2019-2025, linked to Crunchbase funding records via deterministic domain matching, identifying 528 verified Series A raises within 18 months of launch (positive rate: 0.78%). Our best-performing model, a three-component ensemble (ENS_avg, ENS_ISO, XGB) selected by validation F0.5, achieves F0.5 = 0.097 and AP = 0.037 (95% CI: 0.024-0.072; 4.7x lift over random) on the private held-out test set (103 positives). A paired bootstrap confirms a statistically credible advantage over the logistic regression baseline (AP delta: +0.013, 95% CI: [0.004, 0.039], p < 0.001; F0.5 delta: +0.056, 95% CI: [0.006, 0.122], p = 0.016). Validation-set metrics (F0.5 = 0.284, AP = 0.126) reflect best-of-144…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://phbench.com
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.