Data-driven Learning of Probabilistic Model of Binary Droplet Collision for Spray Simulation
Weiming Xu, Tao Yang, and Peng Zhang

TL;DR
This paper introduces a novel probabilistic machine learning model for binary droplet collision, trained on extensive experimental data, enabling more accurate and stochastic spray simulations.
Contribution
It develops the first data-driven probabilistic model for droplet collision that captures complex behaviors and transitions in spray simulations.
Findings
Model achieves 99.2% accuracy in regime classification.
Probabilistic form retains 93.2% accuracy and models transitions.
Biased-dice sampling converts probabilities into stochastic outcomes.
Abstract
Binary droplet collisions are ubiquitous in dense sprays. Traditional deterministic models cannot adequately represent transitional and stochastic behaviors of binary droplet collision. To bridge this gap, we developed a probabilistic model by using a machine learning approach, the Light Gradient-Boosting Machine (LightGBM). The model was trained on a comprehensive dataset of 33,540 experimental cases covering eight collision regimes across broad ranges of Weber number, Ohnesorge number, impact parameter, size ratio, and ambient pressure. The resulting machine learning classifier captures highly nonlinear regime boundaries with 99.2% accuracy and retains sensitivity in transitional regions. To facilitate its implementation in spray simulation, the model was translated into a probabilistic form, a multinomial logistic regression, which preserves 93.2% accuracy and maps continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
