FujiView: Multimodal Late-Fusion for Predicting Scenic Visibility
Bryceton Bible, Shah Md Nehal Hasnaeen, Hairong Qi

TL;DR
FujiView introduces a multimodal framework combining webcam imagery and weather data to predict scenic visibility of Mount Fuji, achieving high accuracy and establishing a new benchmark in environmental forecasting.
Contribution
The paper presents FujiView, a novel multimodal late-fusion approach and dataset for predicting scenic visibility, advancing environmental forecasting research.
Findings
Late fusion improves overall prediction accuracy.
Vision features dominate short-term visibility forecasts.
Weather data becomes more influential for longer-term predictions.
Abstract
Visibility of natural landmarks such as Mount Fuji is a defining factor in both tourism planning and visitor experience, yet it remains difficult to predict due to rapidly changing atmospheric conditions. We present FujiView, a multimodal learning framework and dataset for predicting scenic visibility by fusing webcam imagery with structured meteorological data. Our late-fusion approach combines image-derived class probabilities with numerical weather features to classify visibility into five categories. The dataset currently comprises over 100,000 webcam images paired with concurrent and forecasted weather conditions from more than 40 cameras around Mount Fuji, and continues to expand; it will be released to support further research in environmental forecasting. Experiments show that YOLO-based vision features dominate short-term horizons such as "nowcasting" and "samedaycasting",…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUrban Heat Island Mitigation · Diverse Aspects of Tourism Research · Spatial Cognition and Navigation
