Triplet Feature Fusion for Equipment Anomaly Prediction : An Open-Source Methodology Using Small Foundation Models
Takato Yasuno

TL;DR
This paper introduces a small, open-source, triplet feature fusion pipeline that combines statistical, time-series, and multilingual text features for accurate, edge-deployable equipment anomaly prediction.
Contribution
It presents a novel methodology integrating diverse open-source models into a compact, efficient classifier for industrial anomaly detection with high accuracy and low false positives.
Findings
Achieves 0.992 precision and 0.998 ROC-AUC at 30-day horizon.
Reduces false positive rate from 0.6% to 0.1%.
Embeddings align with fault archetypes, improving discrimination.
Abstract
Predicting equipment anomalies before they escalate into failures is a critical challenge in industrial facility management. Existing approaches rely either on hand-crafted threshold rules, which lack generalizability, or on large neural models that are impractical for on-site, air-gapped deployments. We present an industrial methodology that resolves this tension by combining open-source small foundation models into a unified 1,116-dimensional Triplet Feature Fusion pipeline. This pipeline integrates: (1) statistical features (x in ) derived from 90-day sensor histories, (2) time-series embeddings (y in ) from a LoRA-adapted IBM Granite TinyTimeMixer (TTM, 133K parameters), and (3) multilingual text embeddings (z in ) extracted from Japanese equipment master records via multilingual-e5-large. The concatenated triplet h = [x; y; z] is processed by a LightGBM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
