From Natural Language to PromQL: A Catalog-Driven Framework with Dynamic Temporal Resolution for Cloud-Native Observability
Twinkll Sisodia

TL;DR
This paper introduces a catalog-driven framework that translates natural language questions into PromQL queries for cloud-native observability, enabling faster and more accurate metric querying.
Contribution
It presents a hybrid metrics catalog, a multi-stage query pipeline, and a dynamic temporal resolution mechanism integrated with LLMs for improved natural language to PromQL translation.
Findings
Achieves sub-second metric discovery using pre-computed indices.
Full query pipeline completes in approximately 1.1 seconds.
Deployed on production Kubernetes clusters for AI workloads.
Abstract
Modern cloud-native platforms expose thousands of time series metrics through systems like Prometheus, yet formulating correct queries in domain-specific languages such as PromQL remains a significant barrier for platform engineers and site reliability teams. We present a catalog-driven framework that translates natural language questions into executable PromQL queries, bridging the gap between human intent and observability data. Our approach introduces three contributions: (1) a hybrid metrics catalog that combines a statically curated base of approximately 2,000 metrics with runtime discovery of hardware-specific signals across GPU vendors, (2) a multi-stage query pipeline with intent classification, category-aware metric routing, and multi-dimensional semantic scoring, and (3) a dynamic temporal resolution mechanism that interprets diverse natural language time expressions and maps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
