Cerebra: Aligning Implicit Knowledge in Interactive SQL Authoring
Yunfan Zhou, Qiming Shi, Zhongsu Luo, Xiwen Cai, Yanwei Huang, Dae Hyun Kim, Di Weng, Yingcai Wu

TL;DR
Cerebra is an interactive NL-to-SQL tool that enhances SQL script accuracy by aligning and reviewing implicit knowledge during authoring, improving user control and reducing errors.
Contribution
It introduces a method to automatically retrieve and visualize implicit knowledge from SQL history, facilitating iterative refinement and better alignment with user intent.
Findings
Improved accuracy in SQL script generation.
Enhanced user understanding through interactive knowledge review.
Positive user feedback on iterative refinement process.
Abstract
LLM-driven tools have significantly lowered barriers to writing SQL queries. However, user instructions are often underspecified, assuming the model understands implicit knowledge, such as dataset schemas, domain conventions, and task-specific requirements, that isn't explicitly provided. This results in frequently erroneous scripts that require users to repeatedly clarify their intent. Additionally, users struggle to validate generated scripts because they cannot verify whether the model correctly applied implicit knowledge. We present Cerebra, an interactive NL-to-SQL tool that aligns implicit knowledge between users and LLMs during SQL authoring. Cerebra automatically retrieves implicit knowledge from historical SQL scripts based on user instructions, presents this knowledge in an interactive tree view for code review, and supports iterative refinement to improve generated scripts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Web Application Security Vulnerabilities · Logic, programming, and type systems
