Understanding AI Data Repositories with Automatic Query Generation
Erik Altman

TL;DR
This paper introduces techniques for automatic query generation from ingested data to evaluate and improve AI knowledge without human input, enhancing scalability and domain coverage.
Contribution
It presents novel methods for generating domain-agnostic queries to assess and extend AI knowledge, reducing reliance on human experts.
Findings
Queries can identify gaps and errors in AI knowledge.
Techniques enable scalable AI deployment across domains.
Future work will assess the effectiveness of these methods.
Abstract
We describe a set of techniques to generate queries automatically based on one or more ingested, input corpuses. These queries require no a priori domain knowledge, and hence no human domain experts. Thus, these auto-generated queries help address the epistemological question of how we know what we know, or more precisely in this case, how an AI system with ingested data knows what it knows. These auto-generated queries can also be used to identify and remedy problem areas in ingested material -- areas for which the knowledge of the AI system is incomplete or even erroneous. Similarly, the proposed techniques facilitate tests of AI capability -- both in terms of coverage and accuracy. By removing humans from the main learning loop, our approach also allows more effective scaling of AI and cognitive capabilities to provide (1) broader coverage in a single domain such as health or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Data Classification · Neural Networks and Applications
