Large-Scale Data Mining of Rapid Residue Detection Assay Data From HTML and PDF Documents: Improving Data Access and Visualization for Veterinarians
Majid Jaberi-Douraki, Soudabeh Taghian Dinani, Nuwan Indika Millagaha, Gedara, Xuan Xu, Emily Richards, Fiona Maunsell, Nader Zad, Lisa Ann Tell

TL;DR
This paper presents a novel AI-based method for automatically extracting drug residue assay data from HTML and PDF documents to improve data access and visualization for veterinarians.
Contribution
The study introduces a new data-mining approach combining pattern recognition and software tools to extract assay data from unstructured documents, enhancing data accessibility.
Findings
Successfully extracted assay data from multiple document formats
Improved data access for veterinarians and regulatory agencies
Enhanced accuracy in retrieving assay parameters
Abstract
Extra-label drug use in food animal medicine is authorized by the US Animal Medicinal Drug Use Clarification Act (AMDUCA), and estimated withdrawal intervals are based on published scientific pharmacokinetic data. Occasionally there is a paucity of scientific data on which to base a withdrawal interval or a large number of animals being treated, driving the need to test for drug residues. Rapid assay commercial farm-side tests are essential for monitoring drug residues in animal products to protect human health. Active ingredients, sensitivity, matrices, and species that have been evaluated for commercial rapid assay tests are typically reported on manufacturers' websites or in PDF documents that are available to consumers but may require a special access request. Additionally, this information is not always correlated with FDA-approved tolerances. Furthermore, parameter changes for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBalanced Selection
