Classifying with Uncertain Data Envelopment Analysis
Casey Garner, Allen Holder

TL;DR
This paper introduces a novel classification method based on uncertain data envelopment analysis that effectively handles imperfect data and overcomes computational challenges to classify stocks and medical treatments.
Contribution
It presents a new classification scheme that incorporates data uncertainty, along with algorithms to address non-convexity and large search spaces, demonstrated on financial and medical data.
Findings
Successfully classified Dow Jones stocks into performance tiers.
Effectively categorized prostate treatments by clinical efficacy.
Demonstrated robustness of the method with real-world uncertain data.
Abstract
Classifications organize entities into categories that identify similarities within a category and discern dissimilarities among categories, and they powerfully classify information in support of analysis. We propose a new classification scheme premised on the reality of imperfect data. Our computational model uses uncertain data envelopment analysis to define a classification's proximity to equitable efficiency, which is an aggregate measure of intra-similarity within a classification's categories. Our classification process has two overriding computational challenges, those being a loss of convexity and a combinatorially explosive search space. We overcome the first by establishing lower and upper bounds on the proximity value, and then by searching this range with a first-order algorithm. We overcome the second by adapting the p-median problem to initiate our exploration, and by then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEfficiency Analysis Using DEA · Economic and Technological Innovation
