DeepTopPush: Simple and Scalable Method for Accuracy at the Top

V\'aclav M\'acha; Luk\'a\v{s} Adam; V\'aclav \v{S}m\'idl

arXiv:2006.12293·cs.LG·March 28, 2023

DeepTopPush: Simple and Scalable Method for Accuracy at the Top

V\'aclav M\'acha, Luk\'a\v{s} Adam, V\'aclav \v{S}m\'idl

PDF

Open Access

TL;DR

DeepTopPush is a scalable deep learning method designed to optimize accuracy at the top, effectively handling non-decomposable loss functions for applications like drug discovery and malware detection.

Contribution

It introduces a novel end-to-end training approach for accuracy at the top using a modified stochastic gradient descent and threshold estimation from minibatches.

Findings

01

Achieves high accuracy in visual recognition tasks.

02

Successfully detects 46% malware with very low false alarms.

03

Demonstrates effectiveness on real-world datasets.

Abstract

Accuracy at the top is a special class of binary classification problems where the performance is evaluated only on a small number of relevant (top) samples. Applications include information retrieval systems or processes with manual (expensive) postprocessing. This leads to minimizing the number of irrelevant samples above a threshold. We consider classifiers in the form of an arbitrary (deep) network and propose a new method DeepTopPush for minimizing the loss function at the top. Since the threshold depends on all samples, the problem is non-decomposable. We modify the stochastic gradient descent to handle the non-decomposability in an end-to-end training manner and propose a way to estimate the threshold only from values on the current minibatch and one delayed value. We demonstrate the excellent performance of DeepTopPush on visual recognition datasets and two real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Mass Spectrometry Techniques and Applications · Computational Drug Discovery Methods