Predictive Liability Models and Visualizations of High Dimensional   Retail Employee Data

Richard R. Yang; Mike Borowczak

arXiv:1707.04639·cs.LG·March 30, 2018

Predictive Liability Models and Visualizations of High Dimensional Retail Employee Data

Richard R. Yang, Mike Borowczak

PDF

TL;DR

This paper develops machine learning models to predict employee risk in retail using high-dimensional data, employing feature selection and visualization techniques to aid in risk assessment and management.

Contribution

It introduces optimized regression models and novel visualization methods tailored for high-dimensional retail employee data, enhancing risk analysis capabilities.

Findings

01

Effective feature selection identified key risk factors.

02

Dimension reduction improved interpretability of complex data.

03

Models achieved accurate risk prediction performance.

Abstract

Employee theft and dishonesty is a major contributor to loss in the retail industry. Retailers have reported the need for more automated analytic tools to assess the liability of their employees. In this work, we train and optimize several machine learning models for regression prediction and analysis on this data, which will help retailers identify and manage risky employees. Since the data we use is very high dimensional, we use feature selection techniques to identify the most contributing factors to an employee's assessed risk. We also use dimension reduction and data embedding techniques to present this dataset in a easy to interpret format.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.