Performance Evaluation of Classification Models for Household Income,   Consumption and Expenditure Data Set

Mersha Nigus; Dorsewamy

arXiv:2106.11055·cs.LG·June 22, 2021

Performance Evaluation of Classification Models for Household Income, Consumption and Expenditure Data Set

Mersha Nigus, Dorsewamy

PDF

Open Access

TL;DR

This paper evaluates ten machine learning algorithms for classifying household food security status, finding that Random Forest and Gradient Boosting achieve near-perfect accuracy on the dataset.

Contribution

It introduces a robust methodology for classifying household food security using multiple machine learning models and validates their performance on survey data.

Findings

01

Random Forest and Gradient Boosting achieve 99.97% accuracy.

02

All classifiers perform well with high accuracy metrics.

03

The methodology effectively categorizes household food security status.

Abstract

Food security is more prominent on the policy agenda today than it has been in the past, thanks to recent food shortages at both the regional and global levels as well as renewed promises from major donor countries to combat chronic hunger. One field where machine learning can be used is in the classification of household food insecurity. In this study, we establish a robust methodology to categorize whether or not a household is being food secure and food insecure by machine learning algorithms. In this study, we have used ten machine learning algorithms to classify the food security status of the Household. Gradient Boosting (GB), Random Forest (RF), Extra Tree (ET), Bagging, K-Nearest Neighbor (KNN), Decision Tree (DT), Support Vector Machine (SVM), Logistic Regression (LR), Ada Boost (AB) and Naive Bayes were the classification algorithms used throughout this study (NB). Then, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFood Security and Health in Diverse Populations · Food Waste Reduction and Sustainability · COVID-19 Pandemic Impacts

MethodsAdaptive Discriminator Augmentation · Support Vector Machine · Logistic Regression