Performance Evaluation of Classification Models for Household Income, Consumption and Expenditure Data Set
Mersha Nigus, Dorsewamy

TL;DR
This paper evaluates ten machine learning algorithms for classifying household food security status, finding that Random Forest and Gradient Boosting achieve near-perfect accuracy on the dataset.
Contribution
It introduces a robust methodology for classifying household food security using multiple machine learning models and validates their performance on survey data.
Findings
Random Forest and Gradient Boosting achieve 99.97% accuracy.
All classifiers perform well with high accuracy metrics.
The methodology effectively categorizes household food security status.
Abstract
Food security is more prominent on the policy agenda today than it has been in the past, thanks to recent food shortages at both the regional and global levels as well as renewed promises from major donor countries to combat chronic hunger. One field where machine learning can be used is in the classification of household food insecurity. In this study, we establish a robust methodology to categorize whether or not a household is being food secure and food insecure by machine learning algorithms. In this study, we have used ten machine learning algorithms to classify the food security status of the Household. Gradient Boosting (GB), Random Forest (RF), Extra Tree (ET), Bagging, K-Nearest Neighbor (KNN), Decision Tree (DT), Support Vector Machine (SVM), Logistic Regression (LR), Ada Boost (AB) and Naive Bayes were the classification algorithms used throughout this study (NB). Then, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFood Security and Health in Diverse Populations · Food Waste Reduction and Sustainability · COVID-19 Pandemic Impacts
MethodsAdaptive Discriminator Augmentation · Support Vector Machine · Logistic Regression
