Automated identification of hookahs (waterpipes) on Instagram: an application in feature extraction using Convolutional Neural Network and Support Vector Machine classification
Youshan Zhang, Jon-Patrick Allem, Jennifer B. Unger, Tess Boley Cruz

TL;DR
This study presents a combined CNN and SVM approach to accurately identify hookah images on Instagram, enhancing public health surveillance by automating image classification with high precision.
Contribution
It introduces a novel CNN + SVM method for improved accuracy in classifying waterpipe images, surpassing previous single-method approaches.
Findings
Achieved 99.5% accuracy in classifying images.
Increased features led to higher classification accuracy.
CNN enhances feature extraction for better SVM performance.
Abstract
Background: Instagram, with millions of posts per day, can be used to inform public health surveillance targets and policies. However, current research relying on image-based data often relies on hand coding of images which is time consuming and costly, ultimately limiting the scope of the study. Current best practices in automated image classification (e.g., support vector machine (SVM), Backpropagation (BP) neural network, and artificial neural network) are limited in their capacity to accurately distinguish between objects within images. Objective: This study demonstrates how convolutional neural network (CNN) can be used to extract unique features within an image and how SVM can then be used to classify the image. Methods: Images of waterpipes or hookah (an emerging tobacco product possessing similar harms to that of cigarettes) were collected from Instagram and used in analyses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSupport Vector Machine
