Machine Learning for Tangible Effects: Natural Language Processing for Uncovering the Illicit Massage Industry & Computer Vision for Tactile Sensing
Rui Ouyang

TL;DR
This thesis applies NLP to monitor the illicit massage industry and develops novel tactile sensors using computer vision, contributing new datasets, models, and low-cost sensing solutions for combating human trafficking and tactile sensing applications.
Contribution
It introduces datasets and NLP techniques for analyzing the illicit massage industry and presents innovative tactile sensors, including the Digger Finger and a low-cost force-torque sensor, with open-source designs.
Findings
NLP reveals labor pressures and societal factors in the IMI
Synthetic financial data can aid anti-trafficking efforts
Novel tactile sensors are effective and low-cost
Abstract
I explore two questions in this thesis: how can computer science be used to fight human trafficking? And how can computer vision create a sense of touch? I use natural language processing (NLP) to monitor the United States illicit massage industry (IMI), a multi-billion dollar industry that offers not just therapeutic massages but also commercial sexual services. Employees of this industry are often immigrant women with few job opportunities, leaving them vulnerable to fraud, coercion, and other facets of human trafficking. Monitoring spatiotemporal trends helps prevent trafficking in the IMI. By creating datasets with three publicly-accessible websites: Google Places, Rubmaps, and AMPReviews, combined with NLP techniques such as bag-of-words and Word2Vec, I show how to derive insights into the labor pressures and language barriers that employees face, as well as the income,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Spam and Phishing Detection · Cybercrime and Law Enforcement Studies
