What Did I Just Hear? Detecting Pornographic Sounds in Adult Videos   Using Neural Networks

Holy Lovenia; Dessi Puji Lestari; Rita Frieske

arXiv:2209.03711·cs.SD·September 9, 2022

What Did I Just Hear? Detecting Pornographic Sounds in Adult Videos Using Neural Networks

Holy Lovenia, Dessi Puji Lestari, Rita Frieske

PDF

TL;DR

This paper presents a neural network-based method for detecting pornographic sounds in videos, using spectral features and a voting technique to improve audio-level classification accuracy.

Contribution

It introduces a CNN trained on log mel spectrograms for pornographic sound detection and a voting segment-to-audio method for whole audio classification.

Findings

01

CNN on log mel spectrogram achieves top performance

02

Log mel spectrogram provides better feature representations

03

Voting segment-to-audio improves detection accuracy

Abstract

Audio-based pornographic detection enables efficient adult content filtering without sacrificing performance by exploiting distinct spectral characteristics. To improve it, we explore pornographic sound modeling based on different neural architectures and acoustic features. We find that CNN trained on log mel spectrogram achieves the best performance on Pornography-800 dataset. Our experiment results also show that log mel spectrogram allows better representations for the models to recognize pornographic sounds. Finally, to classify whole audio waveforms rather than segments, we employ voting segment-to-audio technique that yields the best audio-level detection results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.