Publications

Detecting Poisoning Attacks against Speech Datasets using Variational Autoencoders

Abstract

In this paper, we address the threat of data poisoning attacks by proposing a novel method for detecting and isolating poisoned samples. Our approach uses a variational autoencoder (VAE) trained in an unsupervised fashion on the manipulated dataset. By performing per-class clustering and statistical analysis of the latent vectors, we can identify poisoned classes and separate clean and poisoned samples. We evaluate our method on an audio dataset and demonstrate that we outperform two popular baseline defenses. Furthermore, we show the generalizability of a single trained VAE model in exposing a variety of different poisoning attacks against the same dataset.

Date
2023
Authors
Nick Mehlman, Xuan Shi, Aditya Kommineni, Shrikanth Narayanan
Conference
Proc. SPSC 2023
Pages
34-40