Detecting Poisoning Attacks against Speech Datasets using Variational Autoencoders

Abstract

In this paper, we address the threat of data poisoning attacks by proposing a novel method for detecting and isolating poisoned samples. Our approach uses a variational autoencoder (VAE) trained in an unsupervised fashion on the manipulated dataset. By performing per-class clustering and statistical analysis of the latent vectors, we can identify poisoned classes and separate clean and poisoned samples. We evaluate our method on an audio dataset and demonstrate that we outperform two popular baseline defenses. Furthermore, we show the generalizability of a single trained VAE model in exposing a variety of different poisoning attacks against the same dataset.

Date: 2023
Authors: Nick Mehlman, Xuan Shi, Aditya Kommineni, Shrikanth Narayanan
Conference: Proc. SPSC 2023
Pages: 34-40

View Paper

Information Sciences Institute

Publications

Detecting Poisoning Attacks against Speech Datasets using Variational Autoencoders

Abstract