Things Multimodal LLMs Cannot See: Toward Discovering and Mitigating Perceptual Biases in Neural Networks through Visual Interventions

Artificial Intelligence Seminar

Things Multimodal LLMs Cannot See: Toward Discovering and Mitigating Perceptual Biases in Neural Networks through Visual Interventions

When

Thursday, May 23, 2024 11:00am - 12:00pm PDT

Add to calendar:

Presenter

Presented by:

Mahyar Khayatkhoei, USC | ISI

Location

ISI-MDR #1135 + 1137

Virtual URL

Online Link

This event is open to:

Everyone

Event Details

Speaker: Mahyar Khayatkhoei USC | ISI (SPECIAL AI SEMINAR)
Location: ISI Marina del Rey, Conference Room 1135 + 1137. In-person attendance for USC-ISI faculty, staff, and students only; please use your USC account to sign into Zoom. Not open to public.

Thursday, May 23, 2024 from 11am – 12pm PT.

Join Zoom meeting: https://usc.zoom.us/j/93179461297?pwd=d2RpNWlEblhxcHRFMU9RbnRxbWJBUT09

Zoom meeting ID: 931 7946 1297
Passcode: 909966

Hosted by: Adam Russell
POC: Justina Gilleland | Alma Nava

Abstract: In this talk, I will discuss our recent research on the use of pixel-space interventions for discovering and mitigating biases in visual neural networks, including in multimodal large language models (MLLMs). I will start by showcasing our discovered perceptual limitations and biases of MLLMs (including commercial ones such as GPT-4V and LLaVA). I will then discuss our simple yet effective intervention-based approach for mitigating such limitations, which can do so without requiring any training. Finally, I will more broadly discuss the problem of removing attribute-specific bias from neural networks, present our latest information theoretic bounds on this problem, and explain our adversarial input-intervention approach for removing strong attribute bias.

This event will be recorded but only shared with AI Division Leadership.

Speaker Bio

I am a Computer Scientist at the AI Division of the USC Information Sciences Institute. I received my Ph.D. and M.Sc. in computer science from Rutgers University working with Dr. Ahmed Elgammal, and my B.Sc. in electrical engineering from the University of Tehran. My research explores the theory and application of deep generative models, and has identified and resolved major bottlenecks in neural networks’ ability to learn from heterogeneous data (NeurIPS 2018), to learn high frequency features (AAAI 2022), and in their reliable evaluation (ICML 2023). My latest focus is on adopting large-scale generative neural networks to real-world mission-critical tasks. I am particularly interested in developing reliable and efficient data-driven computational models of real-world phenomena that would enhance our current physics-based models. My personal website is at https://mahyarkoy.github.io

Event Details

Zoom meeting ID: 931 7946 1297 Passcode: 909966

Speaker Bio

Zoom meeting ID: 931 7946 1297
Passcode: 909966