Name: Weakly Supervised Learning for Adaptive LLM Agents
Start: 2024-11-21T03:00:00-08:00
End: 2024-11-21T04:00:00-08:00
Location: Conference Rm #689 in-person attendance will be permitted for USC/ISI faculty, staff, students only. Open to the public virtually via Zoom.

ISI Natural Language Seminar

Weakly Supervised Learning for Adaptive LLM Agents

When

Thursday, November 21, 2024 11:00am - 12:00pm PDT

Add to calendar:

Presenter

Presented by:

Da Yin, UCLA

Location

Conference Rm #689 in-person attendance will be permitted for USC/ISI faculty, staff, students only. Open to the public virtually via Zoom.

Virtual Recording

This event is open to:

Everyone

Event Details

Speaker: Da Yin, UCLA

Conference Room Location: ISI-MDR CR#689

REMINDER:

Meeting hosts only admit on-line guests that they know to the Zoom meeting. Hence, you’re highly encouraged to use your USC account to sign into Zoom.

If you’re an outside visitor, please inform us at (nlg-seminar-host(at)isi.edu) to make us aware of your attendance so we can admit you. Specify if you will attend remotely or in person at least one business day prior to the event. Provide your: full name, job title and professional affiliation and arrive at least 10 minutes before the seminar begins.

If you do not have access to the 6th Floor for in-person attendance, please check in at the 10th floor main reception desk to register as a visitor and someone will escort you to the conference room location.

https://usc.zoom.us/j/98977559622?pwd=IK59VbdZJiGIPV9xjUjabrjEauRDai.1
Meeting ID: 989 7755 9622
Passcode: 307452

LLM agents are revolutionizing complex task-solving through multi-step planning, reasoning, and real-world or simulated interactions. However, their adaptability to unseen tasks and environments remains a challenge, especially with limited training resources. In this talk, I will first introduce Agent Lumos (ACL 2024), a foundational framework for training general-purpose, open-source LLM agents that enables better generalization across domains, by the unified training over the trajectories converted from the ubiquitous, unstructured annotated reasoning rationales. I will also discuss Trial and Error (ACL 2024) and Q* Agent, which foster self-exploration, and collect trajectories for preference optimization and process reward modeling based on environmental feedback. Finally, I will outline future directions, including agent critique and world models, to enhance LLM adaptability with minimal effort.

Speaker Bio

Da Yin is a final-year PhD student in Computer Science at UCLA, advised by Prof. Kai-Wei Chang, working in the UCLA NLP lab. He was awarded Amazon PhD Fellowship and Best Paper Award at EMNLP Pan-DL workshop in 2023. He was also the co-organizer of 1st ACL MML workshop, publicity chair of 4th SocalNLP Symposium, and area chair at ACL ARR from 2023. His research interest is building generalizable, adaptive, and inclusive language processing models that can be applied across applications and regions. If speaker approves to be recorded for this NL Seminar talk, it will be posted on the USC/ISI YouTube page within 1-2 business days: https://www.youtube.com/user/USCISI. Subscribe here to learn more about upcoming seminars: https://www.isi.edu/events/ For more information on the NL Seminar series and upcoming talks, please visit: https://www.isi.edu/research-groups-nlg/nlg-seminars/ Hosts: Jonathan May and Katy Felkner

This program is open to all eligible individuals. Information Sciences Institute operates all of its programs and activities consistent with the University’s Notice of Non-Discrimination. Eligibility is not determined based on race, sex, ethnicity, sexual orientation, or any other prohibited factor.