Seminar | November 23 | 11 a.m.-12 p.m. | WeWork Berkeley, Meeting Room A, 4th Floor | Note change in location

 2120 University Avenue, Berkeley, CA 94709

 Adam Gleave, PhD Candidate, UC Berkeley

 Electrical Engineering and Computer Sciences (EECS)

Machine learning has made remarkable progress towards building automated systems that achieve high average-case performance on procedurally specified objectives. However, real-world tasks often have complex objectives that are difficult to specify procedurally, and safety-critical tasks often demand worst-case guarantees. In this talk, I will first discuss how agent objectives can be inferred from human feedback, with a focus on how to test and validate the learned objective. I will then introduce methods for adversarially testing agents, concluding with methods to make agents more robust.

This is a hybrid event and may be also be attended via:

Zoom Location: https://berkeley.zoom.us/j/97650504829?pwd=QkhSUXg3UURiRytId3VFT1VtNHFuQT09
Zoom Meeting ID: 976 5050 4829
Zoom Passcode: 879671

 gleave@berkeley.edu

 Roxana Infante,  roxana@eecs.berkeley.edu,  510-643-3257

Event Date
-
Status
Location Changed
Primary Event Type
Seminar
Location
Meeting Room A, 4th Floor WeWork Berkeley
Performers
Adam Gleave, UC Berkeley (Speaker)
Event ID
149103