Talk Title: Rethinking Post Training
Talk Abstract: In this talk, I will discuss the limitations of current post-training methods and discuss recent advances in continual learning that holistically study: (i) What data should the agent learn from? (ii) What is the right learning algorithm? (iii) and finally questioning the pre-training / finetuning paradigm. This talk will span our papers RL Razor, Self-Distillation Learning, SEAL, and more recent work.
Bio: Pulkit Agrawal is an Associate Professor in the Department of Electrical Engineering and Computer Science at MIT. He earned his Ph.D. from UC Berkeley and co-founded SafelyYou Inc. Pulkit completed his bachelor’s from IIT Kanpur and was awarded the Director’s Gold Medal. His work has received multiple Best Paper Awards, the IEEE Early Career Award in Robotics and Automation, the IROS Toshio Fukuda Young Professional Award, the IIT Kanpur Young Alumnus Award, the Sony Faculty Research Award, the Salesforce Research Award, the Amazon Research Award, the Signatures Fellow Award, the Fulbright Science and Technology Award, and others.