SEMINAR

Generative Sequence Models for Sequential Decision Making

Thursday, May 5 2022 - 10:17 am (GMT + 7)
Speaker
Aditya Grover
Working
University of California
Timeline
Fri, May 06 2022 - 10:00 am (GMT + 7)
About Speaker

Aditya Grover is an Assistant Professor of Computer Science at UCLA. His goal is to develop efficient machine learning approaches for probabilistic reasoning under limited supervision, with a focus on deep generative modeling and sequential decision-making under uncertainty. He is also an affiliate faculty at the UCLA Institute of the Environment and Sustainability, where he grounds his research in real-world applications in climate science and sustainable energy. His 35+ research works have been published at top-tier scientific conferences and journals including Nature, deployed into production at major technology companies (Instagram, Twitter), and covered in major press venues, such as the Wall Street Journal and Wired. Aditya’s research has been recognized with two best paper awards (NeurIPS, StarAI), several research fellowships (Google-Simons Institute, Microsoft Research, Lieberman, Adobe), and the ACM SIGKDD doctoral dissertation award. Aditya received his postdoctoral training at UC Berkeley, Ph.D. from Stanford, and bachelors from IIT Delhi, all in computer science.

Abstract

The ability to make decisions under uncertainty is a key component of intelligence. We introduce a framework that abstracts sequential decision making as a generative sequence modeling problem. This allows us to draw upon the simplicity and scalability of the Transformer architecture, and associated advances in language modeling such as GPT-x. I will show how this framework permits learning from large offline datasets, uncertainty-guided online exploration, and generalization across multiple tasks. On various benchmarks from continuous control to game playing, our framework matches or exceeds the performance of state-of-the-art algorithms.

Related seminars

Coming soon
Niranjan Balasubramanian

Stony Brook University

Towards Reliable Multi-step Reasoning in Question Answering
Fri, Nov 03 2023 - 10:00 am (GMT + 7)
Nghia Hoang

Washington State University

Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms
Fri, Oct 27 2023 - 10:00 am (GMT + 7)
Jey Han Lau

University of Melbourne

Rumour and Disinformation Detection in Online Conversations
Thu, Sep 14 2023 - 10:00 am (GMT + 7)
Tan Nguyen

National University of Singapore

Principled Frameworks for Designing Deep Learning Models: Efficiency, Robustness, and Expressivity
Mon, Aug 28 2023 - 10:00 am (GMT + 7)