SEMINAR

Video Understanding: from Representation Learning to Open-World, Long-term Reasoning

Friday, Jul 22 2022 - 2:00 pm (GMT + 7)
Speaker
Du Tran
Working
Meta AI Research
Timeline
Fri, Jul 22 2022 - 02:00 pm (GMT + 7)
About Speaker

Du Tran is a staff research scientist at Meta AI Research. He graduated with a Ph.D. in computer science from Dartmouth College and an M.S. in computer science from the University of Illinois at Urbana-Champaign, receiving the Dartmouth Presidential Fellowship and the Vietnam Education Fellowship. His research interests are in computer vision, machine learning, and computer graphics, with specific interests in video understanding, representation learning, and multimodal modeling.

Abstract

Video understanding is one of the fundamental problems in computer vision with various applications, including autonomous vehicles, robot learning, and visual perception. Although we have witnessed multiple works in video understanding in the last few years, there are many more challenging video understanding problems that are still unsolved. In this talk, I will present some of our recent work in video understanding, including cross-modal self-supervised learning of video and audio representations and open-world instance segmentation. Finally, I will speculate on several potential future research directions in this area.

Related seminars

Coming soon
Niranjan Balasubramanian

Stony Brook University

Towards Reliable Multi-step Reasoning in Question Answering
Fri, Nov 03 2023 - 10:00 am (GMT + 7)
Nghia Hoang

Washington State University

Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms
Fri, Oct 27 2023 - 10:00 am (GMT + 7)
Jey Han Lau

University of Melbourne

Rumour and Disinformation Detection in Online Conversations
Thu, Sep 14 2023 - 10:00 am (GMT + 7)
Tan Nguyen

National University of Singapore

Principled Frameworks for Designing Deep Learning Models: Efficiency, Robustness, and Expressivity
Mon, Aug 28 2023 - 10:00 am (GMT + 7)