SEMINAR

Innovations in Text-Guided Visual Content Generation

Wednesday, Jul 5 2023 - 5:38 pm (GMT + 7)
Speaker
Wang Hao
Working
Nanyang Technological University
Timeline
Mon, Jul 17 2023 - 11:00 am (GMT + 7)
About Speaker

WANG Hao is a final year PhD candidate in the School of Computer Science and Engineering at Nanyang Technological University, Singapore. He received the B.E. degree from Huazhong University of Science and Technology, China. His research interest is developing AI-powered perception and generation algorithms for the multimodal domain. In particular, his recent work investigates the translation between visual and text data, to generate controllable contents with efficiency and robustness. He has published first-authored top-tier conference and journal work in computer vision and multimedia fields, including CVPR, ECCV, IEEE TPAMI, IEEE TIP, etc.

Abstract

Text-guided visual content generation is a significant task in generative AI, which focuses on translating semantic information from text to visual content. Generating complex and high-quality visuals while maintaining control is a key challenge in this domain. In this talk, we will introduce two innovative frameworks: StyleGAN-based inversion and online alignment. These frameworks aim to overcome the existing challenges, where we enable high-fidelity visual generation and cross-modal semantic matching simultaneously. With our approach, the inference phase allows for the direct generation of visual content from textual input, streamlining the process into a single step.

Related seminars

Coming soon
Niranjan Balasubramanian

Stony Brook University

Towards Reliable Multi-step Reasoning in Question Answering
Fri, Nov 03 2023 - 10:00 am (GMT + 7)
Nghia Hoang

Washington State University

Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms
Fri, Oct 27 2023 - 10:00 am (GMT + 7)
Jey Han Lau

University of Melbourne

Rumour and Disinformation Detection in Online Conversations
Thu, Sep 14 2023 - 10:00 am (GMT + 7)
Tan Nguyen

National University of Singapore

Principled Frameworks for Designing Deep Learning Models: Efficiency, Robustness, and Expressivity
Mon, Aug 28 2023 - 10:00 am (GMT + 7)