SEMINAR

Learning/Optimization over abstract non-manifold structures: A case in biological data analysis

Friday, Oct 2 2020 - 5:57 pm (GMT + 7)
Speaker
Thien Le
Working
MIT CSAIL
Timeline
Fri, Oct 02 2020 - 10:00 am (GMT + 7)
About Speaker

Thien Le is a second year PhD student at Massachusetts Institute of Technology (MIT) Computer Science and Artificial Intelligence Laboratory (CSAIL) with Stefanie Jegelka group. He is interested broadly in theory of optimization over complex structures that arise in biological data and discrete/continuous algorithms used to address them. Prior to coming to MIT, he completed his undergraduate degree at the University of Illinois at Urbana-Champaign (UIUC) in May 2019, where he worked with Tandy Warnow on phylogeny research. He received the Most Outstanding Major Award in Mathematics and Computer Science from UIUC Department of Mathematics in April 2019.

Abstract

Biological data analysis often involves optimization problems where the objective function is efficient to compute, but the feasible set is of complex combinatorial and geometric nature. In particular, phylogenetic studies involve inferring tree structures that depict evolutionary relationships, from information at their leaf nodes (DNA sequences of existing species, for example). One approach that works well in theory and practice models evolution as a parametric stochastic process and performs maximum likelihood analysis. While the likelihood function is easy to compute, the set of all tree structures over a fix number of leaves has high dimension and is, in fact, a non-manifold. Despite these obstacles, there are known discrete algorithms that solve this problem with efficient runtime and sample complexity. In this talk, we visit known results on the algebro-geometric properties of this space and expand on how smooth optimization can be a novel tool to design even more efficient algorithms/heuristics for these problems. Based on on-going work with Stefanie Jegelka.

Related seminars

Coming soon
Niranjan Balasubramanian

Stony Brook University

Towards Reliable Multi-step Reasoning in Question Answering
Fri, Nov 03 2023 - 10:00 am (GMT + 7)
Nghia Hoang

Washington State University

Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms
Fri, Oct 27 2023 - 10:00 am (GMT + 7)
Jey Han Lau

University of Melbourne

Rumour and Disinformation Detection in Online Conversations
Thu, Sep 14 2023 - 10:00 am (GMT + 7)
Tan Nguyen

National University of Singapore

Principled Frameworks for Designing Deep Learning Models: Efficiency, Robustness, and Expressivity
Mon, Aug 28 2023 - 10:00 am (GMT + 7)