On Provably Learning Sparse High-Dimensional Functions

Name: On Provably Learning Sparse High-Dimensional Functions
Start: 2024-03-08T11:00:00-05:00
End: 2024-03-08T12:00:00-05:00
Location: E18-304

March 8, 2024 @ 11:00 am - 12:00 pm

Joan Bruna, New York University

E18-304

Event Navigation

Abstract: Neural Networks are hailed for their ability to discover useful low-dimensional ‘features’ out of complex high-dimensional data, yet such ability remains mostly hand-wavy. Over the recent years, the class of sparse (or ‘multi-index’) functions has emerged as a model with both practical motivations and a rich mathematical structure, enabling a quantitative theory of ‘feature learning’. In this talk, I will present recent progress on this front, by describing (i) the ability of gradient-descent algorithms to efficiently learn the multi-index class over Gaussian data, and (ii) the tight Statistical-Query complexity for the gaussian single-index class.

Joint work with Loucas Pillaud-Vivien, Alex Damian, Alberto Bietti and Jason Lee.

Bio: Joan Bruna is an Associate Professor of Computer Science, Data Science and Mathematics (affiliated) at the Courant Institute and the Center for Data Science, New York University (NYU). He is also a visiting scholar at the Center for Computational Mathematics in the Flatiron Institute. His research interests are in the mathematical foundations of machine learning, including deep learning theory, and its interface with applied mathematics and problems in computational science. For his research contributions, he has been awarded a Sloan Research Fellowship, a NSF CAREER Award, and several best paper awards.

MIT Statistics + Data Science Center
Massachusetts Institute of Technology
77 Massachusetts Avenue
Cambridge, MA 02139-4307
617-253-1764

Accessibility

Events

On Provably Learning Sparse High-Dimensional Functions

March 8, 2024 @ 11:00 am - 12:00 pm

Event Navigation