Views Navigation

Event Views Navigation

Webinar: Inside the MITx MicroMasters Program in Statistics and Data Science

Devavrat Shah, Karene Chu
Online

<br> </br> Interested in starting your data science journey? <a href="https://event.on24.com/eventRegistration/EventLobbyServlet?target=reg20.jsp&amp;referrer=&amp;eventid=2170691&amp;sessionid=1&amp;key=02F897D60682F202E261E07985F9CB92&amp;regTag=&amp;sourcepage=register">Register for this special free virtual event.</a> You'll receive a confirmation e-mail with further details about the webinar. <br> </br> Demand for professionals skilled in data, analytics, and machine learning is exploding. A recent report by IBM and Burning Glass states that there will be 364K new job openings in data-driven professions this year in the US alone. Data scientists bring value to organizations across industries because they are able…

Find out more »

Diffusion K-means Clustering on Manifolds: provable exact recovery via semidefinite relaxations

Xiaohui Chen (University of Illinois at Urbana-Champaign)
E18-304

Abstract: We introduce the diffusion K-means clustering method on Riemannian submanifolds, which maximizes the within-cluster connectedness based on the diffusion distance. The diffusion K-means constructs a random walk on the similarity graph with vertices as data points randomly sampled on the manifolds and edges as similarities given by a kernel that captures the local geometry of manifolds. Thus the diffusion K-means is a multi-scale clustering tool that is suitable for data with non-linear and non-Euclidean geometric features in mixed dimensions. Given…

Find out more »

Predictive Inference with the Jackknife+

Rina Foygel Barber (University of Chicago)
E18-304

Abstract: We introduce the jackknife+, a novel method for constructing predictive confidence intervals that is robust to the distribution of the data. The jackknife+ modifies the well-known jackknife (leaveoneout cross-validation) to account for the variability in the fitted regression function when we subsample the training data. Assuming exchangeable training samples, we prove that the jackknife+ permits rigorous coverage guarantees regardless of the distribution of the data points, for any algorithm that treats the training points symmetrically (in contrast, such guarantees…

Find out more »

Tales of Random Projections

Kavita Ramanan (Brown University)
E18-304

Abstract: Properties of random projections of high-dimensional probability measures are of interest in a variety of fields, including asymptotic convex geometry, and potential applications to high-dimensional statistics and data analysis.   A particular question of interest is to identify what properties of the high-dimensional measure are captured by its lower-dimensional projections.   While fluctuations of these projections have been well studied over the past decade, we describe more recent work on the tail behavior of such projections, and various implications.  This talk is based on…

Find out more »

Does Revolution Work? Evidence from Nepal

Rohini Pande (Yale University)
E18-304

The last half century has seen the adoption of democratic institutions in much of the developing world. However, the conditions under which de jure democratization leads to the representation of historically disadvantaged groups remains debated as do the implications of descriptive representation for policy inclusion. Using detailed administrative and survey data from Nepal, we examine political selection in a new democracy, the implications for policy inclusion and the role of conflict in affecting political transformation. I situate these findings in the context…

Find out more »

Women in Data Science (WiDS) – Cambridge, MA

Microsoft NERD Center

This one-day technical conference brings together local academic leaders,  industrial professionals and students to hear about the latest data science-related research in a number of domains, to learn how leading-edge companies are leveraging data science for success, and to connect with potential mentors, collaborators, and others in the field.

Find out more »

[POSTPONED] Guido Imbens – The Applied Econometrics Professor and Professor of Economics, Graduate School of Business, Stanford University

E18-304

*Please note: this event has been POSTPONED until Fall 2020* See MIT’s COVID-19 policies for more details.   About the author: Prof. Guido Imbens’ primary field of interest is Econometrics. Research topics in which he is interested include: causality, program evaluation, identification, Bayesian methods, semi-parametric methods, instrumental variables. Guido Imbens does research in econometrics and statistics. His research focuses on developing methods for drawing causal inferences in observational studies, using matching, instrumental variables, and regression discontinuity designs. Guido Imbens is Professor…

Find out more »

Matrix Concentration for Products

Jonathan Niles-Weed (New York University)
online

Abstract: We develop nonasymptotic concentration bounds for products of independent random matrices. Such products arise in the study of stochastic algorithms, linear dynamical systems, and random walks on groups. Our bounds exactly match those available for scalar random variables and continue the program, initiated by Ahlswede-Winter and Tropp, of extending familiar concentration bounds to the noncommutative setting. Our proof technique relies on geometric properties of the Schatten trace class. Joint work with D. Huang, J. A. Tropp, and R. Ward.…

Find out more »

[POSTPONED] The Blessings of Multiple Causes

David Blei (Columbia University)
E18-304

  *Please note: this event has been POSTPONED until Fall 2020* See MIT’s COVID-19 policies for more details. Title: The Blessings of Multiple Causes Abstract: Causal inference from observational data is a vital problem, but it comes with strong assumptions. Most methods require that we observe all confounders, variables that affect both the causal variables and the outcome variables. But whether we have observed all confounders is a famously untestable assumption. We describe the deconfounder, a way to do causal…

Find out more »

On Using Graph Distances to Estimate Euclidean and Related Distances

Ery Arias-Castro (University of California, San Diego)
online

Abstract:  Graph distances have proven quite useful in machine learning/statistics, particularly in the estimation of Euclidean or geodesic distances. The talk will include a partial review of the literature, and then present more recent developments on the estimation of curvature-constrained distances on a surface, as well as on the estimation of Euclidean distances based on an unweighted and noisy neighborhood graph. - About the Speaker:  Ery Arias-Castro received his Ph.D. in Statistics from Stanford University in 2004. He then took…

Find out more »


MIT Statistics + Data Science Center
Massachusetts Institute of Technology
77 Massachusetts Avenue
Cambridge, MA 02139-4307
617-253-1764