Views Navigation

Event Views Navigation

Gaussian Differential Privacy, with Applications to Deep Learning

Weijie Su (University of Pennsylvania)
E18-304

Abstract:   Privacy-preserving data analysis has been put on a firm mathematical foundation since the introduction of differential privacy (DP) in 2006. This privacy definition, however, has some well-known weaknesses: notably, it does not tightly handle composition. This weakness has inspired several recent relaxations of differential privacy based on the Renyi divergences. We propose an alternative relaxation we term "f-DP", which has a number of nice properties and avoids some of the difficulties associated with divergence based relaxations. First, f-DP preserves…

Find out more »

Webinar: Inside the MITx MicroMasters Program in Statistics and Data Science

Devavrat Shah, Karene Chu
Online

<br> </br> Interested in starting your data science journey? <a href="https://event.on24.com/eventRegistration/EventLobbyServlet?target=reg20.jsp&amp;referrer=&amp;eventid=2170691&amp;sessionid=1&amp;key=02F897D60682F202E261E07985F9CB92&amp;regTag=&amp;sourcepage=register">Register for this special free virtual event.</a> You'll receive a confirmation e-mail with further details about the webinar. <br> </br> Demand for professionals skilled in data, analytics, and machine learning is exploding. A recent report by IBM and Burning Glass states that there will be 364K new job openings in data-driven professions this year in the US alone. Data scientists bring value to organizations across industries because they are able…

Find out more »

Diffusion K-means Clustering on Manifolds: provable exact recovery via semidefinite relaxations

Xiaohui Chen (University of Illinois at Urbana-Champaign)
E18-304

Abstract: We introduce the diffusion K-means clustering method on Riemannian submanifolds, which maximizes the within-cluster connectedness based on the diffusion distance. The diffusion K-means constructs a random walk on the similarity graph with vertices as data points randomly sampled on the manifolds and edges as similarities given by a kernel that captures the local geometry of manifolds. Thus the diffusion K-means is a multi-scale clustering tool that is suitable for data with non-linear and non-Euclidean geometric features in mixed dimensions. Given…

Find out more »

Predictive Inference with the Jackknife+

Rina Foygel Barber (University of Chicago)
E18-304

Abstract: We introduce the jackknife+, a novel method for constructing predictive confidence intervals that is robust to the distribution of the data. The jackknife+ modifies the well-known jackknife (leaveoneout cross-validation) to account for the variability in the fitted regression function when we subsample the training data. Assuming exchangeable training samples, we prove that the jackknife+ permits rigorous coverage guarantees regardless of the distribution of the data points, for any algorithm that treats the training points symmetrically (in contrast, such guarantees…

Find out more »

Tales of Random Projections

Kavita Ramanan (Brown University)
E18-304

Abstract: Properties of random projections of high-dimensional probability measures are of interest in a variety of fields, including asymptotic convex geometry, and potential applications to high-dimensional statistics and data analysis.   A particular question of interest is to identify what properties of the high-dimensional measure are captured by its lower-dimensional projections.   While fluctuations of these projections have been well studied over the past decade, we describe more recent work on the tail behavior of such projections, and various implications.  This talk is based on…

Find out more »

Does Revolution Work? Evidence from Nepal

Rohini Pande (Yale University)
E18-304

The last half century has seen the adoption of democratic institutions in much of the developing world. However, the conditions under which de jure democratization leads to the representation of historically disadvantaged groups remains debated as do the implications of descriptive representation for policy inclusion. Using detailed administrative and survey data from Nepal, we examine political selection in a new democracy, the implications for policy inclusion and the role of conflict in affecting political transformation. I situate these findings in the context…

Find out more »

Women in Data Science (WiDS) – Cambridge, MA

Microsoft NERD Center

This one-day technical conference brings together local academic leaders,  industrial professionals and students to hear about the latest data science-related research in a number of domains, to learn how leading-edge companies are leveraging data science for success, and to connect with potential mentors, collaborators, and others in the field.

Find out more »

[POSTPONED] Guido Imbens – The Applied Econometrics Professor and Professor of Economics, Graduate School of Business, Stanford University

E18-304

*Please note: this event has been POSTPONED until Fall 2020* See MIT’s COVID-19 policies for more details.   About the author: Prof. Guido Imbens’ primary field of interest is Econometrics. Research topics in which he is interested include: causality, program evaluation, identification, Bayesian methods, semi-parametric methods, instrumental variables. Guido Imbens does research in econometrics and statistics. His research focuses on developing methods for drawing causal inferences in observational studies, using matching, instrumental variables, and regression discontinuity designs. Guido Imbens is Professor…

Find out more »

Matrix Concentration for Products

Jonathan Niles-Weed (New York University)
online

Abstract: We develop nonasymptotic concentration bounds for products of independent random matrices. Such products arise in the study of stochastic algorithms, linear dynamical systems, and random walks on groups. Our bounds exactly match those available for scalar random variables and continue the program, initiated by Ahlswede-Winter and Tropp, of extending familiar concentration bounds to the noncommutative setting. Our proof technique relies on geometric properties of the Schatten trace class. Joint work with D. Huang, J. A. Tropp, and R. Ward.…

Find out more »


MIT Statistics + Data Science Center
Massachusetts Institute of Technology
77 Massachusetts Avenue
Cambridge, MA 02139-4307
617-253-1764