Statistics and Data Science Seminar
IDSS Distinguished Seminars
IDSS Special Seminar
SDSC Special Events
Online events
IDS.190 Topics in Bayesian Modeling and Computation
Past Events
LIDS & Stats Tea

10 events found.

Views Navigation

Event Views Navigation

List
Month

Select date.

February 2016

IDSS Special Seminar

Overcoming Overfitting with Algorithmic Stability

February 23, 2016 @ 2:00 pm

Most applications of machine learning across science and industry rely on the holdout method for model selection and validation. Unfortunately, the holdout method often fails in the now common scenario where the analyst works interactively with the data, iteratively choosing which methods to use by probing the same holdout data many times. In this talk, we apply the principle of algorithmic stability to design reusable holdout methods, which can be used many times without losing the guarantees of fresh data.…

Find out more »

IDSS Special Seminar

Learning in Strategic Environments: Theory and Data

February 24, 2016 @ 2:00 pm

The strategic interaction of multiple parties with different objectives is at the heart of modern large scale computer systems and electronic markets. Participants face such complex decisions in these settings that the classic economic equilibrium is not a good predictor of their behavior. The analysis and design of these systems has to go beyond equilibrium assumptions. Evidence from online auction marketplaces suggests that participants rather use algorithmic learning. In the first part of the talk, I will describe a theoretical…

Find out more »

Statistics and Data Science Seminar

On Shape Constrained Estimation

February 26, 2016 @ 11:00 am

Shape constraints such as monotonicity, convexity, and log-concavity are naturally motivated in many applications, and can offer attractive alternatives to more traditional smoothness constraints in nonparametric estimation. In this talk we present some recent results on shape constrained estimation in high and low dimensions. First, we show how shape constrained additive models can be used to select variables in a sparse convex regression function. In contrast, additive models generally fail for variable selection under smoothness constraints. Next, we introduce graph-structured…

Find out more »

March 2016

Statistics and Data Science Seminar

On Complex Supervised Learning Problems, and On Ranking and Choice Models

March 4, 2016 @ 11:00 am - 12:00 pm

Shivani Agarwal (Indian Institute of Science/Radcliffe)

32-123

While simple supervised learning problems like binary classification and regression are fairly well understood, increasingly, many applications involve more complex learning problems: more complex label and prediction spaces, more complex loss structures, or both. The first part of the talk will discuss recent advances in our understanding of such problems, including the notion of convex calibration dimension of a loss function, unified approaches for designing convex calibrated surrogates for arbitrary losses, and connections between supervised learning and property elicitation. The…

Find out more »

IDSS Distinguished Seminars

Randomized Controlled Trials and Policy Making in Developing Countries

March 8, 2016 @ 4:00 pm

Twenty years ago, randomized controlled trials testing social policies were essentially unheard of in developing countries, although there were prominent examples in developed economies. Today their number, scale and scope is much greater than could probably have been imagined. This talk will take stock of the role that randomized controlled trials have played to date, and can play in the future, in guiding policy. We will try to assess both successes and tribulations, challenges and promises.

Find out more »

IDSS Distinguished Seminars

Universal Laws and Architectures: Theory and Lessons from Brains, Nets, Hearts, Bugs, Grids, Flows, and Zombies

March 17, 2016 @ 1:00 pm

This talk will aim to accessibly describe progress on a theory of network architecture relevant to neuroscience, biology, medicine, and technology, particularly SDN/NFV and cyberphysical systems. Key ideas are motivated by familiar examples from neuroscience, including live demos using audience brains, and compared with examples from technology and biology. Background material and additional details are in online videos (accessible from website cds.caltech.edu/~doyle) for which this talk can be viewed as a short trailer. More specifically, my research is aimed at…

Find out more »

Statistics and Data Science Seminar

Pairwise Comparison Models for High-Dimensional Ranking

March 18, 2016 @ 11:00 am - 12:00 pm

Martin Wainwright (UC Berkeley)

32-123

Data in the form of pairwise comparisons between a collection of n items arises in many settings, including voting schemes, tournament play, and online search rankings. We study a flexible non-parametric model for pairwise comparisons, under which the probabilities of outcomes are required only to satisfy a natural form of stochastic transitivity (SST). The SST class includes a large family of classical parametric models as special cases, among them the Bradley-Terry-Luce and Thurstone models, but is substantially richer. We provide…

Find out more »

April 2016

Statistics and Data Science Seminar

Sub-Gaussian Mean Estimators

April 1, 2016 @ 11:00 am - 12:00 pm

Roberto Oliveira (IMPA)

32-123

We discuss the possibilities and limitations of estimating the mean of a real-valued random variable from independent and identically distributed observations from a non-asymptotic point of view. In particular, we define estimators with a sub-Gaussian behavior even for certain heavy-tailed distributions. We also prove various impossibility results for mean estimators. These results are in http://arxiv.org/abs/1509.05845, to appear in Ann Stat. (Joint work with L. Devroye, M. Lerasle, and G. Lugosi.)

Find out more »

Statistics and Data Science Seminar

Double Machine Learning: Improved Point and Interval Estimation of Treatment and Causal Parameters

April 5, 2016 @ 11:00 am

Most supervised machine learning (ML) methods are explicitly designed to solve prediction problems very well. Achieving this goal does not imply that these methods automatically deliver good estimators of causal parameters. Examples of such parameters include individual regression coefficients, average treatment effects, average lifts, and demand or supply elasticities. In fact, estimates of such causal parameters obtained via naively plugging ML estimators into estimating equations for such parameters can behave very poorly, for example, by formally having inferior rates of…

Find out more »

IDSS Distinguished Seminars

Distributed Learning Dynamics Convergence in Routing Games

April 5, 2016 @ 4:00 pm

With the emergence of smartphone based sensing for mobility as the main paradigm for sensing in the last decade, radically new information sets have become available for the driving public. This information enables commuters to make repeated decisions on a daily basis based on anticipated state of the network. This repeated decision-making process creates interesting patterns for the transportation network, in which users might (or might not) reach an equilibrium, depending on the information at their disposal (for example knowing…

Find out more »

Previous Events
Today
Next Events

Google Calendar
iCalendar
Outlook 365
Outlook Live
Export .ics file
Export Outlook .ics file

MIT Statistics + Data Science Center
Massachusetts Institute of Technology
77 Massachusetts Avenue
Cambridge, MA 02139-4307
617-253-1764

Accessibility

About
People
▼
Academics
▼
Research
News
Events
▼
Seminars
▼
- Upcoming
- Archive
  ▼
Jobs