IDSS Distinguished Seminars


list_alt
  • [POSTPONED] The Blessings of Multiple Causes

    On April 13, 2020 at 4:00 pm till 5:00 pm
    David Blei (Columbia University)
    E18-304

    Please note: this event has been POSTPONED until Fall 2020* See MIT’s COVID-19 policies for more details.

    David Blei

    Title: The Blessings of Multiple Causes

    Abstract: Causal inference from observational data is a vital problem, but it comes with strong assumptions. Most methods require that we observe all confounders, variables that affect both the causal variables and the outcome variables. But whether we have observed all confounders is a famously untestable assumption. We describe the deconfounder, a way to do causal inference with weaker assumptions than the classical methods require.

    How does the deconfounder work? While traditional causal methods measure the effect of a single cause on an outcome, many modern scientific studies involve multiple causes, different variables whose effects are simultaneously of interest. The deconfounder uses the correlation among multiple causes as evidence for unobserved confounders, combining unsupervised machine learning and predictive model checking to perform causal inference. We demonstrate the deconfounder on real-world data and simulation studies, and describe the theoretical requirements for the deconfounder to provide unbiased causal estimates.

    This is joint work with Yixin Wang.

    About the speaker: David Blei is a Professor of Statistics and Computer Science at Columbia University, and a member of the Columbia Data Science Institute. He studies probabilistic machine learning, including its theory, algorithms, and application. David has received several awards for his research, including a Sloan Fellowship (2010), Office of Naval Research Young Investigator Award (2011), Presidential Early Career Award for Scientists and Engineers (2011), Blavatnik Faculty Award (2013), ACM-Infosys Foundation Award (2013), a Guggenheim fellowship (2017), and a Simons Investigator Award (2019). He is the co-editor-in-chief of the Journal of Machine Learning Research. He is a fellow of the ACM and the IMS.

    Reception to follow.

    Find out more »: [POSTPONED] The Blessings of Multiple Causes
  • The Ethical Algorithm

    On May 19, 2020 at 4:00 pm till 5:00 pm
    Michael Kearns (University of Pennsylvania)
    online
    Michael Kearns

    Title: The Ethical Algorithm

    Abstract: Many recent mainstream media articles and popular books have raised alarms over anti-social algorithmic behavior, especially regarding machine learning and artificial intelligence. The concerns include leaks of sensitive personal data by predictive models, algorithmic discrimination as a side-effect of machine learning, and inscrutable decisions made by complex models. While standard and legitimate responses to these phenomena include calls for stronger and better laws and regulations, researchers in machine learning, statistics and related areas are also working on designing better-behaved algorithms. An explosion of recent research in areas such as differential privacy, algorithmic fairness and algorithmic game theory is forging a new science of socially aware algorithm design. I will survey these developments and attempt to place them in a broader societal context. This talk is based on the book The Ethical Algorithm, co-authored with Aaron Roth (Oxford University Press).

    About the speaker: Since 2002, Michael Kearns has been a professor in the Computer and Information Science Department at the University of Pennsylvania. He holds the National Center Chair and has secondary appointments in the department of Economics, and in the departments of Statistics and Operations, Information and Decisions (OID) in the Wharton School. He is the Founding Director of the Warren Center for Network and Data Sciences, the faculty founder and former director of Penn Engineering’s Networked and Social Systems Engineering (NETS) Program, and a faculty affiliate in Penn’s Applied Math and Computational Science graduate program. Kearns has worked extensively in quantitative and algorithmic trading on Wall Street (including at Lehman Brothers, Bank of America, and SAC Capital). He often served as an advisor to technology companies and venture capital firms. Kearns is also involved in the seed-stage fund Founder Collective and occasionally invests in early-stage technology startups. Kearns is a member of the Scientific Advisory Board of the Alan Turing Institute, and of the Market Surveillance Advisory Group of FINRA. Kearns is an elected Fellow of the American Academy of Arts and Sciences, the Association for Computing Machinery, the Association for the Advancement of Artificial Intelligence, and the Society for the Advancement of Economic Theory. He spent 1991-2001 in machine learning and AI research at AT&T Bell Labs. During his last four years there, Kearns was the head of the AI department. Before joining the Penn faculty in January 2002, Kearns spent 2001 as CTO of the European venture capital firm Syntek Capital and served as an advisor to various startups, including Yodle, Wealthfront, and Activate Networks. In the past Kearns has served as a member of the Advanced Technology Advisory Council of PJM Interconnection, the Scientific Advisory Board of Opera Solutions, and the Technical Advisory Board of Microsoft Research Cambridge.

    Zoom meeting ID: 949-0309-1399
    Join Zoom meeting: https://mit.zoom.us/j/94903091399
    YouTube livestream: https://youtu.be/IATv0m5U5z8

    Find out more »: The Ethical Algorithm
  • IDSS Distinguished Seminar Speaker – Arun Majumdar

    On October 11, 2016 at 4:00 pm till 5:00 pm
  • The Moral Character of Cryptographic Work

    On October 18, 2016 at 4:00 pm till 5:00 pm

    Cryptography rearranges power: it configures who can dowhat, from what. This makes cryptography an inherently political tool, and it confers on the field an intrinsically moral dimension. The Snowden revelations motivate a reassessment of the political and moral positioning of cryptography. They lead one to ask if our inability to effectively address mass surveillance constitutes a failure of our field. I believe that it does. I call for a community-wide effort to develop more effective means to resist mass surveillance. I plead for a reinvention of our disciplinary
    culture to attend not only to puzzles and math, but, also, to the societal
    implications of our work.

    Find out more »: The Moral Character of Cryptographic Work
  • Distributed Learning Dynamics Convergence in Routing Games

    On April 5, 2016 at 4:00 pm till 5:00 pm

    With the emergence of smartphone based sensing for mobility as the main paradigm for sensing in the last decade, radically new information sets have become available for the driving public. This information enables commuters to make repeated decisions on a daily basis based on anticipated state of the network. This repeated decision-making process creates interesting patterns for the transportation network, in which users might (or might not) reach an equilibrium, depending on the information at their disposal (for example knowing or not what other users of the network are experiencing or doing). The present talk starts with a brief presentation of the state of the art in traffic monitoring, leading to a new results in routing games. Routing games offer a simple yet powerful model of congestion in traffic networks, both in transportation and communication systems. The congestion in such systems is affected by the combined decision of the agents (drivers or routers), so modeling the decision process of the agents is important, not only to estimate and predict the behavior of the system, but also to be able to control it. This decision process is often called learning, as agents “learn” information about the system or about the other agents. We propose and study different models of learning with the following requirement: the joint learning dynamics should converge asymptotically to the Nash equilibrium of the game. In particular, we focus on two important properties: Is the model robust to stochastic perturbations (such as measurement noise)? And does the model allow heterogeneous learning (different agents may follow different learning strategies)? We study these questions using tools from online learning theory and stochastic approximation theory. We then present experimental results obtained with an online gaming application in which distributed players can play the routing game: they connect to the web app and participate in the game, by iteratively making decisions about their routes and observing outcomes. We show preliminary results from data collected from the application. In particular, we propose and solve a model estimation problem to estimate the learning dynamics of the players, and compare the predictions of the model to the actual behavior of the players, and discuss extensions and open questions.

    Find out more »: Distributed Learning Dynamics Convergence in Routing Games
  • Universal Laws and Architectures: Theory and Lessons from Brains, Nets, Hearts, Bugs, Grids, Flows, and Zombies

    On March 17, 2016 at 1:00 pm till 2:00 pm

    This talk will aim to accessibly describe progress on a theory of network architecture relevant to neuroscience, biology, medicine, and technology, particularly SDN/NFV and cyberphysical systems. Key ideas are motivated by familiar examples from neuroscience, including live demos using audience brains, and compared with examples from technology and biology. Background material and additional details are in online videos (accessible from website cds.caltech.edu/~doyle) for which this talk can be viewed as a short trailer. More specifically, my research is aimed at developing a more “unified” theory for complex networks motivated by and drawing lessons from neuroscience[4], cell biology[3], medical physiology[9], technology (internet, smartgrid, sustainable infrastructure)[1][8], and multiscale physics [2],[5],[6]. This theory involves several elements: hard limits, tradeoffs, and constraints on achievable robust, efficient performance ( “laws”), the organizing principles that succeed or fail in achieving them (“architectures” and protocols), the resulting high variability data and “robust yet fragile” behavior observed in real systems and case studies (behavior, data, statistics), the processes by which systems adapt and evolve (variation, selection, design), and their unavoidable fragilities (hijacking, parasites, predation, zombies). A final crucial element is scalable algorithms to allow study and design of complex networks using this theory. We will leverage a series of case studies with live demos from neuroscience, particularly the necessity of layered architectures due to speed accuracy tradeoffs (SAT, e.g. Fitts’ “law”) in vision, cognition, and sensorimotor control. The online videos compare similar laws and architectures from medicine, cell biology and modern computer and networking technology. Zombies emerge throughout as a ubiquitous, strangely popular, and annoying system fragility, particularly in the form of zombie science. In addition to the videos, papers [1] and [4] (and references therein) are the most accessible and broad introduction while the other papers give more domain specific details, most importantly [3] and [9] for biology and medicine. For math details a good place to start is Nikolai Matni’s website (cds.caltech.edu/~nmatni/) or his EECS/IDSS talk on March 9. I’ll be at MIT for several days, and I’m hoping to discuss the implications of this research direction for social systems with IDSS and other researchers. Selected recent references: [1] Alderson DL, Doyle JC (2010) Contrasting views of complexity and their implications for network-centric infrastructures. IEEE Trans Systems Man Cybernetics—Part A: Syst Humans 40:839-852. [2] Sandberg H, Delvenne JC, Doyle JC. On Lossless Approximations, the Fluctuation-Dissipation Theorem, and Limitations of Measurements, IEEE Trans Auto Control, Feb 2011 [3] Chandra F, Buzi G, Doyle JC (2011) Glycolytic oscillations and limits on robust efficiency. Science, Vol 333, pp 187-192. [4] Doyle JC, Csete ME(2011) Architecture, Constraints, and Behavior, P Natl Acad Sci USA, vol. 108, Sup 3 15624-15630 [5] Gayme DF, McKeon BJ, Bamieh B, Papachristodoulou P, Doyle JC (2011) Amplification and Nonlinear Mechanisms in Plane Couette Flow, Physics of Fluids, V23, Issue 6, 065108 [6] Page, M. T., D. Alderson, and J. Doyle (2011), The magnitude distribution of earthquakes near Southern California faults, J. Geophys. Res., 116, B12309, doi:10.1029/2010JB007933. [7] Namas R, Zamora R, An, G, Doyle, J et al, (2012) Sepsis: Something old, something new, and a systems view, Journal Of Critical Care Volume: 27 Issue: 3 [8] Chen, L; Ho, T; Chiang, M, Low S; Doyle J,(2012) Congestion Control for Multicast Flows With Network Coding, IEEE Trans On Information Theory Volume: 58 Issue: 9 Pages: 5908-5921 [9] Li, Cruz, Chien, Sojoudi, Recht, Stone, Csete, Bahmiller, Doyle (2014) Robust efficiency and actuator saturation explain healthy heart rate control and variability, P Natl Acad Sci USA 2014 111 (33) E3476-E3485

    Find out more »: Universal Laws and Architectures: Theory and Lessons from Brains, Nets, Hearts, Bugs, Grids, Flows, and Zombies
  • Randomized Controlled Trials and Policy Making in Developing Countries

    On March 8, 2016 at 4:00 pm till 5:00 pm

    Twenty years ago, randomized controlled trials testing social policies were essentially unheard of in developing countries, although there were prominent examples in developed economies. Today their number, scale and scope is much greater than could probably have been imagined. This talk will take stock of the role that randomized controlled trials have played to date, and can play in the future, in guiding policy. We will try to assess both successes and tribulations, challenges and promises.

    Find out more »: Randomized Controlled Trials and Policy Making in Developing Countries
  • Making Good Policies with Bad Causal Inference: The Role of Prediction and Machine Learning

    On September 15, 2015 at 4:00 pm till 5:00 pm

    In the last few decades, we have learned to be careful about causation, and have developed powerful tools for making causal inferences from data. Applying these tools has generated both policy impact and conceptual insights. Prof. Mullainathan will argue that there are a large class of problems where causal inference is largely unnecessary where, instead, prediction is the central challenge. These problems are ideally suited to machine learning and high dimensional data analysis tools. In this talk he will (1) try to delineate the difference between problems that require causation and problems that require prediction; (2) describe results from solving one such prediction problem in detail; (3) highlight the set of new statistical issues these problems raise; and (4) argue that solving these problems can also generate both policy impact and conceptual insights.

    Find out more »: Making Good Policies with Bad Causal Inference: The Role of Prediction and Machine Learning
  • A Big Data System for Things That Move

    On October 15, 2015 at 4:00 pm till 5:00 pm

    The world consists of many interesting things that move: people go to work, home, school, and shop in public transit buses and trains, or in cars and taxis; goods move on these networks and by trucks or by air each day; and food items travel a large distance to meet their eater. Thus, massive movement processes are underway in the world every day and it is critical to ensure their safe, timely and efficient operation. Towards this end, low-cost sensing and acquisition of the movement data is being achieved: from GPS devices, RFID and barcode scanners, to smart commuter cards and smartphones, snapshots of the movement process are becoming available. In this talk, I will present a system for stitching together these snapshots and reconstructing urban mobility at a very fine-grained level. The system, which we call the Space-Time Engine, provides an interactive dashboard and a querying engine for answering questions such as: what is the crowding at a train station? where’re packages held up and how can their delivery be sped up? how can the available supply of transport capacity be better used to address daily demand as well as the demand on exceptional days (such as rallies and severe weather events). I will describe the STE’s capabilities for operational and planning purposes, and as a learning system.

    Find out more »: A Big Data System for Things That Move
  • Gossip: Identifying Central Individuals in a Social Network

    On November 10, 2015 at 4:00 pm till 5:00 pm

    How can we identify the most influential nodes in a network for initiating diffusion? Are people able to easily identify those people in their communities who are best at spreading information, and if so, how? Using theory and recent data, we examine these questions and see how the structure of social networks affects information transmission ranging from gossip to the diffusion of new products. In particular, a model of diffusion is used to define centrality and shown to nest other measures of centrality as extreme special cases. Then it will be shown that by tracking gossip within a network, nodes can easily learn to rank the centrality of other nodes without knowing anything about the network itself. The theoretical predictions are consistent with new field experiments.

    Find out more »: Gossip: Identifying Central Individuals in a Social Network