Causal Inference on Outcomes Learned from Text
Abstract: (with Iman Modarressi and Amar Venugopal; arxiv.org/abs/2503.00725 We propose a machine-learning tool that yields causal inference on text in randomized trials. Based on a simple econometric framework in which text may capture outcomes of interest, our procedure addresses three questions: First, is the text affected by the treatment? Second, which outcomes is the effect on? And third, how complete is our description of causal effects? To answer all three questions, our approach uses large language models (LLMs) that suggest systematic…