Author
Abstract
This paper documents a fundamental problem in applied gravity estimation: the variance of gravity estimates becomes prohibitively large when policy regressors are parse. I define sparse regressors as dummy variables that equal one in fewer than 100 to 500 observations depending on the setting; a common characteristic of trade policies such as free trade agreements. Through Monte Carlo simulations calibrated to match previously established data generating processes in the literature, I demonstrate that the variance of coefficient estimates is approximately inversely proportional to the number of treated observations, making reliable statistical inference impossible when policy variables are infrequent. This variance problem is distinct from well-known issues related to high-dimensional fixed effects and affects both OLS and PPML estimators regardless of specification complexity. The severity of this variance problem depends on the magnitude of the true underlying coefficient: the variance problem is severe and practically prohibitive for moderate coefficients (such as those typically found for many trade policy effects), but becomes negligible for large effects. To address this issue, I propose Ridge regularization as a practical solution that reduces estimate variance while introducing minimal bias. The main contribution however is not advocating for Ridge regularization, but rather highlighting that variance is often the dominant source of uncertainty in gravity estimation when dealing with sparse policy variables, underscoring fundamental limitations of gravity models for evaluating infrequent policies with moderate effect sizes. These findings have implications not only for the international trade literature but also for other fields that employ gravitytype specifications, including migration and macroeconomics.
Suggested Citation
Camilo Umana Dajud, 2025.
"The Variance of Gravity,"
Working Papers
2025-12, CEPII research center.
Handle:
RePEc:cii:cepidt:2025-12
Download full text from publisher
More about this item
Keywords
;
;
;
;
JEL classification:
- F10 - International Economics - - Trade - - - General
- F14 - International Economics - - Trade - - - Empirical Studies of Trade
- C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models
Statistics
Access and download statistics
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cii:cepidt:2025-12. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/cepiifr.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.