IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/t8wdg_v1.html

LLM-Based Measurement of Latent Attributes in Trade Data

Author

Listed:
  • DiGiuseppe, Matthew

    (Leiden University)

  • Fu, Xuelong
  • Flynn, Michael E

    (Kansas State University)

Abstract

Trade data are available at a high level of disaggregation, allowing scholars to examine flows of highly specific goods. Yet the sheer number of goods classifications (5,000+) makes it difficult to analyze trade flows and tariff policy at a mid-level of aggregation beyond a few existing categorizations. Here, we outline a method that can scale---not merely classify---traded goods on researcher-defined dimensions that are orthogonal to existing classification schemes. We propose that the embedded knowledge in large language models (LLMs) can be used to conduct pairwise comparisons (PWCs) of Harmonized System (HS) product descriptions by determining their relative proximity to a specific concept. A Bayesian Bradley--Terry model then uses these PWCs to place individual items on a latent scale of interest. These estimates and their associated uncertainty can then be used for downstream descriptive or causal analysis.

Suggested Citation

  • DiGiuseppe, Matthew & Fu, Xuelong & Flynn, Michael E, 2026. "LLM-Based Measurement of Latent Attributes in Trade Data," SocArXiv t8wdg_v1, Center for Open Science.
  • Handle: RePEc:osf:socarx:t8wdg_v1
    DOI: 10.31219/osf.io/t8wdg_v1
    as

    Download full text from publisher

    File URL: https://osf.io/download/69c5385fa382e3ec23254897/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/t8wdg_v1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. In Song Kim & Steven Liao & Kosuke Imai, 2020. "Measuring Trade Profile with Granular Product‐Level Data," American Journal of Political Science, John Wiley & Sons, vol. 64(1), pages 102-117, January.
    2. Kim, In Song & Londregan, John & Ratkovic, Marc, 2019. "The Effects of Political Institutions on the Extensive and Intensive Margins of Trade," International Organization, Cambridge University Press, vol. 73(4), pages 755-792, October.
    3. Carlson, David & Montgomery, Jacob M., 2017. "A Pairwise Comparison Framework for Fast, Flexible, and Reliable Human Coding of Political Texts," American Political Science Review, Cambridge University Press, vol. 111(4), pages 835-843, November.
    4. Matthew Blackwell & James Honaker & Gary King, 2017. "A Unified Approach to Measurement Error and Missing Data: Overview and Applications," Sociological Methods & Research, , vol. 46(3), pages 303-341, August.
    5. David Autor & David Dorn & Gordon Hanson & Kaveh Majlesi, 2020. "Importing Political Polarization? The Electoral Consequences of Rising Trade Exposure," American Economic Review, American Economic Association, vol. 110(10), pages 3139-3183, October.
    6. Marcus Buckmann & Quynh Anh Nguyen & Edward Hill, 2025. "Revealing economic facts: LLMs know more than they say," Papers 2505.08662, arXiv.org, revised Dec 2025.
    7. Wellhausen, Rachel L., 2025. "Tariffs as Environmental Protection: Evidence from the Global South after the China Garbage Shock," British Journal of Political Science, Cambridge University Press, vol. 55, pages 1-1, January.
    8. Bown, Chad P. & Crowley, Meredith A., 2007. "Trade deflection and trade depression," Journal of International Economics, Elsevier, vol. 72(1), pages 176-201, May.
    9. Kim, In Song, 2017. "Political Cleavages within Industry: Firm-level Lobbying for Trade Liberalization," American Political Science Review, Cambridge University Press, vol. 111(1), pages 1-20, February.
    10. Matthew Blackwell & James Honaker & Gary King, 2017. "A Unified Approach to Measurement Error and Missing Data: Details and Extensions," Sociological Methods & Research, , vol. 46(3), pages 342-369, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew Blackwell & James Honaker & Gary King, 2017. "A Unified Approach to Measurement Error and Missing Data: Overview and Applications," Sociological Methods & Research, , vol. 46(3), pages 303-341, August.
    2. Meyer, Bruce D. & Mittag, Nikolas, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," IZA Discussion Papers 12266, IZA Network @ LISER.
    3. repec:osf:osfxxx:2m9fy_v1 is not listed on IDEAS
    4. Ton de Waal & Arnout van Delden & Sander Scholtus, 2020. "Multi‐source Statistics: Basic Situations and Methods," International Statistical Review, International Statistical Institute, vol. 88(1), pages 203-228, April.
    5. Laura Alfaro & Paola Conconi & Fariha Kamal & Zachary Kroff, 2026. "Trade Within Multinational Boundaries," CESifo Working Paper Series 12394, CESifo.
    6. Joscha Legewie, 2018. "Living on the Edge: Neighborhood Boundaries and the Spatial Dynamics of Violent Crime," Demography, Springer;Population Association of America (PAA), vol. 55(5), pages 1957-1977, October.
    7. Kuwayama, Yusuke & Olmstead, Sheila & Zheng, Jiameng, 2022. "A more comprehensive estimate of the value of water quality," Journal of Public Economics, Elsevier, vol. 207(C).
    8. Chad Brown & Paola Conconi & Aksel Erbahar & Lorenzo Trimarchi, 2020. "Trade Protection Along Supply Chains," Working Papers ECARES 2020-52, ULB -- Universite Libre de Bruxelles.
    9. Cevat G. Aksoy & Sergei Guriev & Daniel S. Treisman, 2018. "Globalization, Government Popularity, and the Great Skill Divide," NBER Working Papers 25062, National Bureau of Economic Research, Inc.
    10. Michael Funke & Adrian Wende, 2023. "The US–China Phase One trade deal: An economic analysis of the managed trade agreement," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 56(2), pages 758-786, May.
    11. Meyer, Bruce D. & Mittag, Nikolas & Wu, Derek, 2024. "Race, Ethnicity, and Measurement Error," IZA Discussion Papers 17349, IZA Network @ LISER.
    12. Bruce D. Meyer & Nikolas Mittag, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," NBER Working Papers 25738, National Bureau of Economic Research, Inc.
    13. Bruce D. Meyer & Nikolas Mittag & Derek Wu, 2024. "Race, Ethnicity, and Measurement Error," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, pages 327-381, National Bureau of Economic Research, Inc.
    14. Daniel L. Millimet & Christopher F. Parmeter, 2025. "The impact of measurement error on trends in earnings inequality in the USA," Empirical Economics, Springer, vol. 69(5), pages 2727-2753, November.
    15. Edmund Malesky & Markus Taussig, 2019. "How Do Firms Feel About Participation by Their Peers in the Regulatory Design Process? An Online Survey Experiment Testing the Substantive Change and Spillover Mechanisms," Strategy Science, INFORMS, vol. 4(2), pages 129-150, June.
    16. Raghul Gandhi Venkatesan & Bagavandas Mappillairaju, 2024. "Early student dropout detection in Indian secondary education with special reference to selected districts in Tamil Nadu: a machine learning-based survival analysis approach," Journal of Computational Social Science, Springer, vol. 7(3), pages 2309-2331, December.
    17. Lorenzo Trimarchi, 2020. "Trade Policy and the China Syndrome," SERIES 05-2020, Dipartimento di Economia e Finanza - Università degli Studi di Bari "Aldo Moro", revised May 2020.
    18. Brander, Michael & Bernauer, Thomas & Huss, Matthias, 2021. "Improved on-farm storage reduces seasonal food insecurity of smallholder farmer households – Evidence from a randomized control trial in Tanzania," Food Policy, Elsevier, vol. 98(C).
    19. Simon Grund & Oliver Lüdtke & Alexander Robitzsch, 2021. "On the Treatment of Missing Data in Background Questionnaires in Educational Large-Scale Assessments: An Evaluation of Different Procedures," Journal of Educational and Behavioral Statistics, , vol. 46(4), pages 430-465, August.
    20. Laura Alfaro & Paula Conconi & Fariha FK Kamal & Zachary ZK Kroll, 2026. "Trade Within Multinational Boundaries," Working Papers ECARES 2025-01, ULB -- Universite Libre de Bruxelles.
    21. Rousselière, Damien & Bouchard, Marie J. & Rousselière, Samira, 2024. "How does the social economy contribute to social and environmental innovation? Evidence of direct and indirect effects from a European survey," Research Policy, Elsevier, vol. 53(5).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:t8wdg_v1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.