IDEAS home Printed from https://ideas.repec.org/a/spr/qualqt/v51y2017i3d10.1007_s11135-016-0335-0.html
   My bibliography  Save this article

Spurious relationships arising from aggregate variables in linear regression

Author

Listed:
  • David J. Armor

    (George Mason University)

  • Chenna Reddy Cotla

    (George Mason University)

  • Thomas Stratmann

    (George Mason University)

Abstract

Linear regressions that use aggregated values from a group variable such as a school or a neighborhood are commonplace in the social sciences. This paper uses Monte Carlo methods to demonstrate that aggregated variables produce spurious relationships with other dependent and independent variables in a model even when there are no underlying relationships among those variables. The size of the spurious relationships (or postulated effects) increases as the number of observations per group decreases. Although this problem is remedied by including the individual-level variable in the regression, the problem has not been discussed in the methodological literature. Accordingly, studies using aggregate variables must be interpreted with caution if the individual-level measurements are not available.

Suggested Citation

  • David J. Armor & Chenna Reddy Cotla & Thomas Stratmann, 2017. "Spurious relationships arising from aggregate variables in linear regression," Quality & Quantity: International Journal of Methodology, Springer, vol. 51(3), pages 1359-1379, May.
  • Handle: RePEc:spr:qualqt:v:51:y:2017:i:3:d:10.1007_s11135-016-0335-0
    DOI: 10.1007/s11135-016-0335-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11135-016-0335-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11135-016-0335-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Eric A. Hanushek & John F. Kain & Steven G. Rivkin, 2009. "New Evidence about Brown v. Board of Education: The Complex Effects of School Racial Composition on Achievement," Journal of Labor Economics, University of Chicago Press, vol. 27(3), pages 349-383, July.
    2. Jeffrey M. Wooldridge, 2003. "Cluster-Sample Methods in Applied Econometrics," American Economic Review, American Economic Association, vol. 93(2), pages 133-138, May.
    3. Moulton, Brent R, 1990. "An Illustration of a Pitfall in Estimating the Effects of Aggregate Variables on Micro Unit," The Review of Economics and Statistics, MIT Press, vol. 72(2), pages 334-338, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Massimiliano Giacalone & Demetrio Panarello & Raffaele Mattera, 2018. "Multicollinearity in regression: an efficiency comparison between Lp-norm and least squares estimators," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(4), pages 1831-1859, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mitchell A. Petersen, 2009. "Estimating Standard Errors in Finance Panel Data Sets: Comparing Approaches," Review of Financial Studies, Society for Financial Studies, vol. 22(1), pages 435-480, January.
    2. Antoine Berthou & Lionel Fontagné, 2013. "How do Multiproduct Exporters React to a Change in Trade Costs?," Scandinavian Journal of Economics, Wiley Blackwell, vol. 115(2), pages 326-353, April.
    3. Rok Spruk & Mitja Kovac, 2018. "Inefficient Growth," Review of Economics and Institutions, Università di Perugia, vol. 9(2).
    4. Jere R. Behrman & John Hoddinott & John A. Maluccio, & Erica Soler-Hampejsek & Emily L. Behrman & Reynaldo Martorell & Manuel Ramirez-Zea & Aryeh D. Stein, 2006. "What Determines Adult Cognitive Skills? Impacts of Pre-Schooling, Schooling and Post-Schooling Experiences in Guatemala," PIER Working Paper Archive 06-027, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    5. Fluvià, Modest & Rigall-I-Torrent, Ricard & Espinet, Josep Maria & Garriga, Anna & Saló, Albert, 2011. "Precios implícitos de los atributos de los productos turísticos: ¿Qué esconde el efecto de la localización?/Implicit Prices of the Attributes of Tourism Products: What is Hidden Behind Location?," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 29, pages 781-802, Diciembre.
    6. Klishchuk Bogdan & Zelenyuk Valentin, 2012. "Impact of Services LIberalization on Firm Level Productivity in Eastern Europe and Central Asia," EERC Working Paper Series 12/03e, EERC Research Network, Russia and CIS.
    7. Manuel Acosta & Joaqu�n M. Azagra-Caro & Daniel Coronado, 2016. "Access to Universities' Public Knowledge: Who is More Regionalist?," Regional Studies, Taylor & Francis Journals, vol. 50(3), pages 446-459, March.
    8. Alberto Abadie & Susan Athey & Guido W Imbens & Jeffrey M Wooldridge, 2023. "When Should You Adjust Standard Errors for Clustering?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 138(1), pages 1-35.
    9. Liza G. Steele, 2015. "Income Inequality, Equal Opportunity, and Attitudes About Redistribution," Social Science Quarterly, Southwestern Social Science Association, vol. 96(2), pages 444-464, June.
    10. Thomas C. Buchmueller & John DiNardo & Robert G. Valletta, 2011. "The Effect of an Employer Health Insurance Mandate on Health Insurance Coverage and the Demand for Labor: Evidence from Hawaii," American Economic Journal: Economic Policy, American Economic Association, vol. 3(4), pages 25-51, November.
    11. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 106, University of California, Davis, Department of Economics.
    12. Hansen, Bruce E. & Lee, Seojeong, 2019. "Asymptotic theory for clustered samples," Journal of Econometrics, Elsevier, vol. 210(2), pages 268-290.
    13. Boerner, Lars & Volckart, Oliver, 2011. "The utility of a common coinage: Currency unions and the integration of money markets in late Medieval Central Europe," Explorations in Economic History, Elsevier, vol. 48(1), pages 53-65, January.
    14. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    15. Jacqueline Agesa & Richard U. Agesa, 2012. "Imports, unionization and racial wage discrimination in the US," Applied Economics, Taylor & Francis Journals, vol. 44(3), pages 339-350, January.
    16. Vikström, Johan, 2009. "Cluster sample inference using sensitivity analysis: the case with few groups," Working Paper Series 2009:15, IFAU - Institute for Evaluation of Labour Market and Education Policy.
    17. Michele Campolieti, 2004. "Disability Insurance Benefits and Labor Supply: Some Additional Evidence," Journal of Labor Economics, University of Chicago Press, vol. 22(4), pages 863-890, October.
    18. Jamal Ibrahim Haidar, 2022. "Internalization of externalities in international trade," Empirical Economics, Springer, vol. 63(1), pages 469-497, July.
    19. Fontagnã‰, Lionel & Mitaritonna, Cristina, 2013. "Assessing barriers to trade in the distribution and telecom sectors in emerging countries," World Trade Review, Cambridge University Press, vol. 12(1), pages 57-78, January.
    20. Jennifer Rice, 2009. "The influence of managed care on generic prescribing rates: an analysis of HMO physicians," Applied Economics, Taylor & Francis Journals, vol. 43(7), pages 787-796.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:qualqt:v:51:y:2017:i:3:d:10.1007_s11135-016-0335-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.