IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v025i11.html

Invariant and Metric Free Proximities for Data Matching: An R Package

Author

Listed:
  • Iacus, Stefano
  • Porro, Giuseppe

Abstract

Data matching is a typical statistical problem in non experimental and/or observational studies or, more generally, in cross-sectional studies in which one or more data sets are to be compared. Several methods are available in the literature, most of which based on a particular metric or on statistical models, either parametric or nonparametric. In this paper we present two methods to calculate a proximity which have the property of being invariant under monotonic transformations. These methods require at most the notion of ordering. An open-source software in the form of a R package is also presented.

Suggested Citation

  • Iacus, Stefano & Porro, Giuseppe, 2008. "Invariant and Metric Free Proximities for Data Matching: An R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 25(i11).
  • Handle: RePEc:jss:jstsof:v:025:i11
    DOI: http://hdl.handle.net/10.18637/jss.v025.i11
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v025i11/v25i11.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v025i11/rrp_2.7.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v025i11/v25i11.R
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v025.i11?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Iacus, Stefano M. & Porro, Giuseppe, 2007. "Missing data imputation, matching and other applications of random recursive partitioning," Computational Statistics & Data Analysis, Elsevier, vol. 52(2), pages 773-789, October.
    2. Giuseppe Porro & Stefano Maria Iacus, 2009. "Random Recursive Partitioning: a matching method for the estimation of the average treatment effect," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 24(1), pages 163-185.
    3. Ho, Daniel E. & Imai, Kosuke & King, Gary & Stuart, Elizabeth A., 2007. "Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference," Political Analysis, Cambridge University Press, vol. 15(3), pages 199-236, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Doove, L.L. & Van Buuren, S. & Dusseldorp, E., 2014. "Recursive partitioning for missing data imputation in the presence of interaction effects," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 92-104.
    2. Humera Razzak & Christian Heumann, 2019. "Hybrid Multiple Imputation In A Large Scale Complex Survey," Statistics in Transition New Series, Polish Statistical Association, vol. 20(4), pages 33-58, December.
    3. Razzak Humera & Heumann Christian, 2019. "Hybrid Multiple Imputation In A Large Scale Complex Survey," Statistics in Transition New Series, Statistics Poland, vol. 20(4), pages 33-58, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2011. "Multivariate Matching Methods That Are Monotonic Imbalance Bounding," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 345-361.
    2. Anjani Kumar & Vinay K. Sonkar & K. S. Aditya, 2023. "Assessing the Impact of Lending Through Kisan Credit Cards in Rural India: Evidence from Eastern India," The European Journal of Development Research, Palgrave Macmillan;European Association of Development Research and Training Institutes (EADI), vol. 35(3), pages 602-622, June.
    3. Dettmann, E. & Becker, C. & Schmeißer, C., 2011. "Distance functions for matching in small samples," Computational Statistics & Data Analysis, Elsevier, vol. 55(5), pages 1942-1960, May.
    4. Srinivasa, Aditya Korekallu & Praveen, K.V. & Subash, S.P. & Nithyashree, ML & Jha, Girish Kumar, 2021. "Does a Farmer’s Knowledge of Minimum Support Price (MSP) Affect the Farm-Gate Price? Evidence from India," 2021 Conference, August 17-31, 2021, Virtual 315205, International Association of Agricultural Economists.
    5. Matthew Blackwell & Stefano Iacus & Gary King & Giuseppe Porro, 2009. "cem: Coarsened exact matching in Stata," Stata Journal, StataCorp LLC, vol. 9(4), pages 524-546, December.
    6. Zichen Deng & Maarten Lindeboom, 2021. "Early-life Famine Exposure, Hunger Recall and Later-life Health," Tinbergen Institute Discussion Papers 21-054/V, Tinbergen Institute.
    7. Leah Warfield Smith & Randall Lee Rose & Alex R. Zablah & Heath McCullough & Mohammad “Mike” Saljoughian, 2023. "Examining post-purchase consumer responses to product automation," Journal of the Academy of Marketing Science, Springer, vol. 51(3), pages 530-550, May.
    8. Baron, Opher & Callen, Jeffrey L. & Segal, Dan, 2023. "Does the bullwhip matter economically? A cross-sectional firm-level analysis," International Journal of Production Economics, Elsevier, vol. 259(C).
    9. Jian Jiu Chen & Sai Yin Ho & Wing Man Au & Man Ping Wang & Tai Hing Lam, 2015. "Family Smoking, Exposure to Secondhand Smoke at Home and Family Unhappiness in Children," IJERPH, MDPI, vol. 12(11), pages 1-14, November.
    10. Kube, Roland & von Graevenitz, Kathrine & Löschel, Andreas & Massier, Philipp, 2019. "Do voluntary environmental programs reduce emissions? EMAS in the German manufacturing sector," Energy Economics, Elsevier, vol. 84(S1).
    11. Iacus, Stefano M. & Porro, Giuseppe, 2007. "Missing data imputation, matching and other applications of random recursive partitioning," Computational Statistics & Data Analysis, Elsevier, vol. 52(2), pages 773-789, October.
    12. K. Poehlmann & R. Helm & O. Mauroner & J. Auburger, 2021. "Corporate spin-offs’ success factors: management lessons from a comparative empirical analysis with research-based spin-offs," Review of Managerial Science, Springer, vol. 15(6), pages 1767-1796, August.
    13. Renata Baborska & Emilio Hernandez & Emiliano Magrini & Cristian Morales-Opazo, 2020. "The impact of financial inclusion on rural food security experience: A perspective from low-and middle-income countries," Review of Development Finance Journal, Chartered Institute of Development Finance, vol. 10(2), pages 1-18.
    14. Michael A Ruderman & Deirdra F Wilson & Savanna Reid, 2015. "Does Prison Crowding Predict Higher Rates of Substance Use Related Parole Violations? A Recurrent Events Multi-Level Survival Analysis," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-19, October.
    15. Elaine M. Wolf & Douglas A. Wolf, 2008. "Mixed Results in a Transitional Planning Program for Alternative School Students," Evaluation Review, , vol. 32(2), pages 187-215, April.
    16. Cappelletti, Matilde & Giuffrida, Leonardo M., 2022. "Targeted bidders in government tenders," ZEW Discussion Papers 22-030, ZEW - Leibniz Centre for European Economic Research.
    17. repec:osf:metaar:s42ba_v1 is not listed on IDEAS
    18. Gabriele Spilker & Tobias Böhmelt, 2013. "The impact of preferential trade agreements on governmental repression revisited," The Review of International Organizations, Springer, vol. 8(3), pages 343-361, September.
    19. Michael Funke & Helery Tasane, 2025. "Regional economic impacts of the Øresund cross-border fixed link: Cui Bono?," Regional Studies, Taylor & Francis Journals, vol. 59(1), pages 2573115-257, December.
    20. Reed, Deborah K. & Aloe, Ariel M., 2020. "Interpreting the effectiveness of a summer reading program: The eye of the beholder," Evaluation and Program Planning, Elsevier, vol. 83(C).
    21. Cauê Carrilho & Gabriela Demarchi & Amy Duchelle & Sven Wunder & Carla Morsello, 2022. "Permanence of avoided deforestation in a Transamazon REDD+ initiative (Pará, Brazil)," CEE-M Working Papers hal-03614704, CEE-M, Universtiy of Montpellier, CNRS, INRA, Montpellier SupAgro.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:025:i11. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.