IDEAS home Printed from https://ideas.repec.org/p/fip/fedpwp/101559.html
   My bibliography  Save this paper

Precision Without Labels: Detecting Cross-Applicants in Mortgage Data Using Unsupervised Learning

Author

Listed:
  • Hadi Elzayn

  • Simon Freyaldenhoven
  • Minchul Shin

Abstract

We develop a clustering-based algorithm to detect loan applicants who submit multiple applications (“cross-applicants”) in a loan-level dataset without personal identifiers. A key innovation of our approach is a novel evaluation method that does not require labeled training data, allowing us to optimize the tuning parameters of our machine learning algorithm. By applying this methodology to Home Mortgage Disclosure Act (HMDA) data, we create a unique dataset that consolidates mortgage applications to the individual applicant level across the United States. Our preferred specification identifies cross-applicants with 92.3% precision.

Suggested Citation

  • Hadi Elzayn & Simon Freyaldenhoven & Minchul Shin, 2025. "Precision Without Labels: Detecting Cross-Applicants in Mortgage Data Using Unsupervised Learning," Working Papers 25-25, Federal Reserve Bank of Philadelphia.
  • Handle: RePEc:fip:fedpwp:101559
    DOI: 10.21799/frbp.wp.2025.25
    as

    Download full text from publisher

    File URL: https://www.philadelphiafed.org/-/media/FRBP/Assets/working-papers/2025/wp25-25.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.21799/frbp.wp.2025.25?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yuliya Demyanyk & Otto Van Hemert, 2011. "Understanding the Subprime Mortgage Crisis," The Review of Financial Studies, Society for Financial Studies, vol. 24(6), pages 1848-1880.
    2. Bhutta, Neil & Hizmo, Aurel, 2020. "Paying Too Much? Borrower Sophistication and Overpayment in the US Mortgage Market," CEPR Discussion Papers 14924, C.E.P.R. Discussion Papers.
    3. Christopher Mayer & Karen Pence & Shane M. Sherlund, 2009. "The Rise in Mortgage Defaults," Journal of Economic Perspectives, American Economic Association, vol. 23(1), pages 27-50, Winter.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sumit Agarwal & Yongheng Deng & Jia He & Yonglin Wang & Qi Zhang, 2023. "Lenders’ pricing strategy: Do neighborhood risks matter?," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 51(4), pages 1011-1047, July.
    2. Stefano Colonnello & Mariela Dal Borgo, 2024. "Raising Household Leverage: Evidence from Co-Financed Mortgages," Working Papers 2024: 01, Department of Economics, University of Venice "Ca' Foscari".
    3. Mocetti, Sauro & Viviano, Eliana, 2017. "Looking behind mortgage delinquencies," Journal of Banking & Finance, Elsevier, vol. 75(C), pages 53-63.
    4. Hanming Fang & You Suk Kim & Wenli Li, 2015. "The Dynamics of Adjustable-Rate Subprime Mortgage Default: A Structural Estimation," Finance and Economics Discussion Series 2015-114, Board of Governors of the Federal Reserve System (U.S.).
    5. Mundra, Kusum, 2013. "Minority and Immigrant Homeownership Experience: Evidence from the 2009 American Housing Survey," IZA Discussion Papers 7131, Institute of Labor Economics (IZA).
    6. Gene Amromin & Jennifer Huang & Clemens Sialm & Edward Zhong, 2018. "Complex Mortgages [Why don’t lenders renegotiate more home mortgages? Redefaults, self-cures, and securitization]," Review of Finance, European Finance Association, vol. 22(6), pages 1975-2007.
    7. Charles Grant, 2021. "The Evolution of Arrears among US Households 1995–2013," JRFM, MDPI, vol. 14(2), pages 1-40, January.
    8. Manuel Adelino & Antoinette Schoar & Felipe Severino, 2015. "Loan Originations and Defaults in the Mortgage Crisis: Further Evidence," NBER Working Papers 21320, National Bureau of Economic Research, Inc.
    9. Tomasz Piskorski & Amit Seru & James Witkin, 2015. "Asset Quality Misrepresentation by Financial Intermediaries: Evidence from the RMBS Market," Journal of Finance, American Finance Association, vol. 70(6), pages 2635-2678, December.
    10. Steve Sharpe & Shane Sherlund, 2016. "Crowding Out Effects of Refinancing on New Purchase Mortgages," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 48(2), pages 209-239, March.
    11. Patrick Bayer & Fernando Ferreira & Stephen L. Ross, 2014. "Race, Ethnicity and High-Cost Mortgage Lending," NBER Working Papers 20762, National Bureau of Economic Research, Inc.
    12. van Ooijen, Raun & van Rooij, Maarten C.J., 2016. "Mortgage risks, debt literacy and financial advice," Journal of Banking & Finance, Elsevier, vol. 72(C), pages 201-217.
    13. Hanming Fang & You Suk Kim & Wenli Li, 2016. "The dynamics of subprime adjustable-rate mortgage default: a structural estimation," Working Papers 16-2, Federal Reserve Bank of Philadelphia.
    14. Simlai, Prodosh, 2019. "Subprime credit, idiosyncratic risk, and foreclosures," The Quarterly Review of Economics and Finance, Elsevier, vol. 74(C), pages 175-189.
    15. Yongqiang Chu, 2014. "Credit constraints, inelastic supply, and the housing boom," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 17(1), pages 52-69, January.
    16. Levitin, Adam & Wachter, Susan, 2012. "Explaining the Housing Bubble," MPRA Paper 41920, University Library of Munich, Germany.
    17. W. Scott Frame & Lawrence J. White, 2009. "Technological change, financial innovation, and diffusion in banking," FRB Atlanta Working Paper 2009-10, Federal Reserve Bank of Atlanta.
    18. Abdullah Yavas & Shuang Zhu, 2024. "Misreporting of second liens in portfolio mortgages and privately securitized mortgages," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 52(1), pages 73-109, January.
    19. Agatha M. Poroshina, 2014. "Credit Risk Modeling Of Residential Mortgage Lending In Russia," HSE Working papers WP BRP 30/FE/2014, National Research University Higher School of Economics.
    20. Beltratti, Andrea & Benetton, Matteo & Gavazza, Alessandro, 2017. "The role of prepayment penalties in mortgage loans," Journal of Banking & Finance, Elsevier, vol. 82(C), pages 165-179.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:fip:fedpwp:101559. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Beth Paul (email available below). General contact details of provider: https://edirc.repec.org/data/frbphus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.