IDEAS home Printed from https://ideas.repec.org/p/fip/fedawp/2004-20.html
   My bibliography  Save this paper

Analyzing imputed financial data: a new approach to cluster analysis

Author

Listed:
  • Halima Bensmail
  • Ramon P. DeGennaro

Abstract

The authors introduce a novel statistical modeling technique to cluster analysis and apply it to financial data. Their two main goals are to handle missing data and to find homogeneous groups within the data. Their approach is flexible and handles large and complex data structures with missing observations and with quantitative and qualitative measurements. The authors achieve this result by mapping the data to a new structure that is free of distributional assumptions in choosing homogeneous groups of observations. Their new method also provides insight into the number of different categories needed for classifying the data. The authors use this approach to partition a matched sample of stocks. One group offers dividend reinvestment plans, and the other does not. Their method partitions this sample with almost 97 percent accuracy even when using only easily available financial variables. One interpretation of their result is that the misclassified companies are the best candidates either to adopt a dividend reinvestment plan (if they have none) or to abandon one (if they currently offer one). The authors offer other suggestions for applications in the field of finance.

Suggested Citation

  • Halima Bensmail & Ramon P. DeGennaro, 2004. "Analyzing imputed financial data: a new approach to cluster analysis," FRB Atlanta Working Paper 2004-20, Federal Reserve Bank of Atlanta.
  • Handle: RePEc:fip:fedawp:2004-20
    as

    Download full text from publisher

    File URL: https://www.frbatlanta.org/-/media/documents/research/publications/wp/2004/wp0420.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. repec:bla:jfinan:v:44:y:1989:i:1:p:167-81 is not listed on IDEAS
    2. Dahlstedt, Roy & Salmi, Timo & Luoma, Martti & Laakkonen, Arto, 1994. "On the usefulness of standard industrial classifications in comparative financial statement analysis," European Journal of Operational Research, Elsevier, vol. 79(2), pages 230-238, December.
    3. Michel Bierlaire & Tsippy Lotan & Philippe Toint, 1997. "On The Overspecification of Multinomial and Nested Logit Models Due to Alternative Specific Constants," Transportation Science, INFORMS, vol. 31(4), pages 363-371, November.
    4. Hamparsum Bozdogan, 1987. "Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 345-370, September.
    5. Calhoun, Charles A & Deng, Yongheng, 2002. "A Dynamic Analysis of Fixed- and Adjustable-Rate Mortgage Terminations," The Journal of Real Estate Finance and Economics, Springer, vol. 24(1-2), pages 9-33, Jan.-Marc.
    6. Loretta J. Mester, 1997. "What's the point of credit scoring?," Business Review, Federal Reserve Bank of Philadelphia, issue Sep, pages 3-16.
    7. Daniel McFadden, 1977. "Modelling the Choice of Residential Location," Cowles Foundation Discussion Papers 477, Cowles Foundation for Research in Economics, Yale University.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fan Cai & Nhien-An Le-Khac & Tahar Kechadi, 2016. "Clustering Approaches for Financial Data Analysis: a Survey," Papers 1609.08520, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guillaume Ducarme & Jean-François Hamel & Stéphanie Brun & Hugo Madar & Benjamin Merlot & Loïc Sentilhes, 2017. "Sexual function and postpartum depression 6 months after attempted operative vaginal delivery according to fetal head station: A prospective population-based cohort study," PLOS ONE, Public Library of Science, vol. 12(6), pages 1-17, June.
    2. DeYoung, Robert & Glennon, Dennis & Nigro, Peter, 2008. "Borrower-lender distance, credit scoring, and loan performance: Evidence from informational-opaque small business borrowers," Journal of Financial Intermediation, Elsevier, vol. 17(1), pages 113-143, January.
    3. Francisco Javier Amador & Rosa Marina González & Juan de Dios Ortúzar, 2004. "Preference heterogeneity and willingness to pay for travel time," Documentos de trabajo conjunto ULL-ULPGC 2004-12, Facultad de Ciencias Económicas de la ULPGC.
    4. Ioana Gutu & Daniela Tatiana Agheorghiesei & Alexandru Tugui, 2023. "Assessment of a Workforce Sustainability Tool through Leadership and Digitalization," IJERPH, MDPI, vol. 20(2), pages 1-30, January.
    5. Marianna Virtanen & Jussi Vahtera & Jenny Head & Rosemary Dray-Spira & Annaleena Okuloff & Adam G Tabak & Marcel Goldberg & Jenni Ervasti & Markus Jokela & Archana Singh-Manoux & Jaana Pentti & Marie , 2015. "Work Disability among Employees with Diabetes: Latent Class Analysis of Risk Factors in Three Prospective Cohort Studies," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-14, November.
    6. Patrick Bayer & Fernando Ferreira & Robert McMillan, 2007. "A Unified Framework for Measuring Preferences for Schools and Neighborhoods," Journal of Political Economy, University of Chicago Press, vol. 115(4), pages 588-638, August.
    7. Raslan Alzuabi & Sarah Brown & Mark N. Harris & Karl Taylor, 2024. "Modelling the composition of household portfolios: A latent class approach," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 57(1), pages 243-275, February.
    8. Mayer, T. & Mejean, I. & Nefussi, B., 2010. "The location of domestic and foreign production affiliates by French multinational firms," Journal of Urban Economics, Elsevier, vol. 68(2), pages 115-128, September.
    9. Stahl, Dale O., 2001. "Population rule learning in symmetric normal-form games: theory and evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 45(1), pages 19-35, May.
    10. R. Aaberge & U. Colombino & T. Wennemo, 2009. "Evaluating Alternative Representations Of The Choice Sets In Models Of Labor Supply," Journal of Economic Surveys, Wiley Blackwell, vol. 23(3), pages 586-612, July.
    11. Daniela Andreini & Diego Rinallo & Giuseppe Pedeliento & Mara Bergamaschi, 2017. "Brands and Religion in the Secularized Marketplace and Workplace: Insights from the Case of an Italian Hospital Renamed After a Roman Catholic Pope," Journal of Business Ethics, Springer, vol. 141(3), pages 529-550, March.
    12. Frey, Rainer & Hussinger, Katrin, 2006. "The role of technology in M&As: a firm-level comparison of cross-border and domestic deals," Discussion Paper Series 1: Economic Studies 2006,45, Deutsche Bundesbank.
    13. Julia Kathrin Baumgart & Maaike Moes & Knut Skovereng & Gertjan Ettema & Øyvind Sandbakk, 2018. "Examination of gas exchange and blood lactate thresholds in Paralympic athletes during upper-body poling," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-18, October.
    14. Berger, Allen N. & Dai, Qinglei & Ongena, Steven & Smith, David C., 2003. "To what extent will the banking industry be globalized? A study of bank nationality and reach in 20 European nations," Journal of Banking & Finance, Elsevier, vol. 27(3), pages 383-415, March.
    15. Tzougas, George & Hoon, W. L. & Lim, J. M., 2019. "The negative binomial-inverse Gaussian regression model with an application to insurance ratemaking," LSE Research Online Documents on Economics 101728, London School of Economics and Political Science, LSE Library.
    16. Gil-Molto, Maria Jose & Hole, Arne Risa, 2004. "Tests for the consistency of three-level nested logit models with utility maximization," Economics Letters, Elsevier, vol. 85(1), pages 133-137, October.
    17. Nevo, Aviv, 2001. "Measuring Market Power in the Ready-to-Eat Cereal Industry," Econometrica, Econometric Society, vol. 69(2), pages 307-342, March.
    18. Danks, Nicholas P. & Sharma, Pratyush N. & Sarstedt, Marko, 2020. "Model selection uncertainty and multimodel inference in partial least squares structural equation modeling (PLS-SEM)," Journal of Business Research, Elsevier, vol. 113(C), pages 13-24.
    19. S. A. Abu Bakar & Saralees Nadarajah & Z. A. Absl Kamarul Adzhar, 2018. "Loss modeling using Burr mixtures," Empirical Economics, Springer, vol. 54(4), pages 1503-1516, June.
    20. Turansick, Christopher, 2022. "Identification in the random utility model," Journal of Economic Theory, Elsevier, vol. 203(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:fip:fedawp:2004-20. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Rob Sarwark (email available below). General contact details of provider: https://edirc.repec.org/data/frbatus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.