IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1406.6038.html
   My bibliography  Save this paper

Exact fit of simple finite mixture models

Author

Listed:
  • Dirk Tasche

Abstract

How to forecast next year's portfolio-wide credit default rate based on last year's default observations and the current score distribution? A classical approach to this problem consists of fitting a mixture of the conditional score distributions observed last year to the current score distribution. This is a special (simple) case of a finite mixture model where the mixture components are fixed and only the weights of the components are estimated. The optimum weights provide a forecast of next year's portfolio-wide default rate. We point out that the maximum-likelihood (ML) approach to fitting the mixture distribution not only gives an optimum but even an exact fit if we allow the mixture components to vary but keep their density ratio fix. From this observation we can conclude that the standard default rate forecast based on last year's conditional default rates will always be located between last year's portfolio-wide default rate and the ML forecast for next year. As an application example, then cost quantification is discussed. We also discuss how the mixture model based estimation methods can be used to forecast total loss. This involves the reinterpretation of an individual classification problem as a collective quantification problem.

Suggested Citation

  • Dirk Tasche, 2014. "Exact fit of simple finite mixture models," Papers 1406.6038, arXiv.org, revised Jul 2014.
  • Handle: RePEc:arx:papers:1406.6038
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1406.6038
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hofer, Vera & Krempl, Georg, 2013. "Drift mining in data: A framework for addressing drift in classification," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 377-391.
    2. Dirk Tasche, 2012. "The art of probability-of-default curve calibration," Papers 1212.3716, arXiv.org, revised Nov 2013.
    3. Daniel J. Hopkins & Gary King, 2010. "A Method of Automated Nonparametric Content Analysis for Social Science," American Journal of Political Science, John Wiley & Sons, vol. 54(1), pages 229-247, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dirk Tasche, 2014. "Exact Fit of Simple Finite Mixture Models," JRFM, MDPI, vol. 7(4), pages 1-15, November.
    2. Benjamin E. Bagozzi & Daniel Berliner & Zack W. Almquist, 2021. "When does open government shut? Predicting government responses to citizen information requests," Regulation & Governance, John Wiley & Sons, vol. 15(2), pages 280-297, April.
    3. Sinziana-Maria Rindasu & Ioan Dan Topor & Liliana Ionescu-Feleaga, 2023. "The Evolution of Management Accountants' Digital Skills in Industry 4.0: A Qualitative Approach," Oblik i finansi, Institute of Accounting and Finance, issue 1, pages 38-48, March.
    4. Giovanni Di Franco & Michele Santurro, 2021. "Machine learning, artificial neural networks and social research," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(3), pages 1007-1025, June.
    5. Zhang, Han, 2021. "How Using Machine Learning Classification as a Variable in Regression Leads to Attenuation Bias and What to Do About It," SocArXiv 453jk, Center for Open Science.
    6. Mujtaba Ali Isani, 2021. "Methodological Problems of Using Arabic-Language Twitter as a Gauge for Arab Attitudes Toward Politics and Society," Contemporary Review of the Middle East, , vol. 8(1), pages 22-35, March.
    7. Pauwels, Koen & Aksehirli, Zeynep & Lackman, Andrew, 2016. "Like the ad or the brand? Marketing stimulates different electronic word-of-mouth content to drive online and offline performance," International Journal of Research in Marketing, Elsevier, vol. 33(3), pages 639-655.
    8. Aritz Bilbao-Jayo & Aitor Almeida, 2018. "Automatic political discourse analysis with multi-scale convolutional neural networks and contextual data," International Journal of Distributed Sensor Networks, , vol. 14(11), pages 15501477188, November.
    9. Rauh, Christian, 2018. "Validating a sentiment dictionary for German political language—a workbench note," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 15(4), pages 319-343.
    10. Andrea Sciandra & Alessio Surian & Livio Finos, 2021. "Supervised Machine Learning Methods to Disclose Action and Information in “U.N. 2030 Agenda” Social Media Data," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 156(2), pages 689-699, August.
    11. Aleti, Torgeir & Pallant, Jason I. & Tuan, Annamaria & van Laer, Tom, 2019. "Tweeting with the Stars: Automated Text Analysis of the Effect of Celebrity Social Media Communications on Consumer Word of Mouth," Journal of Interactive Marketing, Elsevier, vol. 48(C), pages 17-32.
    12. Martin Haselmayer & Marcelo Jenny, 2017. "Sentiment analysis of political communication: combining a dictionary approach with crowdcoding," Quality & Quantity: International Journal of Methodology, Springer, vol. 51(6), pages 2623-2646, November.
    13. Christopher D. Wirz & Michael A. Xenos & Dominique Brossard & Dietram Scheufele & Jennifer H. Chung & Luisa Massarani, 2018. "Rethinking Social Amplification of Risk: Social Media and Zika in Three Languages," Risk Analysis, John Wiley & Sons, vol. 38(12), pages 2599-2624, December.
    14. Tiziana CARPI & Airo HINO & Stefano Maria IACUS & Giuseppe PORRO, 2022. "A Japanese Subjective Well-Being Indicator Based on Twitter Data [‘Collective Smile: Measuring Societal Happiness from Geolocated Images’]," Social Science Japan Journal, University of Tokyo and Oxford University Press, vol. 25(2), pages 273-296.
    15. Oliver Blümke, 2020. "Estimating the probability of default for no‐default and low‐default portfolios," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(1), pages 89-107, January.
    16. Angela Chang & Peter J. Schulz & Angus Wenghin Cheong, 2020. "Online Newspaper Framing of Non-Communicable Diseases: Comparison of Mainland China, Taiwan, Hong Kong and Macao," IJERPH, MDPI, vol. 17(15), pages 1-15, August.
    17. Francesca Greco & Alessandro Polli, 2019. "Vaccines in Italy: the emotional text mining of social media," RIEDS - Rivista Italiana di Economia, Demografia e Statistica - The Italian Journal of Economic, Demographic and Statistical Studies, SIEDS Societa' Italiana di Economia Demografia e Statistica, vol. 73(1), pages 89-99, January-M.
    18. Marchand, André & Hennig-Thurau, Thorsten & Wiertz, Caroline, 2017. "Not all digital word of mouth is created equal: Understanding the respective impact of consumer reviews and microblogs on new product success," International Journal of Research in Marketing, Elsevier, vol. 34(2), pages 336-354.
    19. Lehotský, Lukáš & Černoch, Filip & Osička, Jan & Ocelík, Petr, 2019. "When climate change is missing: Media discourse on coal mining in the Czech Republic," Energy Policy, Elsevier, vol. 129(C), pages 774-786.
    20. Luigi Curini & Stefano Iacus & Luciano Canova, 2015. "Measuring Idiosyncratic Happiness Through the Analysis of Twitter: An Application to the Italian Case," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 121(2), pages 525-542, April.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1406.6038. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.