IDEAS home Printed from https://ideas.repec.org/a/kap/qmktec/v11y2013i3d10.1007_s11129-013-9136-0.html
   My bibliography  Save this article

Multi level categorical data fusion using partially fused data

Author

Listed:
  • Zvi Gilula

    (Hebrew University)

  • Robert McCulloch

    (University of Chicago Booth School of Business)

Abstract

Data fusion poses challenging methodological issues for inferring the joint distribution of two random variables when the information available is mainly confined to the marginal distributions. When the variables are categorical, the challenges are even more severe. Applications of categorical data fusion are of top importance in marketing, especially in advertising. A great deal of categorical data fusion methods are confined to binary variables. In this paper we develop an innovative approach to categorical data fusion that extends previous methodologies and applies to categorical variables with any number of levels. We introduce a new concept of “evident dependence” that describes a variety of patterns of joint distributions given the marginals. Using information from partially fused data, our method smoothly accommodates a Bayesian approach based on mixtures of joint distributions constructed using evident dependence. The approach is illustrated using data from the advertising industry.

Suggested Citation

  • Zvi Gilula & Robert McCulloch, 2013. "Multi level categorical data fusion using partially fused data," Quantitative Marketing and Economics (QME), Springer, vol. 11(3), pages 353-377, September.
  • Handle: RePEc:kap:qmktec:v:11:y:2013:i:3:d:10.1007_s11129-013-9136-0
    DOI: 10.1007/s11129-013-9136-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11129-013-9136-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11129-013-9136-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kiesl, Hans & Rässler, Susanne, 2006. "How valid can data fusion be?," IAB-Discussion Paper 200615, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    2. Rubin, Donald B, 1986. "Statistical Matching Using File Concatenation with Adjusted Weights and Multiple Imputations," Journal of Business & Economic Statistics, American Statistical Association, vol. 4(1), pages 87-94, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zvi Gilula & Robert E. McCulloch & Yaacov Ritov & Oleg Urminsky, 2019. "A study into mechanisms of attitudinal scale conversion: A randomized stochastic ordering approach," Quantitative Marketing and Economics (QME), Springer, vol. 17(3), pages 325-357, September.
    2. Hyowon Kim & Greg M. Allenby, 2022. "Integrating Textual Information into Models of Choice and Scaled Response Data," Marketing Science, INFORMS, vol. 41(4), pages 815-830, July.
    3. Gessendorfer Jonathan & Beste Jonas & Drechsler Jörg & Sakshaug Joseph W., 2018. "Statistical Matching as a Supplement to Record Linkage: A Valuable Method to Tackle Nonconsent Bias?," Journal of Official Statistics, Sciendo, vol. 34(4), pages 909-933, December.
    4. Rajkumar Venkatesan & Alexander Bleier & Werner Reinartz & Nalini Ravishanker, 2019. "Improving customer profit predictions with customer mindset metrics through multiple overimputation," Journal of the Academy of Marketing Science, Springer, vol. 47(5), pages 771-794, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrea Cutillo & Mauro Scanu, 2020. "A Mixed Approach for Data Fusion of HBS and SILC," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 150(2), pages 411-437, July.
    2. François Gardes, 2021. "On the value of time and human life," Documents de travail du Centre d'Economie de la Sorbonne 21023, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    3. François Gardes, 2021. "A Solution to the Estimation of an Enlarged GDP Including Domestic Production: An Estimation on Micro Data," Post-Print halshs-03325362, HAL.
    4. Joost Ginkel & Pieter Kroonenberg, 2014. "Using Generalized Procrustes Analysis for Multiple Imputation in Principal Component Analysis," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 242-269, July.
    5. Peter ven de Ven & Anne Harrison & Barbara Fraumeni & Dennis Fixler & David Johnson & Andrew Craig & Kevin Furlong, 2017. "A Consistent Data Series to Evaluate Growth and Inequality in the National Accounts Note: The views expressed in this research, including those related to statistical, methodological, technical, or op," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 63, pages 437-459, December.
    6. Norah Alyabs & Sy Han Chiou, 2022. "The Missing Indicator Approach for Accelerated Failure Time Model with Covariates Subject to Limits of Detection," Stats, MDPI, vol. 5(2), pages 1-13, May.
    7. Eugenio Zucchelli & Andrew M Jones & Nigel Rice, 2012. "The evaluation of health policies through dynamic microsimulation methods," International Journal of Microsimulation, International Microsimulation Association, vol. 5(1), pages 2-20.
    8. Michael S. Rendall & Bonnie Ghosh-Dastidar & Margaret M. Weden & Zafar Nazarov, 2011. "Multiple Imputation for Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys," Working Papers WR-887-1, RAND Corporation.
    9. Joost R. Ginkel, 2020. "Standardized Regression Coefficients and Newly Proposed Estimators for $${R}^{{2}}$$R2 in Multiply Imputed Data," Psychometrika, Springer;The Psychometric Society, vol. 85(1), pages 185-205, March.
    10. Arif Mamun & Ankita Patnaik & Michael Levere & Gina Livermore & Todd Honeycutt & Jacqueline Kauff & Karen Katz & AnnaMaria McCutcheon & Joseph Mastrianni & Brittney Gionfriddo, "undated". "Promoting Readiness of Minors in Supplemental Security Income (PROMISE): Technical Appendix to the Interim Services and Impact Report," Mathematica Policy Research Reports 24c37444a21d4046abb21395a, Mathematica Policy Research.
    11. Hao Dong & Daniel L. Millimet, 2020. "Propensity Score Weighting with Mismeasured Covariates: An Application to Two Financial Literacy Interventions," JRFM, MDPI, vol. 13(11), pages 1-24, November.
    12. Anil Alpman, 2015. "Implementing Rubin's Alternative Multiple Imputation Method for Statistical Matching in Stata," Post-Print hal-01159191, HAL.
    13. Brownstone, David, 1997. "Multiple Imputation Methodology for Missing Data, Non-Random Response, and Panel Attrition," University of California Transportation Center, Working Papers qt2zd6w6hh, University of California Transportation Center.
    14. Lamarche, Pierre, 2017. "Estimating consumption in the HFCS: Experimental results on the first wave of the HFCS," Statistics Paper Series 22, European Central Bank.
    15. Keane, Michael & Stavrunova, Olena, 2016. "Adverse selection, moral hazard and the demand for Medigap insurance," Journal of Econometrics, Elsevier, vol. 190(1), pages 62-78.
    16. Frethey-Bentham, Catherine, 2011. "Pseudo panels as an alternative study design," Australasian marketing journal, Elsevier, vol. 19(4), pages 281-292.
    17. Gina Yannitell Reinhardt, 2009. "Matching Donors and Nonprofits," Journal of Theoretical Politics, , vol. 21(3), pages 283-309, July.
    18. Westermeier, Christian & Grabka, Markus M., 2016. "Longitudinal Wealth Data and Multiple Imputation: An Evaluation Study," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 10(3), pages 237-252.
    19. François Gardes, 2018. "On the value of time and human life," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01903596, HAL.
    20. Marcello D’Orazio, 2015. "Integration and imputation of survey data in R: the StatMatch package," Romanian Statistical Review, Romanian Statistical Review, vol. 63(2), pages 57-68, June.

    More about this item

    Keywords

    Evident dependence; Mixture modeling; Copulas;
    All these keywords.

    JEL classification:

    • C39 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Other
    • C35 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:qmktec:v:11:y:2013:i:3:d:10.1007_s11129-013-9136-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.