IDEAS home Printed from https://ideas.repec.org/a/eee/eejocm/v46y2023ics1755534522000562.html
   My bibliography  Save this article

A control-function correction for endogeneity in random coefficients models: The case of choice-based recommender systems

Author

Listed:
  • Danaf, Mazen
  • Guevara, C. Angelo
  • Ben-Akiva, Moshe

Abstract

Applications of discrete choice models in personalization are becoming increasingly popular among researchers and practitioners. However, in such systems, when users are presented with successive menus (or choice situations), the alternatives and attributes in each menu depend on the choices made by the user in the previous menus. This gives rise to endogeneity which can result in inconsistent estimates. Our companion paper, Danaf et al. (2020), showed that the estimates are only consistent when the entire choice history of each user is included in estimation. However, this might not be feasible because of computational constraints or data availability. In this paper, we present a control-function (CF) correction for the cases where the choice history cannot be included in estimation. Our method uses the attributes of non-personalized attributes as instruments, and applies the CF correction by including interactions between the explanatory variables and the first stage residuals. Estimation can be done either sequentially or simultaneously, however, the latter is more efficient (if the model reflects the true data generating process). This method is able to recover the population means of the distributed coefficients, especially with a long choice history. The variances are underestimated, because part of the inter-consumer variability is explained by the residuals, which are included in the systematic utility. However, the population variances can be computed from the estimation results. The modified utility equations (which include the residuals) can be used in forecasting and model application, and provide superior fit and predictions.

Suggested Citation

  • Danaf, Mazen & Guevara, C. Angelo & Ben-Akiva, Moshe, 2023. "A control-function correction for endogeneity in random coefficients models: The case of choice-based recommender systems," Journal of choice modelling, Elsevier, vol. 46(C).
  • Handle: RePEc:eee:eejocm:v:46:y:2023:i:c:s1755534522000562
    DOI: 10.1016/j.jocm.2022.100399
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1755534522000562
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jocm.2022.100399?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Joel Huber and Kenneth Train., 2000. "On the Similarity of Classical and Bayesian Estimates of Individual Mean Partworths," Economics Working Papers E00-289, University of California at Berkeley.
    2. Jeffrey M. Wooldridge, 2005. "Simple solutions to the initial conditions problem in dynamic, nonlinear panel data models with unobserved heterogeneity," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 20(1), pages 39-54, January.
    3. Hausman, Jerry, 2015. "Specification tests in econometrics," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 38(2), pages 112-134.
    4. David Card, 1993. "Using Geographic Variation in College Proximity to Estimate the Return to Schooling," Working Papers 696, Princeton University, Department of Economics, Industrial Relations Section..
    5. Olivier Toubia & Duncan I. Simester & John R. Hauser & Ely Dahan, 2003. "Fast Polyhedral Adaptive Conjoint Estimation," Marketing Science, INFORMS, vol. 22(3), pages 273-303.
    6. Alpaslan Akay, 2012. "Finite‐sample comparison of alternative methods for estimating dynamic panel data models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 27(7), pages 1189-1204, November.
    7. Guevara, C. Angelo, 2018. "Overidentification tests for the exogeneity of instruments in discrete choice models," Transportation Research Part B: Methodological, Elsevier, vol. 114(C), pages 241-253.
    8. David Card, 1993. "Using Geographic Variation in College Proximity to Estimate the Return to Schooling," Working Papers 696, Princeton University, Department of Economics, Industrial Relations Section..
    9. Koop,Gary & Poirier,Dale J. & Tobias,Justin L., 2007. "Bayesian Econometric Methods," Cambridge Books, Cambridge University Press, number 9780521671736, June.
    10. Fowkes, Tony, 2007. "The design and interpretation of freight stated preference experiments seeking to elicit behavioural valuations of journey attributes," Transportation Research Part B: Methodological, Elsevier, vol. 41(9), pages 966-980, November.
    11. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555.
    12. Garen, John, 1984. "The Returns to Schooling: A Selectivity Bias Approach with a Continuous Choice Variable," Econometrica, Econometric Society, vol. 52(5), pages 1199-1218, September.
    13. Chan,Joshua & Koop,Gary & Poirier,Dale J. & Tobias,Justin L., 2019. "Bayesian Econometric Methods," Cambridge Books, Cambridge University Press, number 9781108423380.
    14. Heckman, James J, 1978. "Dummy Endogenous Variables in a Simultaneous Equation System," Econometrica, Econometric Society, vol. 46(4), pages 931-959, July.
    15. Hai Jiang & Xin Qi & He Sun, 2014. "Choice-Based Recommender Systems: A Unified Approach to Achieving Relevancy and Diversity," Operations Research, INFORMS, vol. 62(5), pages 973-993, October.
    16. Jeffrey M. Wooldridge, 2015. "Control Function Methods in Applied Econometrics," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 420-445.
    17. Train, Kenneth & Wilson, Wesley W., 2008. "Estimation on stated-preference experiments constructed from revealed-preference choices," Transportation Research Part B: Methodological, Elsevier, vol. 42(3), pages 191-203, March.
    18. Ben-Akiva, Moshe & McFadden, Daniel & Train, Kenneth, 2019. "Foundations of Stated Preference Elicitation: Consumer Behavior and Choice-based Conjoint Analysis," Foundations and Trends(R) in Econometrics, now publishers, vol. 10(1-2), pages 1-144, January.
    19. J. Miguel Villas-Boas & Russell S. Winer, 1999. "Endogeneity in Brand Choice Models," Management Science, INFORMS, vol. 45(10), pages 1324-1338, October.
    20. Danaf, Mazen & Guevara, Angelo & Atasoy, Bilge & Ben-Akiva, Moshe, 2020. "Endogeneity in adaptive choice contexts: Choice-based recommender systems and adaptive stated preferences surveys," Journal of choice modelling, Elsevier, vol. 34(C).
    21. Guevara, C. Angelo, 2015. "Critical assessment of five methods to correct for endogeneity in discrete-choice models," Transportation Research Part A: Policy and Practice, Elsevier, vol. 82(C), pages 240-254.
    22. Jain, Dipak C & Vilcassim, Naufel J & Chintagunta, Pradeep K, 1994. "A Random-Coefficients Logit Brand-Choice Model Applied to Panel Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 12(3), pages 317-328, July.
    23. repec:fth:prinin:317 is not listed on IDEAS
    24. Cristian Angelo Guevara & Moshe E. Ben-Akiva, 2012. "Change of Scale and Forecasting with the Control-Function Method in Logit Models," Transportation Science, INFORMS, vol. 46(3), pages 425-437, August.
    25. Berry, Steven & Levinsohn, James & Pakes, Ariel, 1995. "Automobile Prices in Market Equilibrium," Econometrica, Econometric Society, vol. 63(4), pages 841-890, July.
    26. Chandra R. Bhat, 2000. "Incorporating Observed and Unobserved Heterogeneity in Urban Work Travel Mode Choice Modeling," Transportation Science, INFORMS, vol. 34(2), pages 228-238, May.
    27. Dan Horsky & Sanjog Misra & Paul Nelson, 2006. "Observed and Unobserved Preference Heterogeneity in Brand-Choice Models," Marketing Science, INFORMS, vol. 25(4), pages 322-335, 07-08.
    28. Guevara, C. Angelo & Hess, Stephane, 2019. "A control-function approach to correct for endogeneity in discrete choice models estimated on SP-off-RP data and contrasts with an earlier FIML approach by Train & Wilson," Transportation Research Part B: Methodological, Elsevier, vol. 123(C), pages 224-239.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Danaf, Mazen & Guevara, Angelo & Atasoy, Bilge & Ben-Akiva, Moshe, 2020. "Endogeneity in adaptive choice contexts: Choice-based recommender systems and adaptive stated preferences surveys," Journal of choice modelling, Elsevier, vol. 34(C).
    2. Thomas E. Guerrero & C. Angelo Guevara & Elisabetta Cherchi & Juan de Dios Ortúzar, 2021. "Addressing endogeneity in strategic urban mode choice models," Transportation, Springer, vol. 48(4), pages 2081-2102, August.
    3. Gopalakrishnan, Raja & Guevara, C. Angelo & Ben-Akiva, Moshe, 2020. "Combining multiple imputation and control function methods to deal with missing data and endogeneity in discrete-choice models," Transportation Research Part B: Methodological, Elsevier, vol. 142(C), pages 45-57.
    4. Fernández-Antolín, Anna & Guevara, C. Angelo & de Lapparent, Matthieu & Bierlaire, Michel, 2016. "Correcting for endogeneity due to omitted attitudes: Empirical assessment of a modified MIS method using RP mode choice data," Journal of choice modelling, Elsevier, vol. 20(C), pages 1-15.
    5. Guevara, C. Angelo & Tang, Yue & Gao, Song, 2018. "The initial condition problem with complete history dependency in learning models for travel choices," Transportation Research Part B: Methodological, Elsevier, vol. 117(PB), pages 850-861.
    6. Guevara, C. Angelo & Hess, Stephane, 2019. "A control-function approach to correct for endogeneity in discrete choice models estimated on SP-off-RP data and contrasts with an earlier FIML approach by Train & Wilson," Transportation Research Part B: Methodological, Elsevier, vol. 123(C), pages 224-239.
    7. Lurkin, Virginie & Garrow, Laurie A. & Higgins, Matthew J. & Newman, Jeffrey P. & Schyns, Michael, 2017. "Accounting for price endogeneity in airline itinerary choice models: An application to Continental U.S. markets," Transportation Research Part A: Policy and Practice, Elsevier, vol. 100(C), pages 228-246.
    8. Guevara, C. Angelo, 2018. "Overidentification tests for the exogeneity of instruments in discrete choice models," Transportation Research Part B: Methodological, Elsevier, vol. 114(C), pages 241-253.
    9. Guevara, C. Angelo & Tirachini, Alejandro & Hurtubia, Ricardo & Dekker, Thijs, 2020. "Correcting for endogeneity due to omitted crowding in public transport choice using the Multiple Indicator Solution (MIS) method," Transportation Research Part A: Policy and Practice, Elsevier, vol. 137(C), pages 472-484.
    10. Walker, Joan L. & Ehlers, Emily & Banerjee, Ipsita & Dugundji, Elenna R., 2011. "Correcting for endogeneity in behavioral choice models with social influence variables," Transportation Research Part A: Policy and Practice, Elsevier, vol. 45(4), pages 362-374, May.
    11. Watanabe, Hajime & Maruyama, Takuya, 2023. "A Bayesian instrumental variable model for multinomial choice with correlated alternatives," Journal of choice modelling, Elsevier, vol. 46(C).
    12. Gregory L. Rosston & Scott J. Savage & Bradley S. Wimmer, 2018. "Price competition in the market for business telecommunications services," Journal of Regulatory Economics, Springer, vol. 54(1), pages 81-104, August.
    13. Herriges, Joseph A. & Phaneuf, Daniel J. & Tobias, Justin L., 2008. "Estimating demand systems when outcomes are correlated counts," Journal of Econometrics, Elsevier, vol. 147(2), pages 282-298, December.
    14. David A. Hensher & Edward Wei & Wen Liu & Loan Ho & Chinh Ho, 2023. "Development of a practical aggregate spatial road freight modal demand model system for truck and commodity movements with an application of a distance-based charging regime," Transportation, Springer, vol. 50(3), pages 1031-1071, June.
    15. Sarrias, Mauricio, 2021. "A two recursive equation model to correct for endogeneity in latent class binary probit models," Journal of choice modelling, Elsevier, vol. 40(C).
    16. Amil Petrin & Kenneth Train, 2003. "Omitted Product Attributes in Discrete Choice Models," NBER Working Papers 9452, National Bureau of Economic Research, Inc.
    17. Helveston, John Paul & Feit, Elea McDonnell & Michalek, Jeremy J., 2018. "Pooling stated and revealed preference data in the presence of RP endogeneity," Transportation Research Part B: Methodological, Elsevier, vol. 109(C), pages 70-89.
    18. Louis Grange & Felipe González & Ignacio Vargas & Rodrigo Troncoso, 2015. "A Logit Model With Endogenous Explanatory Variables and Network Externalities," Networks and Spatial Economics, Springer, vol. 15(1), pages 89-116, March.
    19. Sungho Park & Sachin Gupta, 2012. "Comparison of SML and GMM estimators for the random coefficient logit model using aggregate data," Empirical Economics, Springer, vol. 43(3), pages 1353-1372, December.
    20. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:eejocm:v:46:y:2023:i:c:s1755534522000562. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/journal-of-choice-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.