IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v34y2022i6p3023-3041.html
   My bibliography  Save this article

A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference

Author

Listed:
  • Md Saiful Islam

    (Mechanical and Industrial Engineering, Northeastern University, Boston, Massachusetts 02115)

  • Md Sarowar Morshed

    (Mechanical and Industrial Engineering, Northeastern University, Boston, Massachusetts 02115)

  • Md. Noor-E-Alam

    (Mechanical and Industrial Engineering, Northeastern University, Boston, Massachusetts 02115)

Abstract

Identifying cause-effect relations among variables is a key step in the decision-making process. Whereas causal inference requires randomized experiments, researchers and policy makers are increasingly using observational studies to test causal hypotheses due to the wide availability of data and the infeasibility of experiments. The matching method is the most used technique to make causal inference from observational data. However, the pair assignment process in one-to-one matching creates uncertainty in the inference because of different choices made by the experimenter. Recently, discrete optimization models have been proposed to tackle such uncertainty; however, they produce 0-1 nonlinear problems and lack scalability. In this work, we investigate this emerging data science problem and develop a unique computational framework to solve the robust causal inference test instances from observational data with continuous outcomes. In the proposed framework, we first reformulate the nonlinear binary optimization problems as feasibility problems. By leveraging the structure of the feasibility formulation, we develop greedy schemes that are efficient in solving robust test problems. In many cases, the proposed algorithms achieve a globally optimal solution. We perform experiments on real-world data sets to demonstrate the effectiveness of the proposed algorithms and compare our results with the state-of-the-art solver. Our experiments show that the proposed algorithms significantly outperform the exact method in terms of computation time while achieving the same conclusion for causal tests. Both numerical experiments and complexity analysis demonstrate that the proposed algorithms ensure the scalability required for harnessing the power of big data in the decision-making process. Finally, the proposed framework not only facilitates robust decision making through big-data causal inference, but it can also be utilized in developing efficient algorithms for other nonlinear optimization problems such as quadratic assignment problems.

Suggested Citation

  • Md Saiful Islam & Md Sarowar Morshed & Md. Noor-E-Alam, 2022. "A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3023-3041, November.
  • Handle: RePEc:inm:orijoc:v:34:y:2022:i:6:p:3023-3041
    DOI: 10.1287/ijoc.2022.1226
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijoc.2022.1226
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2022.1226?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Cafieri, Sonia & Omheni, Riadh, 2017. "Mixed-integer nonlinear programming for aircraft conflict avoidance by sequentially applying velocity and heading angle changes," European Journal of Operational Research, Elsevier, vol. 260(1), pages 283-290.
    2. Alexander G. Nikolaev & Sheldon H. Jacobson & Wendy K. Tam Cho & Jason J. Sauppe & Edward C. Sewell, 2013. "Balance Optimization Subset Selection (BOSS): An Alternative Approach for Causal Inference with Observational Data," Operations Research, INFORMS, vol. 61(2), pages 398-412, April.
    3. Lu Zhen & Wenya Lv & Kai Wang & Chengle Ma & Ziheng Xu, 2020. "Consistent vehicle routing problem with simultaneous distribution and collection," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 71(5), pages 813-830, May.
    4. Quinn McNemar, 1947. "Note on the sampling error of the difference between correlated proportions or percentages," Psychometrika, Springer;The Psychometric Society, vol. 12(2), pages 153-157, June.
    5. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2011. "Multivariate Matching Methods That Are Monotonic Imbalance Bounding," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 345-361.
    6. Glover, Fred & Alidaee, Bahram & Rego, Cesar & Kochenberger, Gary, 2002. "One-pass heuristics for large-scale unconstrained binary quadratic problems," European Journal of Operational Research, Elsevier, vol. 137(2), pages 272-287, March.
    7. Jason J. Sauppe & Sheldon H. Jacobson, 2017. "The role of covariate balance in observational studies," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(4), pages 323-344, June.
    8. Carlo Alberto Bernardi & Enrico Miglierina & Elena Molho, 2019. "Stability of a convex feasibility problem," Journal of Global Optimization, Springer, vol. 75(4), pages 1061-1077, December.
    9. Alexis Diamond & Jasjeet S. Sekhon, 2013. "Genetic Matching for Estimating Causal Effects: A General Multivariate Matching Method for Achieving Balance in Observational Studies," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 932-945, July.
    10. Amitabh Basu & Jesús A. De Loera & Mark Junod, 2014. "On Chubanov's Method for Linear Programming," INFORMS Journal on Computing, INFORMS, vol. 26(2), pages 336-350, May.
    11. Yew Low, Rand Kwong & Faff, Robert & Aas, Kjersti, 2016. "Enhancing mean–variance portfolio selection by modeling distributional asymmetries," Journal of Economics and Business, Elsevier, vol. 85(C), pages 49-72.
    12. S R Howard & S D Pimentel, 2021. "The uniform general signed rank test and its design sensitivity [A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations]," Biometrika, Biometrika Trust, vol. 108(2), pages 381-396.
    13. Zhen, Lu & Ma, Chengle & Wang, Kai & Xiao, Liyang & Zhang, Wei, 2020. "Multi-depot multi-trip vehicle routing problem with time windows and release dates," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 135(C).
    14. José R. Zubizarreta, 2015. "Stable Weights that Balance Covariates for Estimation With Incomplete Outcome Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 910-922, September.
    15. S. Lucidi & F. Rinaldi, 2010. "Exact Penalty Functions for Nonlinear Integer Programming Problems," Journal of Optimization Theory and Applications, Springer, vol. 145(3), pages 479-488, June.
    16. Lu Zhen & Ziheng Xu & Chengle Ma & Liyang Xiao, 2020. "Hybrid electric vehicle routing problem with mode selection," International Journal of Production Research, Taylor & Francis Journals, vol. 58(2), pages 562-576, January.
    17. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    18. Walter Murray & Kien-Ming Ng, 2010. "An algorithm for nonlinear optimization problems with binary variables," Computational Optimization and Applications, Springer, vol. 47(2), pages 257-288, October.
    19. Beau Coker & Cynthia Rudin & Gary King, 2021. "A Theory of Statistical Inference for Ensuring the Robustness of Scientific Results," Management Science, INFORMS, vol. 67(10), pages 6174-6197, October.
    20. Colin B. Fogarty, 2020. "Studentized Sensitivity Analysis for the Sample Average Treatment Effect in Paired Observational Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1518-1530, July.
    21. José R. Zubizarreta, 2012. "Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure After Surgery," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1360-1371, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jason J. Sauppe & Sheldon H. Jacobson, 2017. "The role of covariate balance in observational studies," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(4), pages 323-344, June.
    2. Md Saiful Islam & Md Sarowar Morshed & Gary J Young & Md Noor-E-Alam, 2019. "Robust policy evaluation from large-scale observational studies," PLOS ONE, Public Library of Science, vol. 14(10), pages 1-19, October.
    3. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.
    4. Marco Morucci & Md. Noor-E-Alam & Cynthia Rudin, 2022. "A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference," INFORMS Joural on Data Science, INFORMS, vol. 1(2), pages 156-171, October.
    5. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    6. Hochbaum, Dorit S. & Rao, Xu & Sauppe, Jason, 2022. "Network flow methods for the minimum covariate imbalance problem," European Journal of Operational Research, Elsevier, vol. 300(3), pages 827-836.
    7. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    8. María de los Angeles Resa & José R. Zubizarreta, 2020. "Direct and stable weight adjustment in non‐experimental studies with multivalued treatments: analysis of the effect of an earthquake on post‐traumatic stress," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(4), pages 1387-1410, October.
    9. Hee Youn Kwon & Jason J. Sauppe & Sheldon H. Jacobson, 2019. "Treatment Effect Decomposition and Bootstrap Hypothesis Testing in Observational Studies," Annals of Data Science, Springer, vol. 6(3), pages 491-511, September.
    10. Dutta Shouvik & Jacobson Sheldon H. & Sauppe Jason J., 2017. "Identifying NCAA tournament upsets using Balance Optimization Subset Selection," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 13(2), pages 79-93, June.
    11. Cui, Weiwei & Yang, Yiran & Di, Lei, 2023. "Modeling and optimization for static-dynamic routing of a vehicle with additive manufacturing equipment," International Journal of Production Economics, Elsevier, vol. 257(C).
    12. Datta, Nirupam, 2015. "Evaluating Impacts of Watershed Development Program on Agricultural Productivity, Income, and Livelihood in Bhalki Watershed of Bardhaman District, West Bengal," World Development, Elsevier, vol. 66(C), pages 443-456.
    13. Tulasi Malini Maharatha & Sumirtha Gandhi & Umakant Dash, 2021. "Has the Demand and Supply-side Components of Janani Suraksha Yojana Augmented the Uptake of Maternal Health Care Services among Poor Women in India ? : An Application of Hybrid Matching Technique," BASE University Working Papers 08/2021, BASE University, Bengaluru, India.
    14. Shouvik Dutta & Jason Sauppe & Sheldon Jacobson, 2016. "Targeted Marketing Using Balance Optimization Subset Selection," Annals of Data Science, Springer, vol. 3(4), pages 423-444, December.
    15. Marianna De Santis & Stefano Lucidi & Francesco Rinaldi, 2013. "A new class of functions for measuring solution integrality in the Feasibility Pump approach: Complete Results," DIAG Technical Reports 2013-05, Department of Computer, Control and Management Engineering, Universita' degli Studi di Roma "La Sapienza".
    16. Huber, Martin, 2019. "An introduction to flexible methods for policy evaluation," FSES Working Papers 504, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    17. Ma, Cheng & Zhang, Liansheng, 2015. "On an exact penalty function method for nonlinear mixed discrete programming problems and its applications in search engine advertising problems," Applied Mathematics and Computation, Elsevier, vol. 271(C), pages 642-656.
    18. Brett R. Gordon & Florian Zettelmeyer & Neha Bhargava & Dan Chapsky, 2019. "A Comparison of Approaches to Advertising Measurement: Evidence from Big Field Experiments at Facebook," Marketing Science, INFORMS, vol. 38(2), pages 193-225, March.
    19. Mehrnaz Bathaee & Hamed Nozari & Agnieszka Szmelter-Jarosz, 2023. "Designing a New Location-Allocation and Routing Model with Simultaneous Pick-Up and Delivery in a Closed-Loop Supply Chain Network under Uncertainty," Logistics, MDPI, vol. 7(1), pages 1-33, January.
    20. Richard A. Nielsen, 2016. "Case Selection via Matching," Sociological Methods & Research, , vol. 45(3), pages 569-597, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:34:y:2022:i:6:p:3023-3041. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.