IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2603.16729.html

GeMA: Learning Latent Manifold Frontiers for Benchmarking Complex Systems

Author

Listed:
  • Jia Ming Li
  • Anupriya
  • Daniel J. Graham

Abstract

Benchmarking the performance of complex systems such as rail networks, renewable generation assets and national economies is central to transport planning, regulation and macroeconomic analysis. Classical frontier methods, notably Data Envelopment Analysis (DEA) and Stochastic Frontier Analysis (SFA), estimate an efficient frontier in the observed input-output space and define efficiency as distance to this frontier, but rely on restrictive assumptions on the production set and only indirectly address heterogeneity and scale effects. We propose Geometric Manifold Analysis (GeMA), a latent manifold frontier framework implemented via a productivity-manifold variational autoencoder (ProMan-VAE). Instead of specifying a frontier function in the observed space, GeMA represents the production set as the boundary of a low-dimensional manifold embedded in the joint input-output space. A split-head encoder learns latent variables that capture technological structure and operational inefficiency. Efficiency is evaluated with respect to the learned manifold, endogenous peer groups arise as clusters in latent technology space, a quotient construction supports scale-invariant benchmarking, and a local certification radius, derived from the decoder Jacobian and a Lipschitz bound, quantifies the geometric robustness of efficiency scores. We validate GeMA on synthetic data with non-convex frontiers, heterogeneous technologies and scale bias, and on four real-world case studies: global urban rail systems (COMET), British rail operators (ORR), national economies (Penn World Table) and a high-frequency wind-farm dataset. Across these domains GeMA behaves comparably to established methods when classical assumptions hold, and provides additional insight in settings with pronounced heterogeneity, non-convexity or size-related bias.

Suggested Citation

  • Jia Ming Li & Anupriya & Daniel J. Graham, 2026. "GeMA: Learning Latent Manifold Frontiers for Benchmarking Complex Systems," Papers 2603.16729, arXiv.org.
  • Handle: RePEc:arx:papers:2603.16729
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2603.16729
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Robert C. Feenstra & Robert Inklaar & Marcel P. Timmer, 2015. "The Next Generation of the Penn World Table," American Economic Review, American Economic Association, vol. 105(10), pages 3150-3182, October.
    2. Keshvari, Abolfazl & Kuosmanen, Timo, 2013. "Stochastic non-convex envelopment of data: Applying isotonic regression to frontier estimation," European Journal of Operational Research, Elsevier, vol. 231(2), pages 481-491.
    3. Dominique Deprins & Léopold Simar & Henry Tulkens, 2006. "Measuring Labor-Efficiency in Post Offices," Springer Books, in: Parkash Chander & Jacques Drèze & C. Knox Lovell & Jack Mintz (ed.), Public goods, environmental externalities and fiscal competition, chapter 0, pages 285-309, Springer.
    4. Edward H. Nieh & Manuel Schottdorf & Nicolas W. Freeman & Ryan J. Low & Sam Lewallen & Sue Ann Koay & Lucas Pinto & Jeffrey L. Gauthier & Carlos D. Brody & David W. Tank, 2021. "Geometry of abstract learned knowledge in the hippocampus," Nature, Nature, vol. 595(7865), pages 80-84, July.
    5. Charnes, A. & Cooper, W. W. & Rhodes, E., 1978. "Measuring the efficiency of decision making units," European Journal of Operational Research, Elsevier, vol. 2(6), pages 429-444, November.
    6. Léopold Simar & Paul W. Wilson, 2015. "Statistical Approaches for Non-parametric Frontier Models: A Guided Tour," International Statistical Review, International Statistical Institute, vol. 83(1), pages 77-110, April.
    7. Sickles,Robin C. & Zelenyuk,Valentin, 2019. "Measurement of Productivity and Efficiency," Cambridge Books, Cambridge University Press, number 9781107687653, Enero-Abr.
    8. Léopold Simar, 2007. "How to improve the performances of DEA/FDH estimators in the presence of noise?," Journal of Productivity Analysis, Springer, vol. 28(3), pages 183-201, December.
    9. Smith, Andrew S.J., 2005. "The role of efficiency estimates in UK regulatory price reviews: The case of rail," Utilities Policy, Elsevier, vol. 13(4), pages 294-301, December.
    10. Meeusen, Wim & van den Broeck, Julien, 1977. "Efficiency Estimation from Cobb-Douglas Production Functions with Composed Error," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 18(2), pages 435-444, June.
    11. Aigner, Dennis & Lovell, C. A. Knox & Schmidt, Peter, 1977. "Formulation and estimation of stochastic frontier production function models," Journal of Econometrics, Elsevier, vol. 6(1), pages 21-37, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bao Hoang Nguyen & Valentin Zelenyuk, 2020. "Robust efficiency analysis of public hospitals in Queensland, Australia," CEPA Working Papers Series WP052020, School of Economics, University of Queensland, Australia.
    2. Sickles, Robin C. & Song, Wonho & Zelenyuk, Valentin, 2018. "Econometric Analysis of Productivity: Theory and Implementation in R," Working Papers 18-008, Rice University, Department of Economics.
    3. Bao Hoang Nguyen & Valentin Zelenyuk, 2021. "Aggregation of Outputs and Inputs for DEA Analysis of Hospital Efficiency: Economics, Operations Research and Data Science Perspectives," International Series in Operations Research & Management Science, in: Joe Zhu & Vincent Charles (ed.), Data-Enabled Analytics, pages 123-158, Springer.
    4. Christopher F. Parmeter & Valentin Zelenyuk, 2019. "Combining the Virtues of Stochastic Frontier and Data Envelopment Analysis," Operations Research, INFORMS, vol. 67(6), pages 1628-1658, November.
    5. Mike Tsionas & Christopher F. Parmeter & Valentin Zelenyuk, 2021. "Bridging the Divide? Bayesian Artificial Neural Networks for Frontier Efficiency Analysis," CEPA Working Papers Series WP082021, School of Economics, University of Queensland, Australia.
    6. Mike Tsionas & Valentin Zelenyuk, 2021. "Goodness-of-fit in Optimizing Models of Production: A Generalization with a Bayesian Perspective," CEPA Working Papers Series WP182021, School of Economics, University of Queensland, Australia.
    7. Daraio, Cinzia & Kerstens, Kristiaan & Nepomuceno, Thyago & Sickles, Robin C., 2019. "Empirical Surveys of Frontier Applications: A Meta-Review," Working Papers 19-005, Rice University, Department of Economics.
    8. Quaranta, Anna Grazia & Raffoni, Anna & Visani, Franco, 2018. "A multidimensional approach to measuring bank branch efficiency," European Journal of Operational Research, Elsevier, vol. 266(2), pages 746-760.
    9. Davtalab-Olyaie, Mostafa & Asgharian, Masoud & Nia, Vahid Partovi, 2019. "Stochastic ranking and dominance in DEA," International Journal of Production Economics, Elsevier, vol. 214(C), pages 125-138.
    10. Jean-François Brun & Constantin Thierry Compaore, 2021. "Public Expenditures Efficiency On Education Distribution in Developing Countries," CERDI Working papers hal-03116615, HAL.
    11. Kuosmanen, Timo & Johnson, Andrew, 2017. "Modeling joint production of multiple outputs in StoNED: Directional distance function approach," European Journal of Operational Research, Elsevier, vol. 262(2), pages 792-801.
    12. W. Cooper & C. Lovell, 2011. "History lessons," Journal of Productivity Analysis, Springer, vol. 36(2), pages 193-200, October.
    13. Zhichao Wang & Bao Hoang Nguyen & Valentin Zelenyuk, 2024. "Performance analysis of hospitals in Australia and its peers: a systematic and critical review," Journal of Productivity Analysis, Springer, vol. 62(2), pages 139-173, October.
    14. Simar, Léopold & Zelenyuk, Valentin & Zhao, Shirong, 2024. "Inference for aggregate efficiency: Theory and guidelines for practitioners," European Journal of Operational Research, Elsevier, vol. 316(1), pages 240-254.
    15. Ljubica Nedelkoska, 2010. "Occupations at risk: The task content and job stability," Jena Economics Research Papers 2010-024, Friedrich-Schiller-University Jena.
    16. Stefan Seifert, 2016. "Semi-Parametric Measures of Scale Characteristics of German Natural Gas-Fired Electricity Generation," Discussion Papers of DIW Berlin 1571, DIW Berlin, German Institute for Economic Research.
    17. Guillen, Maria D. & Aparicio, Juan & Kapelko, Magdalena & Esteve, Miriam, 2025. "Measuring environmental inefficiency through machine learning: An approach based on efficiency analysis trees and by-production technology," European Journal of Operational Research, Elsevier, vol. 321(2), pages 529-542.
    18. Holvad, Torben, 2020. "Efficiency analyses for the railway sector: An overview of key issues," Research in Transportation Economics, Elsevier, vol. 82(C).
    19. Mehmet Ali Cengiz & Talat Şenel, 2025. "Bayesian Robust Data Envelopment Analysis With Heavy‐Tailed Priors," Journal of Mathematics, John Wiley & Sons, vol. 2025(1).
    20. Caitlin T. O’Loughlin & Paul W. Wilson, 2021. "Benchmarking the performance of US Municipalities," Empirical Economics, Springer, vol. 60(6), pages 2665-2700, June.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2603.16729. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.