IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.15088.html

SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs

Author

Listed:
  • Xianglin Wu
  • Chiheb Ben Hammouda
  • Cornelis W. Oosterlee

Abstract

Stochastic differential equations (SDEs) driven by fractional Brownian motion (fBm) are increasingly used to model systems with rough dynamics and long-range dependence, such as those arising in quantitative finance and reliability engineering. However, these processes are non-Markovian and lack a semimartingale structure, rendering many classical parameter estimation techniques inapplicable or computationally intractable beyond very specific cases. This work investigates two central questions: (i) whether integrating path signatures into deep learning architectures can improve the trade-off between estimation accuracy and model complexity, and (ii) what constitutes an effective architecture for leveraging signatures as feature maps. We introduce SigMA (Signature Multi-head Attention), a neural architecture that integrates path signatures with multi-head self-attention, supported by a convolutional preprocessing layer and a multilayer perceptron for effective feature encoding. SigMA learns model parameters from synthetically generated paths of fBm-driven SDEs, including fractional Brownian motion, fractional Ornstein-Uhlenbeck, and rough Heston models, with a particular focus on estimating the Hurst parameter and on joint multi-parameter inference, and it generalizes robustly to unseen trajectories. Extensive experiments on synthetic data and two real-world datasets (i.e., equity-index realized volatility and Li-ion battery degradation) show that SigMA consistently outperforms CNN, LSTM, vanilla Transformer, and Deep Signature baselines in accuracy, robustness, and model compactness. These results demonstrate that combining signature transforms with attention-based architectures provides an effective and scalable framework for parameter inference in stochastic systems with rough or persistent temporal structure.

Suggested Citation

  • Xianglin Wu & Chiheb Ben Hammouda & Cornelis W. Oosterlee, 2025. "SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs," Papers 2512.15088, arXiv.org, revised Mar 2026.
  • Handle: RePEc:arx:papers:2512.15088
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.15088
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Henry Stone, 2020. "Calibrating rough volatility models: a convolutional neural network approach," Quantitative Finance, Taylor & Francis Journals, vol. 20(3), pages 379-392, March.
    2. Omar El Euch & Mathieu Rosenbaum, 2019. "The characteristic function of rough Heston models," Mathematical Finance, Wiley Blackwell, vol. 29(1), pages 3-38, January.
    3. Christian Bayer & Peter Friz & Jim Gatheral, 2016. "Pricing under rough volatility," Quantitative Finance, Taylor & Francis Journals, vol. 16(6), pages 887-904, June.
    4. M.L. Kleptsyna & A. Le Breton, 2002. "Statistical Analysis of the Fractional Ornstein–Uhlenbeck Type Process," Statistical Inference for Stochastic Processes, Springer, vol. 5(3), pages 229-248, October.
    5. Jingtang Ma & Haofei Wu, 2022. "A fast algorithm for simulation of rough volatility models," Quantitative Finance, Taylor & Francis Journals, vol. 22(3), pages 447-462, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicola F. Zaugg & Lech A. Grzelak, 2025. "Lifted Heston Model: Efficient Monte Carlo Simulation with Large Time Steps," Papers 2510.08805, arXiv.org.
    2. Yang, Wensheng & Ma, Jingtang & Cui, Zhenyu, 2025. "A general valuation framework for rough stochastic local volatility models and applications," European Journal of Operational Research, Elsevier, vol. 322(1), pages 307-324.
    3. Li, Yicun & Teng, Yuanyang, 2023. "Statistical inference in discretely observed fractional Ornstein–Uhlenbeck processes," Chaos, Solitons & Fractals, Elsevier, vol. 177(C).
    4. Christian Bayer & Peter K. Friz & Paul Gassiat & Jorg Martin & Benjamin Stemper, 2020. "A regularity structure for rough volatility," Mathematical Finance, Wiley Blackwell, vol. 30(3), pages 782-832, July.
    5. Eduardo Abi Jaber, 2022. "The characteristic function of Gaussian stochastic volatility models: an analytic expression," Working Papers hal-02946146, HAL.
    6. Eduardo Abi Jaber, 2022. "The characteristic function of Gaussian stochastic volatility models: an analytic expression," Finance and Stochastics, Springer, vol. 26(4), pages 733-769, October.
    7. Florian Aichinger & Sascha Desmettre, 2025. "Pricing of geometric Asian options in the Volterra-Heston model," Review of Derivatives Research, Springer, vol. 28(1), pages 1-30, April.
    8. Archil Gulisashvili, 2022. "Multivariate Stochastic Volatility Models and Large Deviation Principles," Papers 2203.09015, arXiv.org, revised Nov 2022.
    9. Blanka Horvath & Antoine Jacquier & Aitor Muguruza & Andreas Søjmark, 2024. "Functional central limit theorems for rough volatility," Finance and Stochastics, Springer, vol. 28(3), pages 615-661, July.
    10. Siu Hin Tang & Mathieu Rosenbaum & Chao Zhou, 2023. "Forecasting Volatility with Machine Learning and Rough Volatility: Example from the Crypto-Winter," Papers 2311.04727, arXiv.org, revised Feb 2024.
    11. Daniel Bartl & Michael Kupper & David J. Prömel & Ludovic Tangpi, 2019. "Duality for pathwise superhedging in continuous time," Finance and Stochastics, Springer, vol. 23(3), pages 697-728, July.
    12. Martin Friesen & Stefan Gerhold & Kristof Wiedermann, 2024. "Small-time central limit theorems for stochastic Volterra integral equations and their Markovian lifts," Papers 2412.15971, arXiv.org.
    13. Mathieu Rosenbaum & Jianfei Zhang, 2022. "On the universality of the volatility formation process: when machine learning and rough volatility agree," Papers 2206.14114, arXiv.org.
    14. David J. Promel & David Scheffels, 2022. "Pathwise uniqueness for singular stochastic Volterra equations with H\"older coefficients," Papers 2212.08029, arXiv.org, revised Jul 2024.
    15. Wenyuan Li & Haoqi Lyu, 2026. "Valuation of variable annuities under the Volterra mortality and rough Heston models," Papers 2604.00472, arXiv.org, revised Apr 2026.
    16. Ranieri Dugo & Giacomo Giorgio & Paolo Pigato, 2024. "The Multivariate Fractional Ornstein-Uhlenbeck Process," CEIS Research Paper 581, Tor Vergata University, CEIS, revised 28 Aug 2024.
    17. Ying Jiao & Chunhua Ma & Simone Scotti & Chao Zhou, 2021. "The Alpha‐Heston stochastic volatility model," Mathematical Finance, Wiley Blackwell, vol. 31(3), pages 943-978, July.
    18. Giulia Di Nunno & Kk{e}stutis Kubilius & Yuliya Mishura & Anton Yurchenko-Tytarenko, 2023. "From constant to rough: A survey of continuous volatility modeling," Papers 2309.01033, arXiv.org, revised Aug 2025.
    19. Christa Cuchiero & Sara Svaluto-Ferro, 2021. "Infinite-dimensional polynomial processes," Finance and Stochastics, Springer, vol. 25(2), pages 383-426, April.
    20. Giulia Di Nunno & Anton Yurchenko-Tytarenko, 2022. "Sandwiched Volterra Volatility model: Markovian approximations and hedging," Papers 2209.13054, arXiv.org, revised Jul 2024.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.15088. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.