IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v157y2021ics0167947320302395.html

Embedding and learning with signatures

Author

Listed:
  • Fermanian, Adeline

Abstract

Sequential and temporal data arise in many fields of research, such as quantitative finance, medicine, or computer vision. A novel approach for sequential learning, called the signature method and rooted in rough path theory, is considered. Its basic principle is to represent multidimensional paths by a graded feature set of their iterated integrals, called the signature. This approach relies critically on an embedding principle, which consists in representing discretely sampled data as paths, i.e., functions from [0,1] to Rd. After a survey of machine learning methodologies for signatures, the influence of embeddings on prediction accuracy is investigated with an in-depth study of three recent and challenging datasets. It is shown that a specific embedding, called lead–lag, is systematically the strongest performer across all datasets and algorithms considered. Moreover, an empirical study reveals that computing signatures over the whole path domain does not lead to a loss of local information. It is concluded that, with a good embedding, combining signatures with other simple algorithms achieves results competitive with state-of-the-art, domain-specific approaches.

Suggested Citation

  • Fermanian, Adeline, 2021. "Embedding and learning with signatures," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
  • Handle: RePEc:eee:csdana:v:157:y:2021:i:c:s0167947320302395
    DOI: 10.1016/j.csda.2020.107148
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947320302395
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2020.107148?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Flint, Guy & Hambly, Ben & Lyons, Terry, 2016. "Discretely sampled signals and the rough Hoff process," Stochastic Processes and their Applications, Elsevier, vol. 126(9), pages 2593-2614.
    2. Lajos Gergely Gyurk'o & Terry Lyons & Mark Kontkowski & Jonathan Field, 2013. "Extracting information from the signature of a financial data stream," Papers 1307.7244, arXiv.org, revised Jul 2014.
    3. Kokoszka, Piotr & Oja, Hanny & Park, Byeong & Sangalli, Laura, 2017. "Special issue on functional data analysis," Econometrics and Statistics, Elsevier, vol. 1(C), pages 99-100.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mihriban Ceylan & David J. Promel, 2026. "Global universality via discrete-time signatures," Papers 2603.09773, arXiv.org.
    2. Hugo Inzirillo, 2024. "Clustering Digital Assets Using Path Signatures: Application to Portfolio Construction," Papers 2410.23297, arXiv.org.
    3. Christos Merkatas & Simo Särkkä, 2023. "System identification using autoregressive Bayesian neural networks with nonparametric noise models," Journal of Time Series Analysis, Wiley Blackwell, vol. 44(3), pages 319-330, May.
    4. Samuel N. Cohen & Giulia Mantoan & Lars Nesheim & 'Aureo de Paula & Arthur Turrell & Lingyi Yang, 2023. "Nowcasting using regression on signatures," Papers 2305.10256, arXiv.org, revised Dec 2025.
    5. Eduardo Abi Jaber & Louis-Amand G'erard, 2024. "Signature volatility models: pricing and hedging with Fourier," Papers 2402.01820, arXiv.org, revised Jun 2025.
    6. Xin Guo & Binnan Wang & Ruixun Zhang & Chaoyi Zhao, 2025. "On Consistency of Signature Using Lasso," Operations Research, INFORMS, vol. 73(5), pages 2530-2549, September.
    7. Chung I Lu & Julian Sester, 2024. "Generative modelling of financial time series with structured noise and MMD-based signature learning," Papers 2407.19848, arXiv.org, revised Nov 2025.
    8. Eduardo Abi Jaber & Louis-Amand Gérard, 2025. "Signature volatility models: pricing and hedging with Fourier," Post-Print hal-04435238, HAL.
    9. Fermanian, Adeline, 2022. "Functional linear regression with truncated signatures," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    10. Herv'e Andr`es & Alexandre Boumezoued & Benjamin Jourdain, 2022. "Signature-based validation of real-world economic scenarios," Papers 2208.07251, arXiv.org, revised Apr 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hans Buhler & Blanka Horvath & Terry Lyons & Imanol Perez Arribas & Ben Wood, 2020. "A Data-driven Market Simulator for Small Data Environments," Papers 2006.14498, arXiv.org.
    2. Febrero-Bande, Manuel & González-Manteiga, Wenceslao & Prallon, Brenda & Saporito, Yuri F., 2023. "Functional classification of bitcoin addresses," Computational Statistics & Data Analysis, Elsevier, vol. 181(C).
    3. Febrero-Bande, Manuel & Galeano, Pedro & González-Manteiga, Wenceslao, 2019. "Estimation, imputation and prediction for the functional linear model with scalar response with responses missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 91-103.
    4. Łukasz Smaga & Hidetoshi Matsui, 2018. "A note on variable selection in functional regression via random subspace method," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(3), pages 455-477, August.
    5. Takanori Adachi & Yusuke Naritomi, 2021. "Discrete signature and its application to finance," Papers 2112.09342, arXiv.org, revised Jan 2022.
    6. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    7. Linze Li & William Ferreira, 2025. "Follow the Leader: Enhancing Systematic Trend-Following Using Network Momentum," Papers 2501.07135, arXiv.org.
    8. Eduardo Abi Jaber & Louis-Amand Gérard, 2025. "Hedging with memory: shallow and deep learning with signatures," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-05197836, HAL.
    9. Eduardo Abi Jaber & Louis-Amand G'erard, 2025. "Hedging with memory: shallow and deep learning with signatures," Papers 2508.02759, arXiv.org.
    10. Stefanos Bennett & Mihai Cucuringu & Gesine Reinert, 2022. "Lead-lag detection and network clustering for multivariate time series with an application to the US equity market," Papers 2201.08283, arXiv.org.
    11. Jiang, Qing & Hušková, Marie & Meintanis, Simos G. & Zhu, Lixing, 2019. "Asymptotics, finite-sample comparisons and applications for two-sample tests with functional data," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 202-220.
    12. Zacharia Issa & Blanka Horvath, 2023. "Non-parametric online market regime detection and regime clustering for multidimensional and path-dependent data structures," Papers 2306.15835, arXiv.org.
    13. Terry Lyons & Sina Nejad & Imanol Perez Arribas, 2019. "Nonparametric pricing and hedging of exotic derivatives," Papers 1905.00711, arXiv.org.
    14. Vieu, Philippe, 2018. "On dimension reduction models for functional data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 134-138.
    15. Aneiros, Germán & Horová, Ivana & Hušková, Marie & Vieu, Philippe, 2022. "On functional data analysis and related topics," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    16. Archimbaud, Aurore & Boulfani, Feriel & Gendre, Xavier & Nordhausen, Klaus & Ruiz-Gazen, Anne & Virta, Joni, 2025. "ICS for multivariate functional anomaly detection with applications to predictive maintenance and quality control," Econometrics and Statistics, Elsevier, vol. 33(C), pages 282-303.
    17. Ofelia Bonesini & Emilio Ferrucci & Ioannis Gasteratos & Antoine Jacquier, 2024. "Rough differential equations for volatility," Papers 2412.21192, arXiv.org, revised Mar 2026.
    18. Terry Lyons & Sina Nejad & Imanol Perez Arribas, 2019. "Numerical method for model-free pricing of exotic derivatives using rough path signatures," Papers 1905.01720, arXiv.org, revised Feb 2020.
    19. Imanol Perez Arribas & Cristopher Salvi & Lukasz Szpruch, 2020. "Sig-SDEs model for quantitative finance," Papers 2006.00218, arXiv.org, revised Jun 2020.
    20. Marc Sabate-Vidales & David v{S}iv{s}ka & Lukasz Szpruch, 2020. "Solving path dependent PDEs with LSTM networks and path signatures," Papers 2011.10630, arXiv.org.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:157:y:2021:i:c:s0167947320302395. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.