IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v157y2021ics0167947320302395.html
   My bibliography  Save this article

Embedding and learning with signatures

Author

Listed:
  • Fermanian, Adeline

Abstract

Sequential and temporal data arise in many fields of research, such as quantitative finance, medicine, or computer vision. A novel approach for sequential learning, called the signature method and rooted in rough path theory, is considered. Its basic principle is to represent multidimensional paths by a graded feature set of their iterated integrals, called the signature. This approach relies critically on an embedding principle, which consists in representing discretely sampled data as paths, i.e., functions from [0,1] to Rd. After a survey of machine learning methodologies for signatures, the influence of embeddings on prediction accuracy is investigated with an in-depth study of three recent and challenging datasets. It is shown that a specific embedding, called lead–lag, is systematically the strongest performer across all datasets and algorithms considered. Moreover, an empirical study reveals that computing signatures over the whole path domain does not lead to a loss of local information. It is concluded that, with a good embedding, combining signatures with other simple algorithms achieves results competitive with state-of-the-art, domain-specific approaches.

Suggested Citation

  • Fermanian, Adeline, 2021. "Embedding and learning with signatures," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
  • Handle: RePEc:eee:csdana:v:157:y:2021:i:c:s0167947320302395
    DOI: 10.1016/j.csda.2020.107148
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947320302395
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2020.107148?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Flint, Guy & Hambly, Ben & Lyons, Terry, 2016. "Discretely sampled signals and the rough Hoff process," Stochastic Processes and their Applications, Elsevier, vol. 126(9), pages 2593-2614.
    2. Lajos Gergely Gyurk'o & Terry Lyons & Mark Kontkowski & Jonathan Field, 2013. "Extracting information from the signature of a financial data stream," Papers 1307.7244, arXiv.org, revised Jul 2014.
    3. Kokoszka, Piotr & Oja, Hanny & Park, Byeong & Sangalli, Laura, 2017. "Special issue on functional data analysis," Econometrics and Statistics, Elsevier, vol. 1(C), pages 99-100.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Christos Merkatas & Simo Särkkä, 2023. "System identification using autoregressive Bayesian neural networks with nonparametric noise models," Journal of Time Series Analysis, Wiley Blackwell, vol. 44(3), pages 319-330, May.
    2. Chung I Lu & Julian Sester, 2024. "Generative model for financial time series trained with MMD using a signature kernel," Papers 2407.19848, arXiv.org, revised Jul 2024.
    3. Fermanian, Adeline, 2022. "Functional linear regression with truncated signatures," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    4. Hugo Inzirillo, 2024. "Clustering Digital Assets Using Path Signatures: Application to Portfolio Construction," Papers 2410.23297, arXiv.org.
    5. Samuel N. Cohen & Silvia Lui & Will Malpass & Giulia Mantoan & Lars Nesheim & 'Aureo de Paula & Andrew Reeves & Craig Scott & Emma Small & Lingyi Yang, 2023. "Nowcasting with signature methods," Papers 2305.10256, arXiv.org.
    6. Eduardo Abi Jaber & Louis-Amand G'erard, 2024. "Signature volatility models: pricing and hedging with Fourier," Papers 2402.01820, arXiv.org.
    7. Herv'e Andr`es & Alexandre Boumezoued & Benjamin Jourdain, 2022. "Signature-based validation of real-world economic scenarios," Papers 2208.07251, arXiv.org, revised Apr 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hans Buhler & Blanka Horvath & Terry Lyons & Imanol Perez Arribas & Ben Wood, 2020. "A Data-driven Market Simulator for Small Data Environments," Papers 2006.14498, arXiv.org.
    2. Febrero-Bande, Manuel & Galeano, Pedro & González-Manteiga, Wenceslao, 2019. "Estimation, imputation and prediction for the functional linear model with scalar response with responses missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 91-103.
    3. Stefanos Bennett & Mihai Cucuringu & Gesine Reinert, 2022. "Lead-lag detection and network clustering for multivariate time series with an application to the US equity market," Papers 2201.08283, arXiv.org.
    4. Jiang, Qing & Hušková, Marie & Meintanis, Simos G. & Zhu, Lixing, 2019. "Asymptotics, finite-sample comparisons and applications for two-sample tests with functional data," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 202-220.
    5. Terry Lyons & Sina Nejad & Imanol Perez Arribas, 2019. "Nonparametric pricing and hedging of exotic derivatives," Papers 1905.00711, arXiv.org.
    6. Terry Lyons & Sina Nejad & Imanol Perez Arribas, 2019. "Numerical method for model-free pricing of exotic derivatives using rough path signatures," Papers 1905.01720, arXiv.org, revised Feb 2020.
    7. Imanol Perez Arribas & Cristopher Salvi & Lukasz Szpruch, 2020. "Sig-SDEs model for quantitative finance," Papers 2006.00218, arXiv.org, revised Jun 2020.
    8. Marc Sabate-Vidales & David v{S}iv{s}ka & Lukasz Szpruch, 2020. "Solving path dependent PDEs with LSTM networks and path signatures," Papers 2011.10630, arXiv.org.
    9. Zhang, Xiaoke & Wang, Jane-Ling, 2018. "Optimal weighting schemes for longitudinal and functional data," Statistics & Probability Letters, Elsevier, vol. 138(C), pages 165-170.
    10. Febrero-Bande, Manuel & González-Manteiga, Wenceslao & Prallon, Brenda & Saporito, Yuri F., 2023. "Functional classification of bitcoin addresses," Computational Statistics & Data Analysis, Elsevier, vol. 181(C).
    11. Łukasz Smaga & Hidetoshi Matsui, 2018. "A note on variable selection in functional regression via random subspace method," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(3), pages 455-477, August.
    12. Takanori Adachi & Yusuke Naritomi, 2021. "Discrete signature and its application to finance," Papers 2112.09342, arXiv.org, revised Jan 2022.
    13. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    14. Vieu, Philippe, 2018. "On dimension reduction models for functional data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 134-138.
    15. Aneiros, Germán & Horová, Ivana & Hušková, Marie & Vieu, Philippe, 2022. "On functional data analysis and related topics," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    16. Bongiorno, E.G. & Goia, A. & Vieu, P., 2020. "Estimating the complexity index of functional data: Some asymptotics," Statistics & Probability Letters, Elsevier, vol. 161(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:157:y:2021:i:c:s0167947320302395. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.