IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1004948.html
   My bibliography  Save this article

Neuroprosthetic Decoder Training as Imitation Learning

Author

Listed:
  • Josh Merel
  • David Carlson
  • Liam Paninski
  • John P Cunningham

Abstract

Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user’s intention is not directly observable, recent methods have demonstrated value in training the decoder against a surrogate for the user’s intended movement. Here we show that training a decoder in this way is a novel variant of an imitation learning problem, where an oracle or expert is employed for supervised training in lieu of direct observations, which are not available. Specifically, we describe how a generic imitation learning meta-algorithm, dataset aggregation (DAgger), can be adapted to train a generic brain-computer interface. By deriving existing learning algorithms for brain-computer interfaces in this framework, we provide a novel analysis of regret (an important metric of learning efficacy) for brain-computer interfaces. This analysis allows us to characterize the space of algorithmic variants and bounds on their regret rates. Existing approaches for decoder learning have been performed in the cursor control setting, but the available design principles for these decoders are such that it has been impossible to scale them to naturalistic settings. Leveraging our findings, we then offer an algorithm that combines imitation learning with optimal control, which should allow for training of arbitrary effectors for which optimal control can generate goal-oriented control. We demonstrate this novel and general BCI algorithm with simulated neuroprosthetic control of a 26 degree-of-freedom model of an arm, a sophisticated and realistic end effector.Author Summary: There are various existing methods for rapidly learning a decoder during closed-loop brain computer interface (BCI) tasks. While many of these methods work well in practice, there is no clear theoretical foundation for parameter learning. We offer a unification of closed-loop decoder learning setting as an imitation learning problem. This has two major consequences: first, our approach clarifies how to derive “intention-based” algorithms for any BCI setting, most notably more complex settings like control of an arm; and second, this framework allows us to provide theoretical results, building from an existing literature on the regret of related algorithms. After first demonstrating algorithmic performance in simulation on the well-studied setting of a user trying to reach targets by controlling a cursor on a screen, we then simulate a user controlling an arm with many degrees of freedom in order to grasp a wand. Finally, we describe how extensions in the online-imitation learning literature can improve BCI in additional settings.

Suggested Citation

  • Josh Merel & David Carlson & Liam Paninski & John P Cunningham, 2016. "Neuroprosthetic Decoder Training as Imitation Learning," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-24, May.
  • Handle: RePEc:plo:pcbi00:1004948
    DOI: 10.1371/journal.pcbi.1004948
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004948
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1004948&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1004948?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zheng Li & Joseph E O'Doherty & Timothy L Hanson & Mikhail A Lebedev & Craig S Henriquez & Miguel A L Nicolelis, 2009. "Unscented Kalman Filter for Brain-Machine Interfaces," PLOS ONE, Public Library of Science, vol. 4(7), pages 1-18, July.
    2. Josh Merel & Donald M Pianto & John P Cunningham & Liam Paninski, 2015. "Encoder-Decoder Optimization for Brain-Computer Interfaces," PLOS Computational Biology, Public Library of Science, vol. 11(6), pages 1-25, June.
    3. Leigh R. Hochberg & Daniel Bacher & Beata Jarosiewicz & Nicolas Y. Masse & John D. Simeral & Joern Vogel & Sami Haddadin & Jie Liu & Sydney S. Cash & Patrick van der Smagt & John P. Donoghue, 2012. "Reach and grasp by people with tetraplegia using a neurally controlled robotic arm," Nature, Nature, vol. 485(7398), pages 372-375, May.
    4. Mijail D. Serruya & Nicholas G. Hatsopoulos & Liam Paninski & Matthew R. Fellows & John P. Donoghue, 2002. "Instant neural control of a movement signal," Nature, Nature, vol. 416(6877), pages 141-142, March.
    5. Mark M. Churchland & John P. Cunningham & Matthew T. Kaufman & Justin D. Foster & Paul Nuyujukian & Stephen I. Ryu & Krishna V. Shenoy, 2012. "Neural population dynamics during reaching," Nature, Nature, vol. 487(7405), pages 51-56, July.
    6. Eric A Pohlmeyer & Babak Mahmoudi & Shijia Geng & Noeline W Prins & Justin C Sanchez, 2014. "Using Reinforcement Learning to Provide Stable Brain-Machine Interface Control Despite Neural Input Reorganization," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-12, January.
    7. Jonathan C. Kao & Paul Nuyujukian & Stephen I. Ryu & Mark M. Churchland & John P. Cunningham & Krishna V. Shenoy, 2015. "Single-trial dynamics of motor cortex and their applications to brain-machine interfaces," Nature Communications, Nature, vol. 6(1), pages 1-12, November.
    8. Meel Velliste & Sagi Perel & M. Chance Spalding & Andrew S. Whitford & Andrew B. Schwartz, 2008. "Cortical control of a prosthetic arm for self-feeding," Nature, Nature, vol. 453(7198), pages 1098-1101, June.
    9. Leigh R. Hochberg & Mijail D. Serruya & Gerhard M. Friehs & Jon A. Mukand & Maryam Saleh & Abraham H. Caplan & Almut Branner & David Chen & Richard D. Penn & John P. Donoghue, 2006. "Neuronal ensemble control of prosthetic devices by a human with tetraplegia," Nature, Nature, vol. 442(7099), pages 164-171, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jonathan A Michaels & Benjamin Dann & Hansjörg Scherberger, 2016. "Neural Population Dynamics during Reaching Are Better Explained by a Dynamical System than Representational Tuning," PLOS Computational Biology, Public Library of Science, vol. 12(11), pages 1-22, November.
    2. Andrey Eliseyev & Tetiana Aksenova, 2016. "Penalized Multi-Way Partial Least Squares for Smooth Trajectory Decoding from Electrocorticographic (ECoG) Recording," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-19, May.
    3. Nuri F Ince & Rahul Gupta & Sami Arica & Ahmed H Tewfik & James Ashe & Giuseppe Pellizzer, 2010. "High Accuracy Decoding of Movement Target Direction in Non-Human Primates Based on Common Spatial Patterns of Local Field Potentials," PLOS ONE, Public Library of Science, vol. 5(12), pages 1-11, December.
    4. Yasuhiko Nakanishi & Takufumi Yanagisawa & Duk Shin & Ryohei Fukuma & Chao Chen & Hiroyuki Kambara & Natsue Yoshimura & Masayuki Hirata & Toshiki Yoshimine & Yasuharu Koike, 2013. "Prediction of Three-Dimensional Arm Trajectories Based on ECoG Signals Recorded from Human Sensorimotor Cortex," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-9, August.
    5. Zheng Li & Joseph E O'Doherty & Timothy L Hanson & Mikhail A Lebedev & Craig S Henriquez & Miguel A L Nicolelis, 2009. "Unscented Kalman Filter for Brain-Machine Interfaces," PLOS ONE, Public Library of Science, vol. 4(7), pages 1-18, July.
    6. Andrés Úbeda & Enrique Hortal & Eduardo Iáñez & Carlos Perez-Vidal & Jose M Azorín, 2015. "Assessing Movement Factors in Upper Limb Kinematics Decoding from EEG Signals," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-12, May.
    7. Han-Lin Hsieh & Maryam M Shanechi, 2018. "Optimizing the learning rate for adaptive estimation of neural encoding models," PLOS Computational Biology, Public Library of Science, vol. 14(5), pages 1-34, May.
    8. Hong Gi Yeom & June Sic Kim & Chun Kee Chung, 2014. "High-Accuracy Brain-Machine Interfaces Using Feedback Information," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-7, July.
    9. Ujwal Chaudhary & Bin Xia & Stefano Silvoni & Leonardo G Cohen & Niels Birbaumer, 2017. "Brain–Computer Interface–Based Communication in the Completely Locked-In State," PLOS Biology, Public Library of Science, vol. 15(1), pages 1-25, January.
    10. Shinsuke Koyama & Uri Eden & Emery Brown & Robert Kass, 2010. "Bayesian decoding of neural spike trains," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 62(1), pages 37-59, February.
    11. Tomislav Milekovic & Tonio Ball & Andreas Schulze-Bonhage & Ad Aertsen & Carsten Mehring, 2013. "Detection of Error Related Neuronal Responses Recorded by Electrocorticography in Humans during Continuous Movements," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-20, February.
    12. Eric A Pohlmeyer & Babak Mahmoudi & Shijia Geng & Noeline W Prins & Justin C Sanchez, 2014. "Using Reinforcement Learning to Provide Stable Brain-Machine Interface Control Despite Neural Input Reorganization," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-12, January.
    13. Javier M Antelis & Luis Montesano & Ander Ramos-Murguialday & Niels Birbaumer & Javier Minguez, 2013. "On the Usage of Linear Regression Models to Reconstruct Limb Kinematics from Low Frequency EEG Signals," PLOS ONE, Public Library of Science, vol. 8(4), pages 1-14, April.
    14. Sanaya N. Shroff & Eric Lowet & Sudiksha Sridhar & Howard J. Gritton & Mohammed Abumuaileq & Hua-An Tseng & Cyrus Cheung & Samuel L. Zhou & Krishnakanth Kondabolu & Xue Han, 2023. "Striatal cholinergic interneuron membrane voltage tracks locomotor rhythms in mice," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    15. Shan Yu & Andreas Klaus & Hongdian Yang & Dietmar Plenz, 2014. "Scale-Invariant Neuronal Avalanche Dynamics and the Cut-Off in Size Distributions," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-12, June.
    16. Ian S Howard & David W Franklin, 2015. "Neural Tuning Functions Underlie Both Generalization and Interference," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-21, June.
    17. Linlin Li & Shufang Zhao & Wenhao Ran & Zhexin Li & Yongxu Yan & Bowen Zhong & Zheng Lou & Lili Wang & Guozhen Shen, 2022. "Dual sensing signal decoupling based on tellurium anisotropy for VR interaction and neuro-reflex system application," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    18. Pierre O. Boucher & Tian Wang & Laura Carceroni & Gary Kane & Krishna V. Shenoy & Chandramouli Chandrasekaran, 2023. "Initial conditions combine with sensory evidence to induce decision-related dynamics in premotor cortex," Nature Communications, Nature, vol. 14(1), pages 1-28, December.
    19. Adrian M Haith & David M Huberdeau & John W Krakauer, 2015. "Hedging Your Bets: Intermediate Movements as Optimal Behavior in the Context of an Incomplete Decision," PLOS Computational Biology, Public Library of Science, vol. 11(3), pages 1-21, March.
    20. Tobias Pistohl & Thomas Sebastian Benedikt Schmidt & Tonio Ball & Andreas Schulze-Bonhage & Ad Aertsen & Carsten Mehring, 2013. "Grasp Detection from Human ECoG during Natural Reach-to-Grasp Movements," PLOS ONE, Public Library of Science, vol. 8(1), pages 1-11, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1004948. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.