IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0087253.html
   My bibliography  Save this article

Using Reinforcement Learning to Provide Stable Brain-Machine Interface Control Despite Neural Input Reorganization

Author

Listed:
  • Eric A Pohlmeyer
  • Babak Mahmoudi
  • Shijia Geng
  • Noeline W Prins
  • Justin C Sanchez

Abstract

Brain-machine interface (BMI) systems give users direct neural control of robotic, communication, or functional electrical stimulation systems. As BMI systems begin transitioning from laboratory settings into activities of daily living, an important goal is to develop neural decoding algorithms that can be calibrated with a minimal burden on the user, provide stable control for long periods of time, and can be responsive to fluctuations in the decoder’s neural input space (e.g. neurons appearing or being lost amongst electrode recordings). These are significant challenges for static neural decoding algorithms that assume stationary input/output relationships. Here we use an actor-critic reinforcement learning architecture to provide an adaptive BMI controller that can successfully adapt to dramatic neural reorganizations, can maintain its performance over long time periods, and which does not require the user to produce specific kinetic or kinematic activities to calibrate the BMI. Two marmoset monkeys used the Reinforcement Learning BMI (RLBMI) to successfully control a robotic arm during a two-target reaching task. The RLBMI was initialized using random initial conditions, and it quickly learned to control the robot from brain states using only a binary evaluative feedback regarding whether previously chosen robot actions were good or bad. The RLBMI was able to maintain control over the system throughout sessions spanning multiple weeks. Furthermore, the RLBMI was able to quickly adapt and maintain control of the robot despite dramatic perturbations to the neural inputs, including a series of tests in which the neuron input space was deliberately halved or doubled.

Suggested Citation

  • Eric A Pohlmeyer & Babak Mahmoudi & Shijia Geng & Noeline W Prins & Justin C Sanchez, 2014. "Using Reinforcement Learning to Provide Stable Brain-Machine Interface Control Despite Neural Input Reorganization," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-12, January.
  • Handle: RePEc:plo:pone00:0087253
    DOI: 10.1371/journal.pone.0087253
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0087253
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0087253&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0087253?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Leigh R. Hochberg & Daniel Bacher & Beata Jarosiewicz & Nicolas Y. Masse & John D. Simeral & Joern Vogel & Sami Haddadin & Jie Liu & Sydney S. Cash & Patrick van der Smagt & John P. Donoghue, 2012. "Reach and grasp by people with tetraplegia using a neurally controlled robotic arm," Nature, Nature, vol. 485(7398), pages 372-375, May.
    2. Leigh R. Hochberg & Mijail D. Serruya & Gerhard M. Friehs & Jon A. Mukand & Maryam Saleh & Abraham H. Caplan & Almut Branner & David Chen & Richard D. Penn & John P. Donoghue, 2006. "Neuronal ensemble control of prosthetic devices by a human with tetraplegia," Nature, Nature, vol. 442(7099), pages 164-171, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gerard Derosiere & Solaiman Shokur & Pierre Vassiliadis, 2025. "Reward signals in the motor cortex: from biology to neurotechnology," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    2. Josh Merel & David Carlson & Liam Paninski & John P Cunningham, 2016. "Neuroprosthetic Decoder Training as Imitation Learning," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-24, May.
    3. repec:osf:thesis:4j3fu_v1 is not listed on IDEAS
    4. Burkhart, Michael C., 2019. "A Discriminative Approach to Bayesian Filtering with Applications to Human Neural Decoding," Thesis Commons 4j3fu, Center for Open Science.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ujwal Chaudhary & Bin Xia & Stefano Silvoni & Leonardo G Cohen & Niels Birbaumer, 2017. "Brain–Computer Interface–Based Communication in the Completely Locked-In State," PLOS Biology, Public Library of Science, vol. 15(1), pages 1-25, January.
    2. Andrés Úbeda & Enrique Hortal & Eduardo Iáñez & Carlos Perez-Vidal & Jose M Azorín, 2015. "Assessing Movement Factors in Upper Limb Kinematics Decoding from EEG Signals," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-12, May.
    3. Xingzhao Wang & Shun Wu & Hantao Yang & Yu Bao & Zhi Li & Changchun Gan & Yuanyuan Deng & Junyan Cao & Xue Li & Yun Wang & Chi Ren & Zhigang Yang & Zhengtuo Zhao, 2024. "Intravascular delivery of an ultraflexible neural electrode array for recordings of cortical spiking activity," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    4. repec:osf:thesis:4j3fu_v1 is not listed on IDEAS
    5. Josh Merel & David Carlson & Liam Paninski & John P Cunningham, 2016. "Neuroprosthetic Decoder Training as Imitation Learning," PLOS Computational Biology, Public Library of Science, vol. 12(5), pages 1-24, May.
    6. Hong Gi Yeom & June Sic Kim & Chun Kee Chung, 2014. "High-Accuracy Brain-Machine Interfaces Using Feedback Information," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-7, July.
    7. Sai Deepesh Pokala & Marie Bernert & Takuya Nanami & Takashi Kohno & Timothée Lévi & Blaise Yvert, 2025. "A frugal Spiking Neural Network for unsupervised multivariate temporal pattern classification and multichannel spike sorting," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    8. No-Sang Kwak & Klaus-Robert Müller & Seong-Whan Lee, 2017. "A convolutional neural network for steady state visual evoked potential classification under ambulatory environment," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-20, February.
    9. Baoguo Xu & Wenlong Li & Deping Liu & Kun Zhang & Minmin Miao & Guozheng Xu & Aiguo Song, 2022. "Continuous Hybrid BCI Control for Robotic Arm Using Noninvasive Electroencephalogram, Computer Vision, and Eye Tracking," Mathematics, MDPI, vol. 10(4), pages 1-20, February.
    10. Ke Yu & Hasan AI-Nashash & Nitish Thakor & Xiaoping Li, 2014. "The Analytic Bilinear Discrimination of Single-Trial EEG Signals in Rapid Image Triage," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-10, June.
    11. Fan Li & Jazlyn Gallego & Natasha N. Tirko & Jenna Greaser & Derek Bashe & Rudra Patel & Eric Shaker & Grace E. Valkenburg & Alanoud S. Alsubhi & Steven Wellman & Vanshika Singh & Camila Garcia Padill, 2024. "Low-intensity pulsed ultrasound stimulation (LIPUS) modulates microglial activation following intracortical microelectrode implantation," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    12. Xiao-yu Sun & Bin Ye, 2023. "The functional differentiation of brain–computer interfaces (BCIs) and its ethical implications," Humanities and Social Sciences Communications, Palgrave Macmillan, vol. 10(1), pages 1-9, December.
    13. Lukas K. Amann & Virginia Casasnovas & Alexander Gail, 2025. "Visual target and task-critical feedback uncertainty impair different stages of reach planning in motor cortex," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    14. Dimitris Boufidis & Raghav Garg & Eugenia Angelopoulos & D. Kacy Cullen & Flavia Vitale, 2025. "Bio-inspired electronics: Soft, biohybrid, and “living” neural interfaces," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    15. Fuji Ren & Yanwei Bao, 2020. "A Review on Human-Computer Interaction and Intelligent Robots," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 19(01), pages 5-47, February.
    16. repec:plo:pcbi00:1007118 is not listed on IDEAS
    17. Elisa Donati & Giacomo Valle, 2024. "Neuromorphic hardware for somatosensory neuroprostheses," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    18. Javier León & Juan José Escobar & Andrés Ortiz & Julio Ortega & Jesús González & Pedro Martín-Smith & John Q Gan & Miguel Damas, 2020. "Deep learning for EEG-based Motor Imagery classification: Accuracy-cost trade-off," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-30, June.
    19. Javier M Antelis & Luis Montesano & Ander Ramos-Murguialday & Niels Birbaumer & Javier Minguez, 2013. "On the Usage of Linear Regression Models to Reconstruct Limb Kinematics from Low Frequency EEG Signals," PLOS ONE, Public Library of Science, vol. 8(4), pages 1-14, April.
    20. repec:plo:pone00:0059049 is not listed on IDEAS
    21. Jonathan A Michaels & Benjamin Dann & Hansjörg Scherberger, 2016. "Neural Population Dynamics during Reaching Are Better Explained by a Dynamical System than Representational Tuning," PLOS Computational Biology, Public Library of Science, vol. 12(11), pages 1-22, November.
    22. Yidan Ding & Chalisa Udompanyawit & Yisha Zhang & Bin He, 2025. "EEG-based brain-computer interface enables real-time robotic hand control at individual finger level," Nature Communications, Nature, vol. 16(1), pages 1-20, December.
    23. Longshun Li & Aniya Hartzler & Dhariyat M. Menendez-Lustri & Jichu Zhang & Alex Chen & Danny V. Lam & Baylee Traylor & Emma Quill & David E. Nethery & George F. Hoeferlin & Christa L. Pawlowski & Mich, 2025. "Dexamethasone-loaded platelet-inspired nanoparticles improve intracortical microelectrode recording performance," Nature Communications, Nature, vol. 16(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0087253. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.