IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v15y2023i12p384-d1289522.html
   My bibliography  Save this article

Enhancements in BlenderBot 3: Expanding Beyond a Singular Model Governance and Boosting Generational Performance

Author

Listed:
  • Ondrej Kobza

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic
    These authors contributed equally to this work.)

  • David Herel

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic
    These authors contributed equally to this work.)

  • Jan Cuhel

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic
    These authors contributed equally to this work.)

  • Tommaso Gargiani

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic
    These authors contributed equally to this work.)

  • Jan Pichl

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic
    These authors contributed equally to this work.)

  • Petr Marek

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic
    These authors contributed equally to this work.)

  • Jakub Konrad

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic
    These authors contributed equally to this work.)

  • Jan Sedivy

    (Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University, 156 00 Prague, Czech Republic)

Abstract

This paper provides a pioneering examination and enhancement of generative chat models, with a specific focus on the BlenderBot 3 model. Through meticulous interaction with a diverse set of human participants, we dissected the fundamental components of these models, unveiling several deficiencies, including long-term memory and entity recognition. Leveraging these insights, we engineered refined, streamlined iterations, culminating in a chatbot that transcends the capabilities of all existing models. Our work follows Occam’s razor principle and proves that, for tasks with relatively low complexity, using large overparameterized models instead of smaller ones does not bring significant benefits but increases latency, which may result in a lowered overall user experience. In upholding our commitment to transparency and the progression of shared knowledge, we have made our improved model universally accessible through open-source distribution.

Suggested Citation

  • Ondrej Kobza & David Herel & Jan Cuhel & Tommaso Gargiani & Jan Pichl & Petr Marek & Jakub Konrad & Jan Sedivy, 2023. "Enhancements in BlenderBot 3: Expanding Beyond a Singular Model Governance and Boosting Generational Performance," Future Internet, MDPI, vol. 15(12), pages 1-15, November.
  • Handle: RePEc:gam:jftint:v:15:y:2023:i:12:p:384-:d:1289522
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/15/12/384/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/15/12/384/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:15:y:2023:i:12:p:384-:d:1289522. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.